Publication Date
| In 2026 | 12 |
| Since 2025 | 958 |
| Since 2022 (last 5 years) | 4567 |
| Since 2017 (last 10 years) | 10500 |
| Since 2007 (last 20 years) | 21963 |
Descriptor
| Test Validity | 21786 |
| Validity | 13791 |
| Test Reliability | 10864 |
| Foreign Countries | 9887 |
| Test Construction | 6897 |
| Factor Analysis | 5761 |
| Measures (Individuals) | 5633 |
| Predictive Validity | 5022 |
| Psychometrics | 4820 |
| Reliability | 4635 |
| Correlation | 4376 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 1169 |
| Practitioners | 629 |
| Teachers | 336 |
| Administrators | 165 |
| Policymakers | 110 |
| Counselors | 63 |
| Students | 63 |
| Parents | 15 |
| Community | 12 |
| Media Staff | 10 |
| Support Staff | 8 |
| More ▼ | |
Location
| Turkey | 1397 |
| Australia | 705 |
| Canada | 626 |
| China | 528 |
| United States | 439 |
| Indonesia | 389 |
| United Kingdom | 363 |
| Germany | 340 |
| California | 338 |
| Netherlands | 336 |
| Spain | 311 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 7 |
| Meets WWC Standards with or without Reservations | 12 |
| Does not meet standards | 10 |
Martin, Randy – 1988
Reasons for administering tests fall into two categories--decision-making and promoting learning. The two bases of tests are learning objectives and the level of learning at which training is developed. Test development involves a number of steps. The best way to tie objectives to test items is through the use of a table of specifications, which…
Descriptors: Elementary Secondary Education, Item Analysis, Item Banks, Postsecondary Education
Coffman, William E. – 1988
Four papers presented at the National Council on Measurement in Education meeting are critiqued. G. R. Mandeville and H. Heidari (1988), in a study of unusually effective schools, asked if correlations among cohorts of schools differ depending on the method of analysis used. It is suggested that it would be better to ask whether schools identified…
Descriptors: Educational Research, Elementary Secondary Education, Higher Education, Measurement Techniques
Hirst, Cyntha C.; And Others – 1986
The purpose of this project was to develop a motor performance test for preschool children that would be economical to administer, valid, and reliable. Four test items--standing long jump, hopping within an 18-inch square, balancing on one foot, and a timed gross agility task, were selected from published tests to assess children's strength,…
Descriptors: Age Differences, Guidelines, Individualized Education Programs, Physical Education
Brown, William R. – 1988
The evaluation tools written by teachers are rarely valid or reliable. One teaching aid that can help in the creation of an effective evaluation instrument is called a test map. A test map is a systematic method to consider variables that are important in the construction of the format of a test. Five variables that are discussed in the test…
Descriptors: Elementary Secondary Education, Evaluation Methods, Higher Education, Student Evaluation
Coleman, Geraldine J., Ed. – 1983
This pamphlet presents answers to the most frequently asked questions about the Michigan Educational Assessment Program (MEAP). They include questions about the history of MEAP, its costs, subject coverage, test validity, the type of tests, administration of the tests, and use of MEAP results. (BW)
Descriptors: Educational Assessment, Educational Objectives, Educational Testing, Elementary Secondary Education
Ligon, Glynn; Wilkinson, David – 1984
Inspired by four recent decisions to change achievement tests used in the Austin Independent School District, the separate forms used and procedures followed have been combined into a systematic approach intended for use in future achievement test selections. A rating scale (Attachment 1) was developed to expedite a systematic comparison among…
Descriptors: Achievement Tests, Cost Effectiveness, Elementary Secondary Education, Evaluation Criteria
Oosterhof, Albert C.; Salisbury, David F. – 1984
The Assessment Resource Center (ARC) at Florida State University provides computer assisted testing (CAT) for approximately 4,000 students each term. Computer capabilities permit a small proctoring staff to administer tests simultaneously to large numbers of students. Programs provide immediate feedback for students and generate a variety of…
Descriptors: Computer Assisted Testing, Criterion Referenced Tests, Feedback, Higher Education
Rose, Janet S.; Popham, W. James – 1984
This presentation describes the rationale and major steps in the development of the Teacher's Test of Language Skills (TTLS) to be administered to selected certificated teachers in the Charleston County School District, South Carolina. The paper recounts the factors underlying the establishment of the School Board's policy, then traces the major…
Descriptors: Board of Education Policy, Elementary Secondary Education, Political Issues, Reading Skills
Gottfredson, Linda S. – 1983
The Skills Map, a comprehensive classification of occupations based on their competency requirements, was developed to assess the employability of individuals and of various groups of individuals in different types of occupations. The data on which it was based were the ratings of required worker traits as given by the Dictionary of Occupational…
Descriptors: Adults, Aptitude, Career Choice, Career Education
Austin Independent School District, TX. Office of Research and Evaluation. – 1984
Public opinion polls show that most Americans (including a majority of teachers) favor merit pay for teachers. Teachers' and administrators' organizations generally oppose merit pay because there is no fair way to evaluate teachers and because the merit pay issue diverts attention from the fact that all teachers are underpaid. A review of recent…
Descriptors: Elementary Secondary Education, Evaluation Methods, Incentives, Merit Pay
Santa Rosa Junior Coll., CA. – 1984
A study was conducted to compare scores on the main placement tests used at Santa Rosa Junior College (i.e., the Diagnostic Reading Test and pre-calculus and pre-algebra tests) and scores on the American College Testing Service's ASSET battery of tests with student course grades to see if any of the tests acted as a reliable predictor for success.…
Descriptors: Community Colleges, Counseling Effectiveness, Diagnostic Tests, Educational Counseling
Harvill, Leo M. – 1984
The objectives for this study were to: (1) develop a valid, reliable measure of test-wiseness with equivalent forms for use with students in the health sciences; and (2) determine the level of test-wiseness of entering medical students. The test-wiseness areas included in this study were: similar options, umbrella term, item give-away, convergence…
Descriptors: Higher Education, Measurement Techniques, Medical Students, Multiple Choice Tests
MacPhee, David – 1983
As data on the reliability and validity of ratings of infant temperament have accumulated, researchers have begun to ask what caregiver ratings really measure. An argument has been made that ratings of social behavior are less a reflection of enduring individual differences than a measure of rater characteristics and error variance. This study…
Descriptors: Error of Measurement, Experimenter Characteristics, Infants, Knowledge Level
Watson, Betty U.; And Others – 1983
The Hiskey-Nebraska Test of Learning Aptitude (H-NTLA) (Hiskey, 1966) has been regarded by reviewers as one of the best instruments for assessing the learning abilities of hearing-impaired children. However, there has been a paucity of research on the validity of this test. Further, there is no established test-retest reliability, and questions…
Descriptors: Academic Ability, Academic Achievement, Academic Aptitude, Adolescents
Sax, Gilbert; Reiter, Pauline B. – 1980
Despite the popularity of both multiple-choice (MC) and true-false (TF) items, most investigations comparing the two formats have done so to determine the optimum number of choices to be given to students within a given time period. The purpose of this investigation was to compare the reliabilities and the validities of both formats when the items…
Descriptors: Analysis of Variance, Correlation, Higher Education, Item Analysis


