Publication Date
| In 2026 | 0 |
| Since 2025 | 8 |
| Since 2022 (last 5 years) | 36 |
| Since 2017 (last 10 years) | 115 |
| Since 2007 (last 20 years) | 378 |
Descriptor
| Test Theory | 1166 |
| Test Items | 262 |
| Test Reliability | 252 |
| Test Construction | 246 |
| Test Validity | 245 |
| Psychometrics | 183 |
| Scores | 176 |
| Item Response Theory | 168 |
| Foreign Countries | 160 |
| Item Analysis | 141 |
| Statistical Analysis | 134 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Location
| United States | 17 |
| United Kingdom (England) | 15 |
| Canada | 14 |
| Australia | 13 |
| Turkey | 12 |
| Sweden | 8 |
| United Kingdom | 8 |
| Netherlands | 7 |
| Texas | 7 |
| New York | 6 |
| Taiwan | 6 |
| More ▼ | |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 4 |
| Elementary and Secondary… | 3 |
| Individuals with Disabilities… | 3 |
Assessments and Surveys
What Works Clearinghouse Rating
Reuman, David A.; And Others – 1982
According to classical test theory, the presence of random measurement error in a psychological test has important implications for validation studies. The more comprehensive application of classical test theory in construct validation is distinguished from that in criterion-oriented validation. Critics of thematic apperceptive measurement of the…
Descriptors: Academic Achievement, Achievement Need, Adults, Error of Measurement
Oxford-Carpenter, Rebecca L.; Schultz-Shiner, Linda J. – 1985
This paper addresses practical Army problems in reading assessment from a theory base reflecting the most recent research on reading comprehension. Military and occupational research shows that reading proficiency is related to job performance. Reading assessment is a key issue in the Army due to changes in the reading ability levels of the Army…
Descriptors: Armed Forces, Military Personnel, Postsecondary Education, Psychometrics
Hutchinson, T. P. – 1985
For over 50 years, the overwhelming weight of evidence has been that subjects are able to make use of partial information when responding to multiple-choice items. The subject chooses the alternative which has given rise to the lowest mismatch, except that if this minimum mismatch is larger than some threshold, the question is left unanswered.…
Descriptors: Guessing (Tests), Multiple Choice Tests, Predictive Measurement, Science Tests
Fremer, John J. – 1985
The author proposes a greater professional association role in establishing standards for quality assurance in testing. He presents his views as a test developer who dislikes the legal model for resolving professional issues. The use of publications and informational activities to make people aware of the professional standards and how they can be…
Descriptors: Professional Associations, Professional Continuing Education, Quality Control, Standards
Holmes, Susan E. – 1982
The purpose of the present study was to examine the accuracy of indirect trait estimates, i.e., estimates of some primary trait obtained from a second measure which have been equated to the first. The California Achievement Test in Reading was the primary measure and the Prescriptive Reading Inventory was the indirect measure. Four kinds of…
Descriptors: Content Analysis, Elementary Education, Equated Scores, Item Analysis
Shaycoft, Marion F. – 1979
Focusing on the use of "paper and pencil" criterion-referenced tests in educational measurement, and to correct misconceptions, the definitions of basic terms and historical antecedents are discussed. Classifications of the tests are compared with other achievement tests. The phases in developing criterion-referenced tests are presented with the…
Descriptors: Achievement Tests, Criterion Referenced Tests, Educational Testing, Evaluation Methods
Yen, Wendy M. – 1982
Test scores that are not perfectly reliable cannot be strictly equated unless they are strictly parallel. This fact implies that tau equivalence can be lost if an equipercentile equating is applied to observed scores that are not strictly parallel. Thirty-six simulated data sets are produced to simulate equating tests with different difficulties…
Descriptors: Difficulty Level, Equated Scores, Latent Trait Theory, Methods
Strasler, Gregg M. – 1980
The relationship between classical discrimination indices (CDI) and criterion-referenced discrimination indices (CRDI) and the appropriateness of each for use on criterion-referenced tests are investigated. A CRDI is proposed that attempts to separate those who master material from those who do not master material. A 26 item multiple-choice…
Descriptors: Criterion Referenced Tests, Discriminant Analysis, Higher Education, Mastery Learning
Ellett, Frederick S., Jr. – 1981
Basic issues in criterion-referenced measurement are addressed. In section II, issues involved in determining what a person does and can do are considered. A preliminary analysis of "can" is given which shows that there are several important senses of "can". In section III, results of an analysis of "ability" are…
Descriptors: Academic Ability, Behavior Theories, Criterion Referenced Tests, Induction
Peer reviewedSamejima, Fumiko – Applied Psychological Measurement, 1977
Several important implications in latent trait theory, with implications for individualized or tailored testing, are pointed out. A way of using the information function in tailored testing in connection with the standard error estimation of the ability level using maximum likelihood estimation is suggested. (Author/JKS)
Descriptors: Adaptive Testing, Career Development, Error of Measurement, Item Analysis
Peer reviewedBudescu, David – Journal of Educational Measurement, 1985
An important determinant of equating process efficiency is the correlation between the anchor test and components of each form. Use of some monotonic function of this correlation as a measure of equating efficiency is suggested. A model relating anchor test length and test reliability to this measure of efficiency is presented. (Author/DWH)
Descriptors: Correlation, Equated Scores, Mathematical Models, Standardized Tests
Peer reviewedTallmadge, G. Kasten – Journal of Educational Measurement, 1985
Support for the validity of the equipercentile assumption is presented in contrast with the conclusion of Powers, Slaughter, and Helmick (EJ 289 091). Observed "gains" from pre- to posttests are better attributed to stakeholder bias, posttests that match curriculum content too closely, or a combination of these factors. (Author/DWH)
Descriptors: Data Interpretation, Evaluation Methods, Norm Referenced Tests, Predictive Measurement
Peer reviewedTuman, Myron C.; Miles, Thomas H. – Teaching English in the Two-Year College, 1987
Indicates that if the cloze scores of students in a small group are well distributed, then it is possible to identify which essays would be selected as best and worst by an English professor. Shows that cloze testing constitutes a relatively effective placement instrument when the readers are unschooled. Includes statistical tables. (JD)
Descriptors: Cloze Procedure, Reading Tests, Student Placement, Test Reliability
Peer reviewedDivgi, D. R. – Journal of Educational Measurement, 1986
This paper discusses various issues involved in using the Rasch Model with multiple-choice tests and questions the suitability of this model for multiple-choice items. Results of some past studies supporting the model are shown to be irrelevant. The effects of the model's misfit on test equating are demonstrated. (Author JAZ)
Descriptors: Equated Scores, Goodness of Fit, Latent Trait Theory, Mathematical Models
Peer reviewedAngoff, William H.; Cowell, William R. – Journal of Educational Measurement, 1986
Linear conversions were developed relating scores on recent forms of the Graduate Record Examinations. Conversions based on specially selected subpopulations were compared with total-group conversions and evaluated. Conclusions indicated that the data clearly support the assumption of population independence for homogenoeous tests, but not quite…
Descriptors: College Entrance Examinations, Equated Scores, Groups, Higher Education


