Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 4 |
Descriptor
Equated Scores | 9 |
Sampling | 9 |
Testing Problems | 9 |
College Entrance Examinations | 4 |
Error of Measurement | 4 |
Test Items | 3 |
Test Validity | 3 |
Accuracy | 2 |
Evaluation Problems | 2 |
Item Analysis | 2 |
Item Response Theory | 2 |
More ▼ |
Source
Applied Measurement in… | 2 |
Educational Measurement:… | 1 |
Educational Testing Service | 1 |
Journal of Educational… | 1 |
Author
Angoff, William H. | 1 |
Cowell, William R. | 1 |
Diao, Hongyu | 1 |
Haberman, Shelby J. | 1 |
Hicks, Marilyn M. | 1 |
Keller, Lisa | 1 |
Kim, Sooyeon | 1 |
Lord, Frederic M. | 1 |
Phillips, Gary W. | 1 |
Ree, Malcolm James | 1 |
Wainer, Howard | 1 |
More ▼ |
Publication Type
Reports - Research | 8 |
Journal Articles | 4 |
Reports - Evaluative | 1 |
Education Level
Higher Education | 1 |
Postsecondary Education | 1 |
Audience
Researchers | 1 |
Location
Laws, Policies, & Programs
Assessments and Surveys
Armed Services Vocational… | 1 |
Graduate Record Examinations | 1 |
SAT (College Admission Test) | 1 |
Test of English as a Foreign… | 1 |
What Works Clearinghouse Rating
Kim, Sooyeon; Walker, Michael E. – Educational Measurement: Issues and Practice, 2022
Test equating requires collecting data to link the scores from different forms of a test. Problems arise when equating samples are not equivalent and the test forms to be linked share no common items by which to measure or adjust for the group nonequivalence. Using data from five operational test forms, we created five pairs of research forms for…
Descriptors: Ability, Tests, Equated Scores, Testing Problems
Diao, Hongyu; Keller, Lisa – Applied Measurement in Education, 2020
Examinees who attempt the same test multiple times are often referred to as "repeaters." Previous studies suggested that repeaters should be excluded from the total sample before equating because repeater groups are distinguishable from non-repeater groups. In addition, repeaters might memorize anchor items, causing item drift under a…
Descriptors: Licensing Examinations (Professions), College Entrance Examinations, Repetition, Testing Problems
Phillips, Gary W. – Applied Measurement in Education, 2015
This article proposes that sampling design effects have potentially huge unrecognized impacts on the results reported by large-scale district and state assessments in the United States. When design effects are unrecognized and unaccounted for they lead to underestimating the sampling error in item and test statistics. Underestimating the sampling…
Descriptors: State Programs, Sampling, Research Design, Error of Measurement
Haberman, Shelby J. – Educational Testing Service, 2010
Sampling errors limit the accuracy with which forms can be linked. Limitations on accuracy are especially important in testing programs in which a very large number of forms are employed. Standard inequalities in mathematical statistics may be used to establish lower bounds on the achievable inking accuracy. To illustrate results, a variety of…
Descriptors: Testing Programs, Equated Scores, Sampling, Accuracy
Lord, Frederic M. – 1981
Transformations or equating of raw test scores on two or more forms of the same test are made interchangeable by empirical procedures deriving the standard error of an equipercentile equating for four different situations. Some numerical results are checked by Monte Carlo methods. Numerical standard errors are computed for two sets of real data.…
Descriptors: Educational Testing, Equated Scores, Error of Measurement, Mathematical Formulas

Wainer, Howard – Journal of Educational Measurement, 1986
Describes recent research attempts to draw inferences about the relative standing of the states on the basis of mean SAT scores. This paper identifies five serious errors that call into question the validity of such inferences. Some plausible ways to avoid the errors are described. (Author/LMO)
Descriptors: College Entrance Examinations, Equated Scores, Mathematical Models, Predictor Variables
Hicks, Marilyn M. – 1984
Six methods of equating Test of English as a Foreign Language (TOEFL) test scores for samples consisting of the usual groups of examinees and groups controlled for native language representation were evaluated in terms of scale stability. The equating methods included three item response theory (IRT) variants (fixed b's scaling, a one-parameter…
Descriptors: College Entrance Examinations, Comparative Analysis, English (Second Language), Equated Scores
Angoff, William H.; Cowell, William R. – 1985
Linear and equipercentile equating conversions were developed for two forms of the Graduate Record Examinations (GRE) quantitative test and the verbal-plus-quantitative test. From a very large sample of students taking the GRE in October 1981, subpopulations were selected with respect to race, sex, field of study, and level of performance (defined…
Descriptors: Aptitude Tests, College Entrance Examinations, Equated Scores, Error of Measurement
Wegner, Toni Giuliano; Ree, Malcolm James – 1985
In the late 1970s, the Department of Defense requested that the reference population for the Armed Services Vocational Aptitude Battery (ASVAB) be changed and updated to reflect the current youth population. Analyses of new data collected in 1980 indicated that speeded subtest scores of the new sample were atypically low and that the sample might…
Descriptors: Adults, Answer Sheets, Armed Forces, Data Analysis