ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	4

Descriptor

Equated Scores	9
Sampling	9
Testing Problems	9
College Entrance Examinations	4
Error of Measurement	4
Test Items	3
Test Validity	3
Accuracy	2
Evaluation Problems	2
Item Analysis	2
Item Response Theory	2
Mathematical Models	2
Sample Size	2
Statistical Analysis	2
Test Bias	2
Test Interpretation	2
Test Reliability	2
Testing Programs	2
Weighted Scores	2
Ability	1
Adults	1
Answer Sheets	1
Aptitude Tests	1
Armed Forces	1
Comparative Analysis	1
More ▼

Source

Applied Measurement in…	2
Educational Measurement:…	1
Educational Testing Service	1
Journal of Educational…	1

Author

Angoff, William H.	1
Cowell, William R.	1
Diao, Hongyu	1
Haberman, Shelby J.	1
Hicks, Marilyn M.	1
Keller, Lisa	1
Kim, Sooyeon	1
Lord, Frederic M.	1
Phillips, Gary W.	1
Ree, Malcolm James	1
Wainer, Howard	1
Walker, Michael E.	1
Wegner, Toni Giuliano	1
More ▼

Publication Type

Reports - Research	8
Journal Articles	4
Reports - Evaluative	1

Education Level

Higher Education	1
Postsecondary Education	1

Audience

Researchers

Location

Laws, Policies, & Programs

Assessments and Surveys

Armed Services Vocational…	1
Graduate Record Examinations	1
SAT (College Admission Test)	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing all 9 results Save | Export

Adjusting for Ability Differences of Equating Samples When Randomization Is Suboptimal

Peer reviewed

Direct link

Kim, Sooyeon; Walker, Michael E. – Educational Measurement: Issues and Practice, 2022

Test equating requires collecting data to link the scores from different forms of a test. Problems arise when equating samples are not equivalent and the test forms to be linked share no common items by which to measure or adjust for the group nonequivalence. Using data from five operational test forms, we created five pairs of research forms for…

Descriptors: Ability, Tests, Equated Scores, Testing Problems

Investigating Repeater Effects on Small Sample Equating: Include or Exclude?

Peer reviewed

Direct link

Diao, Hongyu; Keller, Lisa – Applied Measurement in Education, 2020

Examinees who attempt the same test multiple times are often referred to as "repeaters." Previous studies suggested that repeaters should be excluded from the total sample before equating because repeater groups are distinguishable from non-repeater groups. In addition, repeaters might memorize anchor items, causing item drift under a…

Descriptors: Licensing Examinations (Professions), College Entrance Examinations, Repetition, Testing Problems

Impact of Design Effects in Large-Scale District and State Assessments

Peer reviewed

Direct link

Phillips, Gary W. – Applied Measurement in Education, 2015

This article proposes that sampling design effects have potentially huge unrecognized impacts on the results reported by large-scale district and state assessments in the United States. When design effects are unrecognized and unaccounted for they lead to underestimating the sampling error in item and test statistics. Underestimating the sampling…

Descriptors: State Programs, Sampling, Research Design, Error of Measurement

Limits on the Accuracy of Linking. Research Report. ETS RR-10-22

Download full text

Haberman, Shelby J. – Educational Testing Service, 2010

Sampling errors limit the accuracy with which forms can be linked. Limitations on accuracy are especially important in testing programs in which a very large number of forms are employed. Standard inequalities in mathematical statistics may be used to establish lower bounds on the achievable inking accuracy. To illustrate results, a variety of…

Descriptors: Testing Programs, Equated Scores, Sampling, Accuracy

The Standard Error of Equipercentile Equating.

Download full text

Lord, Frederic M. – 1981

Transformations or equating of raw test scores on two or more forms of the same test are made interchangeable by empirical procedures deriving the standard error of an equipercentile equating for four different situations. Some numerical results are checked by Monte Carlo methods. Numerical standard errors are computed for two sets of real data.…

Descriptors: Educational Testing, Equated Scores, Error of Measurement, Mathematical Formulas

Five Pitfalls Encountered While Trying to Compare States on Their SAT Scores.

Peer reviewed

Wainer, Howard – Journal of Educational Measurement, 1986

Describes recent research attempts to draw inferences about the relative standing of the states on the basis of mean SAT scores. This paper identifies five serious errors that call into question the validity of such inferences. Some plausible ways to avoid the errors are described. (Author/LMO)

Descriptors: College Entrance Examinations, Equated Scores, Mathematical Models, Predictor Variables

A Comparative Study of Methods of Equating TOEFL Test Scores.

Download full text

Hicks, Marilyn M. – 1984

Six methods of equating Test of English as a Foreign Language (TOEFL) test scores for samples consisting of the usual groups of examinees and groups controlled for native language representation were evaluated in terms of scale stability. The equating methods included three item response theory (IRT) variants (fixed b's scaling, a one-parameter…

Descriptors: College Entrance Examinations, Comparative Analysis, English (Second Language), Equated Scores

An Examination of the Assumption that the Equating of Parallel Forms is Population-Independent.

Download full text

Angoff, William H.; Cowell, William R. – 1985

Linear and equipercentile equating conversions were developed for two forms of the Graduate Record Examinations (GRE) quantitative test and the verbal-plus-quantitative test. From a very large sample of students taking the GRE in October 1981, subpopulations were selected with respect to race, sex, field of study, and level of performance (defined…

Descriptors: Aptitude Tests, College Entrance Examinations, Equated Scores, Error of Measurement

Armed Services Vocational Aptitude Battery: Correcting the Speeded Subtests for the 1980 Youth Population.

Download full text

Wegner, Toni Giuliano; Ree, Malcolm James – 1985

In the late 1970s, the Department of Defense requested that the reference population for the Armed Services Vocational Aptitude Battery (ASVAB) be changed and updated to reflect the current youth population. Analyses of new data collected in 1980 indicated that speeded subtest scores of the new sample were atypically low and that the sample might…

Descriptors: Adults, Answer Sheets, Armed Forces, Data Analysis