ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	5

Descriptor

Equated Scores	12
Sampling	12
Test Format	12
Test Items	5
Item Response Theory	4
Simulation	4
Statistical Analysis	3
Comparative Analysis	2
Evaluation Methods	2
Multiple Choice Tests	2
Regression (Statistics)	2
Scores	2
Statistical Distributions	2
Statistical Inference	2
Test Bias	2
Test Construction	2
Achievement Tests	1
Criteria	1
Cutting Scores	1
Data Collection	1
Difficulty Level	1
Education Majors	1
Equations (Mathematics)	1
Error of Measurement	1
Estimation (Mathematics)	1
More ▼

Source

Applied Psychological…	3
Journal of Educational and…	2
Applied Measurement in…	1
Educational Testing Service	1
ProQuest LLC	1

Author

Hanson, Bradley A.	2
Kim, Sooyeon	2
Baker, Frank B.	1
Chason, Walter M.	1
Dorans, Neil J.	1
Eignor, Daniel R.	1
Hammond, Shelby	1
Harris, Deborah J.	1
Little, Roderick J. A.	1
Liu, Jinghua	1
Livingston, Samuel A.	1
Motika, Robert T.	1
Rubin, Donald B.	1
Sunnassee, Devdass	1
Walker, Michael	1
Walker, Michael E.	1
Wang, Tianyou	1
van der Linden, Wim J.	1
More ▼

Publication Type

Reports - Evaluative	8
Journal Articles	6
Reports - Research	3
Speeches/Meeting Papers	2
Dissertations/Theses -…	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)	2
Armed Services Vocational…	1

What Works Clearinghouse Rating

Showing all 12 results Save | Export

What Is Actually Equated in "Test Equating"? A Didactic Note

Peer reviewed

Direct link

van der Linden, Wim J. – Journal of Educational and Behavioral Statistics, 2022

The current literature on test equating generally defines it as the process necessary to obtain score comparability between different test forms. The definition is in contrast with Lord's foundational paper which viewed equating as the process required to obtain comparability of measurement scale between forms. The distinction between the notions…

Descriptors: Equated Scores, Test Items, Scores, Probability

Determining the Anchor Composition for a Mixed-Format Test: Evaluation of Subpopulation Invariance of Linking Functions

Peer reviewed

Direct link

Kim, Sooyeon; Walker, Michael – Applied Measurement in Education, 2012

This study examined the appropriateness of the anchor composition in a mixed-format test, which includes both multiple-choice (MC) and constructed-response (CR) items, using subpopulation invariance indices. Linking functions were derived in the nonequivalent groups with anchor test (NEAT) design using two types of anchor sets: (a) MC only and (b)…

Descriptors: Multiple Choice Tests, Test Format, Test Items, Equated Scores

Examining Two Strategies to Link Mixed-Format Tests Using Multiple-Choice Anchors. Research Report. ETS RR-10-18

Download full text

Walker, Michael E.; Kim, Sooyeon – Educational Testing Service, 2010

This study examined the use of an all multiple-choice (MC) anchor for linking mixed format tests containing both MC and constructed-response (CR) items, in a nonequivalent groups design. An MC-only anchor could effectively link two such test forms if either (a) the MC and CR portions of the test measured the same construct, so that the MC anchor…

Descriptors: Equated Scores, Test Format, Multiple Choice Tests, Statistical Analysis

Conditions Affecting the Accuracy of Classical Equating Methods for Small Samples under the NEAT Design: A Simulation Study

Direct link

Sunnassee, Devdass – ProQuest LLC, 2011

Small sample equating remains a largely unexplored area of research. This study attempts to fill in some of the research gaps via a large-scale, IRT-based simulation study that evaluates the performance of seven small-sample equating methods under various test characteristic and sampling conditions. The equating methods considered are typically…

Descriptors: Test Length, Test Format, Sample Size, Simulation

Anchor Test Type and Population Invariance: An Exploration across Subpopulations and Test Administrations

Peer reviewed

Direct link

Dorans, Neil J.; Liu, Jinghua; Hammond, Shelby – Applied Psychological Measurement, 2008

This exploratory study was built on research spanning three decades. Petersen, Marco, and Stewart (1982) conducted a major empirical investigation of the efficacy of different equating methods. The studies reported in Dorans (1990) examined how different equating methods performed across samples selected in different ways. Recent population…

Descriptors: Test Format, Equated Scores, Sampling, Evaluation Methods

The Effectiveness of Circular Equating as a Criterion for Evaluating Equating.

Download full text

Wang, Tianyou; Hanson, Bradley A.; Harris, Deborah J. – 1998

Equating a test form to itself through a chain of equatings, commonly referred to as circular equating, has been widely used as a criterion to evaluate the adequacy of equating. This paper uses both analytical methods and simulation methods to show that this criterion is in general invalid in serving this purpose. For the random groups design done…

Descriptors: Equated Scores, Evaluation Methods, Heuristics, Sampling

An Investigation of the Sampling Distributions of Equating Coefficients.

Peer reviewed

Baker, Frank B. – Applied Psychological Measurement, 1996

Using the characteristic curve method for dichotomously scored test items, the sampling distributions of equating coefficients were examined. Simulations indicate that for the equating conditions studied, the sampling distributions of the equating coefficients appear to have acceptable characteristics, suggesting confidence in the values obtained…

Descriptors: Equated Scores, Item Response Theory, Sampling, Statistical Distributions

The Effects on Observed- and True-Score Equating Procedures of Matching on a Fallible Criterion: A Simulation with Test Variation.

Download full text

Eignor, Daniel R.; And Others – 1995

Two recent simulation studies were conducted to aid in the diagnosis and interpretation of equating differences found between random and matched (nonrandom) samples for four commonly used equating procedures: (1) Tucker; (2) Levine equally reliable; (3) Chained equipercentile observed-score; and (4) three-parameter, item response theory true-score…

Descriptors: Criteria, Equated Scores, Item Response Theory, Raw Scores

Standard Errors of Levine Linear Equating.

Peer reviewed

Hanson, Bradley A.; And Others – Applied Psychological Measurement, 1993

The delta method was used to derive standard errors (SES) of the Levine observed score and Levine true score linear test equating methods using data from two test forms. SES derived without the normality assumption and bootstrap SES were very close. The situation with skewed score distributions is also discussed. (SLD)

Descriptors: Equated Scores, Equations (Mathematics), Error of Measurement, Sampling

Test Equating from Biased Samples, with Application to the Armed Services Vocational Aptitude Battery.

Peer reviewed

Little, Roderick J. A.; Rubin, Donald B. – Journal of Educational and Behavioral Statistics, 1994

Equating a new standard test to an old reference test is considered when samples for equating are not randomly selected from the target population of test takers, identifying two problems from equating from biased samples. An empirical example with data from the Armed Services Vocational Aptitude Battery illustrates the approach. (SLD)

Descriptors: Equated Scores, Military Personnel, Sampling, Statistical Analysis

What Combination of Sampling and Equating Methods Works Best? Revised.

Download full text

Livingston, Samuel A.; And Others – 1989

Combinations of five methods of equating test forms and two methods of selecting samples of students for equating were compared for accuracy. The two sampling methods were representative sampling from the population and matching samples on the anchor test score. The equating methods were: (1) the Tucker method; (2) the Levine method; (3) the…

Descriptors: Comparative Analysis, Data Collection, Equated Scores, High School Students

Performance of Angoff Model IV Linear Test Equating Using Total Test and Content Dimensional Sub-Test Designs in Small Groups of Examinees.

Download full text

Motika, Robert T.; Chason, Walter M. – 1995

Test data from 200 examinees from the Spanish Teacher Certification Examination and 75 examinees from the French Teacher Certification Examination were used in a study of scale drift in sequentially equated test forms. Using sampling with replacement, 1,000 samples of 100 examinees each for Spanish and 1,000 samples of 50 each for French were…

Descriptors: Education Majors, Equated Scores, Estimation (Mathematics), French