ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	3

Descriptor

Comparative Testing	8
Equated Scores	8
Test Items	8
Item Response Theory	4
College Entrance Examinations	3
Comparative Analysis	2
Evaluation Methods	2
Higher Education	2
Item Analysis	2
Mathematical Models	2
Mathematics Tests	2
Sampling	2
Scoring	2
Test Bias	2
Test Construction	2
Ability Grouping	1
Academic Ability	1
Accuracy	1
Achievement Tests	1
Art Education	1
Bayesian Statistics	1
Biology	1
Chemistry	1
Citizenship Education	1
College Students	1
More ▼

Source

Journal of Educational…	2
ETS Research Report Series	1
Educational Research and…	1
Journal of Educational…	1

Author

Carey, Jill	1
Cook, Linda L.	1
Curley, Edward	1
Du Bose, Pansy	1
Fraillon, Julian	1
Kim, Sooyeon	1
Kramer, Gene A.	1
Kromrey, Jeffrey D.	1
Li, Yuan H.	1
Lissitz, Robert W.	1
Liu, Jinghua	1
Mazzeo, John	1
McHale, Frederick	1
Miao, Chang Y.	1
Schulz, Wolfram	1
Walker, Michael E.	1
Yamamoto, Kentaro	1
Zu, Jiyun	1
More ▼

Publication Type

Reports - Evaluative	6
Journal Articles	5
Speeches/Meeting Papers	3
Reports - Research	2

Education Level

Elementary Secondary Education	2
Higher Education	2
Postsecondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

College Board Achievement…	1
National Assessment of…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing all 8 results Save | Export

Test Score Equating Using Discrete Anchor Items versus Passage-Based Anchor Items: A Case Study Using "SAT"® Data. Research Report. ETS RR-14-14

Peer reviewed
PDF on ERIC

Download full text

Liu, Jinghua; Zu, Jiyun; Curley, Edward; Carey, Jill – ETS Research Report Series, 2014

The purpose of this study is to investigate the impact of discrete anchor items versus passage-based anchor items on observed score equating using empirical data.This study compares an "SAT"® critical reading anchor that contains more discrete items proportionally, compared to the total tests to be equated, to another anchor that…

Descriptors: Equated Scores, Test Items, College Entrance Examinations, Comparative Analysis

Comparisons among Designs for Equating Mixed-Format Tests in Large-Scale Assessments

Peer reviewed

Direct link

Kim, Sooyeon; Walker, Michael E.; McHale, Frederick – Journal of Educational Measurement, 2010

In this study we examined variations of the nonequivalent groups equating design for tests containing both multiple-choice (MC) and constructed-response (CR) items to determine which design was most effective in producing equivalent scores across the two tests to be equated. Using data from a large-scale exam, this study investigated the use of…

Descriptors: Measures (Individuals), Scoring, Equated Scores, Test Bias

The Analysis of Measurement Equivalence in International Studies Using the Rasch Model

Peer reviewed

Direct link

Schulz, Wolfram; Fraillon, Julian – Educational Research and Evaluation, 2011

When comparing data derived from tests or questionnaires in cross-national studies, researchers commonly assume measurement invariance in their underlying scaling models. However, different cultural contexts, languages, and curricula can have powerful effects on how students respond in different countries. This article illustrates how the…

Descriptors: Citizenship Education, International Studies, Item Response Theory, International Education

Applications of the Analytically Derived Asymptotic Standard Errors of Item Response Theory Item Parameter Estimates

Peer reviewed

Direct link

Li, Yuan H.; Lissitz, Robert W. – Journal of Educational Measurement, 2004

The analytically derived asymptotic standard errors (SEs) of maximum likelihood (ML) item estimates can be approximated by a mathematical function without examinees' responses to test items, and the empirically determined SEs of marginal maximum likelihood estimation (MMLE)/Bayesian item estimates can be obtained when the same set of items is…

Descriptors: Test Items, Computation, Item Response Theory, Error of Measurement

Detecting Differential Item Functioning Using the Rasch Model with Equivalent-Group Cross-Validation.

Download full text

Miao, Chang Y.; Kramer, Gene A. – 1992

An approach to detecting differential item functioning using the Rasch model with equivalent-group cross-validation was investigated. College students taking the Dental Admission Test, were divided by gender (936 females and 1,537 males) into 2 different samples. Rasch analyses were performed on both samples. Data were recalibrated after…

Descriptors: College Entrance Examinations, College Students, Comparative Testing, Dental Schools

Item Response Theory Scale Linking in NAEP.

Peer reviewed

Yamamoto, Kentaro; Mazzeo, John – Journal of Educational Statistics, 1992

The need for scale linking in the National Assessment of Educational Progress (NAEP) is discussed, and the specific procedures used to carry out the linking in the context of the major analyses of the 1990 NAEP mathematics assessment are described. Issues remaining to be addressed are outlined. (SLD)

Descriptors: Comparative Testing, Educational Assessment, Elementary Secondary Education, Equated Scores

An Empirical Investigation of Equating Stability in a Single and a Double Linkage Design with Small Sample Sizes Using Angoff Model IV.

Download full text

Du Bose, Pansy; Kromrey, Jeffrey D. – 1993

Empirical evidence is presented of the relative efficiency of two potential linkage plans to be used when equivalent test forms are being administered. Equating is a process by which scores on one form of a test are converted to scores on another form of the same test. A Monte Carlo study was conducted to examine equating stability and statistical…

Descriptors: Art Education, Comparative Testing, Computer Simulation, Equated Scores

Equating Achievement Tests Using Samples Matched on Ability. College Board Report No. 90-2.

Download full text

Cook, Linda L.; And Others – 1990

The equating of reasonably parallel forms of College Board Achievement Tests in biology, chemistry, mathematics level II, American history and social studies, and French is discussed. Results of the following five equating methods are compared: (1) Tucker; (2) Levine equally reliable; (3) Levine unequally reliable; (4) frequency estimation…

Descriptors: Academic Ability, Achievement Tests, Biology, Chemistry