ERIC - Search Results

Publication Date

In 2025	3
Since 2024	5
Since 2021 (last 5 years)	30
Since 2016 (last 10 years)	67
Since 2006 (last 20 years)	183

Descriptor

Equated Scores	297
Item Response Theory	297
Test Items	114
Comparative Analysis	68
Test Format	57
True Scores	50
Simulation	45
Error of Measurement	44
Scaling	44
Statistical Analysis	42
Test Construction	41
Difficulty Level	38
Evaluation Methods	36
Sample Size	34
Mathematics Tests	32
Models	32
Scoring	29
College Entrance Examinations	28
Scores	28
Computation	27
Achievement Tests	26
Foreign Countries	26
Sampling	25
Estimation (Mathematics)	24
Psychometrics	24
More ▼

Publication Type

Journal Articles	197
Reports - Research	148
Reports - Evaluative	92
Speeches/Meeting Papers	41
Dissertations/Theses -…	21
Reports - Descriptive	20
Numerical/Quantitative Data	16
Information Analyses	9
Books	3
Collected Works - General	3
Opinion Papers	3
Tests/Questionnaires	3
Book/Product Reviews	1
Guides - Classroom - Learner	1
Guides - Non-Classroom	1
Reference Materials -…	1
Reports - General	1
More ▼

Education Level

Secondary Education	22
Higher Education	20
Elementary Education	15
Postsecondary Education	11
High Schools	10
Elementary Secondary Education	9
Grade 4	8
Junior High Schools	8
Middle Schools	8
Grade 7	7
Intermediate Grades	7
Grade 6	6
Grade 8	6
Early Childhood Education	5
Grade 3	5
Grade 5	5
Primary Education	4
Grade 1	1
Grade 10	1
Grade 11	1
Grade 12	1
Grade 2	1
Preschool Education	1
More ▼

Audience

Practitioners	3
Researchers	1
Students	1

Location

Japan	3
New York	3
Florida	2
Italy	2
Netherlands	2
Spain	2
United Kingdom	2
United Kingdom (England)	2
Australia	1
Canada	1
Delaware	1
Denmark	1
Germany	1
Hungary	1
Malawi	1
Nigeria	1
Saudi Arabia	1
Singapore	1
South Korea	1
Sweden	1
Turkey	1
United States	1
More ▼

Laws, Policies, & Programs

What Works Clearinghouse Rating

Showing 1 to 15 of 297 results Save | Export

Item Parameter Estimation of the 2PL IRT Model with Fixed Ability Estimates: Choices of Ability Estimation Methods and Priors on Slopes

Peer reviewed
PDF on ERIC

Download full text

Jianbin Fu; TsungHan Ho; Xuan Tan – Practical Assessment, Research & Evaluation, 2025

Item parameter estimation using an item response theory (IRT) model with fixed ability estimates is useful in equating with small samples on anchor items. The current study explores the impact of three ability estimation methods (weighted likelihood estimation [WLE], maximum a posteriori [MAP], and posterior ability distribution estimation [PST])…

Descriptors: Item Response Theory, Test Items, Computation, Equated Scores

Impact of Multidimensionality on Unidimensional IRT Linking and Equating Methods

Direct link

Uk Hyun Cho – ProQuest LLC, 2024

The present study investigates the influence of multidimensionality on linking and equating in a unidimensional IRT. Two hypothetical multidimensional scenarios are explored under a nonequivalent group common-item equating design. The first scenario examines test forms designed to measure multiple constructs, while the second scenario examines a…

Descriptors: Item Response Theory, Classification, Correlation, Test Format

Digital Module 29: Multidimensional Item Response Theory Equating

Peer reviewed

Direct link

Kim, Stella Y. – Educational Measurement: Issues and Practice, 2022

In this digital ITEMS module, Dr. Stella Kim provides an overview of multidimensional item response theory (MIRT) equating. Traditional unidimensional item response theory (IRT) equating methods impose the sometimes untenable restriction on data that only a single ability is assessed. This module discusses potential sources of multidimensionality…

Descriptors: Item Response Theory, Models, Equated Scores, Evaluation Methods

Several Variations of Simple-Structure MIRT Equating

Peer reviewed

Direct link

Kim, Stella Y.; Lee, Won-Chan – Journal of Educational Measurement, 2023

The current study proposed several variants of simple-structure multidimensional item response theory equating procedures. Four distinct sets of data were used to demonstrate feasibility of proposed equating methods for two different equating designs: a random groups design and a common-item nonequivalent groups design. Findings indicated some…

Descriptors: Item Response Theory, Equated Scores, Monte Carlo Methods, Research Methodology

IRT Observed-Score Equating for Rater-Mediated Assessments Using a Hierarchical Rater Model

Peer reviewed

Direct link

Tong Wu; Stella Y. Kim; Carl Westine; Michelle Boyer – Journal of Educational Measurement, 2025

While significant attention has been given to test equating to ensure score comparability, limited research has explored equating methods for rater-mediated assessments, where human raters inherently introduce error. If not properly addressed, these errors can undermine score interchangeability and test validity. This study proposes an equating…

Descriptors: Item Response Theory, Evaluators, Error of Measurement, Test Validity

What Affects the Quality of Score Transformations? Potential Issues in True-Score Equating Using the Partial Credit Model

Peer reviewed

Direct link

Fellinghauer, Carolina; Debelak, Rudolf; Strobl, Carolin – Educational and Psychological Measurement, 2023

This simulation study investigated to what extent departures from construct similarity as well as differences in the difficulty and targeting of scales impact the score transformation when scales are equated by means of concurrent calibration using the partial credit model with a common person design. Practical implications of the simulation…

Descriptors: True Scores, Equated Scores, Test Items, Sample Size

Impact of Differential Item Functioning on Item Model Fit Using Concurrent Equating Method

Peer reviewed

Direct link

Zeynep Uzun; Tuncay Ögretmen – Large-scale Assessments in Education, 2025

This study aimed to evaluate the item model fit by equating the forms of the PISA 2018 mathematics subtest with concurrent common items equating in samples from Türkiye, the UK, and Italy. The answers given in mathematics subtest Forms 2, 8, and 12 were used in this context. Analyzes were performed using the Dichotomous Rasch Model in the WINSTEPS…

Descriptors: Item Response Theory, Test Items, Foreign Countries, Mathematics Tests

Detecting Item Parameter Drift in Small Sample Rasch Equating

Peer reviewed

Direct link

Daniel Jurich; Chunyan Liu – Applied Measurement in Education, 2023

Screening items for parameter drift helps protect against serious validity threats and ensure score comparability when equating forms. Although many high-stakes credentialing examinations operate with small sample sizes, few studies have investigated methods to detect drift in small sample equating. This study demonstrates that several newly…

Descriptors: High Stakes Tests, Sample Size, Item Response Theory, Equated Scores

Testing Differential Item Functioning without Predefined Anchor Items Using Robust Regression

Peer reviewed

Direct link

Wang, Weimeng; Liu, Yang; Liu, Hongyun – Journal of Educational and Behavioral Statistics, 2022

Differential item functioning (DIF) occurs when the probability of endorsing an item differs across groups for individuals with the same latent trait level. The presence of DIF items may jeopardize the validity of an instrument; therefore, it is crucial to identify DIF items in routine operations of educational assessment. While DIF detection…

Descriptors: Test Bias, Test Items, Equated Scores, Regression (Statistics)

Practical Considerations in Choosing an Anchor Test Form for Equating under the Random Groups Design

Peer reviewed

Direct link

Cui, Zhongmin; He, Yong – Measurement: Interdisciplinary Research and Perspectives, 2023

Careful considerations are necessary when there is a need to choose an anchor test form from a list of old test forms for equating under the random groups design. The choice of the anchor form potentially affects the accuracy of equated scores on new test forms. Few guidelines, however, can be found in the literature on choosing the anchor form.…

Descriptors: Test Format, Equated Scores, Best Practices, Test Construction

Model Misspecification and Robustness of Observed-Score Test Equating Using Propensity Scores

Peer reviewed

Direct link

Wallin, Gabriel; Wiberg, Marie – Journal of Educational and Behavioral Statistics, 2023

This study explores the usefulness of covariates on equating test scores from nonequivalent test groups. The covariates are captured by an estimated propensity score, which is used as a proxy for latent ability to balance the test groups. The objective is to assess the sensitivity of the equated scores to various misspecifications in the…

Descriptors: Models, Error of Measurement, Robustness (Statistics), Equated Scores

Which Assessment Is Harder? Some Limits of Statistical Linking

Download full text

Benton, Tom; Williamson, Joanna – Research Matters, 2022

Equating methods are designed to adjust between alternate versions of assessments targeting the same content at the same level, with the aim that scores from the different versions can be used interchangeably. The statistical processes used in equating have, however, been extended to statistically "link" assessments that differ, such as…

Descriptors: Statistical Analysis, Equated Scores, Definitions, Alternative Assessment

A Comparison of IRT Linking Approaches under the Nonequivalent Groups Anchor Test Design

Direct link

Jiajing Huang – ProQuest LLC, 2022

The nonequivalent-groups anchor-test (NEAT) data-collection design is commonly used in large-scale assessments. Under this design, different test groups take different test forms. Each test form has its own unique items and all test forms share a set of common items. If item response theory (IRT) models are applied to analyze the test data, the…

Descriptors: Item Response Theory, Test Format, Test Items, Test Construction

Comparing Drift Detection Methods for Accurate Rasch Equating in Different Sample Sizes

Peer reviewed

Direct link

Alahmadi, Sarah; Jones, Andrew T.; Barry, Carol L.; Ibáñez, Beatriz – Applied Measurement in Education, 2023

Rasch common-item equating is often used in high-stakes testing to maintain equivalent passing standards across test administrations. If unaddressed, item parameter drift poses a major threat to the accuracy of Rasch common-item equating. We compared the performance of well-established and newly developed drift detection methods in small and large…

Descriptors: Equated Scores, Item Response Theory, Sample Size, Test Items

An Exploration of Comparability Issues in Educational Research: Scale Linking, Equating, and Propensity Score Weighting

Direct link

Wu, Tong – ProQuest LLC, 2023

This three-article dissertation aims to address three methodological challenges to ensure comparability in educational research, including scale linking, test equating, and propensity score (PS) weighting. The first study intends to improve test scale comparability by evaluating the effect of six missing data handling approaches, including…

Descriptors: Educational Research, Comparative Analysis, Equated Scores, Weighted Scores

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 20

Applied Psychological…	32
Applied Measurement in…	31
Journal of Educational…	26
ProQuest LLC	20
ETS Research Report Series	19
Educational and Psychological…	17
Measurement:…	9
Educational Measurement:…	6
International Journal of…	6
Educational Assessment	5
Journal of Applied Measurement	4
Journal of Educational and…	4
Journal of Experimental…	4
Practical Assessment,…	4
New York State Education…	3
Online Submission	3
Psychometrika	3
ACT, Inc.	2
College Board	2
Educational Sciences: Theory…	2
Educational Testing Service	2
Eurasian Journal of…	2
Journal of Educational…	2
National Center for Research…	2
Advances in Health Sciences…	1
More ▼

Kolen, Michael J.	11
Lee, Won-Chan	10
von Davier, Alina A.	10
van der Linden, Wim J.	9
Dorans, Neil J.	6
Baker, Frank B.	5
Cohen, Allan S.	5
Harris, Deborah J.	5
Keller, Lisa A.	5
Kim, Seock-Ho	5
Cui, Zhongmin	4
Hanson, Bradley A.	4
Lee, Guemin	4
Lee, Yi-Hsuan	4
Li, Yuan H.	4
Sinharay, Sandip	4
Yang, Wen-Ling	4
Beguin, Anton A.	3
Brennan, Robert L.	3
Chen, Hanwei	3
De Champlain, Andre F.	3
DeMars, Christine E.	3
Eignor, Daniel R.	3
Hambleton, Ronald K.	3
More ▼

SAT (College Admission Test)	9
Law School Admission Test	8
ACT Assessment	6
National Assessment of…	6
Graduate Record Examinations	5
Advanced Placement…	4
Test of English as a Foreign…	4
Program for International…	3
College Level Examination…	2
Iowa Tests of Basic Skills	2
Medical College Admission Test	2
Preliminary Scholastic…	2
Alabama High School…	1
Armed Services Vocational…	1
California Learning…	1
College Board Achievement…	1
Comprehensive Tests of Basic…	1
Florida Comprehensive…	1
General Aptitude Test Battery	1
Iowa Tests of Educational…	1
National Merit Scholarship…	1
Pre Professional Skills Tests	1
TerraNova Multiple Assessments	1
Test of Standard Written…	1
Trends in International…	1
More ▼