ERIC - Search Results

Publication Date

In 2025	2
Since 2024	2
Since 2021 (last 5 years)	17
Since 2016 (last 10 years)	33
Since 2006 (last 20 years)	81

Descriptor

Equated Scores	114
Item Response Theory	114
Test Items	114
Comparative Analysis	29
Difficulty Level	29
Test Format	24
Simulation	20
Sample Size	18
Statistical Analysis	18
Test Construction	18
Psychometrics	16
Computation	14
Models	14
Error of Measurement	13
Foreign Countries	13
Multiple Choice Tests	13
Scaling	13
Evaluation Methods	12
Item Analysis	12
Mathematics Tests	12
Accuracy	11
Scores	11
True Scores	11
Scoring	10
Computer Assisted Testing	9
More ▼

Publication Type

Journal Articles	71
Reports - Research	70
Reports - Evaluative	26
Speeches/Meeting Papers	19
Dissertations/Theses -…	10
Numerical/Quantitative Data	8
Reports - Descriptive	4
Information Analyses	2
Books	1
Collected Works - General	1
Guides - Classroom - Learner	1
Reference Materials -…	1
Tests/Questionnaires	1
More ▼

Education Level

Secondary Education	11
Elementary Education	7
Higher Education	6
Elementary Secondary Education	5
High Schools	4
Junior High Schools	4
Middle Schools	4
Grade 6	3
Postsecondary Education	3
Grade 4	2
Grade 7	2
Grade 8	2
Intermediate Grades	2
Grade 12	1
Grade 3	1
Grade 5	1
More ▼

Audience

Practitioners	2
Researchers	1
Students	1

Location

Florida	2
Australia	1
Denmark	1
Germany	1
Italy	1
Japan	1
Netherlands	1
Nigeria	1
Saudi Arabia	1
South Korea	1
Spain	1
Turkey	1
United Kingdom	1
United Kingdom (England)	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	4
Advanced Placement…	4
Graduate Record Examinations	4
National Assessment of…	3
Program for International…	3
Law School Admission Test	2
SAT (College Admission Test)	2
Armed Services Vocational…	1
College Level Examination…	1
Comprehensive Tests of Basic…	1
Florida Comprehensive…	1
General Aptitude Test Battery	1
Medical College Admission Test	1
Pre Professional Skills Tests	1
Preliminary Scholastic…	1
Test of English as a Foreign…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 114 results Save | Export

Item Parameter Estimation of the 2PL IRT Model with Fixed Ability Estimates: Choices of Ability Estimation Methods and Priors on Slopes

Peer reviewed
PDF on ERIC

Download full text

Jianbin Fu; TsungHan Ho; Xuan Tan – Practical Assessment, Research & Evaluation, 2025

Item parameter estimation using an item response theory (IRT) model with fixed ability estimates is useful in equating with small samples on anchor items. The current study explores the impact of three ability estimation methods (weighted likelihood estimation [WLE], maximum a posteriori [MAP], and posterior ability distribution estimation [PST])…

Descriptors: Item Response Theory, Test Items, Computation, Equated Scores

What Affects the Quality of Score Transformations? Potential Issues in True-Score Equating Using the Partial Credit Model

Peer reviewed

Direct link

Fellinghauer, Carolina; Debelak, Rudolf; Strobl, Carolin – Educational and Psychological Measurement, 2023

This simulation study investigated to what extent departures from construct similarity as well as differences in the difficulty and targeting of scales impact the score transformation when scales are equated by means of concurrent calibration using the partial credit model with a common person design. Practical implications of the simulation…

Descriptors: True Scores, Equated Scores, Test Items, Sample Size

Impact of Differential Item Functioning on Item Model Fit Using Concurrent Equating Method

Peer reviewed

Direct link

Zeynep Uzun; Tuncay Ögretmen – Large-scale Assessments in Education, 2025

This study aimed to evaluate the item model fit by equating the forms of the PISA 2018 mathematics subtest with concurrent common items equating in samples from Türkiye, the UK, and Italy. The answers given in mathematics subtest Forms 2, 8, and 12 were used in this context. Analyzes were performed using the Dichotomous Rasch Model in the WINSTEPS…

Descriptors: Item Response Theory, Test Items, Foreign Countries, Mathematics Tests

Detecting Item Parameter Drift in Small Sample Rasch Equating

Peer reviewed

Direct link

Daniel Jurich; Chunyan Liu – Applied Measurement in Education, 2023

Screening items for parameter drift helps protect against serious validity threats and ensure score comparability when equating forms. Although many high-stakes credentialing examinations operate with small sample sizes, few studies have investigated methods to detect drift in small sample equating. This study demonstrates that several newly…

Descriptors: High Stakes Tests, Sample Size, Item Response Theory, Equated Scores

Testing Differential Item Functioning without Predefined Anchor Items Using Robust Regression

Peer reviewed

Direct link

Wang, Weimeng; Liu, Yang; Liu, Hongyun – Journal of Educational and Behavioral Statistics, 2022

Differential item functioning (DIF) occurs when the probability of endorsing an item differs across groups for individuals with the same latent trait level. The presence of DIF items may jeopardize the validity of an instrument; therefore, it is crucial to identify DIF items in routine operations of educational assessment. While DIF detection…

Descriptors: Test Bias, Test Items, Equated Scores, Regression (Statistics)

A Comparison of IRT Linking Approaches under the Nonequivalent Groups Anchor Test Design

Direct link

Jiajing Huang – ProQuest LLC, 2022

The nonequivalent-groups anchor-test (NEAT) data-collection design is commonly used in large-scale assessments. Under this design, different test groups take different test forms. Each test form has its own unique items and all test forms share a set of common items. If item response theory (IRT) models are applied to analyze the test data, the…

Descriptors: Item Response Theory, Test Format, Test Items, Test Construction

Comparing Drift Detection Methods for Accurate Rasch Equating in Different Sample Sizes

Peer reviewed

Direct link

Alahmadi, Sarah; Jones, Andrew T.; Barry, Carol L.; Ibáñez, Beatriz – Applied Measurement in Education, 2023

Rasch common-item equating is often used in high-stakes testing to maintain equivalent passing standards across test administrations. If unaddressed, item parameter drift poses a major threat to the accuracy of Rasch common-item equating. We compared the performance of well-established and newly developed drift detection methods in small and large…

Descriptors: Equated Scores, Item Response Theory, Sample Size, Test Items

A Comparison of Kernel Equating and Item Response Theory Equating Methods

Peer reviewed
PDF on ERIC

Download full text

Akin-Arikan, Çigdem; Gelbal, Selahattin – Eurasian Journal of Educational Research, 2021

Purpose: This study aims to compare the performances of Item Response Theory (IRT) equating and kernel equating (KE) methods based on equating errors (RMSD) and standard error of equating (SEE) using the anchor item nonequivalent groups design. Method: Within this scope, a set of conditions, including ability distribution, type of anchor items…

Descriptors: Equated Scores, Item Response Theory, Test Items, Statistical Analysis

Detection of Outliers in Anchor Items Using Modified Rasch Fit Statistics

Peer reviewed

Direct link

Liu, Chunyan; Jurich, Daniel; Morrison, Carol; Grabovsky, Irina – Applied Measurement in Education, 2021

The existence of outliers in the anchor items can be detrimental to the estimation of examinee ability and undermine the validity of score interpretation across forms. However, in practice, anchor item performance can become distorted due to various reasons. This study compares the performance of modified "INFIT" and "OUTFIT"…

Descriptors: Equated Scores, Test Items, Item Response Theory, Difficulty Level

Comparison of MIRT Observed Score Equating Methods under the Common-Item Nonequivalent Groups Design

Direct link

Choi, Jiwon – ProQuest LLC, 2019

For equating tests that measure several distinct proficiencies, procedures that reflect the multidimensional structure of the data are needed. Although there exist a few equating procedures developed under the multidimensional item response theory (MIRT) framework, there is a need for further research in this area. Therefore, the primary…

Descriptors: Item Response Theory, Equated Scores, Accuracy, Test Items

Anchors Aweigh: How the Choice of Anchor Items Affects the Vertical Scaling of 3PL Data with the Rasch Model

Peer reviewed

Direct link

Waterbury, Glenn Thomas; DeMars, Christine E. – Educational Assessment, 2021

Vertical scaling is used to put tests of different difficulty onto a common metric. The Rasch model is often used to perform vertical scaling, despite its strict functional form. Few, if any, studies have examined anchor item choice when using the Rasch model to vertically scale data that do not fit the model. The purpose of this study was to…

Descriptors: Test Items, Equated Scores, Item Response Theory, Scaling

Efficient Estimation of Mean Ability Growth Using Vertical Scaling

Peer reviewed

Direct link

Bjermo, Jonas; Miller, Frank – Applied Measurement in Education, 2021

In recent years, the interest in measuring growth in student ability in various subjects between different grades in school has increased. Therefore, good precision in the estimated growth is of importance. This paper aims to compare estimation methods and test designs when it comes to precision and bias of the estimated growth of mean ability…

Descriptors: Scaling, Ability, Computation, Test Items

A Comparison of the Relative Performance of Four IRT Models on Equating Passage-Based Tests

Peer reviewed

Direct link

Kim, Kyung Yong; Lim, Euijin; Lee, Won-Chan – International Journal of Testing, 2019

For passage-based tests, items that belong to a common passage often violate the local independence assumption of unidimensional item response theory (UIRT). In this case, ignoring local item dependence (LID) and estimating item parameters using a UIRT model could be problematic because doing so might result in inaccurate parameter estimates,…

Descriptors: Item Response Theory, Equated Scores, Test Items, Models

Effect of Sample Size on Common Item Equating Using the Dichotomous Rasch Model

Peer reviewed

Direct link

O'Neill, Thomas R.; Gregg, Justin L.; Peabody, Michael R. – Applied Measurement in Education, 2020

This study addresses equating issues with varying sample sizes using the Rasch model by examining how sample size affects the stability of item calibrations and person ability estimates. A resampling design was used to create 9 sample size conditions (200, 100, 50, 45, 40, 35, 30, 25, and 20), each replicated 10 times. Items were recalibrated…

Descriptors: Sample Size, Equated Scores, Item Response Theory, Raw Scores

Effect of Item Parameter Drift in Mixed Format Common Items on Test Equating

Peer reviewed
PDF on ERIC

Download full text

Uysal, Ibrahim; Sahin-Kürsad, Merve; Kiliç, Abdullah Faruk – Participatory Educational Research, 2022

The aim of the study was to examine the common items in the mixed format (e.g., multiple-choices and essay items) contain parameter drifts in the test equating processes performed with the common item nonequivalent groups design. In this study, which was carried out using Monte Carlo simulation with a fully crossed design, the factors of test…

Descriptors: Test Items, Test Format, Item Response Theory, Equated Scores

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8

Applied Measurement in…	11
ETS Research Report Series	10
Educational and Psychological…	10
ProQuest LLC	10
Applied Psychological…	8
Journal of Educational…	8
Online Submission	3
Practical Assessment,…	3
Educational Assessment	2
Educational Testing Service	2
Eurasian Journal of…	2
Journal of Applied Measurement	2
Measurement:…	2
Cambridge Assessment	1
College Board	1
Discover Education	1
Educational Measurement:…	1
Educational Research and…	1
Grantee Submission	1
International Journal of…	1
International Journal of…	1
Journal of Educational…	1
Journal of Educational and…	1
Journal of Experimental…	1
Journal of Intelligence	1
More ▼

Lee, Won-Chan	3
Michaelides, Michalis P.	3
Yang, Wen-Ling	3
Baker, Frank B.	2
Cohen, Allan S.	2
Davey, Tim	2
DeMars, Christine E.	2
Hambleton, Ronald K.	2
He, Yong	2
Holland, Paul	2
Huggins-Manley, Anne Corinne	2
Kim, Seock-Ho	2
Kolen, Michael J.	2
Lee, Yi-Hsuan	2
Li, Yuan H.	2
Lissitz, Robert W.	2
Livingston, Samuel A.	2
Miller, G. Edward	2
O'Neill, Thomas R.	2
Paek, Insu	2
Schaeffer, Gary A.	2
Sinharay, Sandip	2
Strobl, Carolin	2
Wells, Craig S.	2
More ▼