ERIC - Search Results

Publication Date

In 2026	0
Since 2025	2
Since 2022 (last 5 years)	12
Since 2017 (last 10 years)	29
Since 2007 (last 20 years)	75

Descriptor

Comparative Analysis	96
Models	96
Test Items	96
Item Response Theory	53
Item Analysis	26
Simulation	25
Difficulty Level	20
Foreign Countries	19
Scores	18
Test Bias	18
Computer Assisted Testing	16
Goodness of Fit	14
Mathematics Tests	13
Statistical Analysis	12
Accuracy	11
Computation	11
Psychometrics	11
Correlation	10
English (Second Language)	10
Evaluation Methods	10
Factor Analysis	10
Scoring	10
Adaptive Testing	9
Equated Scores	9
Error of Measurement	9
More ▼

Publication Type

Journal Articles	77
Reports - Research	63
Reports - Evaluative	20
Dissertations/Theses -…	7
Speeches/Meeting Papers	6
Reports - Descriptive	3
Opinion Papers	2
Books	1
Tests/Questionnaires	1

Education Level

Higher Education	15
Postsecondary Education	10
Elementary Education	6
Elementary Secondary Education	5
Secondary Education	5
Grade 4	3
High Schools	3
Grade 7	2
Grade 8	2
Intermediate Grades	2
Junior High Schools	2
Middle Schools	2
Grade 12	1
Grade 3	1
More ▼

Audience

Practitioners	1
Researchers	1
Students	1

Location

United States	5
South Korea	3
Germany	2
Massachusetts	2
Netherlands	2
Africa	1
Argentina	1
Canada	1
China	1
France	1
Indonesia	1
Iran	1
Israel (Jerusalem)	1
Minnesota	1
Senegal	1
Sweden	1
Taiwan	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Trends in International…	6
National Assessment of…	3
Law School Admission Test	2
Program for International…	2
Test of English as a Foreign…	2
ACT Assessment	1
Advanced Placement…	1
Graduate Record Examinations	1
Raven Advanced Progressive…	1
Test of English for…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 96 results Save | Export

IRT Linking Methods for the Bifactor Model with Mixed Format Tests

Peer reviewed

Direct link

Sohee Kim; Ki Lynn Cole – International Journal of Testing, 2025

This study conducted a comprehensive comparison of Item Response Theory (IRT) linking methods applied to a bifactor model, examining their performance on both multiple choice (MC) and mixed format tests within the common item nonequivalent group design framework. Four distinct multidimensional IRT linking approaches were explored, consisting of…

Descriptors: Item Response Theory, Comparative Analysis, Models, Item Analysis

Detecting Differential Item Functioning with Multiple Causes: A Comparison of Three Methods

Peer reviewed

Direct link

Xiaowen Liu – International Journal of Testing, 2024

Differential item functioning (DIF) often arises from multiple sources. Within the context of multidimensional item response theory, this study examined DIF items with varying secondary dimensions using the three DIF methods: SIBTEST, Mantel-Haenszel, and logistic regression. The effect of the number of secondary dimensions on DIF detection rates…

Descriptors: Item Analysis, Test Items, Item Response Theory, Correlation

Comparing and Combining IRTree Models and Anchoring Vignettes in Addressing Response Styles

Peer reviewed

Direct link

Mingfeng Xue; Ping Chen – Journal of Educational Measurement, 2025

Response styles pose great threats to psychological measurements. This research compares IRTree models and anchoring vignettes in addressing response styles and estimating the target traits. It also explores the potential of combining them at the item level and total-score level (ratios of extreme and middle responses to vignettes). Four models…

Descriptors: Item Response Theory, Models, Comparative Analysis, Vignettes

Parameters and Models of Item Response Theory (IRT): A Review of Literature

Peer reviewed

Direct link

Gyamfi, Abraham; Acquaye, Rosemary – Acta Educationis Generalis, 2023

Introduction: Item response theory (IRT) has received much attention in validation of assessment instrument because it allows the estimation of students' ability from any set of the items. Item response theory allows the difficulty and discrimination levels of each item on the test to be estimated. In the framework of IRT, item characteristics are…

Descriptors: Item Response Theory, Models, Test Items, Difficulty Level

Analyzing Polytomous Test Data: A Comparison between an Information-Based IRT Model and the Generalized Partial Credit Model

Peer reviewed

Direct link

Joakim Wallmark; James O. Ramsay; Juan Li; Marie Wiberg – Journal of Educational and Behavioral Statistics, 2024

Item response theory (IRT) models the relationship between the possible scores on a test item against a test taker's attainment of the latent trait that the item is intended to measure. In this study, we compare two models for tests with polytomously scored items: the optimal scoring (OS) model, a nonparametric IRT model based on the principles of…

Descriptors: Item Response Theory, Test Items, Models, Scoring

Item Response Theory and Modeling with Stata

Peer reviewed

Direct link

Raykov, Tenko – Measurement: Interdisciplinary Research and Perspectives, 2023

This software review discusses the capabilities of Stata to conduct item response theory modeling. The commands needed for fitting the popular one-, two-, and three-parameter logistic models are initially discussed. The procedure for testing the discrimination parameter equality in the one-parameter model is then outlined. The commands for fitting…

Descriptors: Item Response Theory, Models, Comparative Analysis, Item Analysis

Application of Item Response Tree (IRTree) Models on Testing Data: Comparing Its Performance with Binary and Polytomous Item Response Models

Direct link

Yixi Wang – ProQuest LLC, 2020

Binary item response theory (IRT) models are widely used in educational testing data. These models are not perfect because they simplify the individual item responding process, ignore the differences among different response patterns, cannot handle multidimensionality that lay behind options within a single item, and cannot manage missing response…

Descriptors: Item Response Theory, Educational Testing, Data, Models

Modeling and Analyzing Scorer Preferences in Short-Answer Math Questions

Peer reviewed
PDF on ERIC

Download full text

Zhang, Mengxue; Heffernan, Neil; Lan, Andrew – International Educational Data Mining Society, 2023

Automated scoring of student responses to open-ended questions, including short-answer questions, has great potential to scale to a large number of responses. Recent approaches for automated scoring rely on supervised learning, i.e., training classifiers or fine-tuning language models on a small number of responses with human-provided score…

Descriptors: Scoring, Computer Assisted Testing, Mathematics Instruction, Mathematics Tests

Diagnostic Classification Model for Forced-Choice Items and Noncognitive Tests

Peer reviewed

Direct link

Huang, Hung-Yu – Educational and Psychological Measurement, 2023

The forced-choice (FC) item formats used for noncognitive tests typically develop a set of response options that measure different traits and instruct respondents to make judgments among these options in terms of their preference to control the response biases that are commonly observed in normative tests. Diagnostic classification models (DCMs)…

Descriptors: Test Items, Classification, Bayesian Statistics, Decision Making

A Log-Linear Modeling Approach for Differential Item Functioning Detection in Polytomously Scored Items

Peer reviewed

Direct link

Yesiltas, Gonca; Paek, Insu – Educational and Psychological Measurement, 2020

A log-linear model (LLM) is a well-known statistical method to examine the relationship among categorical variables. This study investigated the performance of LLM in detecting differential item functioning (DIF) for polytomously scored items via simulations where various sample sizes, ability mean differences (impact), and DIF types were…

Descriptors: Simulation, Sample Size, Item Analysis, Scores

Classical Test Theory and Item Response Theory Comparison of the Brief Electricity and Magnetism Assessment and the Conceptual Survey of Electricity and Magnetism

Peer reviewed

Direct link

Eaton, Philip; Johnson, Keith; Barrett, Frank; Willoughby, Shannon – Physical Review Physics Education Research, 2019

For proper assessment selection understanding the statistical similarities amongst assessments that measure the same, or very similar, topics is imperative. This study seeks to extend the comparative analysis between the brief electricity and magnetism assessment (BEMA) and the conceptual survey of electricity and magnetism (CSEM) presented by…

Descriptors: Test Theory, Item Response Theory, Comparative Analysis, Energy

A Comparison of Estimation Techniques for IRT Models with Small Samples

Peer reviewed

Direct link

Finch, Holmes; French, Brian F. – Applied Measurement in Education, 2019

The usefulness of item response theory (IRT) models depends, in large part, on the accuracy of item and person parameter estimates. For the standard 3 parameter logistic model, for example, these parameters include the item parameters of difficulty, discrimination, and pseudo-chance, as well as the person ability parameter. Several factors impact…

Descriptors: Item Response Theory, Accuracy, Test Items, Difficulty Level

Scoring Graphical Responses in TIMSS 2019 Using Artificial Neural Networks

Peer reviewed

Direct link

von Davier, Matthias; Tyack, Lillian; Khorramdel, Lale – Educational and Psychological Measurement, 2023

Automated scoring of free drawings or images as responses has yet to be used in large-scale assessments of student achievement. In this study, we propose artificial neural networks to classify these types of graphical responses from a TIMSS 2019 item. We are comparing classification accuracy of convolutional and feed-forward approaches. Our…

Descriptors: Scoring, Networks, Artificial Intelligence, Elementary Secondary Education

Application of Bi-Factor MIRT and Higher-Order CDM Models to an In-House EFL Listening Test for Diagnostic Purposes

Peer reviewed

Direct link

Min, Shangchao; Cai, Hongwen; He, Lianzhen – Language Assessment Quarterly, 2022

The present study examined the performance of the bi-factor multidimensional item response theory (MIRT) model and higher-order (HO) cognitive diagnostic models (CDM) in providing diagnostic information and general ability estimation simultaneously in a listening test. The data used were 1,611 examinees' item-level responses to an in-house EFL…

Descriptors: Listening Comprehension Tests, English (Second Language), Second Language Learning, Foreign Countries

Examining Power and Type 1 Error for Step and Item Level Tests of Invariance: Investigating the Effect of the Number of Item Score Levels

Direct link

Ayodele, Alicia Nicole – ProQuest LLC, 2017

Within polytomous items, differential item functioning (DIF) can take on various forms due to the number of response categories. The lack of invariance at this level is referred to as differential step functioning (DSF). The most common DSF methods in the literature are the adjacent category log odds ratio (AC-LOR) estimator and cumulative…

Descriptors: Statistical Analysis, Test Bias, Test Items, Scores

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7

Educational and Psychological…	10
Journal of Educational…	10
ETS Research Report Series	9
Applied Psychological…	7
ProQuest LLC	7
International Journal of…	6
Journal of Educational and…	4
Applied Measurement in…	2
Journal of Memory and Language	2
Language Testing	2
Measurement:…	2
ACT, Inc.	1
Acta Educationis Generalis	1
Advances in Health Sciences…	1
Assessment in Education:…	1
British Journal of…	1
Developmental Psychology	1
EURASIA Journal of…	1
Educational Research and…	1
Hacettepe University Journal…	1
Intelligence	1
International Educational…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
More ▼

von Davier, Matthias	3
DeMars, Christine E.	2
Haladyna, Thomas M.	2
He, Wei	2
Jin, Ying	2
Lee, Young-Sun	2
Nandakumar, Ratna	2
Paek, Insu	2
Park, Yoon Soo	2
Sinharay, Sandip	2
Strobl, Carolin	2
Suh, Youngsuk	2
Wainer, Howard	2
Xu, Xueli	2
Zeileis, Achim	2
Zhang, Mo	2
van der Linden, Wim J.	2
Acquaye, Rosemary	1
Afsharrad, Mohammad	1
Ariel, Adelaide	1
Atar, Burcu	1
Ayodele, Alicia Nicole	1
Baghaei, Purya	1
Balota, David A.	1
More ▼