ERIC - Search Results

Publication Date

In 2025	4
Since 2024	12
Since 2021 (last 5 years)	42
Since 2016 (last 10 years)	93
Since 2006 (last 20 years)	203

Descriptor

Comparative Analysis	264
Item Response Theory	264
Test Items	264
Difficulty Level	72
Simulation	66
Foreign Countries	57
Item Analysis	53
Models	53
Scores	48
Statistical Analysis	41
Test Bias	38
Test Format	35
Test Construction	34
Computer Assisted Testing	32
Mathematics Tests	32
Sample Size	32
Correlation	31
Multiple Choice Tests	30
Computation	29
Equated Scores	29
Goodness of Fit	27
Accuracy	26
Evaluation Methods	26
Scoring	26
Achievement Tests	25
More ▼

Publication Type

Journal Articles	197
Reports - Research	184
Reports - Evaluative	50
Speeches/Meeting Papers	32
Dissertations/Theses -…	21
Numerical/Quantitative Data	5
Reports - Descriptive	5
Tests/Questionnaires	3
Books	2
Information Analyses	2
Guides - Non-Classroom	1
Non-Print Media	1
Opinion Papers	1
Reference Materials - General	1
Reports - General	1
More ▼

Education Level

Higher Education	32
Secondary Education	27
Postsecondary Education	26
Elementary Education	20
High Schools	13
Elementary Secondary Education	10
Junior High Schools	9
Middle Schools	9
Grade 4	7
Grade 8	7
Intermediate Grades	7
Grade 12	5
Grade 7	5
Early Childhood Education	4
Grade 3	4
Grade 5	4
Grade 9	4
Primary Education	3
Grade 11	1
Grade 6	1
Kindergarten	1
More ▼

Audience

Researchers	2
Practitioners	1
Students	1

Location

Turkey	8
United States	6
Germany	4
Japan	4
South Korea	4
Taiwan	4
Australia	3
Botswana	3
Canada	3
Indonesia	3
Netherlands	3
Nigeria	3
United Kingdom (England)	3
Belgium	2
Chile	2
China	2
France	2
Hong Kong	2
Massachusetts	2
Ohio	2
Philippines	2
Russia	2
Singapore	2
Africa	1
Arkansas	1
More ▼

Laws, Policies, & Programs

What Works Clearinghouse Rating

Showing 1 to 15 of 264 results Save | Export

IRT Linking Methods for the Bifactor Model with Mixed Format Tests

Peer reviewed

Direct link

Sohee Kim; Ki Lynn Cole – International Journal of Testing, 2025

This study conducted a comprehensive comparison of Item Response Theory (IRT) linking methods applied to a bifactor model, examining their performance on both multiple choice (MC) and mixed format tests within the common item nonequivalent group design framework. Four distinct multidimensional IRT linking approaches were explored, consisting of…

Descriptors: Item Response Theory, Comparative Analysis, Models, Item Analysis

Detecting Differential Item Functioning with Multiple Causes: A Comparison of Three Methods

Peer reviewed

Direct link

Xiaowen Liu – International Journal of Testing, 2024

Differential item functioning (DIF) often arises from multiple sources. Within the context of multidimensional item response theory, this study examined DIF items with varying secondary dimensions using the three DIF methods: SIBTEST, Mantel-Haenszel, and logistic regression. The effect of the number of secondary dimensions on DIF detection rates…

Descriptors: Item Analysis, Test Items, Item Response Theory, Correlation

Comparing and Combining IRTree Models and Anchoring Vignettes in Addressing Response Styles

Peer reviewed

Direct link

Mingfeng Xue; Ping Chen – Journal of Educational Measurement, 2025

Response styles pose great threats to psychological measurements. This research compares IRTree models and anchoring vignettes in addressing response styles and estimating the target traits. It also explores the potential of combining them at the item level and total-score level (ratios of extreme and middle responses to vignettes). Four models…

Descriptors: Item Response Theory, Models, Comparative Analysis, Vignettes

Parameters and Models of Item Response Theory (IRT): A Review of Literature

Peer reviewed

Direct link

Gyamfi, Abraham; Acquaye, Rosemary – Acta Educationis Generalis, 2023

Introduction: Item response theory (IRT) has received much attention in validation of assessment instrument because it allows the estimation of students' ability from any set of the items. Item response theory allows the difficulty and discrimination levels of each item on the test to be estimated. In the framework of IRT, item characteristics are…

Descriptors: Item Response Theory, Models, Test Items, Difficulty Level

Analyzing Polytomous Test Data: A Comparison between an Information-Based IRT Model and the Generalized Partial Credit Model

Peer reviewed

Direct link

Joakim Wallmark; James O. Ramsay; Juan Li; Marie Wiberg – Journal of Educational and Behavioral Statistics, 2024

Item response theory (IRT) models the relationship between the possible scores on a test item against a test taker's attainment of the latent trait that the item is intended to measure. In this study, we compare two models for tests with polytomously scored items: the optimal scoring (OS) model, a nonparametric IRT model based on the principles of…

Descriptors: Item Response Theory, Test Items, Models, Scoring

Comparison of Item Response Theory Ability and Item Parameters According to Classical and Bayesian Estimation Methods

Peer reviewed
PDF on ERIC

Download full text

Eray Selçuk; Ergül Demir – International Journal of Assessment Tools in Education, 2024

This research aims to compare the ability and item parameter estimations of Item Response Theory according to Maximum likelihood and Bayesian approaches in different Monte Carlo simulation conditions. For this purpose, depending on the changes in the priori distribution type, sample size, test length, and logistics model, the ability and item…

Descriptors: Item Response Theory, Item Analysis, Test Items, Simulation

Item Response Theory and Modeling with Stata

Peer reviewed

Direct link

Raykov, Tenko – Measurement: Interdisciplinary Research and Perspectives, 2023

This software review discusses the capabilities of Stata to conduct item response theory modeling. The commands needed for fitting the popular one-, two-, and three-parameter logistic models are initially discussed. The procedure for testing the discrimination parameter equality in the one-parameter model is then outlined. The commands for fitting…

Descriptors: Item Response Theory, Models, Comparative Analysis, Item Analysis

IRT Characteristic Curve Linking Methods Weighted by Information for Mixed-Format Tests

Peer reviewed

Direct link

Shaojie Wang; Won-Chan Lee; Minqiang Zhang; Lixin Yuan – Applied Measurement in Education, 2024

To reduce the impact of parameter estimation errors on IRT linking results, recent work introduced two information-weighted characteristic curve methods for dichotomous items. These two methods showed outstanding performance in both simulation and pseudo-form pseudo-group analysis. The current study expands upon the concept of information…

Descriptors: Item Response Theory, Test Format, Test Length, Error of Measurement

Maintaining Score Scales over Time: A Comparison of Five Scoring Methods

Peer reviewed

Direct link

Kim, Stella Yun; Lee, Won-Chan – Applied Measurement in Education, 2023

This study evaluates various scoring methods including number-correct scoring, IRT theta scoring, and hybrid scoring in terms of scale-score stability over time. A simulation study was conducted to examine the relative performance of five scoring methods in terms of preserving the first two moments of scale scores for a population in a chain of…

Descriptors: Scoring, Comparative Analysis, Item Response Theory, Simulation

The Impact of Cheating on Score Comparability via Pool-Based IRT Pre-Equating

Peer reviewed

Direct link

Liu, Jinghua; Becker, Kirk – Journal of Educational Measurement, 2022

For any testing programs that administer multiple forms across multiple years, maintaining score comparability via equating is essential. With continuous testing and high-stakes results, especially with less secure online administrations, testing programs must consider the potential for cheating on their exams. This study used empirical and…

Descriptors: Cheating, Item Response Theory, Scores, High Stakes Tests

A Comparative Study of AI-Human-Made and Human-Made Test Forms for a University TESOL Theory Course

Peer reviewed

Direct link

Kyung-Mi O. – Language Testing in Asia, 2024

This study examines the efficacy of artificial intelligence (AI) in creating parallel test items compared to human-made ones. Two test forms were developed: one consisting of 20 existing human-made items and another with 20 new items generated with ChatGPT assistance. Expert reviews confirmed the content parallelism of the two test forms.…

Descriptors: Comparative Analysis, Artificial Intelligence, Computer Software, Test Items

Hybrid Maximum Clique Algorithm Using Parallel Integer Programming for Uniform Test Assembly

Peer reviewed

Direct link

Fuchimoto, Kazuma; Ishii, Takatoshi; Ueno, Maomi – IEEE Transactions on Learning Technologies, 2022

Educational assessments often require uniform test forms, for which each test form has equivalent measurement accuracy but with a different set of items. For uniform test assembly, an important issue is the increase of the number of assembled uniform tests. Although many automatic uniform test assembly methods exist, the maximum clique algorithm…

Descriptors: Simulation, Efficiency, Test Items, Educational Assessment

Content and Item Response Theory Analysis of ChatGPT-4-Generated Multiple-Choice Items

Peer reviewed

Direct link

Roger Young; Emily Courtney; Alexander Kah; Mariah Wilkerson; Yi-Hsin Chen – Teaching of Psychology, 2025

Background: Multiple-choice item (MCI) assessments are burdensome for instructors to develop. Artificial intelligence (AI, e.g., ChatGPT) can streamline the process without sacrificing quality. The quality of AI-generated MCIs and human experts is comparable. However, whether the quality of AI-generated MCIs is equally good across various domain-…

Descriptors: Item Response Theory, Multiple Choice Tests, Psychology, Textbooks

An Evaluation of Fit Indices Used in Model Selection of Dichotomous Mixture IRT Models

Peer reviewed

Direct link

Sedat Sen; Allan S. Cohen – Educational and Psychological Measurement, 2024

A Monte Carlo simulation study was conducted to compare fit indices used for detecting the correct latent class in three dichotomous mixture item response theory (IRT) models. Ten indices were considered: Akaike's information criterion (AIC), the corrected AIC (AICc), Bayesian information criterion (BIC), consistent AIC (CAIC), Draper's…

Descriptors: Goodness of Fit, Item Response Theory, Sample Size, Classification

Application of Item Response Tree (IRTree) Models on Testing Data: Comparing Its Performance with Binary and Polytomous Item Response Models

Direct link

Yixi Wang – ProQuest LLC, 2020

Binary item response theory (IRT) models are widely used in educational testing data. These models are not perfect because they simplify the individual item responding process, ignore the differences among different response patterns, cannot handle multidimensionality that lay behind options within a single item, and cannot manage missing response…

Descriptors: Item Response Theory, Educational Testing, Data, Models

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 18

Educational and Psychological…	26
ProQuest LLC	21
Applied Psychological…	19
Journal of Educational…	18
ETS Research Report Series	17
Applied Measurement in…	15
International Journal of…	12
International Journal of…	5
Language Testing	5
Eurasian Journal of…	4
Grantee Submission	4
Online Submission	4
Practical Assessment,…	4
Educational Measurement:…	3
Educational Research and…	3
International Journal of…	3
Language Assessment Quarterly	3
Asia Pacific Education Review	2
Assessment & Evaluation in…	2
Educational Research and…	2
Educational Sciences: Theory…	2
IEEE Transactions on Learning…	2
International Journal of…	2
Journal of Educational and…	2
Journal of Speech, Language,…	2
More ▼

Cohen, Allan S.	6
Kim, Seock-Ho	5
von Davier, Matthias	5
Finch, Holmes	4
Lee, Won-Chan	4
Zhang, Jinming	4
Chang, Hua-Hua	3
DeBoer, George E.	3
DeMars, Christine E.	3
Hambleton, Ronald K.	3
Herrmann-Abell, Cari F.	3
Lee, Yi-Hsuan	3
Penfield, Randall D.	3
Rogers, W. Todd	3
Schumacker, Randall E.	3
Smith, Richard M.	3
Cai, Li	2
Cho, Sun-Joo	2
De Champlain, Andre F.	2
Dodd, Barbara G.	2
Elosua, Paula	2
Gattamorta, Karina A.	2
Hardcastle, Joseph	2
He, Wei	2
More ▼

Trends in International…	8
Program for International…	6
Progress in International…	4
SAT (College Admission Test)	4
Graduate Record Examinations	3
National Assessment of…	3
Test of English as a Foreign…	3
ACT Assessment	2
Advanced Placement…	2
Iowa Tests of Basic Skills	2
Defining Issues Test	1
Graduate Management Admission…	1
International English…	1
Law School Admission Test	1
Peabody Individual…	1
Peabody Picture Vocabulary…	1
State of Texas Assessments of…	1
United States Medical…	1
Work Keys (ACT)	1
More ▼