ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	5
Since 2016 (last 10 years)	18
Since 2006 (last 20 years)	37

Descriptor

Item Response Theory	51
Simulation	43
Models	23
Test Items	13
Computation	9
Computer Simulation	9
Computer Software	9
Error of Measurement	8
Goodness of Fit	8
Maximum Likelihood Statistics	8
Computer Assisted Testing	7
Evaluation Methods	7
Item Analysis	7
Monte Carlo Methods	6
Accuracy	5
Bayesian Statistics	5
Probability	5
Comparative Analysis	4
Educational Assessment	4
Estimation (Mathematics)	4
Psychometrics	4
Scores	4
Test Construction	4
Adaptive Testing	3
Factor Analysis	3
More ▼

Publication Type

Reports - Descriptive	51
Journal Articles	48
Reports - Evaluative	2
Speeches/Meeting Papers	2

Education Level

Elementary Education	2
Higher Education	2
Secondary Education	2
Elementary Secondary Education	1
Grade 4	1
Intermediate Grades	1
Postsecondary Education	1

Audience

Researchers

Location

Virginia

Laws, Policies, & Programs

Assessments and Surveys

Program for International…

What Works Clearinghouse Rating

Showing 1 to 15 of 51 results Save | Export

Redefining Item Response Models for Small Samples

Peer reviewed

Direct link

Jean-Paul Fox – Journal of Educational and Behavioral Statistics, 2025

Popular item response theory (IRT) models are considered complex, mainly due to the inclusion of a random factor variable (latent variable). The random factor variable represents the incidental parameter problem since the number of parameters increases when including data of new persons. Therefore, IRT models require a specific estimation method…

Descriptors: Sample Size, Item Response Theory, Accuracy, Bayesian Statistics

Using Cumulative Sum Control Chart to Detect Aberrant Responses in Educational Assessments

Peer reviewed
PDF on ERIC

Download full text

Wan, Siyu; Keller, Lisa A. – Practical Assessment, Research & Evaluation, 2023

Statistical process control (SPC) charts have been widely used in the field of educational measurement. The cumulative sum (CUSUM) is an established SPC method to detect aberrant responses for educational assessments. There are many studies that investigated the performance of CUSUM in different test settings. This paper describes the CUSUM…

Descriptors: Visual Aids, Educational Assessment, Evaluation Methods, Item Response Theory

Hybrid Maximum Clique Algorithm Using Parallel Integer Programming for Uniform Test Assembly

Peer reviewed

Direct link

Fuchimoto, Kazuma; Ishii, Takatoshi; Ueno, Maomi – IEEE Transactions on Learning Technologies, 2022

Educational assessments often require uniform test forms, for which each test form has equivalent measurement accuracy but with a different set of items. For uniform test assembly, an important issue is the increase of the number of assembled uniform tests. Although many automatic uniform test assembly methods exist, the maximum clique algorithm…

Descriptors: Simulation, Efficiency, Test Items, Educational Assessment

Robust Estimation of Ability and Mental Speed Employing the Hierarchical Model for Responses and Response Times

Peer reviewed

Direct link

Ranger, Jochen; Kuhn, Jörg-Tobias; Wolgast, Anett – Journal of Educational Measurement, 2021

Van der Linden's hierarchical model for responses and response times can be used in order to infer the ability and mental speed of test takers from their responses and response times in an educational test. A standard approach for this is maximum likelihood estimation. In real-world applications, the data of some test takers might be partly…

Descriptors: Models, Reaction Time, Item Response Theory, Tests

Vertical Scales, Deceleration, and Empirical Benchmarks for Growth

Peer reviewed

Direct link

Student, Sanford R. – Educational Researcher, 2022

Empirical growth benchmarks, as introduced by Hill, Bloom, Black, and Lipsey (2008), are a well-known way to contextualize effect sizes in education research. Past work on these benchmarks, both positive and negative, has largely avoided confronting the role of vertical scales, yet technical issues with vertical scales trouble the use of such…

Descriptors: Computer Simulation, Benchmarking, Effect Size, Intervention

Digital Module 13: Monte Carlo Simulation Studies in Item Response Theory

Peer reviewed

Direct link

Leventhal, Brian; Ames, Allison – Educational Measurement: Issues and Practice, 2020

In this digital ITEMS module, Dr. Brian Leventhal and Dr. Allison Ames provide an overview of "Monte Carlo simulation studies" (MCSS) in "item response theory" (IRT). MCSS are utilized for a variety of reasons, one of the most compelling being that they can be used when analytic solutions are impractical or nonexistent because…

Descriptors: Item Response Theory, Monte Carlo Methods, Simulation, Test Items

lsasim: An R Package for Simulating Large-Scale Assessment Data

Peer reviewed

Direct link

Matta, Tyler H.; Rutkowski, Leslie; Rutkowski, David; Liaw, Yuan-Ling – Large-scale Assessments in Education, 2018

This article provides an overview of the R package lsasim, designed to facilitate the generation of data that mimics a large scale assessment context. The package features functions for simulating achievement data according to a number of common IRT models with known parameters. A clear advantage of lsasim over other simulation software is that…

Descriptors: Measurement, Data, Simulation, Item Response Theory

Maximum Marginal Likelihood Estimation with an Expectation-Maximization Algorithm for Multigroup/Mixture Multidimensional Item Response Theory Models. Research Report. ETS RR-19-35

Peer reviewed
PDF on ERIC

Download full text

Fu, Jianbin – ETS Research Report Series, 2019

A maximum marginal likelihood estimation with an expectation-maximization algorithm has been developed for estimating multigroup or mixture multidimensional item response theory models using the generalized partial credit function, graded response function, and 3-parameter logistic function. The procedure includes the estimation of item…

Descriptors: Maximum Likelihood Statistics, Mathematics, Item Response Theory, Expectation

Automating Simulation Research for Item Response Theory Using R

Peer reviewed
PDF on ERIC

Download full text

Lee, Sunbok; Choi, Youn-Jeng; Cohen, Allan S. – International Journal of Assessment Tools in Education, 2018

A simulation study is a useful tool in examining how validly item response theory (IRT) models can be applied in various settings. Typically, a large number of replications are required to obtain the desired precision. However, many standard software packages in IRT, such as MULTILOG and BILOG, are not well suited for a simulation study requiring…

Descriptors: Item Response Theory, Simulation, Replication (Evaluation), Automation

Conducting Simulation Studies in Psychometrics

Peer reviewed

Direct link

Feinberg, Richard A.; Rubright, Jonathan D. – Educational Measurement: Issues and Practice, 2016

Simulation studies are fundamental to psychometric discourse and play a crucial role in operational and academic research. Yet, resources for psychometricians interested in conducting simulations are scarce. This Instructional Topics in Educational Measurement Series (ITEMS) module is meant to address this deficiency by providing a comprehensive…

Descriptors: Simulation, Psychometrics, Vocabulary, Research Design

A Technical Note on IRT Simulation Studies: Dealing with Truth, Estimates, Observed Data, and Residuals

Peer reviewed

Direct link

Luecht, Richard; Ackerman, Terry A. – Educational Measurement: Issues and Practice, 2018

Simulation studies are extremely common in the item response theory (IRT) research literature. This article presents a didactic discussion of "truth" and "error" in IRT-based simulation studies. We ultimately recommend that future research focus less on the simple recovery of parameters from a convenient generating IRT model,…

Descriptors: Item Response Theory, Simulation, Ethics, Error of Measurement

A More Flexible Bayesian Multilevel Bifactor Item Response Theory Model

Peer reviewed

Direct link

Fujimoto, Ken A. – Journal of Educational Measurement, 2020

Multilevel bifactor item response theory (IRT) models are commonly used to account for features of the data that are related to the sampling and measurement processes used to gather those data. These models conventionally make assumptions about the portions of the data structure that represent these features. Unfortunately, when data violate these…

Descriptors: Bayesian Statistics, Item Response Theory, Achievement Tests, Secondary School Students

Digital Module 08: Foundations of Operational Item Analysis https://ncme.elevate.commpartners.com

Peer reviewed

Direct link

Yoo, Hanwook; Hambleton, Ronald K. – Educational Measurement: Issues and Practice, 2019

Item analysis is an integral part of operational test development and is typically conducted within two popular statistical frameworks: classical test theory (CTT) and item response theory (IRT). In this digital ITEMS module, Hanwook Yoo and Ronald K. Hambleton provide an accessible overview of operational item analysis approaches within these…

Descriptors: Item Analysis, Item Response Theory, Guidelines, Test Construction

R Packages for Item Response Theory Analysis: Descriptions and Features

Peer reviewed

Direct link

Choi, Youn-Jeng; Asilkalkan, Abdullah – Measurement: Interdisciplinary Research and Perspectives, 2019

About 45 R packages to analyze data using item response theory (IRT) have been developed over the last decade. This article introduces these 45 R packages with their descriptions and features. It also describes possible advanced IRT models using R packages, as well as dichotomous and polytomous IRT models, and R packages that contain applications…

Descriptors: Item Response Theory, Data Analysis, Computer Software, Test Bias

Sensitivity of the RMSD for Detecting Item-Level Misfit in Low-Performing Countries

Peer reviewed

Direct link

Tijmstra, Jesper; Bolsinova, Maria; Liaw, Yuan-Ling; Rutkowski, Leslie; Rutkowski, David – Journal of Educational Measurement, 2020

Although the root-mean squared deviation (RMSD) is a popular statistical measure for evaluating country-specific item-level misfit (i.e., differential item functioning [DIF]) in international large-scale assessment, this paper shows that its sensitivity to detect misfit may depend strongly on the proficiency distribution of the considered…

Descriptors: Test Items, Goodness of Fit, Probability, Accuracy

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Journal of Educational and…	8
Applied Psychological…	7
Educational Measurement:…	4
Journal of Educational…	4
Psychometrika	3
Educational and Psychological…	2
IEEE Transactions on Learning…	2
Journal of Applied Measurement	2
Structural Equation Modeling:…	2
ETS Research Report Series	1
Educational Researcher	1
Gifted Child Quarterly	1
Intelligence	1
International Journal of…	1
International Journal of…	1
Journal of Career Assessment	1
Journal of Chemical Education	1
Journal of STEM Education:…	1
Large-scale Assessments in…	1
Measurement and Evaluation in…	1
Measurement:…	1
National Center for Research…	1
Practical Assessment,…	1
Technology, Knowledge and…	1
More ▼

Choi, Youn-Jeng	2
Liaw, Yuan-Ling	2
Ogasawara, Haruhiko	2
Rutkowski, David	2
Rutkowski, Leslie	2
Ueno, Maomi	2
Ackerman, Terry A.	1
Ames, Allison	1
Ashton, Michael C.	1
Asilkalkan, Abdullah	1
Aybek, Eren Can	1
Bartolucci, Francesco	1
Beretvas, S. Natasha	1
Bolsinova, Maria	1
Bolt, Daniel M.	1
Brennan, Robert L.	1
Burket, George	1
Cai, Li	1
Carmona, Guadalupe	1
Chen, Li-Sue	1
Chia, Mike	1
Choi, Seung W.	1
Cohen, Allan S.	1
Dauvier, Bruno	1
Demirtasli, R. Nukhet	1
More ▼