ERIC - Search Results

Publication Date

In 2025	1
Since 2024	2
Since 2021 (last 5 years)	8
Since 2016 (last 10 years)	16
Since 2006 (last 20 years)	31

Descriptor

Comparative Analysis	35
Item Response Theory	35
Computer Software	34
Models	17
Test Items	13
Simulation	9
Bayesian Statistics	8
Correlation	8
Item Analysis	8
Statistical Analysis	7
Computation	6
Foreign Countries	6
Monte Carlo Methods	6
Accuracy	5
Computer Assisted Testing	5
Factor Analysis	5
Educational Assessment	4
Markov Processes	4
Measurement	4
Measurement Techniques	4
Programming	4
Psychometrics	4
Scores	4
Test Length	4
Artificial Intelligence	3
More ▼

Source

Educational and Psychological…	6
Applied Psychological…	4
Grantee Submission	3
International Educational…	3
Educational Technology &…	2
IEEE Transactions on Learning…	2
Journal of Educational…	2
Measurement:…	2
ProQuest LLC	2
Applied Measurement in…	1
International Journal of…	1
Journal of Educational and…	1
Journal of Experimental…	1
Language Testing in Asia	1
Routledge, Taylor & Francis…	1
Structural Equation Modeling:…	1
Teaching of Psychology	1
More ▼

Publication Type

Journal Articles	27
Reports - Research	22
Reports - Evaluative	6
Books	2
Dissertations/Theses -…	2
Reports - Descriptive	2
Speeches/Meeting Papers	2
Collected Works - Proceedings	1
Reports - General	1

Education Level

Higher Education	6
Postsecondary Education	4
Elementary Secondary Education	3
Secondary Education	2
Elementary Education	1
Grade 6	1
High Schools	1
Intermediate Grades	1
Junior High Schools	1
Kindergarten	1
Middle Schools	1
More ▼

Audience

Practitioners	1
Researchers	1
Students	1

Location

Taiwan	2
Canada	1
Finland	1
France	1
Japan	1

Laws, Policies, & Programs

Assessments and Surveys

National Education…	1
Program for International…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 35 results Save | Export

Item Response Theory and Modeling with Stata

Peer reviewed

Direct link

Raykov, Tenko – Measurement: Interdisciplinary Research and Perspectives, 2023

This software review discusses the capabilities of Stata to conduct item response theory modeling. The commands needed for fitting the popular one-, two-, and three-parameter logistic models are initially discussed. The procedure for testing the discrimination parameter equality in the one-parameter model is then outlined. The commands for fitting…

Descriptors: Item Response Theory, Models, Comparative Analysis, Item Analysis

A Comparative Study of AI-Human-Made and Human-Made Test Forms for a University TESOL Theory Course

Peer reviewed

Direct link

Kyung-Mi O. – Language Testing in Asia, 2024

This study examines the efficacy of artificial intelligence (AI) in creating parallel test items compared to human-made ones. Two test forms were developed: one consisting of 20 existing human-made items and another with 20 new items generated with ChatGPT assistance. Expert reviews confirmed the content parallelism of the two test forms.…

Descriptors: Comparative Analysis, Artificial Intelligence, Computer Software, Test Items

Hybrid Maximum Clique Algorithm Using Parallel Integer Programming for Uniform Test Assembly

Peer reviewed

Direct link

Fuchimoto, Kazuma; Ishii, Takatoshi; Ueno, Maomi – IEEE Transactions on Learning Technologies, 2022

Educational assessments often require uniform test forms, for which each test form has equivalent measurement accuracy but with a different set of items. For uniform test assembly, an important issue is the increase of the number of assembled uniform tests. Although many automatic uniform test assembly methods exist, the maximum clique algorithm…

Descriptors: Simulation, Efficiency, Test Items, Educational Assessment

The Comparison of Estimation Methods for the Four-Parameter Logistic Item Response Theory Model

Peer reviewed

Direct link

Kalkan, Ömür Kaya – Measurement: Interdisciplinary Research and Perspectives, 2022

The four-parameter logistic (4PL) Item Response Theory (IRT) model has recently been reconsidered in the literature due to the advances in the statistical modeling software and the recent developments in the estimation of the 4PL IRT model parameters. The current simulation study evaluated the performance of expectation-maximization (EM),…

Descriptors: Comparative Analysis, Sample Size, Test Length, Algorithms

Content and Item Response Theory Analysis of ChatGPT-4-Generated Multiple-Choice Items

Peer reviewed

Direct link

Roger Young; Emily Courtney; Alexander Kah; Mariah Wilkerson; Yi-Hsin Chen – Teaching of Psychology, 2025

Background: Multiple-choice item (MCI) assessments are burdensome for instructors to develop. Artificial intelligence (AI, e.g., ChatGPT) can streamline the process without sacrificing quality. The quality of AI-generated MCIs and human experts is comparable. However, whether the quality of AI-generated MCIs is equally good across various domain-…

Descriptors: Item Response Theory, Multiple Choice Tests, Psychology, Textbooks

Bayesian Model Selection Methods for Multilevel IRT Models: A Comparison of Five DIC-Based Indices

Peer reviewed

Direct link

Zhang, Xue; Tao, Jian; Wang, Chun; Shi, Ning-Zhong – Journal of Educational Measurement, 2019

Model selection is important in any statistical analysis, and the primary goal is to find the preferred (or most parsimonious) model, based on certain criteria, from a set of candidate models given data. Several recent publications have employed the deviance information criterion (DIC) to do model selection among different forms of multilevel item…

Descriptors: Bayesian Statistics, Item Response Theory, Measurement, Models

Bayesian Model Selection Methods for Multilevel IRT Models: A Comparison of Five DIC-Based Indices

Peer reviewed
PDF on ERIC

Download full text

Direct link

Zhang, Xue; Tao, Jian; Wang, Chun; Shi, Ning-Zhong – Grantee Submission, 2019

Descriptors: Bayesian Statistics, Item Response Theory, Measurement, Models

Bayesian Comparison of Latent Variable Models: Conditional vs Marginal Likelihoods

Peer reviewed
PDF on ERIC

Download full text

Direct link

Merkle, E. C.; Furr, D.; Rabe-Hesketh, S. – Grantee Submission, 2019

Typical Bayesian methods for models with latent variables (or random effects) involve directly sampling the latent variables along with the model parameters. In high-level software code for model definitions (using, e.g., BUGS, JAGS, Stan), the likelihood is therefore specified as conditional on the latent variables. This can lead researchers to…

Descriptors: Bayesian Statistics, Comparative Analysis, Computer Software, Models

On Longitudinal Item Response Theory Models: A Didactic

Peer reviewed
PDF on ERIC

Download full text

Direct link

Wang, Chun; Nydick, Steven W. – Journal of Educational and Behavioral Statistics, 2020

Recent work on measuring growth with categorical outcome variables has combined the item response theory (IRT) measurement model with the latent growth curve model and extended the assessment of growth to multidimensional IRT models and higher order IRT models. However, there is a lack of synthetic studies that clearly evaluate the strength and…

Descriptors: Item Response Theory, Longitudinal Studies, Comparative Analysis, Models

Sparse Factor Autoencoders for Item Response Theory

Peer reviewed
PDF on ERIC

Download full text

PaaBen, Benjamin; Dywel, Malwina; Fleckenstein, Melanie; Pinkwart, Niels – International Educational Data Mining Society, 2022

Item response theory (IRT) is a popular method to infer student abilities and item difficulties from observed test responses. However, IRT struggles with two challenges: How to map items to skills if multiple skills are present? And how to infer the ability of new students that have not been part of the training data? Inspired by recent advances…

Descriptors: Item Response Theory, Test Items, Item Analysis, Inferences

LANA: Towards Personalized Deep Knowledge Tracing through Distinguishable Interactive Sequences

Peer reviewed
PDF on ERIC

Download full text

Zhou, Yuhao; Li, Xihua; Cao, Yunbo; Zhao, Xuemin; Ye, Qing; Lv, Jiancheng – International Educational Data Mining Society, 2021

In educational applications, "Knowledge Tracing" (KT) has been widely studied for decades as it is considered a fundamental task towards adaptive online learning. Among proposed KT methods, Deep Knowledge Tracing (DKT) and its variants are by far the most effective ones due to the high flexibility of the neural network. However, DKT…

Descriptors: Online Courses, Computer Assisted Instruction, Networks, Learning Analytics

On Longitudinal Item Response Theory Models: A Didactic

Peer reviewed
PDF on ERIC

Download full text

Direct link

Wang, Chun; Nydick, Steven W. – Grantee Submission, 2019

Recent work on measuring growth with categorical outcome variables has combined the item response theory (IRT) measurement model with the latent growth curve (LGC) model (e.g., McArdle, 1988) and extended the assessment of growth to multidimensional IRT models (e.g., Hsieh, von Eye, & Maier, 2010; Huang, 2013) and higher-order IRT models…

Descriptors: Longitudinal Studies, Item Response Theory, Comparative Analysis, Models

Scoring Graphical Responses in TIMSS 2019 Using Artificial Neural Networks

Peer reviewed

Direct link

von Davier, Matthias; Tyack, Lillian; Khorramdel, Lale – Educational and Psychological Measurement, 2023

Automated scoring of free drawings or images as responses has yet to be used in large-scale assessments of student achievement. In this study, we propose artificial neural networks to classify these types of graphical responses from a TIMSS 2019 item. We are comparing classification accuracy of convolutional and feed-forward approaches. Our…

Descriptors: Scoring, Networks, Artificial Intelligence, Elementary Secondary Education

Comparative Analyses of MIRT Models and Software (BMIRT and flexMIRT)

Peer reviewed

Direct link

Yavuz, Guler; Hambleton, Ronald K. – Educational and Psychological Measurement, 2017

Application of MIRT modeling procedures is dependent on the quality of parameter estimates provided by the estimation software and techniques used. This study investigated model parameter recovery of two popular MIRT packages, BMIRT and flexMIRT, under some common measurement conditions. These packages were specifically selected to investigate the…

Descriptors: Item Response Theory, Models, Comparative Analysis, Computer Software

Using the Stan Program for Bayesian Item Response Theory

Peer reviewed

Direct link

Luo, Yong; Jiao, Hong – Educational and Psychological Measurement, 2018

Stan is a new Bayesian statistical software program that implements the powerful and efficient Hamiltonian Monte Carlo (HMC) algorithm. To date there is not a source that systematically provides Stan code for various item response theory (IRT) models. This article provides Stan code for three representative IRT models, including the…

Descriptors: Bayesian Statistics, Item Response Theory, Probability, Computer Software

Previous Page | Next Page »

Pages: 1 | 2 | 3

Wang, Chun	4
DeMars, Christine E.	2
Ishii, Takatoshi	2
Jiao, Hong	2
Nydick, Steven W.	2
Shi, Ning-Zhong	2
Tao, Jian	2
Ueno, Maomi	2
Zhang, Xue	2
Alexander Kah	1
Bowles, Ben	1
Cao, Yunbo	1
Chen, Li-Ju	1
Chen, Yan-Lin	1
Chou, Kun-Yi	1
Custer, Michael	1
Deng, Nina	1
Dywel, Malwina	1
Emily Courtney	1
Fleckenstein, Melanie	1
Fuchimoto, Kazuma	1
Furr, D.	1
Hambleton, Ronald K.	1
Harlow, Iain M.	1
Harwell, Michael R.	1
More ▼