ERIC - Search Results

Publication Date

In 2025	2
Since 2024	4
Since 2021 (last 5 years)	6
Since 2016 (last 10 years)	17
Since 2006 (last 20 years)	25

Descriptor

Bayesian Statistics	30
Item Response Theory	30
Sample Size	30
Simulation	13
Comparative Analysis	9
Maximum Likelihood Statistics	9
Models	9
Computation	7
Monte Carlo Methods	7
Test Items	7
Test Length	7
Correlation	6
Statistical Distributions	6
Estimation (Mathematics)	5
Item Analysis	5
Statistical Analysis	5
Accuracy	4
Error of Measurement	4
Evaluation Methods	4
Markov Processes	4
Statistical Bias	4
Statistical Inference	4
Ability	3
Computer Simulation	3
Data Analysis	3
More ▼

Source

Educational and Psychological…	8
Applied Psychological…	4
Journal of Educational and…	4
Journal of Educational…	3
ProQuest LLC	3
International Journal of…	2
Measurement:…	2
Computers & Education	1
Journal of Experimental…	1
Psychometrika	1

Publication Type

Journal Articles	26
Reports - Research	20
Reports - Evaluative	5
Dissertations/Theses -…	3
Reports - Descriptive	1
Speeches/Meeting Papers	1

Education Level

Secondary Education	2
Elementary Education	1
Elementary Secondary Education	1
Grade 4	1
Grade 8	1
Junior High Schools	1
Middle Schools	1

Audience

Location

Armenia	1
Austria	1
Iran	1
Norway	1
Taiwan	1
Tunisia	1

Laws, Policies, & Programs

Assessments and Surveys

Trends in International…	2
National Longitudinal Study…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 30 results Save | Export

Redefining Item Response Models for Small Samples

Peer reviewed

Direct link

Jean-Paul Fox – Journal of Educational and Behavioral Statistics, 2025

Popular item response theory (IRT) models are considered complex, mainly due to the inclusion of a random factor variable (latent variable). The random factor variable represents the incidental parameter problem since the number of parameters increases when including data of new persons. Therefore, IRT models require a specific estimation method…

Descriptors: Sample Size, Item Response Theory, Accuracy, Bayesian Statistics

Item Parameter Recovery: Sensitivity to Prior Distribution

Peer reviewed

Direct link

Christine E. DeMars; Paulius Satkus – Educational and Psychological Measurement, 2024

Marginal maximum likelihood, a common estimation method for item response theory models, is not inherently a Bayesian procedure. However, due to estimation difficulties, Bayesian priors are often applied to the likelihood when estimating 3PL models, especially with small samples. Little focus has been placed on choosing the priors for marginal…

Descriptors: Item Response Theory, Statistical Distributions, Error of Measurement, Bayesian Statistics

A Comparison of Common IRT Model-Selection Methods with Mixed-Format Tests

Peer reviewed

Direct link

Luo, Yong – Measurement: Interdisciplinary Research and Perspectives, 2021

To date, only frequentist model-selection methods have been studied with mixed-format data in the context of IRT model-selection, and it is unknown how popular Bayesian model-selection methods such as DIC, WAIC, and LOO perform. In this study, we present the results of a comprehensive simulation study that compared the performances of eight…

Descriptors: Item Response Theory, Test Format, Selection, Methods

Bayesian Adaptive Lasso for the Detection of Differential Item Functioning in Graded Response Models

Peer reviewed

Direct link

Na Shan; Ping-Feng Xu – Journal of Educational and Behavioral Statistics, 2025

The detection of differential item functioning (DIF) is important in psychological and behavioral sciences. Standard DIF detection methods perform an item-by-item test iteratively, often assuming that all items except the one under investigation are DIF-free. This article proposes a Bayesian adaptive Lasso method to detect DIF in graded response…

Descriptors: Bayesian Statistics, Item Response Theory, Adolescents, Longitudinal Studies

An Evaluation of Fit Indices Used in Model Selection of Dichotomous Mixture IRT Models

Peer reviewed

Direct link

Sedat Sen; Allan S. Cohen – Educational and Psychological Measurement, 2024

A Monte Carlo simulation study was conducted to compare fit indices used for detecting the correct latent class in three dichotomous mixture item response theory (IRT) models. Ten indices were considered: Akaike's information criterion (AIC), the corrected AIC (AICc), Bayesian information criterion (BIC), consistent AIC (CAIC), Draper's…

Descriptors: Goodness of Fit, Item Response Theory, Sample Size, Classification

A General Bayesian Multidimensional Item Response Theory Model for Small and Large Samples

Peer reviewed

Direct link

Fujimoto, Ken A.; Neugebauer, Sabina R. – Educational and Psychological Measurement, 2020

Although item response theory (IRT) models such as the bifactor, two-tier, and between-item-dimensionality IRT models have been devised to confirm complex dimensional structures in educational and psychological data, they can be challenging to use in practice. The reason is that these models are multidimensional IRT (MIRT) models and thus are…

Descriptors: Bayesian Statistics, Item Response Theory, Sample Size, Factor Structure

Bayesian Hierarchical Multidimensional Item Response Modeling of Small Sample, Sparse Data for Personalized Developmental Surveillance

Peer reviewed

Direct link

Gilholm, Patricia; Mengersen, Kerrie; Thompson, Helen – Educational and Psychological Measurement, 2021

Developmental surveillance tools are used to closely monitor the early development of infants and young children. This study provides a novel implementation of a multidimensional item response model, using Bayesian hierarchical priors, to construct developmental profiles for a small sample of children (N = 115) with sparse data collected through…

Descriptors: Bayesian Statistics, Item Response Theory, Sample Size, Child Development

Rasch versus Classical Equating in the Context of Small Sample Sizes

Peer reviewed

Direct link

Babcock, Ben; Hodge, Kari J. – Educational and Psychological Measurement, 2020

Equating and scaling in the context of small sample exams, such as credentialing exams for highly specialized professions, has received increased attention in recent research. Investigators have proposed a variety of both classical and Rasch-based approaches to the problem. This study attempts to extend past research by (1) directly comparing…

Descriptors: Item Response Theory, Equated Scores, Scaling, Sample Size

The Impact of Various Class-Distinction Features on Model Selection in the Mixture Rasch Model

Peer reviewed

Direct link

Choi, In-Hee; Paek, Insu; Cho, Sun-Joo – Journal of Experimental Education, 2017

The purpose of the current study is to examine the performance of four information criteria (Akaike's information criterion [AIC], corrected AIC [AICC] Bayesian information criterion [BIC], sample-size adjusted BIC [SABIC]) for detecting the correct number of latent classes in the mixture Rasch model through simulations. The simulation study…

Descriptors: Item Response Theory, Models, Bayesian Statistics, Simulation

An Improved Estimation Using Polya-Gamma Augmentation for Bayesian Structural Equation Models with Dichotomous Variables

Peer reviewed

Direct link

Kim, Seohyun; Lu, Zhenqiu; Cohen, Allan S. – Measurement: Interdisciplinary Research and Perspectives, 2018

Bayesian algorithms have been used successfully in the social and behavioral sciences to analyze dichotomous data particularly with complex structural equation models. In this study, we investigate the use of the Polya-Gamma data augmentation method with Gibbs sampling to improve estimation of structural equation models with dichotomous variables.…

Descriptors: Bayesian Statistics, Structural Equation Models, Computation, Social Science Research

Interval Estimation of Latent Variable Scores in Item Response Theory

Peer reviewed

Direct link

Liu, Yang; Yang, Ji Seung – Journal of Educational and Behavioral Statistics, 2018

The uncertainty arising from item parameter estimation is often not negligible and must be accounted for when calculating latent variable (LV) scores in item response theory (IRT). It is particularly so when the calibration sample size is limited and/or the calibration IRT model is complex. In the current work, we treat two-stage IRT scoring as a…

Descriptors: Intervals, Scores, Item Response Theory, Bayesian Statistics

Mixture IRT Model with a Higher-Order Structure for Latent Traits

Peer reviewed

Direct link

Huang, Hung-Yu – Educational and Psychological Measurement, 2017

Mixture item response theory (IRT) models have been suggested as an efficient method of detecting the different response patterns derived from latent classes when developing a test. In testing situations, multiple latent traits measured by a battery of tests can exhibit a higher-order structure, and mixtures of latent classes may occur on…

Descriptors: Item Response Theory, Models, Bayesian Statistics, Computation

Detecting Differential Item Discrimination (DID) and the Consequences of Ignoring DID in Multilevel Item Response Models

Peer reviewed

Direct link

Lee, Woo-yeol; Cho, Sun-Joo – Journal of Educational Measurement, 2017

Cross-level invariance in a multilevel item response model can be investigated by testing whether the within-level item discriminations are equal to the between-level item discriminations. Testing the cross-level invariance assumption is important to understand constructs in multilevel data. However, in most multilevel item response model…

Descriptors: Test Items, Item Response Theory, Item Analysis, Simulation

Lord's Wald Test for Detecting Dif in Multidimensional Irt Models: A Comparison of Two Estimation Approaches

Peer reviewed

Direct link

Lee, Soo; Suh, Youngsuk – Journal of Educational Measurement, 2018

Lord's Wald test for differential item functioning (DIF) has not been studied extensively in the context of the multidimensional item response theory (MIRT) framework. In this article, Lord's Wald test was implemented using two estimation approaches, marginal maximum likelihood estimation and Bayesian Markov chain Monte Carlo estimation, to detect…

Descriptors: Item Response Theory, Sample Size, Models, Error of Measurement

A Comparative Study of Online Item Calibration Methods in Multidimensional Computerized Adaptive Testing

Peer reviewed

Direct link

Chen, Ping – Journal of Educational and Behavioral Statistics, 2017

Calibration of new items online has been an important topic in item replenishment for multidimensional computerized adaptive testing (MCAT). Several online calibration methods have been proposed for MCAT, such as multidimensional "one expectation-maximization (EM) cycle" (M-OEM) and multidimensional "multiple EM cycles"…

Descriptors: Test Items, Item Response Theory, Test Construction, Adaptive Testing

Previous Page | Next Page »

Pages: 1 | 2

Kim, Seock-Ho	3
Cho, Sun-Joo	2
Hong, Yuan	2
Swaminathan, Hariharan	2
de la Torre, Jimmy	2
Allan S. Cohen	1
Babcock, Ben	1
Chen, Ping	1
Choi, In-Hee	1
Christine E. DeMars	1
Cohen, Allan S.	1
Deng, Weiling	1
Desmet, Piet	1
Fujimoto, Ken A.	1
Gifford, Janice A.	1
Gilholm, Patricia	1
Glas, Cees A. W.	1
Hambleton, Ronald K.	1
Harwell, Michael R.	1
Hein, Serge F.	1
Hodge, Kari J.	1
Huang, Hung-Yu	1
Janosky, Janine E.	1
Jean-Paul Fox	1
More ▼