Publication Date
In 2025 | 2 |
Since 2024 | 6 |
Since 2021 (last 5 years) | 17 |
Since 2016 (last 10 years) | 45 |
Since 2006 (last 20 years) | 105 |
Descriptor
Bayesian Statistics | 120 |
Item Response Theory | 120 |
Models | 120 |
Computation | 40 |
Simulation | 38 |
Monte Carlo Methods | 35 |
Test Items | 30 |
Markov Processes | 28 |
Psychometrics | 22 |
Comparative Analysis | 21 |
Goodness of Fit | 19 |
More ▼ |
Source
Author
Huang, Hung-Yu | 5 |
Glas, Cees A. W. | 4 |
Sinharay, Sandip | 4 |
Tao, Jian | 4 |
Wang, Wen-Chung | 4 |
Ames, Allison J. | 3 |
Cho, Sun-Joo | 3 |
Cohen, Allan S. | 3 |
Mislevy, Robert J. | 3 |
Revuelta, Javier | 3 |
Shi, Ning-Zhong | 3 |
More ▼ |
Publication Type
Education Level
Secondary Education | 9 |
Higher Education | 8 |
Middle Schools | 7 |
Elementary Education | 5 |
Junior High Schools | 5 |
Postsecondary Education | 5 |
Elementary Secondary Education | 4 |
Intermediate Grades | 4 |
Grade 4 | 3 |
Grade 5 | 2 |
Grade 8 | 2 |
More ▼ |
Audience
Researchers | 2 |
Practitioners | 1 |
Location
Taiwan | 4 |
Brazil | 3 |
Czech Republic | 1 |
Europe | 1 |
Florida | 1 |
Germany | 1 |
Netherlands | 1 |
North Carolina | 1 |
Saudi Arabia | 1 |
Uruguay | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Ken A. Fujimoto; Carl F. Falk – Educational and Psychological Measurement, 2024
Item response theory (IRT) models are often compared with respect to predictive performance to determine the dimensionality of rating scale data. However, such model comparisons could be biased toward nested-dimensionality IRT models (e.g., the bifactor model) when comparing those models with non-nested-dimensionality IRT models (e.g., a…
Descriptors: Item Response Theory, Rating Scales, Predictive Measurement, Bayesian Statistics
Jean-Paul Fox – Journal of Educational and Behavioral Statistics, 2025
Popular item response theory (IRT) models are considered complex, mainly due to the inclusion of a random factor variable (latent variable). The random factor variable represents the incidental parameter problem since the number of parameters increases when including data of new persons. Therefore, IRT models require a specific estimation method…
Descriptors: Sample Size, Item Response Theory, Accuracy, Bayesian Statistics
Sooyong Lee; Suhwa Han; Seung W. Choi – Journal of Educational Measurement, 2024
Research has shown that multiple-indicator multiple-cause (MIMIC) models can result in inflated Type I error rates in detecting differential item functioning (DIF) when the assumption of equal latent variance is violated. This study explains how the violation of the equal variance assumption adversely impacts the detection of nonuniform DIF and…
Descriptors: Factor Analysis, Bayesian Statistics, Test Bias, Item Response Theory
Justin L. Kern – Journal of Educational and Behavioral Statistics, 2024
Given the frequent presence of slipping and guessing in item responses, models for the inclusion of their effects are highly important. Unfortunately, the most common model for their inclusion, the four-parameter item response theory model, potentially has severe deficiencies related to its possible unidentifiability. With this issue in mind, the…
Descriptors: Item Response Theory, Models, Bayesian Statistics, Generalization
Yuqi Gu; Elena A. Erosheva; Gongjun Xu; David B. Dunson – Grantee Submission, 2023
Mixed Membership Models (MMMs) are a popular family of latent structure models for complex multivariate data. Instead of forcing each subject to belong to a single cluster, MMMs incorporate a vector of subject-specific weights characterizing partial membership across clusters. With this flexibility come challenges in uniquely identifying,…
Descriptors: Multivariate Analysis, Item Response Theory, Bayesian Statistics, Models
Bayesian Adaptive Lasso for the Detection of Differential Item Functioning in Graded Response Models
Na Shan; Ping-Feng Xu – Journal of Educational and Behavioral Statistics, 2025
The detection of differential item functioning (DIF) is important in psychological and behavioral sciences. Standard DIF detection methods perform an item-by-item test iteratively, often assuming that all items except the one under investigation are DIF-free. This article proposes a Bayesian adaptive Lasso method to detect DIF in graded response…
Descriptors: Bayesian Statistics, Item Response Theory, Adolescents, Longitudinal Studies
Bürkner, Paul-Christian – Journal of Intelligence, 2020
Raven's Standard Progressive Matrices (SPM) test and related matrix-based tests are widely applied measures of cognitive ability. Using Bayesian Item Response Theory (IRT) models, I reanalyzed data of an SPM short form proposed by Myszkowski and Storme (2018) and, at the same time, illustrate the application of these models. Results indicate that…
Descriptors: Intelligence Tests, Matrices, Bayesian Statistics, Item Response Theory
Lozano, José H.; Revuelta, Javier – Applied Measurement in Education, 2021
The present study proposes a Bayesian approach for estimating and testing the operation-specific learning model, a variant of the linear logistic test model that allows for the measurement of the learning that occurs during a test as a result of the repeated use of the operations involved in the items. The advantages of using a Bayesian framework…
Descriptors: Bayesian Statistics, Computation, Learning, Testing
Huang, Hung-Yu – Educational and Psychological Measurement, 2023
The forced-choice (FC) item formats used for noncognitive tests typically develop a set of response options that measure different traits and instruct respondents to make judgments among these options in terms of their preference to control the response biases that are commonly observed in normative tests. Diagnostic classification models (DCMs)…
Descriptors: Test Items, Classification, Bayesian Statistics, Decision Making
Sarsa, Sami; Leinonen, Juho; Hellas, Arto – Journal of Educational Data Mining, 2022
New knowledge tracing models are continuously being proposed, even at a pace where state-of-the-art models cannot be compared with each other at the time of publication. This leads to a situation where ranking models is hard, and the underlying reasons of the models' performance -- be it architectural choices, hyperparameter tuning, performance…
Descriptors: Learning Processes, Artificial Intelligence, Intelligent Tutoring Systems, Memory
Shi Pu; Yu Yan; Brandon Zhang – Journal of Educational Data Mining, 2024
We propose a novel model, Wide & Deep Item Response Theory (Wide & Deep IRT), to predict the correctness of students' responses to questions using historical clickstream data. This model combines the strengths of conventional Item Response Theory (IRT) models and Wide & Deep Learning for Recommender Systems. By leveraging clickstream…
Descriptors: Prediction, Success, Data Analysis, Learning Analytics
Zhang, Xue; Tao, Jian; Wang, Chun; Shi, Ning-Zhong – Journal of Educational Measurement, 2019
Model selection is important in any statistical analysis, and the primary goal is to find the preferred (or most parsimonious) model, based on certain criteria, from a set of candidate models given data. Several recent publications have employed the deviance information criterion (DIC) to do model selection among different forms of multilevel item…
Descriptors: Bayesian Statistics, Item Response Theory, Measurement, Models
Zhang, Xue; Tao, Jian; Wang, Chun; Shi, Ning-Zhong – Grantee Submission, 2019
Model selection is important in any statistical analysis, and the primary goal is to find the preferred (or most parsimonious) model, based on certain criteria, from a set of candidate models given data. Several recent publications have employed the deviance information criterion (DIC) to do model selection among different forms of multilevel item…
Descriptors: Bayesian Statistics, Item Response Theory, Measurement, Models
Fujimoto, Ken A. – Educational and Psychological Measurement, 2019
Advancements in item response theory (IRT) have led to models for dual dependence, which control for cluster and method effects during a psychometric analysis. Currently, however, this class of models does not include one that controls for when the method effects stem from two method sources in which one source functions differently across the…
Descriptors: Bayesian Statistics, Item Response Theory, Psychometrics, Models
List, Marit Kristine; Köller, Olaf; Nagy, Gabriel – Educational and Psychological Measurement, 2019
Tests administered in studies of student achievement often have a certain amount of not-reached items (NRIs). The propensity for NRIs may depend on the proficiency measured by the test and on additional covariates. This article proposes a semiparametric model to study such relationships. Our model extends Glas and Pimentel's item response theory…
Descriptors: Educational Assessment, Item Response Theory, Multivariate Analysis, Test Items