ERIC - Search Results

Publication Date

In 2025	2
Since 2024	4
Since 2021 (last 5 years)	5
Since 2016 (last 10 years)	10
Since 2006 (last 20 years)	17

Descriptor

Comparative Analysis	19
Simulation	18
Item Response Theory	8
Models	8
Computation	6
Evaluation Methods	6
Statistical Analysis	6
Goodness of Fit	4
Bayesian Statistics	3
Hierarchical Linear Modeling	3
Item Analysis	3
Mathematics Tests	3
Maximum Likelihood Statistics	3
Scoring	3
Test Items	3
Accuracy	2
Correlation	2
Educational Assessment	2
Elementary Secondary Education	2
Foreign Countries	2
Hypothesis Testing	2
Mathematics	2
Measurement	2
Reaction Time	2
Research Design	2
More ▼

Source

Journal of Educational and…

Publication Type

Journal Articles	19
Reports - Research	13
Reports - Evaluative	4
Reports - Descriptive	2

Education Level

Secondary Education	3
Elementary Secondary Education	2
Grade 12	1
High Schools	1

Audience

Location

Netherlands

Laws, Policies, & Programs

Assessments and Surveys

National Longitudinal Study…	1
Trends in International…	1
Wechsler Adult Intelligence…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 19 results Save | Export

Bayesian Diagnostic Classification Models for a Partially Known Q-Matrix

Peer reviewed

Direct link

Kazuhiro Yamaguchi – Journal of Educational and Behavioral Statistics, 2025

This study proposes a Bayesian method for diagnostic classification models (DCMs) for a partially known Q-matrix setting between exploratory and confirmatory DCMs. This Q-matrix setting is practical and useful because test experts have pre-knowledge of the Q-matrix but cannot readily specify it completely. The proposed method employs priors for…

Descriptors: Models, Classification, Bayesian Statistics, Evaluation Methods

Analyzing Polytomous Test Data: A Comparison between an Information-Based IRT Model and the Generalized Partial Credit Model

Peer reviewed

Direct link

Joakim Wallmark; James O. Ramsay; Juan Li; Marie Wiberg – Journal of Educational and Behavioral Statistics, 2024

Item response theory (IRT) models the relationship between the possible scores on a test item against a test taker's attainment of the latent trait that the item is intended to measure. In this study, we compare two models for tests with polytomously scored items: the optimal scoring (OS) model, a nonparametric IRT model based on the principles of…

Descriptors: Item Response Theory, Test Items, Models, Scoring

A Comparison of Latent Semantic Analysis and Latent Dirichlet Allocation in Educational Measurement

Peer reviewed

Direct link

Jordan M. Wheeler; Allan S. Cohen; Shiyu Wang – Journal of Educational and Behavioral Statistics, 2024

Topic models are mathematical and statistical models used to analyze textual data. The objective of topic models is to gain information about the latent semantic space of a set of related textual data. The semantic space of a set of textual data contains the relationship between documents and words and how they are used. Topic models are becoming…

Descriptors: Semantics, Educational Assessment, Evaluators, Reliability

Bayesian Adaptive Lasso for the Detection of Differential Item Functioning in Graded Response Models

Peer reviewed

Direct link

Na Shan; Ping-Feng Xu – Journal of Educational and Behavioral Statistics, 2025

The detection of differential item functioning (DIF) is important in psychological and behavioral sciences. Standard DIF detection methods perform an item-by-item test iteratively, often assuming that all items except the one under investigation are DIF-free. This article proposes a Bayesian adaptive Lasso method to detect DIF in graded response…

Descriptors: Bayesian Statistics, Item Response Theory, Adolescents, Longitudinal Studies

A Comparison of Joint Model and Fully Conditional Specification Imputation for Multilevel Missing Data

Peer reviewed
PDF on ERIC

Download full text

Direct link

Mistler, Stephen A.; Enders, Craig K. – Journal of Educational and Behavioral Statistics, 2017

Multiple imputation methods can generally be divided into two broad frameworks: joint model (JM) imputation and fully conditional specification (FCS) imputation. JM draws missing values simultaneously for all incomplete variables using a multivariate distribution, whereas FCS imputes variables one at a time from a series of univariate conditional…

Descriptors: Statistical Analysis, Comparative Analysis, Hierarchical Linear Modeling, Computer Simulation

Bayesian Multilevel Latent Class Models for the Multiple Imputation of Nested Categorical Data

Peer reviewed

Direct link

Vidotto, Davide; Vermunt, Jeroen K.; van Deun, Katrijn – Journal of Educational and Behavioral Statistics, 2018

With this article, we propose using a Bayesian multilevel latent class (BMLC; or mixture) model for the multiple imputation of nested categorical data. Unlike recently developed methods that can only pick up associations between pairs of variables, the multilevel mixture model we propose is flexible enough to automatically deal with complex…

Descriptors: Bayesian Statistics, Multivariate Analysis, Data, Hierarchical Linear Modeling

Estimation of Expected Fisher Information for IRT Models

Peer reviewed

Direct link

Monroe, Scott – Journal of Educational and Behavioral Statistics, 2019

In item response theory (IRT) modeling, the Fisher information matrix is used for numerous inferential procedures such as estimating parameter standard errors, constructing test statistics, and facilitating test scoring. In principal, these procedures may be carried out using either the expected information or the observed information. However, in…

Descriptors: Item Response Theory, Error of Measurement, Scoring, Inferences

Testing Latent Variable Distribution Fit in IRT Using Posterior Residuals

Peer reviewed

Direct link

Monroe, Scott – Journal of Educational and Behavioral Statistics, 2021

This research proposes a new statistic for testing latent variable distribution fit for unidimensional item response theory (IRT) models. If the typical assumption of normality is violated, then item parameter estimates will be biased, and dependent quantities such as IRT score estimates will be adversely affected. The proposed statistic compares…

Descriptors: Item Response Theory, Simulation, Scores, Comparative Analysis

Person Fit Analysis in Computerized Adaptive Testing Using Tests for a Change Point

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2016

Meijer and van Krimpen-Stoop noted that the number of person-fit statistics (PFSs) that have been designed for computerized adaptive tests (CATs) is relatively modest. This article partially addresses that concern by suggesting three new PFSs for CATs. The statistics are based on tests for a change point and can be used to detect an abrupt change…

Descriptors: Computer Assisted Testing, Adaptive Testing, Item Response Theory, Goodness of Fit

Posterior Predictive Checks for Conditional Independence between Response Time and Accuracy

Peer reviewed

Direct link

Bolsinova, Maria; Tijmstra, Jesper – Journal of Educational and Behavioral Statistics, 2016

Conditional independence (CI) between response time and response accuracy is a fundamental assumption of many joint models for time and accuracy used in educational measurement. In this study, posterior predictive checks (PPCs) are proposed for testing this assumption. These PPCs are based on three discrepancy measures reflecting different…

Descriptors: Reaction Time, Accuracy, Statistical Analysis, Robustness (Statistics)

Covariate Adjustment Strategy Increases Power in the Randomized Controlled Trial With Discrete-Time Survival Endpoints

Peer reviewed

Direct link

Safarkhani, Maryam; Moerbeek, Mirjam – Journal of Educational and Behavioral Statistics, 2013

In a randomized controlled trial, a decision needs to be made about the total number of subjects for adequate statistical power. One way to increase the power of a trial is by including a predictive covariate in the model. In this article, the effects of various covariate adjustment strategies on increasing the power is studied for discrete-time…

Descriptors: Statistical Analysis, Scientific Methodology, Research Design, Sample Size

Design-Comparable Effect Sizes in Multiple Baseline Designs: A General Modeling Framework

Peer reviewed

Direct link

Pustejovsky, James E.; Hedges, Larry V.; Shadish, William R. – Journal of Educational and Behavioral Statistics, 2014

In single-case research, the multiple baseline design is a widely used approach for evaluating the effects of interventions on individuals. Multiple baseline designs involve repeated measurement of outcomes over time and the controlled introduction of a treatment at different times for different individuals. This article outlines a general…

Descriptors: Hierarchical Linear Modeling, Effect Size, Maximum Likelihood Statistics, Computation

A Multilevel Mixture IRT Model with an Application to DIF

Peer reviewed

Direct link

Cho, Sun-Joo; Cohen, Allan S. – Journal of Educational and Behavioral Statistics, 2010

Mixture item response theory models have been suggested as a potentially useful methodology for identifying latent groups formed along secondary, possibly nuisance dimensions. In this article, we describe a multilevel mixture item response theory (IRT) model (MMixIRTM) that allows for the possibility that this nuisance dimensionality may function…

Descriptors: Simulation, Mathematics Tests, Item Response Theory, Student Behavior

Metropolis-Hastings Robbins-Monro Algorithm for Confirmatory Item Factor Analysis

Peer reviewed

Direct link

Cai, Li – Journal of Educational and Behavioral Statistics, 2010

Item factor analysis (IFA), already well established in educational measurement, is increasingly applied to psychological measurement in research settings. However, high-dimensional confirmatory IFA remains a numerical challenge. The current research extends the Metropolis-Hastings Robbins-Monro (MH-RM) algorithm, initially proposed for…

Descriptors: Simulation, Questionnaires, Measurement, Factor Analysis

Effects on Scale Linking of Different Definitions of Criterion Functions for the IRT Characteristic Curve Methods

Peer reviewed

Direct link

Kim, Seonghoon; Kolen, Michael J. – Journal of Educational and Behavioral Statistics, 2007

Under item response theory, the characteristic curve methods (Haebara and Stocking-Lord methods) are used to link two ability scales from separate calibrations. The linking methods use their respective criterion functions that can be defined differently according to the symmetry- and distribution-related schemes. The symmetry-related scheme…

Descriptors: Measures (Individuals), Item Response Theory, Simulation, Comparative Analysis

Previous Page | Next Page »

Pages: 1 | 2

Klockars, Alan J.	2
Monroe, Scott	2
Allan S. Cohen	1
Bolsinova, Maria	1
Cai, Li	1
Cho, Sun-Joo	1
Cohen, Allan S.	1
Enders, Craig K.	1
Hagglund, Gosta	1
Hancock, Gregory	1
Hancock, Gregory R.	1
Hedges, Larry V.	1
James O. Ramsay	1
Jansen, Margo G. H.	1
Joakim Wallmark	1
Jordan M. Wheeler	1
Juan Li	1
Kazuhiro Yamaguchi	1
Kim, Seonghoon	1
Kolen, Michael J.	1
Larsson, Rolf	1
Marie Wiberg	1
Mistler, Stephen A.	1
Moerbeek, Mirjam	1
Na Shan	1
More ▼