ERIC - Search Results

Publication Date

In 2025	2
Since 2024	6
Since 2021 (last 5 years)	6
Since 2016 (last 10 years)	11
Since 2006 (last 20 years)	26

Descriptor

Comparative Analysis	27
Monte Carlo Methods	14
Computation	12
Evaluation Methods	11
Models	10
Item Response Theory	8
Simulation	8
Maximum Likelihood Statistics	7
Item Analysis	5
Sample Size	5
Bayesian Statistics	4
Computer Software	4
Correlation	4
Markov Processes	4
Regression (Statistics)	4
Accuracy	3
Error of Measurement	3
Foreign Countries	3
Scoring	3
Statistical Analysis	3
Statistical Bias	3
Test Items	3
Academic Achievement	2
Bias	2
Causal Models	2
More ▼

Source

Journal of Educational and…

Publication Type

Journal Articles	27
Reports - Research	19
Reports - Evaluative	7
Reports - Descriptive	1

Education Level

Elementary Education	2
Elementary Secondary Education	2
Grade 1	2
Higher Education	2
Postsecondary Education	2
Early Childhood Education	1
Grade 2	1
Primary Education	1
Secondary Education	1

Audience

Location

United Kingdom (England)	2
Australia	1
Austria	1
Belgium	1
Canada	1
China (Shanghai)	1
Cyprus	1
Czech Republic	1
Denmark	1
Estonia	1
Finland	1
Florida	1
France	1
Germany	1
Indiana	1
Ireland	1
Italy	1
Japan	1
Netherlands	1
North Carolina	1
Poland	1
Slovakia	1
South Korea	1
Spain	1
Sweden	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Iowa Tests of Basic Skills	1
National Assessment of…	1
National Longitudinal Study…	1
Program for International…	1
Wechsler Adult Intelligence…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 27 results Save | Export

Bayesian Diagnostic Classification Models for a Partially Known Q-Matrix

Peer reviewed

Direct link

Kazuhiro Yamaguchi – Journal of Educational and Behavioral Statistics, 2025

This study proposes a Bayesian method for diagnostic classification models (DCMs) for a partially known Q-matrix setting between exploratory and confirmatory DCMs. This Q-matrix setting is practical and useful because test experts have pre-knowledge of the Q-matrix but cannot readily specify it completely. The proposed method employs priors for…

Descriptors: Models, Classification, Bayesian Statistics, Evaluation Methods

Analyzing Polytomous Test Data: A Comparison between an Information-Based IRT Model and the Generalized Partial Credit Model

Peer reviewed

Direct link

Joakim Wallmark; James O. Ramsay; Juan Li; Marie Wiberg – Journal of Educational and Behavioral Statistics, 2024

Item response theory (IRT) models the relationship between the possible scores on a test item against a test taker's attainment of the latent trait that the item is intended to measure. In this study, we compare two models for tests with polytomously scored items: the optimal scoring (OS) model, a nonparametric IRT model based on the principles of…

Descriptors: Item Response Theory, Test Items, Models, Scoring

Cognitive Diagnosis Testlet Model for Multiple-Choice Items

Peer reviewed

Direct link

Lei Guo; Wenjie Zhou; Xiao Li – Journal of Educational and Behavioral Statistics, 2024

The testlet design is very popular in educational and psychological assessments. This article proposes a new cognitive diagnosis model, the multiple-choice cognitive diagnostic testlet (MC-CDT) model for tests using testlets consisting of MC items. The MC-CDT model uses the original examinees' responses to MC items instead of dichotomously scored…

Descriptors: Multiple Choice Tests, Diagnostic Tests, Accuracy, Computer Software

Bayesian Adaptive Lasso for the Detection of Differential Item Functioning in Graded Response Models

Peer reviewed

Direct link

Na Shan; Ping-Feng Xu – Journal of Educational and Behavioral Statistics, 2025

The detection of differential item functioning (DIF) is important in psychological and behavioral sciences. Standard DIF detection methods perform an item-by-item test iteratively, often assuming that all items except the one under investigation are DIF-free. This article proposes a Bayesian adaptive Lasso method to detect DIF in graded response…

Descriptors: Bayesian Statistics, Item Response Theory, Adolescents, Longitudinal Studies

Mixed-Effects Location Scale Models for Joint Modeling School Value-Added Effects on the Mean and Variance of Student Achievement

Peer reviewed

Direct link

George Leckie; Richard Parker; Harvey Goldstein; Kate Tilling – Journal of Educational and Behavioral Statistics, 2024

School value-added models are widely applied to study, monitor, and hold schools to account for school differences in student learning. The traditional model is a mixed-effects linear regression of student current achievement on student prior achievement, background characteristics, and a school random intercept effect. The latter is referred to…

Descriptors: Academic Achievement, Value Added Models, Accountability, Institutional Characteristics

Combining Human and Automated Scoring Methods in Experimental Assessments of Writing: A Case Study Tutorial

Peer reviewed

Direct link

Reagan Mozer; Luke Miratrix; Jackie Eunjung Relyea; James S. Kim – Journal of Educational and Behavioral Statistics, 2024

In a randomized trial that collects text as an outcome, traditional approaches for assessing treatment impact require that each document first be manually coded for constructs of interest by human raters. An impact analysis can then be conducted to compare treatment and control groups, using the hand-coded scores as a measured outcome. This…

Descriptors: Scoring, Evaluation Methods, Writing Evaluation, Comparative Analysis

Estimation of Expected Fisher Information for IRT Models

Peer reviewed

Direct link

Monroe, Scott – Journal of Educational and Behavioral Statistics, 2019

In item response theory (IRT) modeling, the Fisher information matrix is used for numerous inferential procedures such as estimating parameter standard errors, constructing test statistics, and facilitating test scoring. In principal, these procedures may be carried out using either the expected information or the observed information. However, in…

Descriptors: Item Response Theory, Error of Measurement, Scoring, Inferences

Does the Package Matter? A Comparison of Five Common Multilevel Modeling Software Packages

Peer reviewed

Direct link

McCoach, D. Betsy; Rifenbark, Graham G.; Newton, Sarah D.; Li, Xiaoran; Kooken, Janice; Yomtov, Dani; Gambino, Anthony J.; Bellara, Aarti – Journal of Educational and Behavioral Statistics, 2018

This study compared five common multilevel software packages via Monte Carlo simulation: HLM 7, M"plus" 7.4, R (lme4 V1.1-12), Stata 14.1, and SAS 9.4 to determine how the programs differ in estimation accuracy and speed, as well as convergence, when modeling multiple randomly varying slopes of different magnitudes. Simulated data…

Descriptors: Hierarchical Linear Modeling, Computer Software, Comparative Analysis, Monte Carlo Methods

The Validity and Precision of the Comparative Interrupted Time-Series Design: Three Within-Study Comparisons

Peer reviewed

Direct link

St. Clair, Travis; Hallberg, Kelly; Cook, Thomas D. – Journal of Educational and Behavioral Statistics, 2016

We explore the conditions under which short, comparative interrupted time-series (CITS) designs represent valid alternatives to randomized experiments in educational evaluations. To do so, we conduct three within-study comparisons, each of which uses a unique data set to test the validity of the CITS design by comparing its causal estimates to…

Descriptors: Research Methodology, Randomized Controlled Trials, Comparative Analysis, Time

Determining Sample Sizes for Precise Contrast Analysis with Heterogeneous Variances

Peer reviewed

Direct link

Jan, Show-Li; Shieh, Gwowen – Journal of Educational and Behavioral Statistics, 2014

The analysis of variance (ANOVA) is one of the most frequently used statistical analyses in practical applications. Accordingly, the single and multiple comparison procedures are frequently applied to assess the differences among mean effects. However, the underlying assumption of homogeneous variances may not always be tenable. This study…

Descriptors: Sample Size, Statistical Analysis, Computation, Probability

IRT Item Parameter Recovery with Marginal Maximum Likelihood Estimation Using Loglinear Smoothing Models

Peer reviewed
PDF on ERIC

Download full text

Direct link

Casabianca, Jodi M.; Lewis, Charles – Journal of Educational and Behavioral Statistics, 2015

Loglinear smoothing (LLS) estimates the latent trait distribution while making fewer assumptions about its form and maintaining parsimony, thus leading to more precise item response theory (IRT) item parameter estimates than standard marginal maximum likelihood (MML). This article provides the expectation-maximization algorithm for MML estimation…

Descriptors: Item Response Theory, Maximum Likelihood Statistics, Computation, Comparative Analysis

Grade of Membership Response Time Model for Detecting Guessing Behaviors

Peer reviewed

Direct link

Pokropek, Artur – Journal of Educational and Behavioral Statistics, 2016

A response model that is able to detect guessing behaviors and produce unbiased estimates in low-stake conditions using timing information is proposed. The model is a special case of the grade of membership model in which responses are modeled as partial members of a class that is affected by motivation and a class that responds only according to…

Descriptors: Reaction Time, Models, Guessing (Tests), Computation

Using Data-Dependent Priors to Mitigate Small Sample Bias in Latent Growth Models: A Discussion and Illustration Using M"plus"

Peer reviewed

Direct link

McNeish, Daniel M. – Journal of Educational and Behavioral Statistics, 2016

Mixed-effects models (MEMs) and latent growth models (LGMs) are often considered interchangeable save the discipline-specific nomenclature. Software implementations of these models, however, are not interchangeable, particularly with small sample sizes. Restricted maximum likelihood estimation that mitigates small sample bias in MEMs has not been…

Descriptors: Models, Statistical Analysis, Hierarchical Linear Modeling, Sample Size

Analyzing Regression-Discontinuity Designs with Multiple Assignment Variables: A Comparative Study of Four Estimation Methods

Peer reviewed

Direct link

Wong, Vivian C.; Steiner, Peter M.; Cook, Thomas D. – Journal of Educational and Behavioral Statistics, 2013

In a traditional regression-discontinuity design (RDD), units are assigned to treatment on the basis of a cutoff score and a continuous assignment variable. The treatment effect is measured at a single cutoff location along the assignment variable. This article introduces the multivariate regression-discontinuity design (MRDD), where multiple…

Descriptors: Computation, Research Design, Regression (Statistics), Multivariate Analysis

Alternatives for Mixed-Effects Meta-Regression Models in the Reliability Generalization Approach: A Simulation Study

Peer reviewed

Direct link

López-López, José Antonio; Botella, Juan; Sánchez-Meca, Julio; Marín-Martínez, Fulgencio – Journal of Educational and Behavioral Statistics, 2013

Since heterogeneity between reliability coefficients is usually found in reliability generalization studies, moderator analyses constitute a crucial step for that meta-analytic approach. In this study, different procedures for conducting mixed-effects meta-regression analyses were compared. Specifically, four transformation methods for the…

Descriptors: Reliability, Generalization, Meta Analysis, Regression (Statistics)

Previous Page | Next Page »

Pages: 1 | 2

Cook, Thomas D.	2
Lewis, Charles	2
Bellara, Aarti	1
Berger, Martijn P. F.	1
Botella, Juan	1
Cai, Li	1
Casabianca, Jodi M.	1
Chen, Jinsong	1
Cho, Sun-Joo	1
Choi, Jaehwa	1
Cohen, Allan S.	1
Dannels, Sharon	1
Draper, David	1
Gambino, Anthony J.	1
George Leckie	1
Gloster, Andrew T.	1
Hagglund, Gosta	1
Hallberg, Kelly	1
Harvey Goldstein	1
Jackie Eunjung Relyea	1
James O. Ramsay	1
James S. Kim	1
Jan, Show-Li	1
Jo, Booil	1
Joakim Wallmark	1
More ▼