ERIC - Search Results

Publication Date

In 2025	5
Since 2024	12
Since 2021 (last 5 years)	16
Since 2016 (last 10 years)	20
Since 2006 (last 20 years)	45

Descriptor

Evaluation Methods	59
Simulation	20
Models	19
Item Response Theory	14
Bayesian Statistics	12
Comparative Analysis	11
Computation	10
Statistical Analysis	10
Test Items	8
Academic Achievement	7
Data Analysis	7
Educational Assessment	7
Equations (Mathematics)	7
Item Analysis	7
Achievement Tests	6
Correlation	6
Regression (Statistics)	6
Research Methodology	6
Responses	6
Error of Measurement	5
Goodness of Fit	5
Probability	5
Psychometrics	5
Scores	5
Statistical Distributions	5
More ▼

Source

Journal of Educational and…

Publication Type

Journal Articles	59
Reports - Research	30
Reports - Evaluative	16
Reports - Descriptive	13
Opinion Papers	1
Speeches/Meeting Papers	1

Education Level

Elementary Secondary Education	5
Elementary Education	4
Secondary Education	3
Grade 1	2
Higher Education	2
Early Childhood Education	1
Grade 2	1
Grade 3	1
Grade 4	1
Grade 5	1
Grade 6	1
Grade 7	1
Grade 8	1
High Schools	1
Middle Schools	1
Postsecondary Education	1
Primary Education	1
More ▼

Audience

Location

California	1
California (Los Angeles)	1
Florida	1
Indiana	1
Pennsylvania	1
Peru	1
Tennessee	1
United Kingdom (England)	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	2
Trends in International…	2
Armed Services Vocational…	1
National Assessment of…	1
National Longitudinal Study…	1
SAT (College Admission Test)	1
Wechsler Adult Intelligence…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 59 results Save | Export

Equivalencies between Ad Hoc Strategies and Multivariate Models for Meta-Analysis of Dependent Effect Sizes

Peer reviewed

Direct link

James E. Pustejovsky; Man Chen – Journal of Educational and Behavioral Statistics, 2024

Meta-analyses of educational research findings frequently involve statistically dependent effect size estimates. Meta-analysts have often addressed dependence issues using ad hoc approaches that involve modifying the data to conform to the assumptions of models for independent effect size estimates, such as by aggregating estimates to obtain one…

Descriptors: Meta Analysis, Multivariate Analysis, Effect Size, Evaluation Methods

Redefining Item Response Models for Small Samples

Peer reviewed

Direct link

Jean-Paul Fox – Journal of Educational and Behavioral Statistics, 2025

Popular item response theory (IRT) models are considered complex, mainly due to the inclusion of a random factor variable (latent variable). The random factor variable represents the incidental parameter problem since the number of parameters increases when including data of new persons. Therefore, IRT models require a specific estimation method…

Descriptors: Sample Size, Item Response Theory, Accuracy, Bayesian Statistics

Bayesian Diagnostic Classification Models for a Partially Known Q-Matrix

Peer reviewed

Direct link

Kazuhiro Yamaguchi – Journal of Educational and Behavioral Statistics, 2025

This study proposes a Bayesian method for diagnostic classification models (DCMs) for a partially known Q-matrix setting between exploratory and confirmatory DCMs. This Q-matrix setting is practical and useful because test experts have pre-knowledge of the Q-matrix but cannot readily specify it completely. The proposed method employs priors for…

Descriptors: Models, Classification, Bayesian Statistics, Evaluation Methods

Computational Strategies and Estimation Performance with Bayesian Semiparametric Item Response Theory Models

Peer reviewed

Direct link

Paganin, Sally; Paciorek, Christopher J.; Wehrhahn, Claudia; Rodríguez, Abel; Rabe-Hesketh, Sophia; de Valpine, Perry – Journal of Educational and Behavioral Statistics, 2023

Item response theory (IRT) models typically rely on a normality assumption for subject-specific latent traits, which is often unrealistic in practice. Semiparametric extensions based on Dirichlet process mixtures (DPMs) offer a more flexible representation of the unknown distribution of the latent trait. However, the use of such models in the IRT…

Descriptors: Bayesian Statistics, Item Response Theory, Guidance, Evaluation Methods

A Simple Technique Assessing Ordinal and Disordinal Interaction Effects

Peer reviewed

Direct link

Sang-June Park; Youjae Yi – Journal of Educational and Behavioral Statistics, 2024

Previous research explicates ordinal and disordinal interactions through the concept of the "crossover point." This point is determined via simple regression models of a focal predictor at specific moderator values and signifies the intersection of these models. An interaction effect is labeled as disordinal (or ordinal) when the…

Descriptors: Interaction, Predictor Variables, Causal Models, Mathematical Models

Deep Learning Imputation for Asymmetric and Incomplete Likert-Type Items

Peer reviewed

Direct link

Zachary K. Collier; Minji Kong; Olushola Soyoye; Kamal Chawla; Ann M. Aviles; Yasser Payne – Journal of Educational and Behavioral Statistics, 2024

Asymmetric Likert-type items in research studies can present several challenges in data analysis, particularly concerning missing data. These items are often characterized by a skewed scaling, where either there is no neutral response option or an unequal number of possible positive and negative responses. The use of conventional techniques, such…

Descriptors: Likert Scales, Test Items, Item Analysis, Evaluation Methods

Analyzing Polytomous Test Data: A Comparison between an Information-Based IRT Model and the Generalized Partial Credit Model

Peer reviewed

Direct link

Joakim Wallmark; James O. Ramsay; Juan Li; Marie Wiberg – Journal of Educational and Behavioral Statistics, 2024

Item response theory (IRT) models the relationship between the possible scores on a test item against a test taker's attainment of the latent trait that the item is intended to measure. In this study, we compare two models for tests with polytomously scored items: the optimal scoring (OS) model, a nonparametric IRT model based on the principles of…

Descriptors: Item Response Theory, Test Items, Models, Scoring

Improving Balance in Educational Measurement: A Legacy of E. F. Lindquist

Peer reviewed

Direct link

Daniel Koretz – Journal of Educational and Behavioral Statistics, 2024

A critically important balance in educational measurement between practical concerns and matters of technique has atrophied in recent decades, and as a result, some important issues in the field have not been adequately addressed. I start with the work of E. F. Lindquist, who exemplified the balance that is now wanting. Lindquist was arguably the…

Descriptors: Educational Assessment, Evaluation Methods, Achievement Tests, Educational History

Predictive Performance of Bayesian Stacking in Multilevel Education Data

Peer reviewed

Direct link

Mingya Huang; David Kaplan – Journal of Educational and Behavioral Statistics, 2025

The issue of model uncertainty has been gaining interest in education and the social sciences community over the years, and the dominant methods for handling model uncertainty are based on Bayesian inference, particularly, Bayesian model averaging. However, Bayesian model averaging assumes that the true data-generating model is within the…

Descriptors: Bayesian Statistics, Hierarchical Linear Modeling, Statistical Inference, Predictor Variables

Detecting Compromised Items with Response Times Using a Bayesian Change-Point Approach

Peer reviewed

Direct link

Yang Du; Susu Zhang – Journal of Educational and Behavioral Statistics, 2025

Item compromise has long posed challenges in educational measurement, jeopardizing both test validity and test security of continuous tests. Detecting compromised items is therefore crucial to address this concern. The present literature on compromised item detection reveals two notable gaps: First, the majority of existing methods are based upon…

Descriptors: Item Response Theory, Item Analysis, Bayesian Statistics, Educational Assessment

Jenss-Bayley Latent Change Score Model with Individual Ratio of the Growth Acceleration in the Framework of Individual Measurement Occasions

Peer reviewed

Direct link

Liu, Jin – Journal of Educational and Behavioral Statistics, 2022

Longitudinal data analysis has been widely employed to examine between-individual differences in within-individual changes. One challenge of such analyses is that the rate-of-change is only available indirectly when change patterns are nonlinear with respect to time. Latent change score models (LCSMs), which can be employed to investigate the…

Descriptors: Longitudinal Studies, Individual Differences, Scores, Models

Bayesian Adaptive Lasso for the Detection of Differential Item Functioning in Graded Response Models

Peer reviewed

Direct link

Na Shan; Ping-Feng Xu – Journal of Educational and Behavioral Statistics, 2025

The detection of differential item functioning (DIF) is important in psychological and behavioral sciences. Standard DIF detection methods perform an item-by-item test iteratively, often assuming that all items except the one under investigation are DIF-free. This article proposes a Bayesian adaptive Lasso method to detect DIF in graded response…

Descriptors: Bayesian Statistics, Item Response Theory, Adolescents, Longitudinal Studies

Mixed-Effects Location Scale Models for Joint Modeling School Value-Added Effects on the Mean and Variance of Student Achievement

Peer reviewed

Direct link

George Leckie; Richard Parker; Harvey Goldstein; Kate Tilling – Journal of Educational and Behavioral Statistics, 2024

School value-added models are widely applied to study, monitor, and hold schools to account for school differences in student learning. The traditional model is a mixed-effects linear regression of student current achievement on student prior achievement, background characteristics, and a school random intercept effect. The latter is referred to…

Descriptors: Academic Achievement, Value Added Models, Accountability, Institutional Characteristics

Combining Human and Automated Scoring Methods in Experimental Assessments of Writing: A Case Study Tutorial

Peer reviewed

Direct link

Reagan Mozer; Luke Miratrix; Jackie Eunjung Relyea; James S. Kim – Journal of Educational and Behavioral Statistics, 2024

In a randomized trial that collects text as an outcome, traditional approaches for assessing treatment impact require that each document first be manually coded for constructs of interest by human raters. An impact analysis can then be conducted to compare treatment and control groups, using the hand-coded scores as a measured outcome. This…

Descriptors: Scoring, Evaluation Methods, Writing Evaluation, Comparative Analysis

Forced-Choice Ranking Models for Raters' Ranking Data

Peer reviewed

Direct link

Hung, Su-Pin; Huang, Hung-Yu – Journal of Educational and Behavioral Statistics, 2022

To address response style or bias in rating scales, forced-choice items are often used to request that respondents rank their attitudes or preferences among a limited set of options. The rating scales used by raters to render judgments on ratees' performance also contribute to rater bias or errors; consequently, forced-choice items have recently…

Descriptors: Evaluation Methods, Rating Scales, Item Analysis, Preferences

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Sinharay, Sandip	3
Bauer, Daniel J.	2
Briggs, Derek C.	2
Cai, Li	2
van der Linden, Wim J.	2
Andrejko, Lisa	1
Ann M. Aviles	1
Ballou, Dale	1
Bazan, Jorge Luis	1
Becker, Kirsten	1
Berger, Martijn P. F.	1
Bhattacharya, Suman K.	1
Bloxom, Bruce	1
Bolfarine, Heleno	1
Camilli, Gregory	1
Camparo, James	1
Camparo, Lorinda B.	1
Cho, Sun-Joo	1
Cohen, Allan S.	1
Collins, Rebecca L.	1
Cook, Thomas D.	1
Curran, Patrick J.	1
D'Amico, Elizabeth J.	1
Daniel Koretz	1
More ▼