ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	7

Source

Journal of Educational and…	4
Grantee Submission	1
International Journal of…	1
Journal of Educational…	1
Practical Assessment,…	1
Psychometrika	1

Publication Type

Reports - Descriptive	10
Journal Articles	8

Education Level

Elementary Secondary Education	1
Higher Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Trends in International…

What Works Clearinghouse Rating

Showing all 10 results Save | Export

Item Pool Quality Control in Educational Testing: Change Point Model, Compound Risk, and Sequential Detection

Peer reviewed

Direct link

Chen, Yunxiao; Lee, Yi-Hsuan; Li, Xiaoou – Journal of Educational and Behavioral Statistics, 2022

In standardized educational testing, test items are reused in multiple test administrations. To ensure the validity of test scores, the psychometric properties of items should remain unchanged over time. In this article, we consider the sequential monitoring of test items, in particular, the detection of abrupt changes to their psychometric…

Descriptors: Standardized Tests, Test Items, Test Validity, Scores

Using JAGS for Bayesian Cognitive Diagnosis Modeling: A Tutorial

Peer reviewed

Direct link

Zhan, Peida; Jiao, Hong; Man, Kaiwen; Wang, Lijun – Journal of Educational and Behavioral Statistics, 2019

In this article, we systematically introduce the just another Gibbs sampler (JAGS) software program to fit common Bayesian cognitive diagnosis models (CDMs) including the deterministic inputs, noisy "and" gate model; the deterministic inputs, noisy "or" gate model; the linear logistic model; the reduced reparameterized unified…

Descriptors: Bayesian Statistics, Computer Software, Models, Test Items

A Bayesian Beta-Mixture Model for Nonparametric IRT (BBM-IRT)

Peer reviewed
PDF on ERIC

Download full text

Arenson, Ethan A.; Karabatsos, George – Grantee Submission, 2017

Item response models typically assume that the item characteristic (step) curves follow a logistic or normal cumulative distribution function, which are strictly monotone functions of person test ability. Such assumptions can be overly-restrictive for real item response data. We propose a simple and more flexible Bayesian nonparametric IRT model…

Descriptors: Bayesian Statistics, Item Response Theory, Nonparametric Statistics, Models

RIM: A Random Item Mixture Model to Detect Differential Item Functioning

Peer reviewed

Direct link

Frederickx, Sofie; Tuerlinckx, Francis; De Boeck, Paul; Magis, David – Journal of Educational Measurement, 2010

In this paper we present a new methodology for detecting differential item functioning (DIF). We introduce a DIF model, called the random item mixture (RIM), that is based on a Rasch model with random item difficulties (besides the common random person abilities). In addition, a mixture model is assumed for the item difficulties such that the…

Descriptors: Test Bias, Models, Test Items, Difficulty Level

Scoring and Classifying Examinees Using Measurement Decision Theory

Peer reviewed

Direct link

Rudner, Lawrence M. – Practical Assessment, Research & Evaluation, 2009

This paper describes and evaluates the use of measurement decision theory (MDT) to classify examinees based on their item response patterns. The model has a simple framework that starts with the conditional probabilities of examinees in each category or mastery state responding correctly to each item. The presented evaluation investigates: (1) the…

Descriptors: Classification, Scoring, Item Response Theory, Measurement

A Bayesian Method for Studying DIF: A Cautionary Tale Filled with Surprises and Delights

Peer reviewed

Direct link

Wang, Xiaohui; Bradlow, Eric T.; Wainer, Howard; Muller, Eric S. – Journal of Educational and Behavioral Statistics, 2008

In the course of screening a form of a medical licensing exam for items that function differentially (DIF) between men and women, the authors used the traditional Mantel-Haenszel (MH) statistic for initial screening and a Bayesian method for deeper analysis. For very easy items, the MH statistic unexpectedly often found DIF where there was none.…

Descriptors: Bayesian Statistics, Licensing Examinations (Professions), Medicine, Test Items

Covariates of the Rating Process in Hierarchical Models for Multiple Ratings of Test Items

Peer reviewed

Direct link

Mariano, Louis T.; Junker, Brian W. – Journal of Educational and Behavioral Statistics, 2007

When constructed response test items are scored by more than one rater, the repeated ratings allow for the consideration of individual rater bias and variability in estimating student proficiency. Several hierarchical models based on item response theory have been introduced to model such effects. In this article, the authors demonstrate how these…

Descriptors: Test Items, Item Response Theory, Rating Scales, Scoring

Calibration of Automatically Generated Items Using Bayesian Hierarchical Modeling.

Download full text

Johnson, Matthew S.; Sinharay, Sandip – 2003

For complex educational assessments, there is an increasing use of "item families," which are groups of related items. However, calibration or scoring for such an assessment requires fitting models that take into account the dependence structure inherent among the items that belong to the same item family. C. Glas and W. van der Linden…

Descriptors: Bayesian Statistics, Constructed Response, Educational Assessment, Estimation (Mathematics)

Specifying and Refining a Measurement Model for a Computer-Based Interactive Assessment

Peer reviewed

Direct link

Levy, Roy; Mislevy, Robert J. – International Journal of Testing, 2004

The challenges of modeling students' performance in computer-based interactive assessments include accounting for multiple aspects of knowledge and skill that arise in different situations and the conditional dependencies among multiple aspects of performance. This article describes a Bayesian approach to modeling and estimating cognitive models…

Descriptors: Computer Assisted Testing, Markov Processes, Computer Networks, Bayesian Statistics

Analysis of Distractor Difficulty in Multiple-Choice Items

Peer reviewed

Direct link

Revuelta, Javier – Psychometrika, 2004

Two psychometric models are presented for evaluating the difficulty of the distractors in multiple-choice items. They are based on the criterion of rising distractor selection ratios, which facilitates interpretation of the subject and item parameters. Statistical inferential tools are developed in a Bayesian framework: modal a posteriori…

Descriptors: Multiple Choice Tests, Psychometrics, Models, Difficulty Level

Bayesian Statistics	10
Test Items	10
Item Response Theory	6
Models	6
Markov Processes	3
Monte Carlo Methods	3
Accuracy	2
Difficulty Level	2
Multiple Choice Tests	2
Psychometrics	2
Scoring	2
Statistical Inference	2
Student Evaluation	2
Achievement Tests	1
Classification	1
Cognitive Measurement	1
Comparative Analysis	1
Computation	1
Computer Assisted Testing	1
Computer Networks	1
Computer Software	1
Constructed Response	1
Data Analysis	1
Decision Making	1
Educational Assessment	1
More ▼

Arenson, Ethan A.	1
Bradlow, Eric T.	1
Chen, Yunxiao	1
De Boeck, Paul	1
Frederickx, Sofie	1
Jiao, Hong	1
Johnson, Matthew S.	1
Junker, Brian W.	1
Karabatsos, George	1
Lee, Yi-Hsuan	1
Levy, Roy	1
Li, Xiaoou	1
Magis, David	1
Man, Kaiwen	1
Mariano, Louis T.	1
Mislevy, Robert J.	1
Muller, Eric S.	1
Revuelta, Javier	1
Rudner, Lawrence M.	1
Sinharay, Sandip	1
Tuerlinckx, Francis	1
Wainer, Howard	1
Wang, Lijun	1
Wang, Xiaohui	1
Zhan, Peida	1
More ▼