ERIC - Search Results

Publication Date

In 2026	0
Since 2025	1
Since 2022 (last 5 years)	3
Since 2017 (last 10 years)	6
Since 2007 (last 20 years)	13

Descriptor

Bayesian Statistics	30
Computer Assisted Testing	30
Item Response Theory	30
Adaptive Testing	20
Test Items	14
Maximum Likelihood Statistics	8
Simulation	7
Computation	6
Estimation (Mathematics)	6
Test Construction	6
Test Length	6
Classification	5
Ability	4
Accuracy	4
Item Banks	4
Mastery Tests	4
Mathematical Models	4
Measurement Techniques	4
Models	4
Test Bias	4
Computer Simulation	3
Difficulty Level	3
Educational Assessment	3
Efficiency	3
Equations (Mathematics)	3
More ▼

Source

Applied Psychological…	7
Educational and Psychological…	5
Journal of Educational and…	4
Journal of Educational…	2
ETS Research Report Series	1
International Journal of…	1
Journal of Speech, Language,…	1
Measurement:…	1
Psychological Methods	1
Psychometrika	1

Publication Type

Journal Articles	24
Reports - Research	17
Reports - Evaluative	8
Reports - Descriptive	4
Speeches/Meeting Papers	2
Information Analyses	1

Education Level

Elementary Secondary Education

Audience

Researchers

Location

Netherlands

Laws, Policies, & Programs

Assessments and Surveys

Law School Admission Test	1
MacArthur Communicative…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 30 results Save | Export

Detecting Compromised Items with Response Times Using a Bayesian Change-Point Approach

Peer reviewed

Direct link

Yang Du; Susu Zhang – Journal of Educational and Behavioral Statistics, 2025

Item compromise has long posed challenges in educational measurement, jeopardizing both test validity and test security of continuous tests. Detecting compromised items is therefore crucial to address this concern. The present literature on compromised item detection reveals two notable gaps: First, the majority of existing methods are based upon…

Descriptors: Item Response Theory, Item Analysis, Bayesian Statistics, Educational Assessment

On Bank Assembly and Block Selection in Multidimensional Forced-Choice Adaptive Assessments

Peer reviewed

Direct link

Kreitchmann, Rodrigo S.; Sorrel, Miguel A.; Abad, Francisco J. – Educational and Psychological Measurement, 2023

Multidimensional forced-choice (FC) questionnaires have been consistently found to reduce the effects of socially desirable responding and faking in noncognitive assessments. Although FC has been considered problematic for providing ipsative scores under the classical test theory, item response theory (IRT) models enable the estimation of…

Descriptors: Measurement Techniques, Questionnaires, Social Desirability, Adaptive Testing

Handling Extreme Scores in Vertically Scaled Fixed-Length Computerized Adaptive Tests

Peer reviewed

Direct link

Wyse, Adam E.; McBride, James R. – Measurement: Interdisciplinary Research and Perspectives, 2022

A common practical challenge is how to assign ability estimates to all incorrect and all correct response patterns when using item response theory (IRT) models and maximum likelihood estimation (MLE) since ability estimates for these types of responses equal -8 or +8. This article uses a simulation study and data from an operational K-12…

Descriptors: Scores, Adaptive Testing, Computer Assisted Testing, Test Length

Detecting Aberrant Behavior and Item Preknowledge: A Comparison of Mixture Modeling Method and Residual Method

Peer reviewed
PDF on ERIC

Download full text

Direct link

Wang, Chun; Xu, Gongjun; Shang, Zhuoran; Kuncel, Nathan – Journal of Educational and Behavioral Statistics, 2018

The modern web-based technology greatly popularizes computer-administered testing, also known as online testing. When these online tests are administered continuously within a certain "testing window," many items are likely to be exposed and compromised, posing a type of test security concern. In addition, if the testing time is limited,…

Descriptors: Computer Assisted Testing, Cheating, Guessing (Tests), Item Response Theory

A Bayesian-Inspired Item Response Theory-Based Framework to Produce Very Short Versions of MacArthur-Bates Communicative Development Inventories

Peer reviewed

Direct link

Chai, Jun Ho; Lo, Chang Huan; Mayor, Julien – Journal of Speech, Language, and Hearing Research, 2020

Purpose: This study introduces a framework to produce very short versions of the MacArthur-Bates Communicative Development Inventories (CDIs) by combining the Bayesian-inspired approach introduced by Mayor and Mani (2019) with an item response theory-based computerized adaptive testing that adapts to the ability of each child, in line with…

Descriptors: Bayesian Statistics, Item Response Theory, Measures (Individuals), Language Skills

A Comparative Study of Online Item Calibration Methods in Multidimensional Computerized Adaptive Testing

Peer reviewed

Direct link

Chen, Ping – Journal of Educational and Behavioral Statistics, 2017

Calibration of new items online has been an important topic in item replenishment for multidimensional computerized adaptive testing (MCAT). Several online calibration methods have been proposed for MCAT, such as multidimensional "one expectation-maximization (EM) cycle" (M-OEM) and multidimensional "multiple EM cycles"…

Descriptors: Test Items, Item Response Theory, Test Construction, Adaptive Testing

The Random-Threshold Generalized Unfolding Model and Its Application of Computerized Adaptive Testing

Peer reviewed

Direct link

Wang, Wen-Chung; Liu, Chen-Wei; Wu, Shiu-Lien – Applied Psychological Measurement, 2013

The random-threshold generalized unfolding model (RTGUM) was developed by treating the thresholds in the generalized unfolding model as random effects rather than fixed effects to account for the subjective nature of the selection of categories in Likert items. The parameters of the new model can be estimated with the JAGS (Just Another Gibbs…

Descriptors: Computer Assisted Testing, Adaptive Testing, Models, Bayesian Statistics

The Problem of Bias in Person Parameter Estimation in Adaptive Testing

Peer reviewed

Direct link

Doebler, Anna – Applied Psychological Measurement, 2012

It is shown that deviations of estimated from true values of item difficulty parameters, caused for example by item calibration errors, the neglect of randomness of item difficulty parameters, testlet effects, or rule-based item generation, can lead to systematic bias in point estimation of person parameters in the context of adaptive testing.…

Descriptors: Adaptive Testing, Computer Assisted Testing, Computation, Item Response Theory

Item Pool Design for an Operational Variable-Length Computerized Adaptive Test

Peer reviewed

Direct link

He, Wei; Reckase, Mark D. – Educational and Psychological Measurement, 2014

For computerized adaptive tests (CATs) to work well, they must have an item pool with sufficient numbers of good quality items. Many researchers have pointed out that, in developing item pools for CATs, not only is the item pool size important but also the distribution of item parameters and practical considerations such as content distribution…

Descriptors: Item Banks, Test Length, Computer Assisted Testing, Adaptive Testing

Variations on Stochastic Curtailment in Sequential Mastery Testing

Peer reviewed

Direct link

Finkelman, Matthew David – Applied Psychological Measurement, 2010

In sequential mastery testing (SMT), assessment via computer is used to classify examinees into one of two mutually exclusive categories. Unlike paper-and-pencil tests, SMT has the capability to use variable-length stopping rules. One approach to shortening variable-length tests is stochastic curtailment, which halts examination if the probability…

Descriptors: Mastery Tests, Computer Assisted Testing, Adaptive Testing, Test Length

14 Conversations about Three Things

Peer reviewed

Direct link

Wainer, Howard – Journal of Educational and Behavioral Statistics, 2010

In this essay, the author tries to look forward into the 21st century to divine three things: (i) What skills will researchers in the future need to solve the most pressing problems? (ii) What are some of the most likely candidates to be those problems? and (iii) What are some current areas of research that seem mined out and should not distract…

Descriptors: Research Skills, Researchers, Internet, Access to Information

Evaluating Cognitive Theory: A Joint Modeling Approach Using Responses and Response Times

Peer reviewed

Direct link

Klein Entink, Rinke H.; Kuhn, Jorg-Tobias; Hornke, Lutz F.; Fox, Jean-Paul – Psychological Methods, 2009

In current psychological research, the analysis of data from computer-based assessments or experiments is often confined to accuracy scores. Response times, although being an important source of additional information, are either neglected or analyzed separately. In this article, a new model is developed that allows the simultaneous analysis of…

Descriptors: Psychological Studies, Monte Carlo Methods, Markov Processes, Educational Assessment

Empirical Initialization of the Trait Estimator in Adaptive Testing.

Peer reviewed

van der Linden, Wim J. – Applied Psychological Measurement, 1999

Proposes a procedure for empirical initialization of the trait (theta) estimator in adaptive testing that is based on the statistical relation between theta and background variables known prior to test administration. Illustrates the procedure for an adaptive version of a test from the Dutch General Aptitude Battery. (SLD)

Descriptors: Adaptive Testing, Aptitude Tests, Bayesian Statistics, Computer Assisted Testing

Modeling Diagnostic Assessments with Bayesian Networks

Peer reviewed

Direct link

Almond, Russell G.; DiBello, Louis V.; Moulder, Brad; Zapata-Rivera, Juan-Diego – Journal of Educational Measurement, 2007

This paper defines Bayesian network models and examines their applications to IRT-based cognitive diagnostic modeling. These models are especially suited to building inference engines designed to be synchronous with the finer grained student models that arise in skills diagnostic assessment. Aspects of the theory and use of Bayesian network models…

Descriptors: Inferences, Models, Item Response Theory, Cognitive Measurement

Effects of Estimation Bias on Multiple-Category Classification with an IRT-Based Adaptive Classification Procedure

Peer reviewed

Direct link

Yang, Xiangdong; Poggio, John C.; Glasnapp, Douglas R. – Educational and Psychological Measurement, 2006

The effects of five ability estimators, that is, maximum likelihood estimator, weighted likelihood estimator, maximum a posteriori, expected a posteriori, and Owen's sequential estimator, on the performances of the item response theory-based adaptive classification procedure on multiple categories were studied via simulations. The following…

Descriptors: Classification, Computation, Simulation, Item Response Theory

Previous Page | Next Page »

Pages: 1 | 2

De Ayala, R. J.	3
Lewis, Charles	3
Glas, Cees A. W.	2
Sheehan, Kathleen	2
Wainer, Howard	2
van der Linden, Wim J.	2
Abad, Francisco J.	1
Almond, Russell G.	1
Berger, Martijn P. F.	1
Bradlow, Eric T.	1
Chai, Jun Ho	1
Chen, Ping	1
De Boeck, Paul	1
DiBello, Louis V.	1
Doebler, Anna	1
Finkelman, Matthew David	1
Fox, Jean-Paul	1
Glasnapp, Douglas R.	1
Hankins, Janette A.	1
He, Wei	1
Hornke, Lutz F.	1
Kim, JinGyu	1
Klein Entink, Rinke H.	1
Kreitchmann, Rodrigo S.	1
More ▼