ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	2
Since 2007 (last 20 years)	9

Descriptor

Adaptive Testing	13
Classification	13
Simulation	13
Computer Assisted Testing	11
Item Response Theory	6
Test Items	6
Probability	5
Accuracy	4
Bayesian Statistics	4
Comparative Analysis	3
Computation	3
Test Construction	3
Test Length	3
Ability	2
Clinical Diagnosis	2
Correlation	2
Estimation (Mathematics)	2
Intervals	2
Item Banks	2
Maximum Likelihood Statistics	2
Models	2
Monte Carlo Methods	2
Sample Size	2
Statistical Distributions	2
Artificial Intelligence	1
More ▼

Source

Applied Psychological…	4
Educational and Psychological…	2
International Journal of…	1
Journal of Educational and…	1
Measurement:…	1
Practical Assessment,…	1

Publication Type

Journal Articles	10
Reports - Research	7
Reports - Evaluative	5
Reports - Descriptive	1
Speeches/Meeting Papers	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 13 results Save | Export

An Investigation of Item Calibration Methods in Multistage Testing

Peer reviewed

Direct link

Cai, Liuhan; Albano, Anthony D.; Roussos, Louis A. – Measurement: Interdisciplinary Research and Perspectives, 2021

Multistage testing (MST), an adaptive test delivery mode that involves algorithmic selection of predefined item modules rather than individual items, offers a practical alternative to linear and fully computerized adaptive testing. However, interactions across stages between item modules and examinee groups can lead to challenges in item…

Descriptors: Adaptive Testing, Test Items, Item Response Theory, Test Construction

Item Parameter Drift in Computer Adaptive Testing Due to Lack of Content Knowledge

Peer reviewed

Direct link

Aksu Dunya, Beyza – International Journal of Testing, 2018

This study was conducted to analyze potential item parameter drift (IPD) impact on person ability estimates and classification accuracy when drift affects an examinee subgroup. Using a series of simulations, three factors were manipulated: (a) percentage of IPD items in the CAT exam, (b) percentage of examinees affected by IPD, and (c) item pool…

Descriptors: Adaptive Testing, Classification, Accuracy, Computer Assisted Testing

The Sequential Probability Ratio Test and Binary Item Response Models

Peer reviewed

Direct link

Nydick, Steven W. – Journal of Educational and Behavioral Statistics, 2014

The sequential probability ratio test (SPRT) is a common method for terminating item response theory (IRT)-based adaptive classification tests. To decide whether a classification test should stop, the SPRT compares a simple log-likelihood ratio, based on the classification bound separating two categories, to prespecified critical values. As has…

Descriptors: Probability, Item Response Theory, Models, Classification

The Influence of Item Calibration Error on Variable-Length Computerized Adaptive Testing

Peer reviewed

Direct link

Patton, Jeffrey M.; Cheng, Ying; Yuan, Ke-Hai; Diao, Qi – Applied Psychological Measurement, 2013

Variable-length computerized adaptive testing (VL-CAT) allows both items and test length to be "tailored" to examinees, thereby achieving the measurement goal (e.g., scoring precision or classification) with as few items as possible. Several popular test termination rules depend on the standard error of the ability estimate, which in turn depends…

Descriptors: Adaptive Testing, Computer Assisted Testing, Test Length, Ability

Panel Design Variations in the Multistage Test Using the Mixed-Format Tests

Peer reviewed

Direct link

Kim, Jiseon; Chung, Hyewon; Dodd, Barbara G.; Park, Ryoungsun – Educational and Psychological Measurement, 2012

This study compared various panel designs of the multistage test (MST) using mixed-format tests in the context of classification testing. Simulations varied the design of the first-stage module. The first stage was constructed according to three levels of test information functions (TIFs) with three different TIF centers. Additional computerized…

Descriptors: Test Format, Comparative Analysis, Computer Assisted Testing, Adaptive Testing

The Random-Threshold Generalized Unfolding Model and Its Application of Computerized Adaptive Testing

Peer reviewed

Direct link

Wang, Wen-Chung; Liu, Chen-Wei; Wu, Shiu-Lien – Applied Psychological Measurement, 2013

The random-threshold generalized unfolding model (RTGUM) was developed by treating the thresholds in the generalized unfolding model as random effects rather than fixed effects to account for the subjective nature of the selection of categories in Likert items. The parameters of the new model can be estimated with the JAGS (Just Another Gibbs…

Descriptors: Computer Assisted Testing, Adaptive Testing, Models, Bayesian Statistics

Termination Criteria for Computerized Classification Testing

Peer reviewed

Direct link

Thompson, Nathan A. – Practical Assessment, Research & Evaluation, 2011

Computerized classification testing (CCT) is an approach to designing tests with intelligent algorithms, similar to adaptive testing, but specifically designed for the purpose of classifying examinees into categories such as "pass" and "fail." Like adaptive testing for point estimation of ability, the key component is the…

Descriptors: Adaptive Testing, Computer Assisted Testing, Classification, Probability

A Comparison of Content-Balancing Procedures for Estimating Multiple Clinical Domains in Computerized Adaptive Testing: Relative Precision, Validity, and Detection of Persons with Misfitting Responses

Peer reviewed

Direct link

Riley, Barth B.; Dennis, Michael L.; Conrad, Kendon J. – Applied Psychological Measurement, 2010

This simulation study sought to compare four different computerized adaptive testing (CAT) content-balancing procedures designed for use in a multidimensional assessment with respect to measurement precision, symptom severity classification, validity of clinical diagnostic recommendations, and sensitivity to atypical responding. The four…

Descriptors: Simulation, Computer Assisted Testing, Adaptive Testing, Comparative Analysis

Variations on Stochastic Curtailment in Sequential Mastery Testing

Peer reviewed

Direct link

Finkelman, Matthew David – Applied Psychological Measurement, 2010

In sequential mastery testing (SMT), assessment via computer is used to classify examinees into one of two mutually exclusive categories. Unlike paper-and-pencil tests, SMT has the capability to use variable-length stopping rules. One approach to shortening variable-length tests is stochastic curtailment, which halts examination if the probability…

Descriptors: Mastery Tests, Computer Assisted Testing, Adaptive Testing, Test Length

Effects of Estimation Bias on Multiple-Category Classification with an IRT-Based Adaptive Classification Procedure

Peer reviewed

Direct link

Yang, Xiangdong; Poggio, John C.; Glasnapp, Douglas R. – Educational and Psychological Measurement, 2006

The effects of five ability estimators, that is, maximum likelihood estimator, weighted likelihood estimator, maximum a posteriori, expected a posteriori, and Owen's sequential estimator, on the performances of the item response theory-based adaptive classification procedure on multiple categories were studied via simulations. The following…

Descriptors: Classification, Computation, Simulation, Item Response Theory

Stability of DIF Classification: An Alternative Representation of the Variability of the Mantel-Haenszel DIF Statistic.

Zwick, Rebecca – 1995

This paper describes a study, now in progress, of new methods for representing the sampling variability of Mantel-Haenszel differential item functioning (DIF) results, based on the system for categorizing the severity of DIF that is now in place at the Educational Testing Service. The methods, which involve a Bayesian elaboration of procedures…

Descriptors: Adaptive Testing, Bayesian Statistics, Classification, Computer Assisted Testing

Designing Item Pools for Computerized Adaptive Testing. Research Report 99-03.

Download full text

Veldkamp, Bernard P.; van der Linden, Wim J. – 1999

A method of item pool design is proposed that uses an optimal blueprint for the item pool calculated from the test specifications. The blueprint is a document that specifies the attributes that the items in the computerized adaptive test (CAT) pool should have. The blueprint can be a starting point for the item writing process, and it can be used…

Descriptors: Ability, Adaptive Testing, Classification, Computer Assisted Testing

Assessing Disease Class-Specific Diagnostic Ability: A Practical Adaptive Test Approach.

Download full text

Papa, Frank J.; Schumacker, Randall E. – 1995

Measures of the robustness of disease class-specific diagnostic concepts could play a central role in training programs designed to assure the development of diagnostic competence. In the pilot study, the authors used disease/sign-symptom conditional probability estimates, Monte Carlo procedures, and artificial intelligence (AI) tools to create…

Descriptors: Adaptive Testing, Artificial Intelligence, Classification, Clinical Diagnosis

Aksu Dunya, Beyza	1
Albano, Anthony D.	1
Cai, Liuhan	1
Cheng, Ying	1
Chung, Hyewon	1
Conrad, Kendon J.	1
Dennis, Michael L.	1
Diao, Qi	1
Dodd, Barbara G.	1
Finkelman, Matthew David	1
Glasnapp, Douglas R.	1
Kim, Jiseon	1
Liu, Chen-Wei	1
Nydick, Steven W.	1
Papa, Frank J.	1
Park, Ryoungsun	1
Patton, Jeffrey M.	1
Poggio, John C.	1
Riley, Barth B.	1
Roussos, Louis A.	1
Schumacker, Randall E.	1
Thompson, Nathan A.	1
Veldkamp, Bernard P.	1
Wang, Wen-Chung	1
Wu, Shiu-Lien	1
More ▼