ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	6

Descriptor

Classification	13
Test Length	13
Item Response Theory	7
Test Items	5
Adaptive Testing	4
Computation	4
Computer Assisted Testing	4
Cutting Scores	4
Reliability	4
Error of Measurement	3
Mastery Tests	3
Probability	3
Simulation	3
Ability	2
Bayesian Statistics	2
Estimation (Mathematics)	2
Higher Education	2
Item Analysis	2
Models	2
Scores	2
Scoring	2
Statistical Analysis	2
Test Results	2
Accuracy	1
Comparative Analysis	1
More ▼

Source

Applied Psychological…	2
Journal of Educational…	2
Educational Research and…	1
Educational and Psychological…	1
International Journal of…	1
Psychological Methods	1
Research in the Schools	1

Publication Type

Reports - Evaluative	13
Journal Articles	9
Reports - Research	1
Speeches/Meeting Papers	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 13 results Save | Export

A Nonparametric Approach to Estimate Classification Accuracy and Consistency

Peer reviewed

Direct link

Lathrop, Quinn N.; Cheng, Ying – Journal of Educational Measurement, 2014

When cut scores for classifications occur on the total score scale, popular methods for estimating classification accuracy (CA) and classification consistency (CC) require assumptions about a parametric form of the test scores or about a parametric response model, such as item response theory (IRT). This article develops an approach to estimate CA…

Descriptors: Cutting Scores, Classification, Computation, Nonparametric Statistics

Computerized Classification Testing with the Rasch Model

Peer reviewed

Direct link

Eggen, Theo J. H. M. – Educational Research and Evaluation, 2011

If classification in a limited number of categories is the purpose of testing, computerized adaptive tests (CATs) with algorithms based on sequential statistical testing perform better than estimation-based CATs (e.g., Eggen & Straetmans, 2000). In these computerized classification tests (CCTs), the Sequential Probability Ratio Test (SPRT) (Wald,…

Descriptors: Test Length, Adaptive Testing, Classification, Item Analysis

Computerized Classification Testing under the Generalized Graded Unfolding Model

Peer reviewed

Direct link

Wang, Wen-Chung; Liu, Chen-Wei – Educational and Psychological Measurement, 2011

The generalized graded unfolding model (GGUM) has been recently developed to describe item responses to Likert items (agree-disagree) in attitude measurement. In this study, the authors (a) developed two item selection methods in computerized classification testing under the GGUM, the current estimate/ability confidence interval method and the cut…

Descriptors: Computer Assisted Testing, Adaptive Testing, Classification, Item Response Theory

Variations on Stochastic Curtailment in Sequential Mastery Testing

Peer reviewed

Direct link

Finkelman, Matthew David – Applied Psychological Measurement, 2010

In sequential mastery testing (SMT), assessment via computer is used to classify examinees into one of two mutually exclusive categories. Unlike paper-and-pencil tests, SMT has the capability to use variable-length stopping rules. One approach to shortening variable-length tests is stochastic curtailment, which halts examination if the probability…

Descriptors: Mastery Tests, Computer Assisted Testing, Adaptive Testing, Test Length

Correcting Fallacies in Validity, Reliability, and Classification

Peer reviewed

Direct link

Sijtsma, Klaas – International Journal of Testing, 2009

This article reviews three topics from test theory that continue to raise discussion and controversy and capture test theorists' and constructors' interest. The first topic concerns the discussion of the methodology of investigating and establishing construct validity; the second topic concerns reliability and its misuse, alternative definitions…

Descriptors: Construct Validity, Reliability, Classification, Test Theory

On the Consistency of Individual Classification Using Short Scales

Peer reviewed

Direct link

Emons, Wilco H. M.; Sijtsma, Klaas; Meijer, Rob R. – Psychological Methods, 2007

Short tests containing at most 15 items are used in clinical and health psychology, medicine, and psychiatry for making decisions about patients. Because short tests have large measurement error, the authors ask whether they are reliable enough for classifying patients into a treatment and a nontreatment group. For a given certainty level,…

Descriptors: Psychiatry, Patients, Error of Measurement, Test Length

Estimating the Consistency and Accuracy of Classifications Based on Test Scores.

Peer reviewed

Livingston, Samuel A.; Lewis, Charles – Journal of Educational Measurement, 1995

A method is presented for estimating the accuracy and consistency of classifications based on test scores. The reliability of the score is used to estimate effective test length in terms of discrete items. The true-score distribution is estimated by fitting a four-parameter beta model. (SLD)

Descriptors: Classification, Estimation (Mathematics), Scores, Statistical Distributions

Estimating the Consistency and Accuracy of Classifications Based on Test Scores.

Download full text

Livingston, Samuel A.; Lewis, Charles – 1993

This paper presents a method for estimating the accuracy and consistency of classifications based on test scores. The scores can be produced by any scoring method, including the formation of a weighted composite. The estimates use data from a single form. The reliability of the score is used to estimate its effective test length in terms of…

Descriptors: Classification, Error of Measurement, Estimation (Mathematics), Reliability

The Classification Accuracy of Shortened versus Full Length Tests with Number Correct Scoring.

Download full text

Schulz, E. Matthew; Wang, Lin – 2001

In this study, items were drawn from a full-length test of 30 items in order to construct shorter tests for the purpose of making accurate pass/fail classifications with regard to a specific criterion point on the latent ability metric. A three-item parameter Item Response Theory (IRT) framework was used. The criterion point on the latent ability…

Descriptors: Ability, Classification, Item Response Theory, Pass Fail Grading

The Effect of Person Misfit on Classification Decisions

Peer reviewed

Direct link

Hendrawan, Irene; Glas, Cees A. W.; Meijer, Rob R. – Applied Psychological Measurement, 2005

The effect of person misfit to an item response theory model on a mastery/nonmastery decision was investigated. Furthermore, it was investigated whether the classification precision can be improved by identifying misfitting respondents using person-fit statistics. A simulation study was conducted to investigate the probability of a correct…

Descriptors: Probability, Statistics, Test Length, Simulation

Testing at Higher Taxonomic Levels: Are We Jeopardizing Reliability by Increasing the Emphasis on Complexity?

Clements, Andrea D.; Rothenberg, Lori – Research in the Schools, 1996

Undergraduate psychology examinations from 48 schools were analyzed to determine the proportion of items at each level of Bloom's Taxonomy, item format, and test length. Analyses indicated significant relationships between item complexity and test length even when taking format into account. Use of higher items may be related to shorter tests,…

Descriptors: Classification, Difficulty Level, Educational Objectives, Higher Education

Passing Score and Length of a Mastery Test: An Old Problem Appraoched Anew. Twente Educational Report Number 11.

Download full text

van der Linden, Wim J. – 1980

A classical problem in mastery testing is the choice of passing score and test length so that the mastery decisions are optimal. This problem has been addressed several times from a variety of viewpoints. In this paper, the usual indifference zone approach is adopted, with a new criterion for optimizing the passing score. Specifically,…

Descriptors: Classification, Cutting Scores, Error Patterns, Guessing (Tests)

A Minimax Sequential Procedure in the Context of Computerized Adaptive Mastery Testing.

Download full text

Vos, Hans J. – 1997

The purpose of this paper is to derive optimal rules for variable-length mastery tests in case three mastery classification decisions (nonmastery, partial mastery, and mastery) are distinguished. In a variable-length or adaptive mastery test, the decision is to classify a subject as a master, a partial master, a nonmaster, or continuing sampling…

Descriptors: Adaptive Testing, Classification, Computer Assisted Testing, Concept Formation

Lewis, Charles	2
Livingston, Samuel A.	2
Meijer, Rob R.	2
Sijtsma, Klaas	2
Cheng, Ying	1
Clements, Andrea D.	1
Eggen, Theo J. H. M.	1
Emons, Wilco H. M.	1
Finkelman, Matthew David	1
Glas, Cees A. W.	1
Hendrawan, Irene	1
Lathrop, Quinn N.	1
Liu, Chen-Wei	1
Rothenberg, Lori	1
Schulz, E. Matthew	1
Vos, Hans J.	1
Wang, Lin	1
Wang, Wen-Chung	1
van der Linden, Wim J.	1
More ▼