Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 2 |
Descriptor
Criteria | 8 |
Models | 4 |
Item Response Theory | 3 |
Classification | 2 |
Computer Assisted Testing | 2 |
Evaluation Methods | 2 |
Selection | 2 |
Test Construction | 2 |
Test Items | 2 |
Accuracy | 1 |
Adaptive Testing | 1 |
More ▼ |
Source
Applied Psychological… | 8 |
Author
Andrich, David | 1 |
Berger, Martijn P. F. | 1 |
Chang, Wanchen | 1 |
Chen, Shu-Ying | 1 |
Clauser, Brian E. | 1 |
Dodd, Barbara G. | 1 |
Hanson, Bradley A. | 1 |
Harris, Deborah J. | 1 |
Hsu, Chia-Ling | 1 |
Kolen, Michael | 1 |
Leucht, Richard M. | 1 |
More ▼ |
Publication Type
Journal Articles | 8 |
Reports - Descriptive | 4 |
Reports - Evaluative | 2 |
Reports - Research | 2 |
Education Level
Elementary Secondary Education | 1 |
Grade 12 | 1 |
Grade 4 | 1 |
Grade 8 | 1 |
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
Iowa Tests of Basic Skills | 1 |
National Assessment of… | 1 |
What Works Clearinghouse Rating
Hsu, Chia-Ling; Wang, Wen-Chung; Chen, Shu-Ying – Applied Psychological Measurement, 2013
Interest in developing computerized adaptive testing (CAT) under cognitive diagnosis models (CDMs) has increased recently. CAT algorithms that use a fixed-length termination rule frequently lead to different degrees of measurement precision for different examinees. Fixed precision, in which the examinees receive the same degree of measurement…
Descriptors: Computer Assisted Testing, Adaptive Testing, Cognitive Tests, Diagnostic Tests
Whittaker, Tiffany A.; Chang, Wanchen; Dodd, Barbara G. – Applied Psychological Measurement, 2012
When tests consist of multiple-choice and constructed-response items, researchers are confronted with the question of which item response theory (IRT) model combination will appropriately represent the data collected from these mixed-format tests. This simulation study examined the performance of six model selection criteria, including the…
Descriptors: Item Response Theory, Models, Selection, Criteria

Berger, Martijn P. F. – Applied Psychological Measurement, 1998
Reviews some results on optimal design of tests with items of dichotomous and polytomous response formats and offers rules and guidelines for optimal test assembly. Discusses the problem of optimal test design for two optimality criteria. (Author/SLD)
Descriptors: Criteria, Test Construction

Clauser, Brian E. – Applied Psychological Measurement, 2000
Provides a conceptual framework for the development of scoring procedures for performance assessments. The framework considers: (1) aspects of the performance to be scored; (2) criteria to evaluate aspects of the performance; (3) development of scoring criteria; and (4) application of scoring criteria. (SLD)
Descriptors: Criteria, Models, Performance Based Assessment, Scoring

Leucht, Richard M. – Applied Psychological Measurement, 1998
Presents a variation of a "greedy" algorithm that can be used in test-assembly problems. The algorithm, the normalized weighted absolute-deviation heuristic, selects items to have a locally optimal fit to a moving set of average criterion values. Demonstrates application of the model. (SLD)
Descriptors: Algorithms, Computer Assisted Testing, Criteria, Heuristics

Andrich, David – Applied Psychological Measurement, 1995
Two classes of graded response models, one based on the work of L. L. Thurstone and the other on the work of G. Rasch, are juxtaposed and shown to satisfy important but mutually incompatible criteria and to reflect different response processes. Implications of the choice between these models are discussed. (Author/SLD)
Descriptors: Classification, Criteria, Data Collection, Item Response Theory

Wang, Tianyou; Hanson, Bradley A.; Harris, Deborah J. – Applied Psychological Measurement, 2000
Studied whether circular equating could provide an adequate measure of various types of equating error when applied to different equating methods under different equating designs. Analyses and simluations show that circular equating is generally invalid as a criterion to evaluate the adequacy of equating. (SLD)
Descriptors: Criteria, Equated Scores, Error of Measurement, Evaluation Methods
Tong, Ye; Kolen, Michael – Applied Psychological Measurement, 2005
The performance of three equating methods--the presmoothed equipercentile method, the item response theory (IRT) true score method, and the IRT observed score method--were examined based on three equating criteria: the same distributions property, the first-order equity property, and the second-order equity property. The magnitude of the…
Descriptors: True Scores, Criteria, Raw Scores, Item Response Theory