ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	2

Source

Applied Psychological…

Author

Andrich, David	1
Berger, Martijn P. F.	1
Chang, Wanchen	1
Chen, Shu-Ying	1
Clauser, Brian E.	1
Dodd, Barbara G.	1
Hanson, Bradley A.	1
Harris, Deborah J.	1
Hsu, Chia-Ling	1
Kolen, Michael	1
Leucht, Richard M.	1
Tong, Ye	1
Wang, Tianyou	1
Wang, Wen-Chung	1
Whittaker, Tiffany A.	1
More ▼

Publication Type

Journal Articles	8
Reports - Descriptive	4
Reports - Evaluative	2
Reports - Research	2

Education Level

Elementary Secondary Education	1
Grade 12	1
Grade 4	1
Grade 8	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Iowa Tests of Basic Skills	1
National Assessment of…	1

What Works Clearinghouse Rating

Showing all 8 results Save | Export

Variable-Length Computerized Adaptive Testing Based on Cognitive Diagnosis Models

Peer reviewed

Direct link

Hsu, Chia-Ling; Wang, Wen-Chung; Chen, Shu-Ying – Applied Psychological Measurement, 2013

Interest in developing computerized adaptive testing (CAT) under cognitive diagnosis models (CDMs) has increased recently. CAT algorithms that use a fixed-length termination rule frequently lead to different degrees of measurement precision for different examinees. Fixed precision, in which the examinees receive the same degree of measurement…

Descriptors: Computer Assisted Testing, Adaptive Testing, Cognitive Tests, Diagnostic Tests

The Performance of IRT Model Selection Methods with Mixed-Format Tests

Peer reviewed

Direct link

Whittaker, Tiffany A.; Chang, Wanchen; Dodd, Barbara G. – Applied Psychological Measurement, 2012

When tests consist of multiple-choice and constructed-response items, researchers are confronted with the question of which item response theory (IRT) model combination will appropriately represent the data collected from these mixed-format tests. This simulation study examined the performance of six model selection criteria, including the…

Descriptors: Item Response Theory, Models, Selection, Criteria

Optimal Design of Tests with Dichotomous and Polytomous Items.

Peer reviewed

Berger, Martijn P. F. – Applied Psychological Measurement, 1998

Reviews some results on optimal design of tests with items of dichotomous and polytomous response formats and offers rules and guidelines for optimal test assembly. Discusses the problem of optimal test design for two optimality criteria. (Author/SLD)

Descriptors: Criteria, Test Construction

Recurrent Issues and Recent Advances in Scoring Performance Assessments.

Peer reviewed

Clauser, Brian E. – Applied Psychological Measurement, 2000

Provides a conceptual framework for the development of scoring procedures for performance assessments. The framework considers: (1) aspects of the performance to be scored; (2) criteria to evaluate aspects of the performance; (3) development of scoring criteria; and (4) application of scoring criteria. (SLD)

Descriptors: Criteria, Models, Performance Based Assessment, Scoring

Computer-Assisted Test Assembly Using Optimization Heuristics.

Peer reviewed

Leucht, Richard M. – Applied Psychological Measurement, 1998

Presents a variation of a "greedy" algorithm that can be used in test-assembly problems. The algorithm, the normalized weighted absolute-deviation heuristic, selects items to have a locally optimal fit to a moving set of average criterion values. Demonstrates application of the model. (SLD)

Descriptors: Algorithms, Computer Assisted Testing, Criteria, Heuristics

Distinctive and Incompatible Properties of Two Common Classes of IRT Models for Graded Responses.

Peer reviewed

Andrich, David – Applied Psychological Measurement, 1995

Two classes of graded response models, one based on the work of L. L. Thurstone and the other on the work of G. Rasch, are juxtaposed and shown to satisfy important but mutually incompatible criteria and to reflect different response processes. Implications of the choice between these models are discussed. (Author/SLD)

Descriptors: Classification, Criteria, Data Collection, Item Response Theory

The Effectiveness of Circular Equating as a Criterion for Evaluating Equating.

Peer reviewed

Wang, Tianyou; Hanson, Bradley A.; Harris, Deborah J. – Applied Psychological Measurement, 2000

Studied whether circular equating could provide an adequate measure of various types of equating error when applied to different equating methods under different equating designs. Analyses and simluations show that circular equating is generally invalid as a criterion to evaluate the adequacy of equating. (SLD)

Descriptors: Criteria, Equated Scores, Error of Measurement, Evaluation Methods

Assessing Equating Results on Different Equating Criteria

Peer reviewed

Direct link

Tong, Ye; Kolen, Michael – Applied Psychological Measurement, 2005

The performance of three equating methods--the presmoothed equipercentile method, the item response theory (IRT) true score method, and the IRT observed score method--were examined based on three equating criteria: the same distributions property, the first-order equity property, and the second-order equity property. The magnitude of the…

Descriptors: True Scores, Criteria, Raw Scores, Item Response Theory

Criteria	8
Models	4
Item Response Theory	3
Classification	2
Computer Assisted Testing	2
Evaluation Methods	2
Selection	2
Test Construction	2
Test Items	2
Accuracy	1
Adaptive Testing	1
Algorithms	1
Bayesian Statistics	1
Cognitive Tests	1
Data Analysis	1
Data Collection	1
Diagnostic Tests	1
Equated Scores	1
Error of Measurement	1
Heuristics	1
Item Analysis	1
Mathematics Tests	1
National Competency Tests	1
Performance Based Assessment	1
Psychological Testing	1
More ▼