ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	2

Descriptor

Scoring	5
Test Construction	5
Test Items	5
Test Length	5
Test Format	3
Adaptive Testing	2
Computer Assisted Testing	2
Item Analysis	2
Simulation	2
Accuracy	1
Algebra	1
Career Development	1
Classification	1
Comparative Analysis	1
Criterion Referenced Tests	1
Difficulty Level	1
Educational Assessment	1
Goodness of Fit	1
Guessing (Tests)	1
Information Security	1
Item Response Theory	1
Latent Trait Theory	1
Measurement	1
Models	1
Multiple Choice Tests	1
More ▼

Source

Educational and Psychological…	1
Journal of Educational…	1
ProQuest LLC	1

Author

Cook, Linda L.	1
Hambleton, Ronald K.	1
Jing Ma	1
Liaw, Yuan-Ling	1
Rutkowski, David	1
Rutkowski, Leslie	1
Svetina, Dubravka	1
Wainer, Howard	1
Wilcox, Rand R.	1

Publication Type

Reports - Research	3
Journal Articles	2
Dissertations/Theses -…	1
Reports - Evaluative	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 5 results Save | Export

The Impact of Scoring Later on Mixed Format Adaptive Testing

Direct link

Jing Ma – ProQuest LLC, 2024

This study investigated the impact of scoring polytomous items later on measurement precision, classification accuracy, and test security in mixed-format adaptive testing. Utilizing the shadow test approach, a simulation study was conducted across various test designs, lengths, number and location of polytomous item. Results showed that while…

Descriptors: Scoring, Adaptive Testing, Test Items, Classification

Routing Strategies and Optimizing Design for Multistage Testing in International Large-Scale Assessments

Peer reviewed

Direct link

Svetina, Dubravka; Liaw, Yuan-Ling; Rutkowski, Leslie; Rutkowski, David – Journal of Educational Measurement, 2019

This study investigates the effect of several design and administration choices on item exposure and person/item parameter recovery under a multistage test (MST) design. In a simulation study, we examine whether number-correct (NC) or item response theory (IRT) methods are differentially effective at routing students to the correct next stage(s)…

Descriptors: Measurement, Item Analysis, Test Construction, Item Response Theory

Determining the Length of Multiple Choice Criterion-Referenced Tests When an Answer-Until-Correct Scoring Procedure Is Used.

Peer reviewed

Wilcox, Rand R. – Educational and Psychological Measurement, 1982

When determining criterion-referenced test length, problems of guessing are shown to be more serious than expected. A new method of scoring is presented that corrects for guessing without assuming that guessing is random. Empirical investigations of the procedure are examined. Test length can be substantially reduced. (Author/CM)

Descriptors: Criterion Referenced Tests, Guessing (Tests), Multiple Choice Tests, Scoring

Some Results on the Robustness of Latent Trait Models.

Download full text

Hambleton, Ronald K.; Cook, Linda L. – 1978

The purpose of the present research was to study, systematically, the "goodness-of-fit" of the one-, two-, and three-parameter logistic models. We studied, using computer-simulated test data, the effects of four variables: variation in item discrimination parameters, the average value of the pseudo-chance level parameters, test length,…

Descriptors: Career Development, Difficulty Level, Goodness of Fit, Item Analysis

An Adaptive Algebra Test: A Testlet-Based, Hierarchically-Structured Test with Validity-Based Scoring. Technical Report No. 90-92.

Download full text

Wainer, Howard; And Others – 1990

The initial development of a testlet-based algebra test was previously reported (Wainer and Lewis, 1990). This account provides the details of this excursion into the use of hierarchical testlets and validity-based scoring. A pretest of two 15-item hierarchical testlets was carried out in which examinees' performance on a 4-item subset of each…

Descriptors: Adaptive Testing, Algebra, Comparative Analysis, Computer Assisted Testing