ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	4

Descriptor

Multiple Choice Tests	8
Test Reliability	8
Test Items	6
Difficulty Level	3
Item Response Theory	3
Scores	3
Test Construction	3
Test Validity	3
Constructed Response	2
Educational Assessment	2
Science Tests	2
Scoring	2
Ability	1
Academic Achievement	1
Academic Standards	1
Achievement Tests	1
Alternative Assessment	1
Beginning Teachers	1
College Students	1
Computer Assisted Testing	1
Correlation	1
Cost Effectiveness	1
Disabilities	1
Efficiency	1
Elementary Secondary Education	1
More ▼

Source

Applied Measurement in…

Author

Beddow, Peter A.	1
Bolt, Daniel M.	1
Elliott, Stephen N.	1
Feldt, Leonard S.	1
Godfrey, Alan T. K.	1
Henly, George A.	1
Hou, Liling	1
Kettler, Ryan J.	1
Kurz, Alexander	1
Millman, Jason	1
Musch, Jochen	1
Papenberg, Martin	1
Rodriguez, Michael C.	1
Slepkov, Aaron D.	1
Sykes, Robert C.	1
Thissen, David	1
Wainer, Howard	1
Wan, Lei	1
More ▼

Publication Type

Journal Articles	8
Reports - Evaluative	4
Reports - Research	4

Education Level

Elementary Secondary Education	2
Grade 8	2
Middle Schools	2
Secondary Education	2
Elementary Education	1
Grade 5	1
High Schools	1
Higher Education	1
Junior High Schools	1
Postsecondary Education	1

Audience

Location

Arizona	1
Germany	1
Hawaii	1
Idaho	1
Indiana	1

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 8 results Save | Export

Partial Credit in Answer-Until-Correct Multiple-Choice Tests Deployed in a Classroom Setting

Peer reviewed

Direct link

Slepkov, Aaron D.; Godfrey, Alan T. K. – Applied Measurement in Education, 2019

The answer-until-correct (AUC) method of multiple-choice (MC) testing involves test respondents making selections until the keyed answer is identified. Despite attendant benefits that include improved learning, broad student adoption, and facile administration of partial credit, the use of AUC methods for classroom testing has been extremely…

Descriptors: Multiple Choice Tests, Test Items, Test Reliability, Scores

Of Small Beauties and Large Beasts: The Quality of Distractors on Multiple-Choice Tests Is More Important than Their Quantity

Peer reviewed

Direct link

Papenberg, Martin; Musch, Jochen – Applied Measurement in Education, 2017

In multiple-choice tests, the quality of distractors may be more important than their number. We therefore examined the joint influence of distractor quality and quantity on test functioning by providing a sample of 5,793 participants with five parallel test sets consisting of items that differed in the number and quality of distractors.…

Descriptors: Multiple Choice Tests, Test Items, Test Validity, Test Reliability

Measurement Properties of Two Innovative Item Formats in a Computer-Based Test

Peer reviewed

Direct link

Wan, Lei; Henly, George A. – Applied Measurement in Education, 2012

Many innovative item formats have been proposed over the past decade, but little empirical research has been conducted on their measurement properties. This study examines the reliability, efficiency, and construct validity of two innovative item formats--the figural response (FR) and constructed response (CR) formats used in a K-12 computerized…

Descriptors: Test Items, Test Format, Computer Assisted Testing, Measurement

Modified Multiple-Choice Items for Alternate Assessments: Reliability, Difficulty, and Differential Boost

Peer reviewed

Direct link

Kettler, Ryan J.; Rodriguez, Michael C.; Bolt, Daniel M.; Elliott, Stephen N.; Beddow, Peter A.; Kurz, Alexander – Applied Measurement in Education, 2011

Federal policy on alternate assessment based on modified academic achievement standards (AA-MAS) inspired this research. Specifically, an experimental study was conducted to determine whether tests composed of modified items would have the same level of reliability as tests composed of original items, and whether these modified items helped reduce…

Descriptors: Multiple Choice Tests, Test Items, Alternative Assessment, Test Reliability

The Relationship between the Distribution of Item Difficulties and Test Reliability.

Peer reviewed

Feldt, Leonard S. – Applied Measurement in Education, 1993

The recommendation that the reliability of multiple-choice tests will be enhanced if the distribution of item difficulties is concentrated at approximately 0.50 is reinforced and extended in this article by viewing the 0/1 item scoring as a dichotomization of an underlying normally distributed ability score. (SLD)

Descriptors: Ability, Difficulty Level, Guessing (Tests), Mathematical Models

Weighting Constructed-Response Items in IRT-Based Exams

Peer reviewed

Direct link

Sykes, Robert C.; Hou, Liling – Applied Measurement in Education, 2003

Weighting responses to Constructed-Response (CR) items has been proposed as a way to increase the contribution these items make to the test score when there is insufficient testing time to administer additional CR items. The effect of various types of weighting items of an IRT-based mixed-format writing examination was investigated.…

Descriptors: Item Response Theory, Weighted Scores, Responses, Scores

Combining Multiple-Choice and Constructed-Response Test Scores: Toward a Marxist Theory of Test Construction.

Peer reviewed

Wainer, Howard; Thissen, David – Applied Measurement in Education, 1993

Because assessment instruments of the future may well be composed of a combination of types of questions, a way to combine those scores effectively is discussed. Two new graphic tools are presented that show that it may not be practical to equalize the reliability of different components. (SLD)

Descriptors: Constructed Response, Educational Assessment, Graphs, Item Response Theory

Teaching Licensing and the New Assessment Methodologies.

Peer reviewed

Millman, Jason – Applied Measurement in Education, 1991

Alternatives to multiple-choice tests for teacher licensing examinations are described, and their advantages are cited. Concerns are expressed in the areas of cost and practicality, reliability, corruptibility, and validity. A suggestion for reducing costs using multiple-choice responses calibrated to constructed-response tasks is proposed. (SLD)

Descriptors: Beginning Teachers, Constructed Response, Cost Effectiveness, Educational Assessment