ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	6

Descriptor

Item Analysis	6
Multiple Choice Tests	6
Test Items	6
Foreign Countries	3
Test Construction	3
Correlation	2
Grade 7	2
Models	2
Reading Tests	2
Test Bias	2
Test Format	2
African American Culture	1
African American Students	1
Bayesian Statistics	1
Biotechnology	1
Cognitive Measurement	1
College Graduates	1
Computer Assisted Testing	1
Content Analysis	1
Criterion Referenced Tests	1
Difficulty Level	1
Diseases	1
Educational Indicators	1
Error Patterns	1
Gender Differences	1
More ▼

Source

Applied Measurement in…

Author

Abu-Ghazalah, Rashid M.	1
Banks, Kathleen	1
Boulais, André-Philippe	1
De Champlain, André	1
Dubins, David N.	1
Gierl, Mark J.	1
Hogan, Thomas P.	1
Lai, Hollis	1
Lee, Yoonsun	1
Murphy, Gavin	1
Musch, Jochen	1
Papenberg, Martin	1
Poon, Gregory M. K.	1
Pugh, Debra	1
Taylor, Catherine S.	1
Touchie, Claire	1
More ▼

Publication Type

Journal Articles	6
Reports - Research	4
Reports - Evaluative	2

Education Level

Elementary Secondary Education	2
Grade 7	2
Higher Education	2
Postsecondary Education	2
Elementary Education	1
Grade 10	1
Grade 4	1
Grade 5	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
Secondary Education	1
More ▼

Audience

Location

Canada	2
Germany	1

Laws, Policies, & Programs

Assessments and Surveys

TerraNova Multiple Assessments

What Works Clearinghouse Rating

Showing all 6 results Save | Export

Dissecting Knowledge, Guessing, and Blunder in Multiple Choice Assessments

Peer reviewed

Direct link

Abu-Ghazalah, Rashid M.; Dubins, David N.; Poon, Gregory M. K. – Applied Measurement in Education, 2023

Multiple choice results are inherently probabilistic outcomes, as correct responses reflect a combination of knowledge and guessing, while incorrect responses additionally reflect blunder, a confidently committed mistake. To objectively resolve knowledge from responses in an MC test structure, we evaluated probabilistic models that explicitly…

Descriptors: Guessing (Tests), Multiple Choice Tests, Probability, Models

Of Small Beauties and Large Beasts: The Quality of Distractors on Multiple-Choice Tests Is More Important than Their Quantity

Peer reviewed

Direct link

Papenberg, Martin; Musch, Jochen – Applied Measurement in Education, 2017

In multiple-choice tests, the quality of distractors may be more important than their number. We therefore examined the joint influence of distractor quality and quantity on test functioning by providing a sample of 5,793 participants with five parallel test sets consisting of items that differed in the number and quality of distractors.…

Descriptors: Multiple Choice Tests, Test Items, Test Validity, Test Reliability

Evaluating the Psychometric Characteristics of Generated Multiple-Choice Test Items

Peer reviewed

Direct link

Gierl, Mark J.; Lai, Hollis; Pugh, Debra; Touchie, Claire; Boulais, André-Philippe; De Champlain, André – Applied Measurement in Education, 2016

Item development is a time- and resource-intensive process. Automatic item generation integrates cognitive modeling with computer technology to systematically generate test items. To date, however, items generated using cognitive modeling procedures have received limited use in operational testing situations. As a result, the psychometric…

Descriptors: Psychometrics, Multiple Choice Tests, Test Items, Item Analysis

Gender DIF in Reading and Mathematics Tests with Mixed Item Formats

Peer reviewed

Direct link

Taylor, Catherine S.; Lee, Yoonsun – Applied Measurement in Education, 2012

This was a study of differential item functioning (DIF) for grades 4, 7, and 10 reading and mathematics items from state criterion-referenced tests. The tests were composed of multiple-choice and constructed-response items. Gender DIF was investigated using POLYSIBTEST and a Rasch procedure. The Rasch procedure flagged more items for DIF than did…

Descriptors: Test Bias, Gender Differences, Reading Tests, Mathematics Tests

Are Inferential Reading Items More Susceptible to Cultural Bias than Literal Reading Items?

Peer reviewed

Direct link

Banks, Kathleen – Applied Measurement in Education, 2012

The purpose of this article is to illustrate a seven-step process for determining whether inferential reading items were more susceptible to cultural bias than literal reading items. The seven-step process was demonstrated using multiple-choice data from the reading portion of a reading/language arts test for fifth and seventh grade Hispanic,…

Descriptors: Reading Tests, Test Items, Standardized Tests, Test Bias

Recommendations for Preparing and Scoring Constructed-Response Items: What the Experts Say

Peer reviewed

Direct link

Hogan, Thomas P.; Murphy, Gavin – Applied Measurement in Education, 2007

We determined the recommendations for preparing and scoring constructed-response (CR) test items in 25 sources (textbooks and chapters) on educational and psychological measurement. The project was similar to Haladyna's (2004) analysis for multiple-choice items. We identified 12 recommendations for preparing CR items given by multiple sources,…

Descriptors: Test Items, Scoring, Test Construction, Educational Indicators