ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	3

Descriptor

Multiple Choice Tests	28
Test Validity	27
Test Reliability	19
Test Construction	11
Guessing (Tests)	8
Test Items	7
Scoring Formulas	6
Comparative Analysis	5
Higher Education	5
Scoring	5
Achievement Tests	4
Response Style (Tests)	4
Test Wiseness	4
Testing	4
Weighted Scores	4
Confidence Testing	3
Correlation	3
Foreign Countries	3
High School Students	3
High Schools	3
Objective Tests	3
Reading Comprehension	3
Reading Tests	3
Scores	3
Test Format	3
More ▼

Source

Journal of Educational…

Publication Type

Journal Articles	12
Reports - Research	12
Speeches/Meeting Papers	1

Education Level

Higher Education	1
Postsecondary Education	1
Secondary Education	1

Audience

Location

Canada	1
Jordan	1

Laws, Policies, & Programs

Assessments and Surveys

Advanced Placement…	1
Program for International…	1
Sequential Tests of…	1
Test of Standard Written…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 28 results Save | Export

Using Eye-Tracking Data as Part of the Validity Argument for Multiple-Choice Questions: A Demonstration

Peer reviewed

Direct link

Yaneva, Victoria; Clauser, Brian E.; Morales, Amy; Paniagua, Miguel – Journal of Educational Measurement, 2021

Eye-tracking technology can create a record of the location and duration of visual fixations as a test-taker reads test questions. Although the cognitive process the test-taker is using cannot be directly observed, eye-tracking data can support inferences about these unobserved cognitive processes. This type of information has the potential to…

Descriptors: Eye Movements, Test Validity, Multiple Choice Tests, Cognitive Processes

Gender Bias in Test Item Formats: Evidence from PISA 2009, 2012, and 2015 Math and Reading Tests

Peer reviewed

Direct link

Shear, Benjamin R. – Journal of Educational Measurement, 2023

Large-scale standardized tests are regularly used to measure student achievement overall and for student subgroups. These uses assume tests provide comparable measures of outcomes across student subgroups, but prior research suggests score comparisons across gender groups may be complicated by the type of test items used. This paper presents…

Descriptors: Gender Bias, Item Analysis, Test Items, Achievement Tests

Can We Learn from Student Mistakes in a Formative, Reading Comprehension Assessment?

Peer reviewed

Direct link

Liu, Bowen; Kennedy, Patrick C.; Seipel, Ben; Carlson, Sarah E.; Biancarosa, Gina; Davison, Mark L. – Journal of Educational Measurement, 2019

This article describes an ongoing project to develop a formative, inferential reading comprehension assessment of causal story comprehension. It has three features to enhance classroom use: equated scale scores for progress monitoring within and across grades, a scale score to distinguish among low-scoring students based on patterns of mistakes,…

Descriptors: Formative Evaluation, Reading Comprehension, Story Reading, Test Construction

The Effects of Guttman Weights on the Reliability and Predictive Validity of Objective Tests When Omissions Are Not Differentially Weighted

Peer reviewed

Raffeld, Paul – Journal of Educational Measurement, 1975

Results support the contention that a Guttman-weighted objective test can have psychometric properties that are superior to those of its unweighted counterpart, as long as omissions do not exist or are assigned a value equal to the mean of the k item alternative weights. (Author/BJG)

Descriptors: Multiple Choice Tests, Predictive Validity, Test Reliability, Test Validity

Incremental Reliability and Validity of Multiple-Choice Tests with an Answer-Until-Correct Procedure

Peer reviewed

Hanna, Gerald S. – Journal of Educational Measurement, 1975

An alternative to the conventional right-wrong scoring method used on multiple-choice tests was presented. In the experiment, the examinee continued to respond to a multiple-choice item until feedback signified a correct answer. Findings showed that experimental scores were more reliable but less valid than inferred conventional scores.…

Descriptors: Feedback, Higher Education, Multiple Choice Tests, Scoring

Overconfidence on Probabilistic Tests

Peer reviewed

Koehler, Roger A. – Journal of Educational Measurement, 1974

The purposes of the study were to develop a measure of overconfidence on probabilistic tests, to assess the measurement characteristics of such a measure, and to investigate the relationship of overconfidence on tests to knowledge and to risk-taking propensity. (Author/BB)

Descriptors: Confidence Testing, Measurement Techniques, Multiple Choice Tests, Risk

The Number of Alternatives for Optimum Test Reliability

Peer reviewed

Grier, J. Brown – Journal of Educational Measurement, 1975

The expected reliability of a multiple choice test is maximized by the use of three alternative items. (Author)

Descriptors: Achievement Tests, Multiple Choice Tests, Test Construction, Test Reliability

Can Teachers Write Good True-False Test Items?

Peer reviewed

Ebel, Robert L. – Journal of Educational Measurement, 1975

Descriptors: Comparative Analysis, Multiple Choice Tests, Objective Tests, Teachers

The Effects of Selected Poor Item-Writing Practices on Test Difficulty, Reliability and Validity

Peer reviewed

Board, Cynthia; Whitney, Douglas R. – Journal of Educational Measurement, 1972

For the principles studied here, poor item-writing practices serve to obscure (or attentuate) differences between good and poor students. (Authors)

Descriptors: College Students, Item Analysis, Multiple Choice Tests, Test Construction

Development and Evaluation of a Test of Information Storage During Reading

Peer reviewed

Carver, Ronald P.; Darby, Charles A., Jr. – Journal of Educational Measurement, 1971

Discusses a reading test using chunked" items -- groups of meaningfully related words in which certain groups are changed in meaning from the original passage. (Author)

Descriptors: Information Storage, Multiple Choice Tests, Reading Comprehension, Reading Tests

Elimination Scoring: An Empirical Evaluation

Peer reviewed

Collet, Leverne S. – Journal of Educational Measurement, 1971

The purpose of this paper was to provide an empirical test of the hypothesis that elimination scores are more reliable and valid than classical corrected-for-guessing scores or weighted-choice scores. The evidence presented supports the hypothesized superiority of elimination scoring. (Author)

Descriptors: Evaluation, Guessing (Tests), Multiple Choice Tests, Scoring Formulas

Multiple Choice Versus True-False: A Comparison of Reliabilities and Concurrent Validities

Peer reviewed

Frisbee, David A. – Journal of Educational Measurement, 1973

The purpose of this study was to gather empirical evidence to compare the reliabilities and concurrent validities of multiple choice and true-false tests that were written to measure understandings and relationships in the same content areas. (Author)

Descriptors: Achievement Tests, Correlation, High School Students, Measurement

Effects of Empirical Option Weighting on Reliability and Validity of an Academic Aptitude Test

Peer reviewed

Reilly, Richard R.; Jackson, Rex – Journal of Educational Measurement, 1973

The present study suggests that although the reliability of an academic aptitude test given under formula-score condition can be increased substantially through empirical option weighting, much of the increase is due to the capitalization of the keying procedure on omitting tendencies which are reliable but not valid. (Author)

Descriptors: Aptitude Tests, Correlation, Factor Analysis, Item Sampling

Assessment of Retarded Student Achievement with Standardized True/False and Multiple-Choice Tests.

Peer reviewed

Irvin, Larry K.; And Others – Journal of Educational Measurement, 1980

The relative efficacy of content-appropriate, orally administered true/false and multiple-choice testing was examined with retarded adolescents. Both approaches demonstrated utility and psychometric adequacy. Implications regarding test development for retarded students are briefly discussed. (Author)

Descriptors: High Schools, Mild Mental Retardation, Multiple Choice Tests, Objective Tests

A Preliminary Study of the Reliability and Validity of a Scoring Procedure Based Upon Confidence and Partial Information

Peer reviewed

Diamond, James J. – Journal of Educational Measurement, 1975

Investigates the reliability and validity of scores yielded from a new scoring formula. (Author/DEP)

Descriptors: Guessing (Tests), Multiple Choice Tests, Objective Tests, Scoring

Previous Page | Next Page »

Pages: 1 | 2

Hakstian, A. Ralph	2
Kansup, Wanlop	2
Bennett, Randy Elliot	1
Biancarosa, Gina	1
Board, Cynthia	1
Breland, Hunter M.	1
Carlson, Sarah E.	1
Carver, Ronald P.	1
Clauser, Brian E.	1
Collet, Leverne S.	1
Cross, Lawrence	1
Darby, Charles A., Jr.	1
Davison, Mark L.	1
Diamond, James J.	1
Ebel, Robert L.	1
Farr, Roger	1
Forsyth, Robert A.	1
Frary, Robert	1
Frary, Robert B.	1
Frisbee, David A.	1
Gaynor, Judith L.	1
Grier, J. Brown	1
Hanna, Gerald S.	1
Irvin, Larry K.	1
More ▼