ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	5

Descriptor

Scoring	9
Test Items	9
Achievement Tests	3
Test Construction	3
Test Validity	3
Elementary Secondary Education	2
Error of Measurement	2
High Stakes Tests	2
Multiple Choice Tests	2
Psychometrics	2
Science Tests	2
Test Format	2
Test Interpretation	2
Test Reliability	2
Testing	2
Accuracy	1
Adults	1
Aptitude Tests	1
Automation	1
Barriers	1
Bias	1
Certification	1
Cognitive Tests	1
College Entrance Examinations	1
Comparative Testing	1
More ▼

Source

Educational Measurement:…

Publication Type

Journal Articles	9
Reports - Evaluative	5
Reports - Descriptive	3
Information Analyses	1
Reports - Research	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	1
Graduate Record Examinations	1
Preliminary Scholastic…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing all 9 results Save | Export

Rapid-Guessing Behavior: Its Identification, Interpretation, and Implications

Peer reviewed

Direct link

Wise, Steven L. – Educational Measurement: Issues and Practice, 2017

The rise of computer-based testing has brought with it the capability to measure more aspects of a test event than simply the answers selected or constructed by the test taker. One behavior that has drawn much research interest is the time test takers spend responding to individual multiple-choice items. In particular, very short response…

Descriptors: Guessing (Tests), Multiple Choice Tests, Test Items, Reaction Time

Automated Scoring of Constructed-Response Science Items: Prospects and Obstacles

Peer reviewed

Direct link

Liu, Ou Lydia; Brew, Chris; Blackmore, John; Gerard, Libby; Madhok, Jacquie; Linn, Marcia C. – Educational Measurement: Issues and Practice, 2014

Content-based automated scoring has been applied in a variety of science domains. However, many prior applications involved simplified scoring rubrics without considering rubrics representing multiple levels of understanding. This study tested a concept-based scoring tool for content-based scoring, c-rater™, for four science items with rubrics…

Descriptors: Science Tests, Test Items, Scoring, Automation

The Contestant Perspective on Taking Tests: Emanations from the Statue within

Peer reviewed

Direct link

Dorans, Neil J. – Educational Measurement: Issues and Practice, 2012

Views on testing--its purpose and uses and how its data are analyzed--are related to one's perspective on test takers. Test takers can be viewed as learners, examinees, or contestants. I briefly discuss the perspective of test takers as learners. I maintain that much of psychometrics views test takers as examinees. I discuss test takers as a…

Descriptors: Testing, Test Theory, Item Response Theory, Test Reliability

Same-Form Retest Effects on Credentialing Examinations

Peer reviewed

Direct link

Raymond, Mark R.; Neustel, Sandra; Anderson, Dan – Educational Measurement: Issues and Practice, 2009

Examinees who take high-stakes assessments are usually given an opportunity to repeat the test if they are unsuccessful on their initial attempt. To prevent examinees from obtaining unfair score increases by memorizing the content of specific test items, testing agencies usually assign a different test form to repeat examinees. The use of multiple…

Descriptors: Test Results, Test Items, Testing, Aptitude Tests

Effects of Assigning Raters to Items

Peer reviewed

Direct link

Sykes, Robert C.; Ito, Kyoko; Wang, Zhen – Educational Measurement: Issues and Practice, 2008

Student responses to a large number of constructed response items in three Math and three Reading tests were scored on two occasions using three ways of assigning raters: single reader scoring, a different reader for each response (item-specific), and three readers each scoring a rater item block (RIB) containing approximately one-third of a…

Descriptors: Test Items, Mathematics Tests, Reading Tests, Scoring

Development of Performance Assessments in Science: Conceptual, Practical, and Logistical Issues.

Peer reviewed

Solano-Flores, Guillermo; Shavelson, Richard J. – Educational Measurement: Issues and Practice, 1997

Conceptual, practical, and logistical issues in the development of science performance assessments (SPAs) are discussed. The conceptual framework identifies task, response format, and scoring system as components, and conceives of SPAs as tasks that attempt to recreate conditions in which scientists work. Developing SPAs is a sophisticated effort…

Descriptors: Elementary Secondary Education, Performance Based Assessment, Science Education, Science Tests

Context-Dependent Item Sets.

Peer reviewed

Haladyna, Thomas M. – Educational Measurement: Issues and Practice, 1992

Context-dependent item sets, containing a subset of test items related to a passage or stimulus, are discussed. A brief review of methods for developing item sets reveals their potential for measuring high-level thinking. Theories and technologies for scoring item sets remain largely experimental. Research needs are discussed. (SLD)

Descriptors: Cognitive Tests, Educational Technology, Licensing Examinations (Professions), Problem Solving

The Multiple True-False Item Format: A Status Review.

Peer reviewed

Frisbie, David A. – Educational Measurement: Issues and Practice, 1992

Literature related to the multiple true-false (MTF) item format is reviewed. Each answer cluster of a MTF item may have several true items and the correctness of each is judged independently. MTF tests appear efficient and reliable, although they are a bit harder than multiple choice items for examinees. (SLD)

Descriptors: Achievement Tests, Difficulty Level, Literature Reviews, Multiple Choice Tests

Valid Normative Information from Customized Achievement Tests.

Peer reviewed

Yen, Wendy M.; And Others – Educational Measurement: Issues and Practice, 1987

This paper discusses how to maintain the integrity of national nomative information for achievement tests when the test that is administered has been customized to satisfy local needs and is not a test that has been nationally normed. Alternative procedures for item selection and calibration are examined. (Author/LMO)

Descriptors: Achievement Tests, Elementary Secondary Education, Goodness of Fit, Item Analysis

Anderson, Dan	1
Blackmore, John	1
Brew, Chris	1
Dorans, Neil J.	1
Frisbie, David A.	1
Gerard, Libby	1
Haladyna, Thomas M.	1
Ito, Kyoko	1
Linn, Marcia C.	1
Liu, Ou Lydia	1
Madhok, Jacquie	1
Neustel, Sandra	1
Raymond, Mark R.	1
Shavelson, Richard J.	1
Solano-Flores, Guillermo	1
Sykes, Robert C.	1
Wang, Zhen	1
Wise, Steven L.	1
Yen, Wendy M.	1
More ▼