ERIC - Search Results

Publication Date

In 2025	1
Since 2024	2
Since 2021 (last 5 years)	7
Since 2016 (last 10 years)	14
Since 2006 (last 20 years)	23

Descriptor

Guessing (Tests)	85
Scoring	85
Multiple Choice Tests	46
Test Items	25
Test Reliability	24
Test Validity	22
Testing Problems	19
Scoring Formulas	17
Test Construction	14
Scores	12
Difficulty Level	11
Item Analysis	11
Testing	11
Confidence Testing	10
Foreign Countries	10
Higher Education	10
Item Response Theory	10
Comparative Analysis	9
Measurement Techniques	9
Objective Tests	9
Probability	9
Response Style (Tests)	9
Statistical Analysis	9
Mathematical Models	8
Psychometrics	7
More ▼

Publication Type

Reports - Research	40
Journal Articles	39
Reports - Evaluative	14
Speeches/Meeting Papers	13
Reports - Descriptive	4
Opinion Papers	2
Collected Works - General	1
Dissertations/Theses -…	1
Guides - Non-Classroom	1
Information Analyses	1

Education Level

Secondary Education	5
Higher Education	4
Postsecondary Education	3
Adult Education	1
Elementary Education	1
Elementary Secondary Education	1

Audience

Researchers	2
Parents	1

Location

Malaysia	2
Germany	1
Israel	1
Jordan	1
Nigeria	1
Ohio	1
South Africa	1
Texas	1
Turkey	1

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)	3
Graduate Record Examinations	2
California Achievement Tests	1
College Board Achievement…	1
Comprehensive Tests of Basic…	1
English Proficiency Test	1
Graduate Management Admission…	1
Measures of Academic Progress	1
Preliminary Scholastic…	1
Test of English as a Foreign…	1
Trends in International…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 85 results Save | Export

An Examination of Individual Ability Estimation and Classification Accuracy under Rapid Guessing Misidentifications

Peer reviewed

Direct link

Rios, Joseph – Applied Measurement in Education, 2022

To mitigate the deleterious effects of rapid guessing (RG) on ability estimates, several rescoring procedures have been proposed. Underlying many of these procedures is the assumption that RG is accurately identified. At present, there have been minimal investigations examining the utility of rescoring approaches when RG is misclassified, and…

Descriptors: Accuracy, Guessing (Tests), Scoring, Classification

Linking Errors Introduced by Rapid Guessing Responses When Employing Multigroup Concurrent IRT Scaling

Direct link

Jiayi Deng – ProQuest LLC, 2024

Test score comparability in international large-scale assessments (LSA) is of utmost importance in measuring the effectiveness of education systems and understanding the impact of education on economic growth. To effectively compare test scores on an international scale, score linking is widely used to convert raw scores from different linguistic…

Descriptors: Item Response Theory, Scoring Rubrics, Scoring, Error of Measurement

Is Effort Moderated Scoring Robust to Multidimensional Rapid Guessing?

Peer reviewed

Direct link

Joseph A. Rios; Jiayi Deng – Educational and Psychological Measurement, 2025

To mitigate the potential damaging consequences of rapid guessing (RG), a form of noneffortful responding, researchers have proposed a number of scoring approaches. The present simulation study examines the robustness of the most popular of these approaches, the unidimensional effort-moderated (EM) scoring procedure, to multidimensional RG (i.e.,…

Descriptors: Scoring, Guessing (Tests), Reaction Time, Item Response Theory

A Method for Identifying Partial Test-Taking Engagement

Peer reviewed

Direct link

Wise, Steven; Kuhfeld, Megan – Applied Measurement in Education, 2021

Effort-moderated (E-M) scoring is intended to estimate how well a disengaged test taker would have performed had they been fully engaged. It accomplishes this adjustment by excluding disengaged responses from scoring and estimating performance from the remaining responses. The scoring method, however, assumes that the remaining responses are not…

Descriptors: Scoring, Achievement Tests, Identification, Validity

Using Retest Data to Evaluate and Improve Effort-Moderated Scoring

Peer reviewed

Direct link

Wise, Steven L.; Kuhfeld, Megan R. – Journal of Educational Measurement, 2021

There has been a growing research interest in the identification and management of disengaged test taking, which poses a validity threat that is particularly prevalent with low-stakes tests. This study investigated effort-moderated (E-M) scoring, in which item responses classified as rapid guesses are identified and excluded from scoring. Using…

Descriptors: Scoring, Data Use, Response Style (Tests), Guessing (Tests)

Marginalized Learners in International and Regional Test Data: The Extent of Floor Effects

Peer reviewed

Direct link

Gustafsson, Martin; Barakat, Bilal Fouad – Comparative Education Review, 2023

International assessments inform education policy debates, yet little is known about their floor effects: To what extent do they fail to differentiate between the lowest performers, and what are the implications of this? TIMSS, SACMEQ, and LLECE data are analyzed to answer this question. In TIMSS, floor effects have been reduced through the…

Descriptors: Achievement Tests, Elementary Secondary Education, International Assessment, Foreign Countries

Person-Fit Assessment under the D-Scoring Method

Peer reviewed

Direct link

Dimitrov, Dimiter M.; Atanasov, Dimitar V.; Luo, Yong – Measurement: Interdisciplinary Research and Perspectives, 2020

This study examines and compares four person-fit statistics (PFSs) in the framework of the "D"- scoring method (DSM): (a) van der Flier's "U3" statistic; (b) "Ud" statistic, as a modification of "U3" under the DSM; (c) "Zd" statistic, as a modification of the "Z3 (l[subscript z])"…

Descriptors: Goodness of Fit, Item Analysis, Item Response Theory, Scoring

Reconsidering the Assessment Policy: Practical Use of Liberal Multiple-Choice Tests (SAC Method)

Peer reviewed
PDF on ERIC

Download full text

Cesur, Kursat – Educational Policy Analysis and Strategic Research, 2019

Examinees' performances are assessed using a wide variety of different techniques. Multiple-choice (MC) tests are among the most frequently used ones. Nearly, all standardized achievement tests make use of MC test items and there is a variety of ways to score these tests. The study compares number right and liberal scoring (SAC) methods. Mixed…

Descriptors: Multiple Choice Tests, Scoring, Evaluation Methods, Guessing (Tests)

Examining Patterns of Omitted Responses in a Large-Scale English Language Proficiency Test

Peer reviewed

Direct link

Sarac, Merve; Loken, Eric – International Journal of Testing, 2023

This study is an exploratory analysis of examinee behavior in a large-scale language proficiency test. Despite a number-right scoring system with no penalty for guessing, we found that 16% of examinees omitted at least one answer and that women were more likely than men to omit answers. Item-response theory analyses treating the omitted responses…

Descriptors: English (Second Language), Language Proficiency, Language Tests, Second Language Learning

Effects of Removing Responses with Likely Random Guessing under Rasch Measurement on a Multiple-Choice Language Proficiency Test

Peer reviewed

Direct link

Lin, Chih-Kai – Language Assessment Quarterly, 2018

With multiple options to choose from, there is always a chance of lucky guessing by examinees on multiple-choice (MC) items, thereby potentially introducing bias in item difficulty estimates. Correct responses by random guessing thus pose threats to the validity of claims made from test performance on an MC test. Under the Rasch framework, the…

Descriptors: Guessing (Tests), Item Response Theory, Multiple Choice Tests, Language Tests

Rapid-Guessing Behavior: Its Identification, Interpretation, and Implications

Peer reviewed

Direct link

Wise, Steven L. – Educational Measurement: Issues and Practice, 2017

The rise of computer-based testing has brought with it the capability to measure more aspects of a test event than simply the answers selected or constructed by the test taker. One behavior that has drawn much research interest is the time test takers spend responding to individual multiple-choice items. In particular, very short response…

Descriptors: Guessing (Tests), Multiple Choice Tests, Test Items, Reaction Time

Getting Lucky: How Guessing Threatens the Validity of Performance Classifications

Peer reviewed
PDF on ERIC

Download full text

Foley, Brett P. – Practical Assessment, Research & Evaluation, 2016

There is always a chance that examinees will answer multiple choice (MC) items correctly by guessing. Design choices in some modern exams have created situations where guessing at random through the full exam--rather than only for a subset of items where the examinee does not know the answer--can be an effective strategy to pass the exam. This…

Descriptors: Guessing (Tests), Multiple Choice Tests, Case Studies, Test Construction

Investigating the Treatment of Missing Data in an Olympiad-Type Test -- The Case of the Selection Validity in the South African Mathematics Olympiad

Peer reviewed
PDF on ERIC

Download full text

Long, Caroline; Engelbrecht, Johann; Scherman, Vanessa; Dunne, Tim – Pythagoras, 2016

The purpose of the South African Mathematics Olympiad is to generate interest in mathematics and to identify the most talented mathematical minds. Our focus is on how the handling of missing data affects the selection of the 'best' contestants. Two approaches handling missing data, applying the Rasch model, are described. The issue of guessing is…

Descriptors: Foreign Countries, Competition, Secondary School Students, Talent Identification

Confidence-Based Assessments within an Adult Learning Environment

Download full text

Novacek, Paul – International Association for Development of the Information Society, 2013

Traditional knowledge assessments rely on multiple-choice type questions that only report a right or wrong answer. The reliance within the education system on this technique infers that a student who provides a correct answer purely through guesswork possesses knowledge equivalent to a student who actually knows the correct answer. A more complete…

Descriptors: Adult Learning, Multiple Choice Tests, Guessing (Tests), Confidence Testing

The Impact of Different Scoring Rubrics for Grading Virtual Patient-Based Exams

Peer reviewed

Direct link

Fors, Uno G. H.; Gunning, William T. – Journal of Educational Computing Research, 2014

Virtual patient cases (VPs) are used for healthcare education and assessment. Most VP systems track user interactions to be used for assessment. Few studies have investigated how virtual exam cases should be scored and graded. We have applied eight different scoring models on a data set from 154 students. Issues studied included the impact of…

Descriptors: Scoring Rubrics, Health Education, Evaluation Methods, Case Method (Teaching Technique)

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

Educational and Psychological…	7
Journal of Educational…	7
Applied Measurement in…	3
Applied Psychological…	3
Educ Psychol Meas	2
New Directions for Testing…	2
American Journal of…	1
American Mathematical Monthly	1
Assessment and Evaluation in…	1
Comparative Education Review	1
Computers & Education	1
Educational Measurement:…	1
Educational Policy Analysis…	1
Educational Research and…	1
Educational Technology	1
Educational Technology &…	1
European Journal of…	1
International Association for…	1
International Journal of…	1
J Exp Educ	1
Journal of Computer Assisted…	1
Journal of Economic Education	1
Journal of Educational…	1
Journal of Educational…	1
Journal of Educational…	1
More ▼

Lord, Frederic M.	6
Frary, Robert B.	5
Wilcox, Rand R.	4
Bruno, James E.	2
Jaradat, Derar	2
Jiayi Deng	2
Rippey, Robert M.	2
Wise, Steven L.	2
Abu-Sayf, F. K.	1
Ahlgren, Andrew	1
Angoff, William H.	1
Arnold, J. C.	1
Atanasov, Dimitar V.	1
Austin, Joe Dan	1
Barakat, Bilal Fouad	1
Ben-Shakhar, Gershon	1
Bin Usop, Hasbee	1
Boldt, Robert F.	1
Braswell, James S.	1
Budescu, David V.	1
Burton, Richard F.	1
Cesur, Kursat	1
Chevalier, Shirley A.	1
Cross, Lawrence H.	1
More ▼