ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	10
Since 2006 (last 20 years)	15

Descriptor

Guessing (Tests)	74
Test Construction	74
Multiple Choice Tests	42
Test Items	33
Test Reliability	26
Test Validity	21
Item Analysis	17
Difficulty Level	15
Testing Problems	15
Scores	14
Scoring	14
Test Wiseness	12
Scoring Formulas	11
Foreign Countries	10
Higher Education	10
Objective Tests	10
Response Style (Tests)	10
Testing	10
Mathematical Models	9
Probability	9
Responses	9
Statistical Analysis	9
Test Format	9
Achievement Tests	8
Adaptive Testing	8
More ▼

Publication Type

Reports - Research	34
Journal Articles	30
Reports - Evaluative	11
Speeches/Meeting Papers	10
Reports - Descriptive	5
Opinion Papers	3
Tests/Questionnaires	2
Collected Works - Proceedings	1
Guides - Classroom - Teacher	1
Information Analyses	1
Numerical/Quantitative Data	1
More ▼

Education Level

Higher Education	3
Postsecondary Education	3
Adult Education	1
High Schools	1
Secondary Education	1

Audience

Researchers	4
Practitioners	2

Location

Canada	2
United Kingdom	2
Denmark	1
Indonesia	1
Japan	1
Nigeria	1
Pennsylvania	1
United Kingdom (England)	1
United Kingdom (Great Britain)	1

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)	2
California Achievement Tests	1
Graduate Record Examinations	1
Iowa Tests of Basic Skills	1
Preliminary Scholastic…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 74 results Save | Export

From Investigating the Alignment of a Priori Item Characteristics Based on the CTT and Four-Parameter Logistic (4-PL) IRT Models to Further Exploring the Comparability of the Two Models

Peer reviewed
PDF on ERIC

Download full text

Agus Santoso; Heri Retnawati; Timbul Pardede; Ibnu Rafi; Munaya Nikma Rosyada; Gulzhaina K. Kassymova; Xu Wenxin – Practical Assessment, Research & Evaluation, 2024

The test blueprint is important in test development, where it guides the test item writer in creating test items according to the desired objectives and specifications or characteristics (so-called a priori item characteristics), such as the level of item difficulty in the category and the distribution of items based on their difficulty level.…

Descriptors: Foreign Countries, Undergraduate Students, Business English, Test Construction

Towards a New Sophistication in Vocabulary Assessment

Peer reviewed

Direct link

Read, John – Language Testing, 2023

Published work on vocabulary assessment has grown substantially in the last 10 years, but it is still somewhat outside the mainstream of the field. There has been a recent call for those developing vocabulary tests to apply professional standards to their work, especially in validating their instruments for specified purposes before releasing them…

Descriptors: Language Tests, Vocabulary Development, Second Language Learning, Test Format

Dissecting Knowledge, Guessing, and Blunder in Multiple Choice Assessments

Peer reviewed

Direct link

Abu-Ghazalah, Rashid M.; Dubins, David N.; Poon, Gregory M. K. – Applied Measurement in Education, 2023

Multiple choice results are inherently probabilistic outcomes, as correct responses reflect a combination of knowledge and guessing, while incorrect responses additionally reflect blunder, a confidently committed mistake. To objectively resolve knowledge from responses in an MC test structure, we evaluated probabilistic models that explicitly…

Descriptors: Guessing (Tests), Multiple Choice Tests, Probability, Models

Reconsidering the Assessment Policy: Practical Use of Liberal Multiple-Choice Tests (SAC Method)

Peer reviewed
PDF on ERIC

Download full text

Cesur, Kursat – Educational Policy Analysis and Strategic Research, 2019

Examinees' performances are assessed using a wide variety of different techniques. Multiple-choice (MC) tests are among the most frequently used ones. Nearly, all standardized achievement tests make use of MC test items and there is a variety of ways to score these tests. The study compares number right and liberal scoring (SAC) methods. Mixed…

Descriptors: Multiple Choice Tests, Scoring, Evaluation Methods, Guessing (Tests)

A Simulation-Based Method for Finding the Optimal Number of Options for Multiple-Choice Items on a Test. Research Report. ETS RR-18-22

Peer reviewed
PDF on ERIC

Download full text

Guo, Hongwen; Zu, Jiyun; Kyllonen, Patrick – ETS Research Report Series, 2018

For a multiple-choice test under development or redesign, it is important to choose the optimal number of options per item so that the test possesses the desired psychometric properties. On the basis of available data for a multiple-choice assessment with 8 options, we evaluated the effects of changing the number of options on test properties…

Descriptors: Multiple Choice Tests, Test Items, Simulation, Test Construction

Randomize It: Fair Procedures When Constructing Multiple-Choice Test-Keys

Peer reviewed
PDF on ERIC

Download full text

Joseph, Dane Christian – Journal of Effective Teaching in Higher Education, 2019

Multiple-choice testing is a staple within the U.S. higher education system. From classroom assessments to standardized entrance exams such as the GRE, GMAT, or LSAT, test developers utilize a variety of validated and heuristic driven item-writing guidelines. One such guideline that has been given recent attention is to randomize the position of…

Descriptors: Test Construction, Multiple Choice Tests, Guessing (Tests), Test Wiseness

An Investigation into the Roles of Guessing and Partial Knowledge in the Vocabulary Size Test

Peer reviewed
PDF on ERIC

Download full text

Asquith, Steven – TESL-EJ, 2022

Although an accurate measure of vocabulary size is integral to understanding the proficiency of language learners, the validity of multiple-choice (M/C) vocabulary tests to determine this has been questioned due to users guessing correct answers which inflates scores. In this paper the nature of guessing and partial knowledge used when taking the…

Descriptors: Guessing (Tests), English (Second Language), Second Language Learning, Language Tests

Controlling Bias in Both Constructed Response and Multiple-Choice Items When Analyzed with the Dichotomous Rasch Model

Peer reviewed

Direct link

Andrich, David; Marais, Ida – Journal of Educational Measurement, 2018

Even though guessing biases difficulty estimates as a function of item difficulty in the dichotomous Rasch model, assessment programs with tests which include multiple-choice items often construct scales using this model. Research has shown that when all items are multiple-choice, this bias can largely be eliminated. However, many assessments have…

Descriptors: Multiple Choice Tests, Test Items, Guessing (Tests), Test Bias

Spoilt for Choice? Issues around the Use and Comparability of Optional Exam Questions

Peer reviewed

Direct link

Bramley, Tom; Crisp, Victoria – Assessment in Education: Principles, Policy & Practice, 2019

For many years, question choice has been used in some UK public examinations, with students free to choose which questions they answer from a selection (within certain parameters). There has been little published research on choice of exam questions in recent years in the UK. In this article we distinguish different scenarios in which choice…

Descriptors: Test Items, Test Construction, Difficulty Level, Foreign Countries

Getting Lucky: How Guessing Threatens the Validity of Performance Classifications

Peer reviewed
PDF on ERIC

Download full text

Foley, Brett P. – Practical Assessment, Research & Evaluation, 2016

There is always a chance that examinees will answer multiple choice (MC) items correctly by guessing. Design choices in some modern exams have created situations where guessing at random through the full exam--rather than only for a subset of items where the examinee does not know the answer--can be an effective strategy to pass the exam. This…

Descriptors: Guessing (Tests), Multiple Choice Tests, Case Studies, Test Construction

Construction of Valid and Reliable Test for Assessment of Students

Peer reviewed
PDF on ERIC

Download full text

Osadebe, P. U. – Journal of Education and Practice, 2015

The study was carried out to construct a valid and reliable test in Economics for secondary school students. Two research questions were drawn to guide the establishment of validity and reliability for the Economics Achievement Test (EAT). It is a multiple choice objective test of five options with 100 items. A sample of 1000 students was randomly…

Descriptors: Student Evaluation, Secondary School Students, Economics, Achievement Tests

A New Twist on Studying the Development of Dynamic Spatial Transformations: Mental Paper Folding in Young Children

Peer reviewed

Direct link

Harris, Justin; Newcombe, Nora S.; Hirsh-Pasek, Kathy – Mind, Brain, and Education, 2013

The relation of spatial skills to academic success in areas such as math and science has sparked discussion in early education around how spatial thinking skills might be included in early schooling. Planning and evaluating new curricula or interventions requires understanding these skills and having the means to assess them. Prior developmental…

Descriptors: Young Children, Spatial Ability, Thinking Skills, Cognitive Processes

New Directions in Vocabulary Testing

Direct link

Webb, Stuart A.; Sasao, Yosuke – RELC Journal: A Journal of Language Teaching and Research, 2013

There have been great strides made in research on vocabulary in the last 30 years. However, there has been relatively little progress in the development of new vocabulary tests. This may be due in some degree to the impressive contributions made by tests such as the Vocabulary Levels Test (Nation, 1983; Schmitt et al., 2001) and the Word…

Descriptors: Language Tests, Vocabulary Development, Second Language Instruction, Second Language Learning

Assessing Multiple Choice Question (MCQ) Tests--A Mathematical Perspective

Peer reviewed

Direct link

Scharf, Eric M.; Baldwin, Lynne P. – Active Learning in Higher Education: The Journal of the Institute for Learning and Teaching, 2007

The reasoning behind popular methods for analysing the raw data generated by multiple choice question (MCQ) tests is not always appreciated, occasionally with disastrous results. This article discusses and analyses three options for processing the raw data produced by MCQ tests. The article shows that one extreme option is not to penalize a…

Descriptors: Guessing (Tests), Test Items, Multiple Choice Tests, Questioning Techniques

On Keats' Generalization of the Rasch Model

Peer reviewed

Colonius, Hans – Psychometrika, 1977

Parameter estimation for Keats generalization of the Rasch model that takes account of guessing behavior is investigated. It is shown that no minimal sufficient statistics for the ability parameters independent of the difficulty parameters exist. (Author/JKS)

Descriptors: Guessing (Tests), Item Analysis, Test Construction, Test Reliability

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5

Educational and Psychological…	6
Applied Measurement in…	3
Journal of Educational…	3
Language Testing	2
Practical Assessment,…	2
Active Learning in Higher…	1
American Journal of Physics	1
American Mathematical Monthly	1
Applied Psychological…	1
Assessment & Evaluation in…	1
Assessment in Education:…	1
ETS Research Report Series	1
Educ Psychol Meas	1
Educational Policy Analysis…	1
Educational Technology	1
English Language Teaching	1
Harvard Educational Review	1
International Journal of…	1
Journal of Education and…	1
Journal of Effective Teaching…	1
Journal of Geography in…	1
Medical Education	1
Medical Teacher	1
Mind, Brain, and Education	1
Nursing Outlook	1
More ▼

Lord, Frederic M.	3
Urry, Vern W.	3
Frary, Robert B.	2
Wise, Steven L.	2
Abu-Ghazalah, Rashid M.	1
Abu-Sayf, F. K.	1
Agus Santoso	1
Anderson, Paul S.	1
Andrich, David	1
Asquith, Steven	1
Austin, Joe Dan	1
Baldauf, Richard B., Jr.	1
Baldwin, Lynne P.	1
Berger, Martijn P. F.	1
Biran, Leonard A.	1
Bramley, Tom	1
Braswell, James S.	1
Brennan, Robert L,	1
Budescu, David V.	1
Burton, Richard F.	1
Cesur, Kursat	1
Choppin, Bruce	1
Clark, Cynthia L., Ed.	1
Clarke, Mark A.	1
More ▼