Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 4 |
Since 2016 (last 10 years) | 10 |
Since 2006 (last 20 years) | 15 |
Descriptor
Guessing (Tests) | 74 |
Test Construction | 74 |
Multiple Choice Tests | 42 |
Test Items | 33 |
Test Reliability | 26 |
Test Validity | 21 |
Item Analysis | 17 |
Difficulty Level | 15 |
Testing Problems | 15 |
Scores | 14 |
Scoring | 14 |
More ▼ |
Source
Author
Lord, Frederic M. | 3 |
Urry, Vern W. | 3 |
Frary, Robert B. | 2 |
Wise, Steven L. | 2 |
Abu-Ghazalah, Rashid M. | 1 |
Abu-Sayf, F. K. | 1 |
Agus Santoso | 1 |
Anderson, Paul S. | 1 |
Andrich, David | 1 |
Asquith, Steven | 1 |
Austin, Joe Dan | 1 |
More ▼ |
Publication Type
Education Level
Higher Education | 3 |
Postsecondary Education | 3 |
Adult Education | 1 |
High Schools | 1 |
Secondary Education | 1 |
Audience
Researchers | 4 |
Practitioners | 2 |
Location
Canada | 2 |
United Kingdom | 2 |
Denmark | 1 |
Indonesia | 1 |
Japan | 1 |
Nigeria | 1 |
Pennsylvania | 1 |
United Kingdom (England) | 1 |
United Kingdom (Great Britain) | 1 |
Laws, Policies, & Programs
Assessments and Surveys
SAT (College Admission Test) | 2 |
California Achievement Tests | 1 |
Graduate Record Examinations | 1 |
Iowa Tests of Basic Skills | 1 |
Preliminary Scholastic… | 1 |
What Works Clearinghouse Rating
Agus Santoso; Heri Retnawati; Timbul Pardede; Ibnu Rafi; Munaya Nikma Rosyada; Gulzhaina K. Kassymova; Xu Wenxin – Practical Assessment, Research & Evaluation, 2024
The test blueprint is important in test development, where it guides the test item writer in creating test items according to the desired objectives and specifications or characteristics (so-called a priori item characteristics), such as the level of item difficulty in the category and the distribution of items based on their difficulty level.…
Descriptors: Foreign Countries, Undergraduate Students, Business English, Test Construction
Read, John – Language Testing, 2023
Published work on vocabulary assessment has grown substantially in the last 10 years, but it is still somewhat outside the mainstream of the field. There has been a recent call for those developing vocabulary tests to apply professional standards to their work, especially in validating their instruments for specified purposes before releasing them…
Descriptors: Language Tests, Vocabulary Development, Second Language Learning, Test Format
Abu-Ghazalah, Rashid M.; Dubins, David N.; Poon, Gregory M. K. – Applied Measurement in Education, 2023
Multiple choice results are inherently probabilistic outcomes, as correct responses reflect a combination of knowledge and guessing, while incorrect responses additionally reflect blunder, a confidently committed mistake. To objectively resolve knowledge from responses in an MC test structure, we evaluated probabilistic models that explicitly…
Descriptors: Guessing (Tests), Multiple Choice Tests, Probability, Models
Cesur, Kursat – Educational Policy Analysis and Strategic Research, 2019
Examinees' performances are assessed using a wide variety of different techniques. Multiple-choice (MC) tests are among the most frequently used ones. Nearly, all standardized achievement tests make use of MC test items and there is a variety of ways to score these tests. The study compares number right and liberal scoring (SAC) methods. Mixed…
Descriptors: Multiple Choice Tests, Scoring, Evaluation Methods, Guessing (Tests)
Guo, Hongwen; Zu, Jiyun; Kyllonen, Patrick – ETS Research Report Series, 2018
For a multiple-choice test under development or redesign, it is important to choose the optimal number of options per item so that the test possesses the desired psychometric properties. On the basis of available data for a multiple-choice assessment with 8 options, we evaluated the effects of changing the number of options on test properties…
Descriptors: Multiple Choice Tests, Test Items, Simulation, Test Construction
Joseph, Dane Christian – Journal of Effective Teaching in Higher Education, 2019
Multiple-choice testing is a staple within the U.S. higher education system. From classroom assessments to standardized entrance exams such as the GRE, GMAT, or LSAT, test developers utilize a variety of validated and heuristic driven item-writing guidelines. One such guideline that has been given recent attention is to randomize the position of…
Descriptors: Test Construction, Multiple Choice Tests, Guessing (Tests), Test Wiseness
Asquith, Steven – TESL-EJ, 2022
Although an accurate measure of vocabulary size is integral to understanding the proficiency of language learners, the validity of multiple-choice (M/C) vocabulary tests to determine this has been questioned due to users guessing correct answers which inflates scores. In this paper the nature of guessing and partial knowledge used when taking the…
Descriptors: Guessing (Tests), English (Second Language), Second Language Learning, Language Tests
Andrich, David; Marais, Ida – Journal of Educational Measurement, 2018
Even though guessing biases difficulty estimates as a function of item difficulty in the dichotomous Rasch model, assessment programs with tests which include multiple-choice items often construct scales using this model. Research has shown that when all items are multiple-choice, this bias can largely be eliminated. However, many assessments have…
Descriptors: Multiple Choice Tests, Test Items, Guessing (Tests), Test Bias
Bramley, Tom; Crisp, Victoria – Assessment in Education: Principles, Policy & Practice, 2019
For many years, question choice has been used in some UK public examinations, with students free to choose which questions they answer from a selection (within certain parameters). There has been little published research on choice of exam questions in recent years in the UK. In this article we distinguish different scenarios in which choice…
Descriptors: Test Items, Test Construction, Difficulty Level, Foreign Countries
Foley, Brett P. – Practical Assessment, Research & Evaluation, 2016
There is always a chance that examinees will answer multiple choice (MC) items correctly by guessing. Design choices in some modern exams have created situations where guessing at random through the full exam--rather than only for a subset of items where the examinee does not know the answer--can be an effective strategy to pass the exam. This…
Descriptors: Guessing (Tests), Multiple Choice Tests, Case Studies, Test Construction
Osadebe, P. U. – Journal of Education and Practice, 2015
The study was carried out to construct a valid and reliable test in Economics for secondary school students. Two research questions were drawn to guide the establishment of validity and reliability for the Economics Achievement Test (EAT). It is a multiple choice objective test of five options with 100 items. A sample of 1000 students was randomly…
Descriptors: Student Evaluation, Secondary School Students, Economics, Achievement Tests
Harris, Justin; Newcombe, Nora S.; Hirsh-Pasek, Kathy – Mind, Brain, and Education, 2013
The relation of spatial skills to academic success in areas such as math and science has sparked discussion in early education around how spatial thinking skills might be included in early schooling. Planning and evaluating new curricula or interventions requires understanding these skills and having the means to assess them. Prior developmental…
Descriptors: Young Children, Spatial Ability, Thinking Skills, Cognitive Processes
Webb, Stuart A.; Sasao, Yosuke – RELC Journal: A Journal of Language Teaching and Research, 2013
There have been great strides made in research on vocabulary in the last 30 years. However, there has been relatively little progress in the development of new vocabulary tests. This may be due in some degree to the impressive contributions made by tests such as the Vocabulary Levels Test (Nation, 1983; Schmitt et al., 2001) and the Word…
Descriptors: Language Tests, Vocabulary Development, Second Language Instruction, Second Language Learning
Scharf, Eric M.; Baldwin, Lynne P. – Active Learning in Higher Education: The Journal of the Institute for Learning and Teaching, 2007
The reasoning behind popular methods for analysing the raw data generated by multiple choice question (MCQ) tests is not always appreciated, occasionally with disastrous results. This article discusses and analyses three options for processing the raw data produced by MCQ tests. The article shows that one extreme option is not to penalize a…
Descriptors: Guessing (Tests), Test Items, Multiple Choice Tests, Questioning Techniques

Colonius, Hans – Psychometrika, 1977
Parameter estimation for Keats generalization of the Rasch model that takes account of guessing behavior is investigated. It is shown that no minimal sufficient statistics for the ability parameters independent of the difficulty parameters exist. (Author/JKS)
Descriptors: Guessing (Tests), Item Analysis, Test Construction, Test Reliability