Publication Date
In 2025 | 1 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 9 |
Since 2016 (last 10 years) | 21 |
Since 2006 (last 20 years) | 35 |
Descriptor
Comparative Analysis | 68 |
Multiple Choice Tests | 68 |
Test Validity | 50 |
Test Reliability | 25 |
Foreign Countries | 22 |
Test Construction | 21 |
Test Items | 18 |
Language Tests | 16 |
Higher Education | 15 |
Test Format | 13 |
Validity | 13 |
More ▼ |
Source
Author
Coniam, David | 2 |
Ebel, Robert L. | 2 |
Frisbie, David A. | 2 |
Hakstian, A. Ralph | 2 |
Kansup, Wanlop | 2 |
Kibble, Jonathan D. | 2 |
Lee, Tony | 2 |
Aesaert, Koen | 1 |
Alemi, Minoo | 1 |
Anderson, Paul S. | 1 |
Arth, Thomas O. | 1 |
More ▼ |
Publication Type
Education Level
Higher Education | 16 |
Postsecondary Education | 14 |
Secondary Education | 9 |
Elementary Education | 5 |
High Schools | 5 |
Middle Schools | 3 |
Junior High Schools | 2 |
Elementary Secondary Education | 1 |
Grade 4 | 1 |
Grade 6 | 1 |
Grade 8 | 1 |
More ▼ |
Audience
Practitioners | 1 |
Location
Germany | 4 |
Australia | 2 |
China | 2 |
Europe | 2 |
Indonesia | 2 |
Taiwan | 2 |
Turkey | 2 |
United States | 2 |
Belgium | 1 |
California | 1 |
Chile | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
Advanced Placement… | 1 |
National Assessment of… | 1 |
Personality Research Form | 1 |
Program for International… | 1 |
SAT (College Admission Test) | 1 |
What Works Clearinghouse Rating
Katrin Klingbeil; Fabian Rösken; Bärbel Barzel; Florian Schacht; Kaye Stacey; Vicki Steinle; Daniel Thurm – ZDM: Mathematics Education, 2024
Assessing students' (mis)conceptions is a challenging task for teachers as well as for researchers. While individual assessment, for example through interviews, can provide deep insights into students' thinking, this is very time-consuming and therefore not feasible for whole classes or even larger settings. For those settings, automatically…
Descriptors: Multiple Choice Tests, Formative Evaluation, Mathematics Tests, Misconceptions
Coniam, David; Lee, Tony; Lampropoulou, Leda – English Language Teaching, 2021
This article explores the issue of identifying guessers -- with a specific focus on multiple-choice tests. Guessing has long been considered a problem due to the fact that it compromises validity. A test taker scoring higher than they should through guessing does not provide a picture of their actual ability. After an initial description of issues…
Descriptors: Language Tests, Guessing (Tests), English (Second Language), Second Language Learning
Yangqiuting Li; Chandralekha Singh – Physical Review Physics Education Research, 2025
Research-based multiple-choice questions implemented in class with peer instruction have been shown to be an effective tool for improving students' engagement and learning outcomes. Moreover, multiple-choice questions that are carefully sequenced to build on each other can be particularly helpful for students to develop a systematic understanding…
Descriptors: Physics, Science Instruction, Science Tests, Multiple Choice Tests
Deribo, Tobias; Goldhammer, Frank; Kroehne, Ulf – Educational and Psychological Measurement, 2023
As researchers in the social sciences, we are often interested in studying not directly observable constructs through assessments and questionnaires. But even in a well-designed and well-implemented study, rapid-guessing behavior may occur. Under rapid-guessing behavior, a task is skimmed shortly but not read and engaged with in-depth. Hence, a…
Descriptors: Reaction Time, Guessing (Tests), Behavior Patterns, Bias
Shear, Benjamin R. – Journal of Educational Measurement, 2023
Large-scale standardized tests are regularly used to measure student achievement overall and for student subgroups. These uses assume tests provide comparable measures of outcomes across student subgroups, but prior research suggests score comparisons across gender groups may be complicated by the type of test items used. This paper presents…
Descriptors: Gender Bias, Item Analysis, Test Items, Achievement Tests
Graben, Katharina; Doering, Bettina K.; Barke, Antonia – Education and Information Technologies, 2022
In this study, we investigated whether the use of smartphone games while reading a text reduces learning performance or reading speed. We also examined whether this is affected by push notifications. Ninety-three students were randomly assigned to three learning conditions. In the gaming group (G), participants played a game app for 20 s at 2-min…
Descriptors: Telecommunications, Handheld Devices, Computer Games, Reading Processes
Coniam, David; Lee, Tony; Milanovic, Michael; Pike, Nigel; Zhao, Wen – Language Education & Assessment, 2022
The calibration of test materials generally involves the interaction between empirical analysis and expert judgement. This paper explores the extent to which scale familiarity might affect expert judgement as a component of test validation in the calibration process. It forms part of a larger study that investigates the alignment of the…
Descriptors: Specialists, Language Tests, Test Validity, College Faculty
Smith, Mark D. – Theory and Research in Social Education, 2018
History education scholars have recognized the need for test validity research in recent years and have called for empirical studies that explore how to best measure historical thinking processes. The present study was designed to help answer this call and to provide a model that others can adapt to carry this line of research forward. It employed…
Descriptors: History Instruction, Multiple Choice Tests, Cognitive Tests, Protocol Analysis
Howe, Eric – ProQuest LLC, 2019
The study of art, especially perspective, involves the use of specialized vocabulary words. Vocabulary words can be difficult to comprehend, but when students learn to use the specialized vocabulary or academic language of a subject, the learner is better able to think about the content. While academic language is only a part of a visual art…
Descriptors: Metacognition, Learning Processes, Comparative Analysis, Art Education
Bardovi-Harlig, Kathleen; Su, Yunwen – TESL-EJ, 2021
This exploratory study examines the role of foreign and second language contexts in the acquisition of conventional expressions. A group of 21 ESL learners was compared to 25 EFL learners randomly selected from a larger pool. Both groups completed an aural multiple-choice discourse completion task (MC-DCT), which was developed from a previously…
Descriptors: Multiple Choice Tests, Second Language Learning, Second Language Instruction, English (Second Language)
Bourdeaud'Hui, Heleen; Aesaert, Koen; van Braak, Johan – Language Assessment Quarterly, 2021
Effective listening comprehension skills are an important prerequisite for the academic success of primary school students. However, the assessment of listening skills in the instructional language appears to have received only scant attention in the literature. Therefore, the goal of the present study was twofold. Firstly, a comprehensive…
Descriptors: Native Language, Indo European Languages, Second Language Learning, Test Items
Szulewski, Adam; Gegenfurtner, Andreas; Howes, Daniel W.; Sivilotti, Marco L. A.; van Merriënboer, Jeroen J. G. – Advances in Health Sciences Education, 2017
In general, researchers attempt to quantify cognitive load using physiologic and psychometric measures. Although the construct measured by both of these metrics is thought to represent overall cognitive load, there is a paucity of studies that compares these techniques to one another. The authors compared data obtained from one physiologic tool…
Descriptors: Physicians, Cognitive Processes, Difficulty Level, Physiology
Fauville, Géraldine; Strang, Craig; Cannady, Matthew A.; Chen, Ying-Fang – Environmental Education Research, 2019
The Ocean Literacy movement began in the U.S. in the early 2000s, and has recently become an international effort. The focus on marine environmental issues and marine education is increasing, and yet it has been difficult to show progress of the ocean literacy movement, in part, because no widely adopted measurement tool exists. The International…
Descriptors: Marine Education, Environmental Education, Comparative Analysis, Factor Structure
Steedle, Jeffrey T.; Ferrara, Steve – Applied Measurement in Education, 2016
As an alternative to rubric scoring, comparative judgment generates essay scores by aggregating decisions about the relative quality of the essays. Comparative judgment eliminates certain scorer biases and potentially reduces training requirements, thereby allowing a large number of judges, including teachers, to participate in essay evaluation.…
Descriptors: Essays, Scoring, Comparative Analysis, Evaluators
Krell, Moritz; Mathesius, Sabrina; van Driel, Jan; Vergara, Claudia; Krüger, Dirk – International Journal of Science Education, 2020
Scientific reasoning competencies are relevant science competencies and therefore the development of assessment instruments for scientific reasoning competencies has become an integral part of science education research. However, some authors have questioned the validity of the instruments available so far, since their psychometric quality has not…
Descriptors: Preservice Teachers, Science Teachers, Science Instruction, Psychometrics