Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 11 |
Since 2016 (last 10 years) | 21 |
Since 2006 (last 20 years) | 32 |
Descriptor
Difficulty Level | 46 |
Multiple Choice Tests | 46 |
Scores | 46 |
Test Items | 34 |
Test Format | 17 |
Test Construction | 15 |
Test Reliability | 14 |
Foreign Countries | 12 |
Item Analysis | 12 |
Higher Education | 9 |
Undergraduate Students | 9 |
More ▼ |
Source
Author
Publication Type
Reports - Research | 41 |
Journal Articles | 35 |
Speeches/Meeting Papers | 9 |
Reports - Evaluative | 3 |
Tests/Questionnaires | 3 |
Dissertations/Theses -… | 1 |
Guides - General | 1 |
Information Analyses | 1 |
Education Level
Higher Education | 16 |
Postsecondary Education | 13 |
Elementary Education | 4 |
Intermediate Grades | 3 |
Secondary Education | 3 |
Grade 5 | 2 |
Middle Schools | 2 |
Early Childhood Education | 1 |
Grade 1 | 1 |
Grade 2 | 1 |
Grade 4 | 1 |
More ▼ |
Audience
Researchers | 3 |
Location
Turkey | 3 |
Colorado | 2 |
Canada | 1 |
Germany | 1 |
Germany (Berlin) | 1 |
India | 1 |
Indonesia | 1 |
Iran | 1 |
Morocco | 1 |
New Jersey | 1 |
North Dakota | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
Test of English as a Foreign… | 2 |
Flesch Kincaid Grade Level… | 1 |
Flesch Reading Ease Formula | 1 |
Program for International… | 1 |
SAT (College Admission Test) | 1 |
What Works Clearinghouse Rating
Thompson, Kathryn N. – ProQuest LLC, 2023
It is imperative to collect validity evidence prior to interpreting and using test scores. During the process of collecting validity evidence, test developers should consider whether test scores are contaminated by sources of extraneous information. This is referred to as construct irrelevant variance, or the "degree to which test scores are…
Descriptors: Test Wiseness, Test Items, Item Response Theory, Scores
DeCarlo, Lawrence T. – Journal of Educational Measurement, 2023
A conceptualization of multiple-choice exams in terms of signal detection theory (SDT) leads to simple measures of item difficulty and item discrimination that are closely related to, but also distinct from, those used in classical item analysis (CIA). The theory defines a "true split," depending on whether or not examinees know an item,…
Descriptors: Multiple Choice Tests, Test Items, Item Analysis, Test Wiseness
Herrmann-Abell, Cari F.; Hardcastle, Joseph; DeBoer, George E. – Grantee Submission, 2022
As implementation of the "Next Generation Science Standards" moves forward, there is a need for new assessments that can measure students' integrated three-dimensional science learning. The National Research Council has suggested that these assessments be multicomponent tasks that utilize a combination of item formats including…
Descriptors: Multiple Choice Tests, Conditioning, Test Items, Item Response Theory
Slepkov, A. D.; Van Bussel, M. L.; Fitze, K. M.; Burr, W. S. – SAGE Open, 2021
There is a broad literature in multiple-choice test development, both in terms of item-writing guidelines, and psychometric functionality as a measurement tool. However, most of the published literature concerns multiple-choice testing in the context of expert-designed high-stakes standardized assessments, with little attention being paid to the…
Descriptors: Foreign Countries, Undergraduate Students, Student Evaluation, Multiple Choice Tests
Saepuzaman, Duden; Istiyono, Edi; Haryanto – Pegem Journal of Education and Instruction, 2022
HOTS is one part of the skills that need to be developed in the 21st Century . This study aims to determine the characteristics of the Fundamental Physics Higher-order Thinking Skill (FundPhysHOTS) test for prospective physics teachers using Item Response Theory (IRT) analysis. This study uses a quantitative approach. 254 prospective physics…
Descriptors: Thinking Skills, Physics, Science Process Skills, Cognitive Tests
Graben, Katharina; Doering, Bettina K.; Barke, Antonia – Education and Information Technologies, 2022
In this study, we investigated whether the use of smartphone games while reading a text reduces learning performance or reading speed. We also examined whether this is affected by push notifications. Ninety-three students were randomly assigned to three learning conditions. In the gaming group (G), participants played a game app for 20 s at 2-min…
Descriptors: Telecommunications, Handheld Devices, Computer Games, Reading Processes
Ait bentaleb, Khalid; Dachraoui, Saddik; Hassouni, Taoufik; Alibrahmi, El mehdi; Chakir, Elmahjoub; Belboukhari, Aimad – European Journal of Educational Research, 2022
We developed a Quantum Mechanics Conceptual Understanding Survey (QMCUS) in this study. The survey was conducted using a quantitative methodology. A multiple-choice survey of 35 questions was administered to 338 undergraduate students. Three experienced quantum mechanics instructors examined the validity of the survey. The reliability of our…
Descriptors: Scientific Concepts, Concept Formation, Physics, Undergraduate Students
Guven Demir, Elif; Öksuz, Yücel – Participatory Educational Research, 2022
This research aimed to investigate animation-based achievement tests according to the item format, psychometric features, students' performance, and gender. The study sample consisted of 52 fifth-grade students in Samsun/Turkey in 2017-2018. Measures of the research were open-ended (OE), animation-based open-ended (AOE), multiple-choice (MC), and…
Descriptors: Animation, Achievement Tests, Test Items, Psychometrics
Guo, Hongwen; Zu, Jiyun; Kyllonen, Patrick – ETS Research Report Series, 2018
For a multiple-choice test under development or redesign, it is important to choose the optimal number of options per item so that the test possesses the desired psychometric properties. On the basis of available data for a multiple-choice assessment with 8 options, we evaluated the effects of changing the number of options on test properties…
Descriptors: Multiple Choice Tests, Test Items, Simulation, Test Construction
Moro, Sérgio; Martins, António; Ramos, Pedro; Esmerado, Joaquim; Costa, Joana Martinho; Almeida, Daniela – Computers in the Schools, 2020
Many university programs include Microsoft Excel courses given their value as a scientific and technical tool. However, evaluating what is effectively learned by students is a challenging task. Considering multiple-choice written exams are a standard evaluation format, this study aimed to uncover the features influencing students' success in…
Descriptors: Multiple Choice Tests, Test Items, Spreadsheets, Computer Software
Wikle, Jocelyn S.; West, Richard E. – International Journal on E-Learning, 2019
Each year students spend thousands of hours participating in discussion forums. Yet there has not been sufficient scholarship on the observable effects of discussion forums on learning outcomes, and there is not a clear understanding of the role discussion forums play in transmitting knowledge of course concepts to students. This study…
Descriptors: Student Participation, Discussion Groups, Outcomes of Education, Difficulty Level
Lindner, Marlit A.; Schult, Johannes; Mayer, Richard E. – Journal of Educational Psychology, 2022
This classroom experiment investigates the effects of adding representational pictures to multiple-choice and constructed-response test items to understand the role of the response format for the multimedia effect in testing. Participants were 575 fifth- and sixth-graders who answered 28 science test items--seven items in each of four experimental…
Descriptors: Elementary School Students, Grade 5, Grade 6, Multimedia Materials
Yeager, Rebecca; Meyer, Zachary – International Journal of Listening, 2022
This study investigates the effects of adding stem preview to an English for Academic Purposes (EAP) multiple-choice listening assessment. In stem preview, listeners may view the item stems, but not response options, before listening. Previous research indicates that adding preview to an exam typically decreases difficulty, but raises concerns…
Descriptors: English for Academic Purposes, Second Language Learning, Second Language Instruction, Teaching Methods
Susanti, Yuni; Tokunaga, Takenobu; Nishikawa, Hitoshi; Obari, Hiroyuki – Research and Practice in Technology Enhanced Learning, 2017
The present study investigates the best factor for controlling the item difficulty of multiple-choice English vocabulary questions generated by an automatic question generation system. Three factors are considered for controlling item difficulty: (1) reading passage difficulty, (2) semantic similarity between the correct answer and distractors,…
Descriptors: Test Items, Difficulty Level, Computer Assisted Testing, Vocabulary Development
Eckerly, Carol; Smith, Russell; Sowles, John – Practical Assessment, Research & Evaluation, 2018
The Discrete Option Multiple Choice (DOMC) item format was introduced by Foster and Miller (2009) with the intent of improving the security of test content. However, by changing the amount and order of the content presented, the test taking experience varies by test taker, thereby introducing potential fairness issues. In this paper we…
Descriptors: Culture Fair Tests, Multiple Choice Tests, Testing, Test Items