Publication Date
| In 2026 | 0 |
| Since 2025 | 1 |
| Since 2022 (last 5 years) | 2 |
| Since 2017 (last 10 years) | 12 |
| Since 2007 (last 20 years) | 28 |
Descriptor
| Correlation | 52 |
| Multiple Choice Tests | 52 |
| Test Reliability | 36 |
| Test Validity | 25 |
| Foreign Countries | 19 |
| Test Construction | 16 |
| Scoring | 14 |
| Reliability | 13 |
| Test Items | 13 |
| Scores | 12 |
| Difficulty Level | 10 |
| More ▼ | |
Source
Author
| Frary, Robert B. | 2 |
| Frisbie, David A. | 2 |
| Hendrickson, Gerry F. | 2 |
| Steedle, Jeffrey T. | 2 |
| Alsma, Jelmer | 1 |
| Anatri Desstya | 1 |
| Anderson, Paul S. | 1 |
| Arth, Thomas O. | 1 |
| Attali, Yigal | 1 |
| Banta, Trudy W. | 1 |
| Beddow, Peter A. | 1 |
| More ▼ | |
Publication Type
Education Level
| Higher Education | 14 |
| Postsecondary Education | 14 |
| Secondary Education | 8 |
| Elementary Education | 4 |
| High Schools | 4 |
| Grade 8 | 3 |
| Middle Schools | 3 |
| Junior High Schools | 2 |
| Elementary Secondary Education | 1 |
| Grade 11 | 1 |
| Grade 5 | 1 |
| More ▼ | |
Audience
| Researchers | 2 |
| Practitioners | 1 |
| Teachers | 1 |
Location
| Germany | 3 |
| Iran | 3 |
| Canada | 2 |
| Netherlands | 2 |
| Turkey | 2 |
| Arizona | 1 |
| Finland | 1 |
| Indonesia | 1 |
| Japan | 1 |
| Montana | 1 |
| Pennsylvania | 1 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Anatri Desstya; Ika Candra Sayekti; Muhammad Abduh; Sukartono – Journal of Turkish Science Education, 2025
This study aimed to develop a standardised instrument for diagnosing science misconceptions in primary school children. Following a developmental research approach using the 4-D model (Define, Design, Develop, Disseminate), 100 four-tier multiple choice items were constructed. Content validity was established through expert evaluation by six…
Descriptors: Test Construction, Science Tests, Science Instruction, Diagnostic Tests
Slepkov, A. D.; Van Bussel, M. L.; Fitze, K. M.; Burr, W. S. – SAGE Open, 2021
There is a broad literature in multiple-choice test development, both in terms of item-writing guidelines, and psychometric functionality as a measurement tool. However, most of the published literature concerns multiple-choice testing in the context of expert-designed high-stakes standardized assessments, with little attention being paid to the…
Descriptors: Foreign Countries, Undergraduate Students, Student Evaluation, Multiple Choice Tests
Kotoka, Love; Kriek, Jeanne – Journal of Baltic Science Education, 2022
Learners underperform in stoichiometry as they lack conceptual reasoning of the underlying concepts and the ability to solve stoichiometric problems. Therefore, it was necessary to determine if there is a statistical correlation between problem-solving skills and conceptual reasoning in stoichiometry and if so, whether one can significantly…
Descriptors: Prediction, Correlation, Science Instruction, Chemistry
Papenberg, Martin; Musch, Jochen – Applied Measurement in Education, 2017
In multiple-choice tests, the quality of distractors may be more important than their number. We therefore examined the joint influence of distractor quality and quantity on test functioning by providing a sample of 5,793 participants with five parallel test sets consisting of items that differed in the number and quality of distractors.…
Descriptors: Multiple Choice Tests, Test Items, Test Validity, Test Reliability
Yazdinejad, Anoushe; Zeraatpishe, Mitra – International Journal of Language Testing, 2019
In this study the validity of partial dictation as a measure of overall language proficiency was examined. Two partial dictation tests along with a C-Test, a cloze test, and a reading comprehension test, as criterion measures, were administered to a group of Iranian EFL learners. The coefficients of correlation between partial dictation and…
Descriptors: Test Validity, Verbal Communication, Language Proficiency, Language Tests
Cromley, Jennifer G.; Dai, Ting; Fechter, Tia; Nelson, Frank E.; Van Boekel, Martin; Du, Yang – Grantee Submission, 2021
Making inferences and reasoning with new scientific information is critical for successful performance in biology coursework. Thus, identifying students who are weak in these skills could allow the early provision of additional support and course placement recommendations to help students develop their reasoning abilities, leading to better…
Descriptors: Science Tests, Multiple Choice Tests, Logical Thinking, Inferences
Kalinowski, Steven T.; Willoughby, Shannon – Journal of Research in Science Teaching, 2019
We present a multiple-choice test, the Montana State University Formal Reasoning Test (FORT), to assess college students' scientific reasoning ability. The test defines scientific reasoning to be equivalent to formal operational reasoning. It contains 20 questions divided evenly among five types of problems: control of variables, hypothesis…
Descriptors: Science Tests, Test Construction, Science Instruction, Introductory Courses
Severo, Milton; Gaio, A. Rita; Povo, Ana; Silva-Pereira, Fernanda; Ferreira, Maria Amélia – Anatomical Sciences Education, 2015
In theory the formula scoring methods increase the reliability of multiple-choice tests in comparison with number-right scoring. This study aimed to evaluate the impact of the formula scoring method in clinical anatomy multiple-choice examinations, and to compare it with that from the number-right scoring method, hoping to achieve an…
Descriptors: Anatomy, Multiple Choice Tests, Scoring, Decision Making
Zare, Samaneh; Boori, Ali Akbar – International Journal of Language Testing, 2018
In this study, the cloze-elide test was developed and administered under time constraints. This research is aimed to examine the validity and reliability of the speeded cloze-elide test and investigate its relationship with reading comprehension, C-Test, and multiple-choice cloze test. Processing speed is a vital indicator to distinguish high to…
Descriptors: Cloze Procedure, Timed Tests, Language Tests, English (Second Language)
Steedle, Jeffrey T.; Ferrara, Steve – Applied Measurement in Education, 2016
As an alternative to rubric scoring, comparative judgment generates essay scores by aggregating decisions about the relative quality of the essays. Comparative judgment eliminates certain scorer biases and potentially reduces training requirements, thereby allowing a large number of judges, including teachers, to participate in essay evaluation.…
Descriptors: Essays, Scoring, Comparative Analysis, Evaluators
Krell, Moritz – Cogent Education, 2017
This study evaluates a 12-item instrument for subjective measurement of mental load (ML) and mental effort (ME) by analysing different sources of validity evidence. The findings of an expert judgement (N = 8) provide "evidence based on test content" that the formulation of the items corresponds to the meaning of ML and ME. An empirical…
Descriptors: Cognitive Processes, Test Validity, Secondary School Students, Multiple Choice Tests
Kural, Faruk – Journal of Language and Linguistic Studies, 2018
The present paper, which is a study based on midterm exam results of 53 University English prep-school students, examines correlation between a direct writing test, measured holistically by multiple-trait scoring, and two indirect writing tests used in a competence exam, one of which is a multiple-choice cloze test and the other a rewrite test…
Descriptors: Writing Evaluation, Cloze Procedure, Comparative Analysis, Essays
Sener, Nilay; Tas, Erol – Journal of Education and Learning, 2017
The purpose of this study is to prepare a multiple-choice achievement test with high reliability and validity for the "Let's Solve the Puzzle of Our Body" unit. For this purpose, a multiple choice achievement test consisting of 46 items was applied to 178 fifth grade students in total. As a result of the test and material analysis…
Descriptors: Achievement Tests, Grade 5, Science Instruction, Biology
Gorbunova, Tatiana N. – European Journal of Contemporary Education, 2017
The subject of the research is to build methodologies to evaluate the student knowledge by testing. The author points to the importance of feedback about the mastering level in the learning process. Testing is considered as a tool. The object of the study is to create the test system models for defence practice problems. Special attention is paid…
Descriptors: Testing, Evaluation Methods, Feedback (Response), Simulation
Hauser, Peter C.; Paludneviciene, Raylene; Riddle, Wanda; Kurz, Kim B.; Emmorey, Karen; Contreras, Jessica – Journal of Deaf Studies and Deaf Education, 2016
The American Sign Language Comprehension Test (ASL-CT) is a 30-item multiple-choice test that measures ASL receptive skills and is administered through a website. This article describes the development and psychometric properties of the test based on a sample of 80 college students including deaf native signers, hearing native signers, deaf…
Descriptors: American Sign Language, Comprehension, Multiple Choice Tests, Receptive Language

Peer reviewed
Direct link
