Publication Date
In 2025 | 1 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 6 |
Since 2016 (last 10 years) | 14 |
Since 2006 (last 20 years) | 23 |
Descriptor
Item Analysis | 35 |
Multiple Choice Tests | 35 |
Psychometrics | 35 |
Test Items | 20 |
Test Construction | 17 |
Foreign Countries | 15 |
Test Reliability | 14 |
Test Validity | 13 |
Achievement Tests | 10 |
Difficulty Level | 9 |
Item Response Theory | 8 |
More ▼ |
Source
Author
Baghaei, Purya | 2 |
Gierl, Mark J. | 2 |
Adeleke, A. A. | 1 |
Ahmad, Mazalah | 1 |
Akarsu, Bayram | 1 |
Apino, Ezi | 1 |
Barniol, Pablo | 1 |
Bauer, Daniel | 1 |
Bolat, Mualla | 1 |
Boulais, André-Philippe | 1 |
Bulut, Okan | 1 |
More ▼ |
Publication Type
Education Level
Higher Education | 11 |
Secondary Education | 10 |
Postsecondary Education | 9 |
High Schools | 7 |
Grade 12 | 3 |
Elementary Education | 2 |
Middle Schools | 2 |
Elementary Secondary Education | 1 |
Grade 10 | 1 |
Grade 5 | 1 |
Grade 9 | 1 |
More ▼ |
Audience
Teachers | 2 |
Administrators | 1 |
Counselors | 1 |
Laws, Policies, & Programs
National Defense Education Act | 1 |
Assessments and Surveys
Comprehensive Tests of Basic… | 1 |
Embedded Figures Test | 1 |
General Educational… | 1 |
What Works Clearinghouse Rating
Baghaei, Purya; Christensen, Karl Bang – Language Testing, 2023
C-tests are gap-filling tests mainly used as rough and economical measures of second-language proficiency for placement and research purposes. A C-test usually consists of several short independent passages where the second half of every other word is deleted. Owing to their interdependent structure, C-test items violate the local independence…
Descriptors: Item Response Theory, Language Tests, Language Proficiency, Second Language Learning
Musa Adekunle Ayanwale – Discover Education, 2023
Examination scores obtained by students from the West African Examinations Council (WAEC), and National Business and Technical Examinations Board (NABTEB) may not be directly comparable due to differences in examination administration, item characteristics of the subject in question, and student abilities. For more accurate comparisons, scores…
Descriptors: Equated Scores, Mathematics Tests, Test Items, Test Format
Slepkov, A. D.; Van Bussel, M. L.; Fitze, K. M.; Burr, W. S. – SAGE Open, 2021
There is a broad literature in multiple-choice test development, both in terms of item-writing guidelines, and psychometric functionality as a measurement tool. However, most of the published literature concerns multiple-choice testing in the context of expert-designed high-stakes standardized assessments, with little attention being paid to the…
Descriptors: Foreign Countries, Undergraduate Students, Student Evaluation, Multiple Choice Tests
Nurussaniah Nurussaniah; Punaji Setyosari; Dedi Kuswandi; Saida Ulfa – Journal of Baltic Science Education, 2025
The accurate assessment of analytical thinking in physics, particularly in magnetism, poses substantial challenges due to the limitations of conventional tools in measuring higher-order cognitive skills. This study aimed to validate an analytical skills test in physics, based on Bloom's Revised Taxonomy, with an emphasis on the dimensions of…
Descriptors: Physics, Science Tests, Science Instruction, Thinking Skills
Deribo, Tobias; Goldhammer, Frank; Kroehne, Ulf – Educational and Psychological Measurement, 2023
As researchers in the social sciences, we are often interested in studying not directly observable constructs through assessments and questionnaires. But even in a well-designed and well-implemented study, rapid-guessing behavior may occur. Under rapid-guessing behavior, a task is skimmed shortly but not read and engaged with in-depth. Hence, a…
Descriptors: Reaction Time, Guessing (Tests), Behavior Patterns, Bias
Rafi, Ibnu; Retnawati, Heri; Apino, Ezi; Hadiana, Deni; Lydiati, Ida; Rosyada, Munaya Nikma – Pedagogical Research, 2023
This study describes the characteristics of the test and its items used in the national-standardized school examination by applying classical test theory and focusing on the item difficulty, item discrimination, test reliability, and distractor analysis. We analyzed response data of 191 12th graders from one of public senior high schools in…
Descriptors: Foreign Countries, National Competency Tests, Standardized Tests, Mathematics Tests
Chiavaroli, Neville – Practical Assessment, Research & Evaluation, 2017
Despite the majority of MCQ writing guides discouraging the use of negatively-worded multiple choice questions (NWQs), they continue to be regularly used both in locally produced examinations and commercially available questions. There are several reasons why the use of NWQs may prove resistant to sound pedagogical advice. Nevertheless, systematic…
Descriptors: Multiple Choice Tests, Test Construction, Test Items, Validity
Lahner, Felicitas-Maria; Lörwald, Andrea Carolin; Bauer, Daniel; Nouns, Zineb Miriam; Krebs, René; Guttormsen, Sissel; Fischer, Martin R.; Huwendiek, Sören – Advances in Health Sciences Education, 2018
Multiple true-false (MTF) items are a widely used supplement to the commonly used single-best answer (Type A) multiple choice format. However, an optimal scoring algorithm for MTF items has not yet been established, as existing studies yielded conflicting results. Therefore, this study analyzes two questions: What is the optimal scoring algorithm…
Descriptors: Scoring Formulas, Scoring Rubrics, Objective Tests, Multiple Choice Tests
Gierl, Mark J.; Bulut, Okan; Guo, Qi; Zhang, Xinxin – Review of Educational Research, 2017
Multiple-choice testing is considered one of the most effective and enduring forms of educational assessment that remains in practice today. This study presents a comprehensive review of the literature on multiple-choice testing in education focused, specifically, on the development, analysis, and use of the incorrect options, which are also…
Descriptors: Multiple Choice Tests, Difficulty Level, Accuracy, Error Patterns
Krell, Moritz; Mathesius, Sabrina; van Driel, Jan; Vergara, Claudia; Krüger, Dirk – International Journal of Science Education, 2020
Scientific reasoning competencies are relevant science competencies and therefore the development of assessment instruments for scientific reasoning competencies has become an integral part of science education research. However, some authors have questioned the validity of the instruments available so far, since their psychometric quality has not…
Descriptors: Preservice Teachers, Science Teachers, Science Instruction, Psychometrics
Gierl, Mark J.; Lai, Hollis; Pugh, Debra; Touchie, Claire; Boulais, André-Philippe; De Champlain, André – Applied Measurement in Education, 2016
Item development is a time- and resource-intensive process. Automatic item generation integrates cognitive modeling with computer technology to systematically generate test items. To date, however, items generated using cognitive modeling procedures have received limited use in operational testing situations. As a result, the psychometric…
Descriptors: Psychometrics, Multiple Choice Tests, Test Items, Item Analysis
Park, Mihwa; Liu, Xiufeng – Science Education, 2016
Energy is one of the most central and richly connected ideas across all science disciplines. The purpose of this study was to develop a measurement instrument for assessing students' understanding of the energy concept within and across different science disciplines. To achieve this goal, the Inter-Disciplinary Energy concept Assessment (IDEA) was…
Descriptors: Energy, Energy Education, Concept Teaching, Scientific Concepts
Sözen, Merve; Bolat, Mualla – Journal of Education and Learning, 2016
The purpose of this study is to develop an achievement test which includes the basic concepts about the subject of sound and its properties in middle school science lessons and which at the same time aims to reveal the alternative concepts that the students already have. During the process of the development of the test, studies in the field and…
Descriptors: Achievement Tests, Science Education, Acoustics, Test Construction
Osadebe, P. U. – World Journal of Education, 2014
The study was carried out to construct a valid and reliable test in Economics for secondary school students. Two research questions were drawn to guide the establishment of validity and reliability for the Economics Achievement Test (EAT). It is a multiple choice objective test of five options with 100 items. A sample of 1000 students was randomly…
Descriptors: Achievement Tests, Economics Education, Student Evaluation, Test Construction
Baghaei, Purya; Dourakhshan, Alireza – International Journal of Language Testing, 2016
The purpose of the present study is to compare the psychometric qualities of canonical single-response multiple-choice items with their double-response counterparts. Thirty, two-response fouroption grammar items for undergraduate students of English were constructed. A second version of the test was constructed by replacing one of the correct…
Descriptors: Language Tests, Multiple Choice Tests, Test Items, Factor Analysis