Publication Date
In 2025 | 4 |
Since 2024 | 8 |
Since 2021 (last 5 years) | 19 |
Since 2016 (last 10 years) | 34 |
Since 2006 (last 20 years) | 59 |
Descriptor
Difficulty Level | 151 |
Item Analysis | 151 |
Test Construction | 151 |
Test Items | 123 |
Multiple Choice Tests | 49 |
Test Validity | 47 |
Test Reliability | 41 |
Foreign Countries | 34 |
Achievement Tests | 30 |
Statistical Analysis | 27 |
Higher Education | 26 |
More ▼ |
Source
Author
Roid, Gale | 4 |
Hambleton, Ronald K. | 3 |
Lord, Frederic M. | 3 |
Bejar, Isaac I. | 2 |
Benjamin W. Domingue | 2 |
Benson, Jeri | 2 |
Bichi, Ado Abdu | 2 |
Bucak, S. Deniz | 2 |
Bulut, Okan | 2 |
Cook, Linda L. | 2 |
Haladyna, Tom | 2 |
More ▼ |
Publication Type
Education Level
Audience
Researchers | 12 |
Teachers | 2 |
Practitioners | 1 |
Location
Nigeria | 4 |
Turkey | 4 |
Australia | 3 |
Canada | 2 |
Florida | 2 |
Georgia | 2 |
Indonesia | 2 |
Taiwan | 2 |
Turkey (Istanbul) | 2 |
Alabama | 1 |
Arizona | 1 |
More ▼ |
Laws, Policies, & Programs
Elementary and Secondary… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Thompson, Kathryn N. – ProQuest LLC, 2023
It is imperative to collect validity evidence prior to interpreting and using test scores. During the process of collecting validity evidence, test developers should consider whether test scores are contaminated by sources of extraneous information. This is referred to as construct irrelevant variance, or the "degree to which test scores are…
Descriptors: Test Wiseness, Test Items, Item Response Theory, Scores
Thayaamol Upapong; Apantee Poonputta – Educational Process: International Journal, 2025
Background/purpose: The purposes of this research are to develop a reliable and valid assessment tool for measuring systems thinking skills in upper primary students in Thailand and to establish a normative criterion for evaluating their systems thinking abilities based on educational standards. Materials/methods: The study followed a three-phase…
Descriptors: Thinking Skills, Elementary School Students, Measures (Individuals), Foreign Countries
Sophie Langhorne; Nora Uglik-Marucha; Charlotte Broadhurst; Elena Lieven; Amelia Pearson; Silia Vitoratou; Kathy Leadbitter – Journal of Autism and Developmental Disorders, 2025
Tools to measure autism knowledge are needed to assess levels of understanding within particular groups of people and to evaluate whether awareness-raising campaigns or interventions lead to improvements in understanding. Several such measures are in circulation, but, to our knowledge, there are no psychometrically-validated questionnaires that…
Descriptors: Foreign Countries, Autism Spectrum Disorders, Questionnaires, Psychometrics
Roger Young; Emily Courtney; Alexander Kah; Mariah Wilkerson; Yi-Hsin Chen – Teaching of Psychology, 2025
Background: Multiple-choice item (MCI) assessments are burdensome for instructors to develop. Artificial intelligence (AI, e.g., ChatGPT) can streamline the process without sacrificing quality. The quality of AI-generated MCIs and human experts is comparable. However, whether the quality of AI-generated MCIs is equally good across various domain-…
Descriptors: Item Response Theory, Multiple Choice Tests, Psychology, Textbooks
Kevser Arslan; Asli Görgülü Ari – Shanlax International Journal of Education, 2024
This study aimed to develop a valid and reliable multiple-choice achievement test for the subject area of ecology. The study was conducted within the framework of exploratory sequential design based on mixed research methods, and the study group consisted of a total of 250 middle school students studying at the sixth and seventh grade level. In…
Descriptors: Ecology, Science Tests, Test Construction, Multiple Choice Tests
Büsra Kilinç; Mehmet Diyaddin Yasar – Science Insights Education Frontiers, 2024
In this study, it was aimed to develop an achievement test taking into account the subject acquisitions of the sound and properties unit in the sixth-grade science course. In the test development phase, firstly, literature review for the study was conducted. Then, 30 multiple choice questions in align with the subject acquisition in the 2018…
Descriptors: Science Tests, Test Construction, Grade 6, Science Instruction
Joshua B. Gilbert; Luke W. Miratrix; Mridul Joshi; Benjamin W. Domingue – Journal of Educational and Behavioral Statistics, 2025
Analyzing heterogeneous treatment effects (HTEs) plays a crucial role in understanding the impacts of educational interventions. A standard practice for HTE analysis is to examine interactions between treatment status and preintervention participant characteristics, such as pretest scores, to identify how different groups respond to treatment.…
Descriptors: Causal Models, Item Response Theory, Statistical Inference, Psychometrics
Berger, Stéphanie; Verschoor, Angela J.; Eggen, Theo J. H. M.; Moser, Urs – Journal of Educational Measurement, 2019
Calibration of an item bank for computer adaptive testing requires substantial resources. In this study, we investigated whether the efficiency of calibration under the Rasch model could be enhanced by improving the match between item difficulty and student ability. We introduced targeted multistage calibration designs, a design type that…
Descriptors: Simulation, Computer Assisted Testing, Test Items, Difficulty Level
Joshua B. Gilbert; Luke W. Miratrix; Mridul Joshi; Benjamin W. Domingue – Annenberg Institute for School Reform at Brown University, 2024
Analyzing heterogeneous treatment effects (HTE) plays a crucial role in understanding the impacts of educational interventions. A standard practice for HTE analysis is to examine interactions between treatment status and pre-intervention participant characteristics, such as pretest scores, to identify how different groups respond to treatment.…
Descriptors: Causal Models, Item Response Theory, Statistical Inference, Psychometrics
Aleyna Altan; Zehra Taspinar Sener – Online Submission, 2023
This research aimed to develop a valid and reliable test to be used to detect sixth grade students' misconceptions and errors regarding the subject of fractions. A misconception diagnostic test has been developed that includes the concept of fractions, different representations of fractions, ordering and comparing fractions, equivalence of…
Descriptors: Diagnostic Tests, Mathematics Tests, Fractions, Misconceptions
Mardiana – Eurasian Journal of Applied Linguistics, 2023
Written inquiries, which are more frequent and have less of a focus on complex thinking, are issues at school. Students are not taught how to respond to questions found in High-Level Thinking Skills (HOTS) tests, hence, their thinking abilities are generally weak. The issue for teachers is that neither they nor anyone else has been able to create…
Descriptors: Skill Development, Thinking Skills, Check Lists, Models
Gio Jay B. Aligway; Jo C. Delos Angeles; Angeli V. Collano; Eljoy P. Barroca; Anna Clarissa D. Aves; Juneflor F. Catubay; Jennifer T. Edjec; Ma. Diana A. Butaya; Sylvester T. Cortes – Journal of Biological Education Indonesia (Jurnal Pendidikan Biologi Indonesia), 2024
Biology education plays a vital role in nurturing the understanding of learners about the intricacy of life. Various efforts have emerged to strengthen learning biological concepts but there were still studies that showed that learners have low mastery in some aspects. To determine how well students understood various biological topics, including…
Descriptors: Validity, Reliability, Taxonomy, Concept Formation
Cifci, Musa; Kaplan, Kadir – Turkish Online Journal of Educational Technology - TOJET, 2020
An achievement test was prepared to determine students' caricature reading skills. In the first draft of the achievement test, 32 test items and four choices were prepared for each question. The item analysis of the data obtained from the pre-application was made and the internal consistency coefficient (KR-20) was calculated as 0.67 for the…
Descriptors: Reading Tests, Achievement Tests, Reading Skills, Literary Devices
Zhang, Lishan; VanLehn, Kurt – Interactive Learning Environments, 2021
Despite their drawback, multiple-choice questions are an enduring feature in instruction because they can be answered more rapidly than open response questions and they are easily scored. However, it can be difficult to generate good incorrect choices (called "distractors"). We designed an algorithm to generate distractors from a…
Descriptors: Semantics, Networks, Multiple Choice Tests, Teaching Methods
Rafi, Ibnu; Retnawati, Heri; Apino, Ezi; Hadiana, Deni; Lydiati, Ida; Rosyada, Munaya Nikma – Pedagogical Research, 2023
This study describes the characteristics of the test and its items used in the national-standardized school examination by applying classical test theory and focusing on the item difficulty, item discrimination, test reliability, and distractor analysis. We analyzed response data of 191 12th graders from one of public senior high schools in…
Descriptors: Foreign Countries, National Competency Tests, Standardized Tests, Mathematics Tests