Publication Date
| In 2026 | 0 |
| Since 2025 | 2 |
| Since 2022 (last 5 years) | 8 |
| Since 2017 (last 10 years) | 20 |
| Since 2007 (last 20 years) | 50 |
Descriptor
| Multiple Choice Tests | 72 |
| Reliability | 72 |
| Validity | 33 |
| Foreign Countries | 24 |
| Statistical Analysis | 19 |
| Test Items | 14 |
| Comparative Analysis | 13 |
| Correlation | 13 |
| Science Tests | 13 |
| Item Analysis | 11 |
| Psychometrics | 11 |
| More ▼ | |
Source
Author
| Alonzo, Julie | 2 |
| Anderson, Daniel | 2 |
| Attali, Yigal | 2 |
| Jamgochian, Elisa | 2 |
| Lai, Cheng-Fei | 2 |
| Nese, Joseph F. T. | 2 |
| Saez, Leilani | 2 |
| Tindal, Gerald | 2 |
| Ahmed Yaqinuddin | 1 |
| Ait bentaleb, Khalid | 1 |
| Alfa, Ahmadu S. | 1 |
| More ▼ | |
Publication Type
| Reports - Research | 59 |
| Journal Articles | 56 |
| Reports - Evaluative | 10 |
| Tests/Questionnaires | 8 |
| Speeches/Meeting Papers | 7 |
| Numerical/Quantitative Data | 2 |
| Reports - Descriptive | 1 |
Education Level
| Secondary Education | 18 |
| Higher Education | 15 |
| Postsecondary Education | 13 |
| High Schools | 10 |
| Elementary Education | 9 |
| Middle Schools | 6 |
| Grade 11 | 5 |
| Elementary Secondary Education | 4 |
| Grade 8 | 4 |
| Junior High Schools | 4 |
| Grade 4 | 2 |
| More ▼ | |
Audience
| Researchers | 2 |
Location
| Turkey | 6 |
| Canada | 2 |
| Nigeria | 2 |
| Pennsylvania | 2 |
| Arizona | 1 |
| China | 1 |
| Colorado | 1 |
| Finland | 1 |
| Florida | 1 |
| Germany | 1 |
| Greece | 1 |
| More ▼ | |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 2 |
Assessments and Surveys
What Works Clearinghouse Rating
Claude, ChatGPT, Copilot, and Gemini Performance versus Students in Different Topics of Neuroscience
Volodymyr Mavrych; Ahmed Yaqinuddin; Olena Bolgova – Advances in Physiology Education, 2025
Despite extensive studies on large language models and their capability to respond to questions from various licensed exams, there has been limited focus on employing chatbots for specific subjects within the medical curriculum, specifically medical neuroscience. This research compared the performances of Claude 3.5 Sonnet (Anthropic), GPT-3.5 and…
Descriptors: Artificial Intelligence, Computer Software, Neurosciences, Medical Education
Volfson, Alexander; Eshach, Haim; Ben-Abu, Yuval – Physical Review Physics Education Research, 2021
Science knowledge is reflected in mental models that students tend to form when dealing with science phenomena. One way to identify students' mental models about scientific concepts is the use of diagnostic tests (inventories). Even though several statistical approaches and tools intended for the analysis of such inventories' results exist in the…
Descriptors: Schemata (Cognition), Diagnostic Tests, Scientific Concepts, Multiple Choice Tests
Thayaamol Upapong; Apantee Poonputta – Educational Process: International Journal, 2025
Background/purpose: The purposes of this research are to develop a reliable and valid assessment tool for measuring systems thinking skills in upper primary students in Thailand and to establish a normative criterion for evaluating their systems thinking abilities based on educational standards. Materials/methods: The study followed a three-phase…
Descriptors: Thinking Skills, Elementary School Students, Measures (Individuals), Foreign Countries
Kevser Arslan; Asli Görgülü Ari – Shanlax International Journal of Education, 2024
This study aimed to develop a valid and reliable multiple-choice achievement test for the subject area of ecology. The study was conducted within the framework of exploratory sequential design based on mixed research methods, and the study group consisted of a total of 250 middle school students studying at the sixth and seventh grade level. In…
Descriptors: Ecology, Science Tests, Test Construction, Multiple Choice Tests
Shan Lin; Jian Wang – Journal of Baltic Science Education, 2024
Scientific thinking constitutes a vital component of scientific competencies, crucial for citizens to adapt to the evolving societal landscape. To cultivate students' scientific thinking, teachers should possess an adequate professional knowledge foundation, which encompasses pedagogical content knowledge (PCK). Assessing teachers' PCK of…
Descriptors: Secondary School Teachers, Teacher Attitudes, Biology, Pedagogical Content Knowledge
Deribo, Tobias; Goldhammer, Frank; Kroehne, Ulf – Educational and Psychological Measurement, 2023
As researchers in the social sciences, we are often interested in studying not directly observable constructs through assessments and questionnaires. But even in a well-designed and well-implemented study, rapid-guessing behavior may occur. Under rapid-guessing behavior, a task is skimmed shortly but not read and engaged with in-depth. Hence, a…
Descriptors: Reaction Time, Guessing (Tests), Behavior Patterns, Bias
Lee, Won-Chan; Kim, Stella Y.; Choi, Jiwon; Kang, Yujin – Journal of Educational Measurement, 2020
This article considers psychometric properties of composite raw scores and transformed scale scores on mixed-format tests that consist of a mixture of multiple-choice and free-response items. Test scores on several mixed-format tests are evaluated with respect to conditional and overall standard errors of measurement, score reliability, and…
Descriptors: Raw Scores, Item Response Theory, Test Format, Multiple Choice Tests
Ait bentaleb, Khalid; Dachraoui, Saddik; Hassouni, Taoufik; Alibrahmi, El mehdi; Chakir, Elmahjoub; Belboukhari, Aimad – European Journal of Educational Research, 2022
We developed a Quantum Mechanics Conceptual Understanding Survey (QMCUS) in this study. The survey was conducted using a quantitative methodology. A multiple-choice survey of 35 questions was administered to 338 undergraduate students. Three experienced quantum mechanics instructors examined the validity of the survey. The reliability of our…
Descriptors: Scientific Concepts, Concept Formation, Physics, Undergraduate Students
Lu, Huanhuan; Jiang, Yanxia; Bi, Hualin – Chemistry Education Research and Practice, 2020
The galvanic cell is a basic concept in electrochemistry. To assess mainland Chinese students' proficiency levels in galvanic cells, the Galvanic Cell Proficiency Level Assessment (GCPA) was developed based on the Rasch model. The GCPA was developed through a pilot test and consists of seven multiple-choice questions and four open questions. The…
Descriptors: Measurement Techniques, Science Instruction, High School Students, Grade 11
Pelánek, Radek; Effenberger, Tomáš; Kukucka, Adam – Journal of Educational Data Mining, 2022
We study the automatic identification of educational items worthy of content authors' attention. Based on the results of such analysis, content authors can revise and improve the content of learning environments. We provide an overview of item properties relevant to this task, including difficulty and complexity measures, item discrimination, and…
Descriptors: Item Analysis, Identification, Difficulty Level, Case Studies
Kotoka, Love; Kriek, Jeanne – Journal of Baltic Science Education, 2022
Learners underperform in stoichiometry as they lack conceptual reasoning of the underlying concepts and the ability to solve stoichiometric problems. Therefore, it was necessary to determine if there is a statistical correlation between problem-solving skills and conceptual reasoning in stoichiometry and if so, whether one can significantly…
Descriptors: Prediction, Correlation, Science Instruction, Chemistry
Putranta, Himawan; Supahar – Online Submission, 2019
This paper is based on the background of the problem of the low high order thinking skills in students, especially in the skills to think creatively and conceptual understanding. Conceptual understanding that students have in relation to physics learning material has an important role in developing students' high order thinking skills in solving…
Descriptors: Physics, Science Tests, Thinking Skills, Concept Formation
Sieke, Scott A.; McIntosh, Betsy B.; Steele, Matthew M.; Knight, Jennifer K. – CBE - Life Sciences Education, 2019
Understanding student ideas in large-enrollment biology courses can be challenging, because easy-to-administer multiple-choice questions frequently do not fully capture the diversity of student ideas. As part of the Automated Analysis of Constructed Responses (AACR) project, we designed a question prompting students to describe the possible…
Descriptors: Genetics, Scientific Concepts, Biology, Science Instruction
Tan, Kim Chwee Daniel; Taber, Keith S.; Liew, Yong Qiang; Teo, Kay Liang Alan – Chemistry Education Research and Practice, 2019
The internet is prevalent in society today, and user-friendly web-based productivity tools are readily available for developing diagnostic instruments. This study sought to determine the affordances of a web-based diagnostic instrument on ionisation energy (wIEDI) based on the pen-and-paper version, the Ionisation Energy Diagnostic Instrument…
Descriptors: Energy, Secondary School Science, Chemistry, Diagnostic Tests
Türkoguz, Suat – Anatolian Journal of Education, 2020
This study aimed to investigate the item "Response Time Fidelity scores" ("RTFs"), "KuderRichardson Reliability" ("KR[subscript 20]") and "Cronbach's Alpha Reliability" ("alpha") coefficients, calculate "KR[subscript 20]" coefficients with "RTFs" for 30 threshold…
Descriptors: Comparative Analysis, Reaction Time, Multiple Choice Tests, Scores

Peer reviewed
Direct link
