Publication Date
| In 2026 | 0 |
| Since 2025 | 13 |
| Since 2022 (last 5 years) | 39 |
| Since 2017 (last 10 years) | 130 |
| Since 2007 (last 20 years) | 179 |
Descriptor
| Science Tests | 211 |
| Test Reliability | 211 |
| Test Validity | 146 |
| Foreign Countries | 96 |
| Test Construction | 90 |
| Test Items | 82 |
| Multiple Choice Tests | 54 |
| Scientific Concepts | 52 |
| Science Instruction | 46 |
| Difficulty Level | 41 |
| Item Response Theory | 40 |
| More ▼ | |
Source
Author
| Bao, Lei | 5 |
| Xiao, Yang | 5 |
| Conoyer, Sarah J. | 4 |
| Han, Jing | 4 |
| Koenig, Kathleen | 4 |
| Barniol, Pablo | 3 |
| Ford, Jeremy W. | 3 |
| Hosp, John L. | 3 |
| Prasetyo, Zuhdan Kun | 3 |
| Sachin Nedungadi | 3 |
| Zavala, Genaro | 3 |
| More ▼ | |
Publication Type
Education Level
Audience
| Researchers | 7 |
| Practitioners | 4 |
| Teachers | 3 |
| Administrators | 1 |
Location
| Indonesia | 25 |
| Turkey | 14 |
| Germany | 7 |
| Nebraska | 5 |
| United Kingdom (England) | 4 |
| Australia | 3 |
| Canada | 3 |
| India | 3 |
| Iran | 3 |
| Israel | 3 |
| Malaysia | 3 |
| More ▼ | |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 2 |
| Elementary and Secondary… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Meryem Konu Kadirhanogullari; Esra Özay Köse – Science Insights Education Frontiers, 2025
This study aims to develop a valid and reliable achievement test in accordance with the content framework of the 9th-grade Biology Course Curriculum published within the scope of the Turkish Century Maarif Model on the subject of "Organic Matter". The screening method was used for this purpose. The sample of the study consists of 258…
Descriptors: Science Tests, Test Construction, Grade 9, Biology
Ntumi, Simon; Agbenyo, Sheilla; Bulala, Tapela – Shanlax International Journal of Education, 2023
There is no need or point to testing of knowledge, attributes, traits, behaviours or abilities of an individual if information obtained from the test is inaccurate. However, by and large, it seems the estimation of psychometric properties of test items in classroomshas been completely ignored otherwise dying slowly in most testing environments. In…
Descriptors: Psychometrics, Accuracy, Test Validity, Factor Analysis
Raudlah Melinda Sidik; Ana Ratna Wulan; K. Kusnadi – Journal of Biological Education Indonesia (Jurnal Pendidikan Biologi Indonesia), 2025
The research developed and validated EKSAI (Epistemic Knowledge Science Assessment Instrument), an assessment tool for epistemic knowledge in science education. The background is that 21st-century challenges demand a transformation in science education, with a focus on understanding how scientific knowledge is developed and evaluated, which is…
Descriptors: Science Tests, Knowledge Level, Biology, Test Validity
Yuriko K. Sosa Paredes; Björn Andersson – Educational Assessment, Evaluation and Accountability, 2025
In international large-scale assessments, student performance comparisons across educational systems are frequently done to assess the state and development in different domains. These results often have a large impact on educational policy and on the perceptions of an educational system's performance. Early assessments, such as the First and…
Descriptors: Test Interpretation, International Assessment, Science Tests, Scores
E.?B. Merki; S.?I. Hofer; A. Vaterlaus; A. Lichtenberger – Physical Review Physics Education Research, 2025
When describing motion in physics, the selection of a frame of reference is crucial. The graph of a moving object can look quite different based on the frame of reference. In recent years, various tests have been developed to assess the interpretation of kinematic graphs, but none of these tests have specifically addressed differences in reference…
Descriptors: Graphs, Motion, Physics, Secondary School Students
Conoyer, Sarah J.; Therrien, William J.; White, Kristen K. – Assessment for Effective Intervention, 2022
Meta-analysis was used to examine curriculum-based measurement in the content areas of social studies and science. Nineteen studies between the years of 1998 and 2020 were reviewed to determine overall mean correlation for criterion validity and examine alternate-form reliability and slope coefficients. An overall mean correlation of 0.59 was…
Descriptors: Curriculum Based Assessment, Test Validity, Test Reliability, Science Tests
Grace C. Tetschner; Sachin Nedungadi – Chemistry Education Research and Practice, 2025
Many undergraduate chemistry students hold alternate conceptions related to resonance--an important and fundamental topic of organic chemistry. To help address these alternate conceptions, an organic chemistry instructor could administer the resonance concept inventory (RCI), which is a multiple-choice assessment that was designed to identify…
Descriptors: Scientific Concepts, Concept Formation, Item Response Theory, Scores
Achmad Rante Suparman; Eli Rohaeti; Sri Wening – Journal on Efficiency and Responsibility in Education and Science, 2024
This study focuses on developing a five-tier chemical diagnostic test based on a computer-based test with 11 assessment categories with an assessment score from 0 to 10. A total of 20 items produced were validated by education experts, material experts, measurement experts, and media experts, and an average index of the Aiken test > 0.70 was…
Descriptors: Chemistry, Diagnostic Tests, Computer Assisted Testing, Credits
Yalinkilic, Funda; Gul, Seyda – Science Insights Education Frontiers, 2023
The aim of this study is to develop a valid and reliable achievement test on the subject of 'Basic Compounds in the Structure of Living Things'. During the preparation of the draft form of the test, a 32 item-question pool was created by the researchers in the light of the relevant literature. Then, these questions were presented to expert opinion…
Descriptors: Test Construction, Science Achievement, Science Tests, Test Validity
Ruying Li; Gaofeng Li – International Journal of Science and Mathematics Education, 2025
Systems thinking (ST) is an essential competence for future life and biology learning. Appropriate assessment is critical for collecting sufficient information to develop ST in biology education. This research offers an ST framework based on a comprehensive understanding of biological systems, encompassing four skills across three complexity…
Descriptors: Test Construction, Test Validity, Science Tests, Cognitive Tests
Lyniesha Ward; Fridah Rotich; Jeffrey R. Raker; Regis Komperda; Sachin Nedungadi; Maia Popova – Chemistry Education Research and Practice, 2025
This paper describes the design and evaluation of the Organic chemistry Representational Competence Assessment (ORCA). Grounded in Kozma and Russell's representational competence framework, the ORCA measures the learner's ability to "interpret," "translate," and "use" six commonly used representations of molecular…
Descriptors: Organic Chemistry, Science Tests, Test Construction, Student Evaluation
Mensure Alkis Küçükaydin; Elçin Ayaz – International Journal of Science and Mathematics Education, 2025
Scientific reasoning competencies (SRC) are an area of competence emphasized in science education and are considered essential in the world of 21st Century skills. Developing these competencies is important for all levels of education, from primary school to university. However, to accurately measure them, measurement tools with validity and…
Descriptors: Science Tests, Cognitive Tests, Scientific Literacy, Thinking Skills
Sachin Nedungadi; Corina E. Brown; Sue Hyeon Paek – Journal of Chemical Education, 2022
The Fundamental Concepts for Organic Reaction Mechanisms Inventory (FC-ORMI) is a concept inventory with most items in a two-tier design in which an answer tier is followed by a reasoning tier. Statistical results provided strong evidence for the validity and reliability of the data obtained using the FC-ORMI. In this study, differential item…
Descriptors: Test Bias, Test Validity, Test Reliability, Gender Differences
Testing Anatomy: Dissecting Spatial and Non-Spatial Knowledge in Multiple-Choice Question Assessment
Julie Dickson; Darren J. Shaw; Andrew Gardiner; Susan Rhind – Anatomical Sciences Education, 2024
Limited research has been conducted on the spatial ability of veterinary students and how this is evaluated within anatomy assessments. This study describes the creation and evaluation of a split design multiple-choice question (MCQ) assessment (totaling 30 questions divided into 15 non-spatial MCQs and 15 spatial MCQs). Two cohorts were tested,…
Descriptors: Anatomy, Spatial Ability, Multiple Choice Tests, Factor Analysis
Irmak, Meltem; Inaltun, Hüseyin; Ercan-Dursun, Jale; Yanis-Kelleci, Hilal; Yuruk, Nejla – International Journal of Science and Mathematics Education, 2023
The purpose of this study is twofold: (1) to develop and validate a three-tier diagnostic test on work, power, and energy (WPE) concepts and (2) to identify Turkish pre-service science teachers' conceptual understanding through this test. The Work, Power, and Energy Concept Test (WPECT) was developed through interviews, read-aloud sessions, and…
Descriptors: Diagnostic Tests, Preservice Teachers, Science Teachers, Science Tests

Peer reviewed
Direct link
