Publication Date
In 2025 | 2 |
Since 2024 | 6 |
Since 2021 (last 5 years) | 18 |
Since 2016 (last 10 years) | 52 |
Since 2006 (last 20 years) | 93 |
Descriptor
Student Evaluation | 136 |
Test Items | 136 |
Test Validity | 99 |
Test Construction | 72 |
Test Reliability | 58 |
Evaluation Methods | 35 |
Foreign Countries | 25 |
Scores | 25 |
Item Response Theory | 23 |
Psychometrics | 21 |
Multiple Choice Tests | 20 |
More ▼ |
Source
Author
Abedi, Jamal | 3 |
Filby, Nikola N. | 3 |
Breakstone, Joel | 2 |
Clauser, Brian E. | 2 |
Dishaw, Marilyn | 2 |
Erickson, Harley E. | 2 |
Griffin, Noelle | 2 |
Herman, Joan | 2 |
Herman, Joan L. | 2 |
Liu, Ou Lydia | 2 |
Miller, Patrick W. | 2 |
More ▼ |
Publication Type
Education Level
Audience
Practitioners | 11 |
Teachers | 10 |
Administrators | 9 |
Policymakers | 3 |
Support Staff | 3 |
Community | 2 |
Parents | 2 |
Students | 2 |
Researchers | 1 |
Location
Australia | 3 |
Canada | 3 |
Washington | 3 |
Colorado | 2 |
Georgia | 2 |
Germany | 2 |
Illinois | 2 |
Indonesia | 2 |
Iran | 2 |
Massachusetts | 2 |
New Mexico | 2 |
More ▼ |
Laws, Policies, & Programs
Individuals with Disabilities… | 5 |
Every Student Succeeds Act… | 3 |
No Child Left Behind Act 2001 | 3 |
Rehabilitation Act 1973… | 3 |
Job Training Partnership Act… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Endang Susantini; Yurizka Melia Sari; Prima Vidya Asteria; Muhammad Ilyas Marzuqi – Journal of Education and Learning (EduLearn), 2025
Assessing preservice' higher order thinking skills (HOTS) in science and mathematics is essential. Teachers' HOTS ability is closely related to their ability to create HOTS-type science and mathematics problems. Among various types of HOTS, one is Bloomian HOTS. To facilitate the preservice teacher to create problems in those subjects, an Android…
Descriptors: Content Validity, Mathematics Instruction, Decision Making, Thinking Skills
Pablo Robles-García; Stuart McLean; Jeffrey Stewart; Ji-young Shin; Claudia Helena Sánchez-Gutiérrez – Language Assessment Quarterly, 2024
Recent literature in the field of L2 vocabulary assessment has advocated for the development of written receptive vocabulary tests such as Vocabulary Levels Tests (VLTs) that use: (a) meaning-recall item formats, (b) a minimum of 40 item counts per 1,000-frequency band to improve level estimates, and (c) lemmas (not word-families) as the lexical…
Descriptors: Spanish, Test Validity, Test Construction, Vocabulary Development
Erlina Fatkur Rohmah; Sukarmin; Daru Wahyuningsih – Pegem Journal of Education and Instruction, 2024
The study aimed to analyze the content validation of the STEM-integrated on thermal and transport concept inventory instrument used to measure the problem-solving abilities of high school students. The instrument questions developed amounted to nine description questions. This type of study is development research. The steps in this research are…
Descriptors: Content Validity, Measures (Individuals), Concept Formation, STEM Education
Anani Sarab, Mohammad Reza; Rahmani, Simindokht – International Journal of Language Testing, 2023
Language testing and assessment have grown in popularity and gained significance in the last few decades, and there is a rising need for assessment literate stakeholders in the field of language education. As teachers play a major role in assessing students, there is a need to make sure they have the right level of assessment knowledge and skills…
Descriptors: Language Tests, Literacy, Second Language Learning, Factor Analysis
Malone, Kathy L.; Boone, William J.; Stammen, Andria; Schuchardt, Anita; Ding, Lin; Sabree, Zakee – EURASIA Journal of Mathematics, Science and Technology Education, 2021
Instruments for assessing secondary students' conceptual understanding of core concepts in biology are needed by educational practitioners and researchers alike. Most instruments available for secondary biology (years 9 to 12) focus only on highly specific biological concepts instead of multiple core concepts. This study describes the development…
Descriptors: Measures (Individuals), Test Construction, Construct Validity, Test Reliability
Lyniesha Ward; Fridah Rotich; Jeffrey R. Raker; Regis Komperda; Sachin Nedungadi; Maia Popova – Chemistry Education Research and Practice, 2025
This paper describes the design and evaluation of the Organic chemistry Representational Competence Assessment (ORCA). Grounded in Kozma and Russell's representational competence framework, the ORCA measures the learner's ability to "interpret," "translate," and "use" six commonly used representations of molecular…
Descriptors: Organic Chemistry, Science Tests, Test Construction, Student Evaluation
Simon, Molly N.; Prather, Edward E.; Buxner, Sanlyn R.; Impey, Chris D. – International Journal of Science Education, 2019
The discovery and characterisation of planets orbiting distant stars has shed light on the origin of our own Solar System. It is important that college-level introductory astronomy students have a general understanding of the planet formation process before they are able to draw parallels between extrasolar systems and our own Solar System. In…
Descriptors: Measures (Individuals), Test Validity, Test Reliability, Student Evaluation
Kevin Ackermans; Marjoke Bakker; Pierre Gorissen; Anne-Marieke Loon; Marijke Kral; Gino Camp – Journal of Computer Assisted Learning, 2024
Background: A practical test that measures the information and communication technology (ICT) skills students need for effectively using ICT in primary education has yet to be developed (Oh et al., 2021). This paper reports on the development, validation, and reliability of a test measuring primary school students' ICT skills required for…
Descriptors: Test Construction, Test Validity, Measures (Individuals), Elementary School Students
Karen Leary Duseau – North American Chapter of the International Group for the Psychology of Mathematics Education, 2023
Assessment is a topic of concern to all stakeholders in our educational system. Pattern Based Questions are an assessment tool which is an alternative to the standardized assessment tool, and they are based on generative learning pedagogy, which shows promise in engaging all learners and usefulness in teaching and learning but validity has not yet…
Descriptors: Undergraduate Students, College Mathematics, Mathematics Skills, Thinking Skills
Balta, Nuri; Logman, Paul S. W. M. – Physics Education, 2022
The purpose of this study is to develop a test to assess students' level of counterintuitiveness in basic electric circuits. Data from four samples were gathered and used to develop and validate the counterintuitive basic electric circuit test (CBECT). The initial version of the CBECT was administered to the first sample and data collected from…
Descriptors: Science Tests, Test Construction, Student Evaluation, Intuition
Thomas Bickerton, Robert; Sangwin, Chris J. – International Journal of Mathematical Education in Science and Technology, 2022
We discuss a practical method for assessing mathematical proof online. We examine the use of faded worked examples and reading comprehension questions to understand proof. By breaking down a given proof, we formulate a checklist that can be used to generate comprehension questions which can be assessed automatically online. We then provide some…
Descriptors: Mathematics Instruction, Validity, Mathematical Logic, Evaluation Methods
Mamolo, Leo A. – Anatolian Journal of Education, 2021
The first batch of graduates in the country under the K-12 curriculum graduated in 2018. Thus, a call for an evaluation of students' acquired competency is essential. That's why, there is a need for the construction of assessment tools. In this study, a valid, reliable, and item quality achievement test in General Mathematics was developed. Eight…
Descriptors: Test Construction, Achievement Tests, Mathematics Achievement, Student Evaluation
Yaneva, Victoria; Clauser, Brian E.; Morales, Amy; Paniagua, Miguel – Journal of Educational Measurement, 2021
Eye-tracking technology can create a record of the location and duration of visual fixations as a test-taker reads test questions. Although the cognitive process the test-taker is using cannot be directly observed, eye-tracking data can support inferences about these unobserved cognitive processes. This type of information has the potential to…
Descriptors: Eye Movements, Test Validity, Multiple Choice Tests, Cognitive Processes
Tim Jacobbe; Bob delMas; Brad Hartlaub; Jeff Haberstroh; Catherine Case; Steven Foti; Douglas Whitaker – Numeracy, 2023
The development of assessments as part of the funded LOCUS project is described. The assessments measure students' conceptual understanding of statistics as outlined in the GAISE PreK-12 Framework. Results are reported from a large-scale administration to 3,430 students in grades 6 through 12 in the United States. Items were designed to assess…
Descriptors: Statistics Education, Common Core State Standards, Student Evaluation, Elementary School Students
Shu-Fen Lin; Wan-Chin Shie – International Journal of Science and Mathematics Education, 2024
Teachers lack effective curriculum-based instruments to assess their students' scientific competence that would provide information for modifying their inquiry instruction. The main purpose of this study was to develop and validate a Curriculum-Based Scientific Competence (CBSC) test to assess students' scientific competence in a 1-semester Grade…
Descriptors: Science Curriculum, Validity, Grade 9, Science Tests