Publication Date
In 2025 | 1 |
Since 2024 | 10 |
Since 2021 (last 5 years) | 43 |
Since 2016 (last 10 years) | 85 |
Since 2006 (last 20 years) | 119 |
Descriptor
Test Items | 121 |
Elementary School Students | 68 |
Grade 3 | 65 |
Difficulty Level | 42 |
Grade 2 | 41 |
Mathematics Tests | 40 |
Item Response Theory | 37 |
Test Construction | 36 |
Grade 4 | 32 |
Kindergarten | 30 |
Test Reliability | 29 |
More ▼ |
Source
Author
Alonzo, Julie | 12 |
Tindal, Gerald | 12 |
Anderson, Daniel | 9 |
Schoen, Robert C. | 9 |
Park, Bitnara Jasmine | 8 |
Irvin, P. Shawn | 7 |
Joshua B. Gilbert | 5 |
Luke W. Miratrix | 5 |
Saven, Jessica L. | 4 |
Bauduin, Charity | 3 |
James S. Kim | 3 |
More ▼ |
Publication Type
Education Level
Primary Education | 121 |
Early Childhood Education | 115 |
Elementary Education | 112 |
Grade 3 | 66 |
Grade 2 | 41 |
Intermediate Grades | 35 |
Grade 4 | 32 |
Kindergarten | 30 |
Grade 1 | 25 |
Grade 5 | 25 |
Middle Schools | 24 |
More ▼ |
Audience
Teachers | 2 |
Practitioners | 1 |
Students | 1 |
Location
Florida | 11 |
Germany | 5 |
New York | 5 |
Australia | 4 |
California | 4 |
South Africa | 4 |
China | 3 |
Illinois | 3 |
Maryland | 3 |
North Carolina | 3 |
Turkey | 3 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Wai Kei Chan; Li Zhang; Emily Pey-Tee Oon – International Journal of Assessment Tools in Education, 2023
We report the validity of a test instrument that assesses the arithmetic ability of primary students by (a) describing the theoretical model of arithmetic ability assessment using Wilson's (2004) four building blocks of constructing measures and (b) providing empirical evidence for the validation study. The instrument consists of 21…
Descriptors: Foreign Countries, Elementary School Students, Arithmetic, Grade 3
Sevgi Demirel; Hatice Cetin – International Electronic Journal of Elementary Education, 2023
This study aims to develop a new spatial visualization test (SVT) for second grade primary school students. The study employed the survey design, and the test was developed in accordance with the test development steps. According to the findings obtained as a result of the pilot study, the items were generally high difficulty levels, and they were…
Descriptors: Test Construction, Grade 2, Elementary School Students, Spatial Ability
Qilong Zhang; Weiying Wu; Ke Jiang – European Journal of Teacher Education, 2024
Teacher professional standards are a mechanism to safeguard quality teaching. In the context of Chinese early childhood education (ECE), this study developed a scale for self-assessing teacher competence against professional standards. The study adopted a three-phase design. In Phase 1, in accordance with Professional Standards for Kindergarten…
Descriptors: Standards, Self Evaluation (Individuals), Rating Scales, Preschool Teachers
Sarah Wellberg; Anthony Sparks; Leanne Ketterlin-Geller – Practical Assessment, Research & Evaluation, 2023
The early development of spatial reasoning skills has been linked to future success in mathematics (Wai, Lubinski, & Benbow, 2009), but research to date has mainly focused on the development of these skills within classroom settings rather than at home. The home environment is often the first place students are exposed to, and develop, early…
Descriptors: Test Construction, Test Validity, Measures (Individuals), Surveys
Nikola Ebenbeck; Markus Gebhardt – Journal of Special Education Technology, 2024
Technologies that enable individualization for students have significant potential in special education. Computerized Adaptive Testing (CAT) refers to digital assessments that automatically adjust their difficulty level based on students' abilities, allowing for personalized, efficient, and accurate measurement. This article examines whether CAT…
Descriptors: Computer Assisted Testing, Students with Disabilities, Special Education, Grade 3
Mumba, Brian – Journal on Educational Psychology, 2022
Researchers in educational measurement use Differential Item Functioning (DIF) to examine whether test items are functioning uniquely across subgroups of test participants while taking into account their ability level. DIF is essential for test validity arguments, thus making it a necessary part of validity studies. This study examines DIF across…
Descriptors: Test Bias, Test Items, Gender Differences, Grade 2
Bacon, Terrence E. – ProQuest LLC, 2023
The purpose of this study was to investigate developmental music aptitude with a broader sample in order to propose national norms. Research questions were: 1) To what extent are published Primary Measures of Music Aptitude (PMMA) norms different from those established using a current sample? 2) Are there comparative differences in PMMA item…
Descriptors: Psychometrics, Music, Aptitude Tests, Test Items
Gilbert, Joshua B.; Kim, James S.; Miratrix, Luke W. – Journal of Educational and Behavioral Statistics, 2023
Analyses that reveal how treatment effects vary allow researchers, practitioners, and policymakers to better understand the efficacy of educational interventions. In practice, however, standard statistical methods for addressing heterogeneous treatment effects (HTE) fail to address the HTE that may exist "within" outcome measures. In…
Descriptors: Test Items, Item Response Theory, Computer Assisted Testing, Program Effectiveness
Joshua B. Gilbert; Luke W. Miratrix; Mridul Joshi; Benjamin W. Domingue – Journal of Educational and Behavioral Statistics, 2025
Analyzing heterogeneous treatment effects (HTEs) plays a crucial role in understanding the impacts of educational interventions. A standard practice for HTE analysis is to examine interactions between treatment status and preintervention participant characteristics, such as pretest scores, to identify how different groups respond to treatment.…
Descriptors: Causal Models, Item Response Theory, Statistical Inference, Psychometrics
Kosh, Audra E. – Journal of Applied Testing Technology, 2021
In recent years, Automatic Item Generation (AIG) has increasingly shifted from theoretical research to operational implementation, a shift raising some unforeseen practical challenges. Specifically, generating high-quality answer choices presents several challenges such as ensuring that answer choices blend in nicely together for all possible item…
Descriptors: Test Items, Multiple Choice Tests, Decision Making, Test Construction
Alkis Küçükaydin, Mensure; Akkanat, Çigdem – Problems of Education in the 21st Century, 2022
Computational thinking is recognized as a vital skill related to problem-solving in technological and non-technological fields. The existence of different sub-domains related to this skill has been pointed out. Therefore, there is a need for tools that measure these different sub-domains. Because of its structure that includes different skills,…
Descriptors: Elementary School Students, Thinking Skills, Computation, Tests
Güntay Tasci – Science Insights Education Frontiers, 2024
Developing tools to identify students' misconceptions about basic biology concepts is necessary. Therefore, a two-tier diagnostic test was developed to determine such misconceptions in primary school (3rd-4th Grade) students. The test content includes two-tiered multiple-choice questions addressing common misconceptions found in the literature on…
Descriptors: Science Tests, Biology, Diagnostic Tests, Misconceptions
Dong, Yixiao; Dumas, Denis; Clements, Douglas H.; Day-Hess, Crystal A.; Sarama, Julie – Journal of Psychoeducational Assessment, 2023
Consequential validity (often referred to as "test fairness" in practice) is an essential aspect of educational measurement. This study evaluated the consequential validity of the Research-Based Early Mathematics Assessment (REMA). A sample of 627 children from PreK to second grade was collected using the short form of the REMA. We…
Descriptors: Mathematics Instruction, Mathematics Tests, Item Analysis, Test Items
Ali Türkdogan – Online Submission, 2023
This study was carried out in order to determine how the 3rd grade students of the Department of Elementary Mathematics Education structured their "if and only if propositions". The data were obtained by examining the students' answers given to the midterm exam questions and discussing the solutions with the students in the classroom.…
Descriptors: Mathematics Instruction, Teaching Methods, Difficulty Level, Questioning Techniques
Joshua B. Gilbert; Luke W. Miratrix; Mridul Joshi; Benjamin W. Domingue – Annenberg Institute for School Reform at Brown University, 2024
Analyzing heterogeneous treatment effects (HTE) plays a crucial role in understanding the impacts of educational interventions. A standard practice for HTE analysis is to examine interactions between treatment status and pre-intervention participant characteristics, such as pretest scores, to identify how different groups respond to treatment.…
Descriptors: Causal Models, Item Response Theory, Statistical Inference, Psychometrics