Publication Date
In 2025 | 1 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 10 |
Since 2016 (last 10 years) | 21 |
Since 2006 (last 20 years) | 31 |
Descriptor
Scores | 31 |
Test Items | 31 |
Elementary School Students | 18 |
Grade 4 | 16 |
Foreign Countries | 15 |
Grade 5 | 15 |
Mathematics Tests | 14 |
Grade 6 | 12 |
Achievement Tests | 10 |
Item Analysis | 9 |
Item Response Theory | 9 |
More ▼ |
Source
Author
Publication Type
Reports - Research | 24 |
Journal Articles | 23 |
Reports - Descriptive | 5 |
Numerical/Quantitative Data | 3 |
Dissertations/Theses -… | 2 |
Speeches/Meeting Papers | 1 |
Education Level
Elementary Education | 31 |
Intermediate Grades | 31 |
Middle Schools | 23 |
Grade 4 | 16 |
Grade 5 | 15 |
Grade 6 | 12 |
Junior High Schools | 9 |
Secondary Education | 9 |
Grade 7 | 6 |
High Schools | 5 |
Early Childhood Education | 4 |
More ▼ |
Audience
Location
Germany | 4 |
Massachusetts | 3 |
Netherlands | 2 |
Ohio | 2 |
Turkey | 2 |
Arkansas | 1 |
Canada | 1 |
Colorado | 1 |
District of Columbia | 1 |
Idaho | 1 |
Illinois | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Does not meet standards | 1 |
Andrés Christiansen; Rianne Janssen – Educational Assessment, Evaluation and Accountability, 2024
In international large-scale assessments, students may not be compelled to answer every test item: a student can decide to skip a seemingly difficult item or may drop out before the end of the test is reached. The way these missing responses are treated will affect the estimation of the item difficulty and student ability, and ultimately affect…
Descriptors: Test Items, Item Response Theory, Grade 4, International Assessment
Steinmann, Isa; Sánchez, Daniel; van Laar, Saskia; Braeken, Johan – Assessment in Education: Principles, Policy & Practice, 2022
Questionnaire scales that are mixed-worded, i.e. include both positively and negatively worded items, often suffer from issues like low reliability and more complex latent structures than intended. Part of the problem might be that some responders fail to respond consistently to the mixed-worded items. We investigated the prevalence and impact of…
Descriptors: Response Style (Tests), Test Items, Achievement Tests, Foreign Countries
Bulut, Okan; Bulut, Hatice Cigdem; Cormier, Damien C.; Ilgun Dibek, Munevver; Sahin Kursad, Merve – Educational Assessment, 2023
Some statewide testing programs allow students to receive corrective feedback and revise their answers during testing. Despite its pedagogical benefits, the effects of providing revision opportunities remain unknown in the context of alternate assessments. Therefore, this study examined student data from a large-scale alternate assessment that…
Descriptors: Error Correction, Alternative Assessment, Feedback (Response), Multiple Choice Tests
Sarwanto; Fajari, Laksmi Evasufi Widi; Chumdari – Malaysian Journal of Learning and Instruction, 2021
Purpose: This study aimed to examine elementary school students' critical thinking skills and their impact. Methodology: This research was a qualitative case study. The subjects of this study were 29 fifth-grade students and three teachers at an elementary school, chosen by a purposive sampling technique. Data were collected through observation,…
Descriptors: Critical Thinking, Thinking Skills, Skill Development, Correlation
Bianca Böhmer; Gabrielle Wills – Large-scale Assessments in Education, 2025
This paper examines the effect of COVID-19 on learning loss and learning inequality in South Africa using 2016 and 2021 Grade 4 PIRLS datasets. On average, South African Grade 4 reading achievement declined by 31 PIRLS points from 320 in 2016 to 288 in 2021, equivalent to a decline of 0.29 standard deviations or 50-60% of a year of learning. The…
Descriptors: COVID-19, Pandemics, Grade 4, Elementary School Students
Guven Demir, Elif; Öksuz, Yücel – Participatory Educational Research, 2022
This research aimed to investigate animation-based achievement tests according to the item format, psychometric features, students' performance, and gender. The study sample consisted of 52 fifth-grade students in Samsun/Turkey in 2017-2018. Measures of the research were open-ended (OE), animation-based open-ended (AOE), multiple-choice (MC), and…
Descriptors: Animation, Achievement Tests, Test Items, Psychometrics
Ping Wang – ProQuest LLC, 2021
According to the RAND model framework, reading comprehension test performance is influenced by readers' reading skills or reader characteristics, test properties, and their interactions. However, little empirical research has systematically compared the impacts of reader characteristics, test properties, and reader-test interactions across…
Descriptors: Reading Comprehension, Reading Tests, Reading Research, Test Items
Noble, Tracy; Sireci, Stephen G.; Wells, Craig S.; Kachchaf, Rachel R.; Rosebery, Ann S.; Wang, Yang Caroline – American Educational Research Journal, 2020
In this experimental study, 20 multiple-choice test items from the Massachusetts Grade 5 science test were linguistically simplified, and original and simplified test items were administered to 310 English learners (ELs) and 1,580 non-ELs in four Massachusetts school districts. This study tested the hypothesis that specific linguistic features of…
Descriptors: Science Tests, Language Usage, English Language Learners, School Districts
Lindner, Marlit A.; Schult, Johannes; Mayer, Richard E. – Journal of Educational Psychology, 2022
This classroom experiment investigates the effects of adding representational pictures to multiple-choice and constructed-response test items to understand the role of the response format for the multimedia effect in testing. Participants were 575 fifth- and sixth-graders who answered 28 science test items--seven items in each of four experimental…
Descriptors: Elementary School Students, Grade 5, Grade 6, Multimedia Materials
Buono, Stephanie; Jang, Eunice Eunhee – Educational Assessment, 2021
Increasing linguistic diversity in classrooms has led researchers to examine the validity and fairness of standardized achievement tests, specifically concerning whether test score interpretations are free of bias and score use is fair for all students. This study examined whether mathematics achievement test items that contain complex language…
Descriptors: English Language Learners, Standardized Tests, Achievement Tests, Culture Fair Tests
Shanmugam, S. Kanageswari Suppiah; Veloo, Arsaythamby; Md-Ali, Ruzlan – Diaspora, Indigenous, and Minority Education, 2021
This study examined the validity of trilingual test as a test accommodation to assess the Indigenous pupils' mathematical performance in Malaysia. The study employed two tests; BM-only test with items written in Malay language (BM) and trilingual test, which had items written in BM and English, and oral audio recording in their native Temiar…
Descriptors: Multilingualism, Testing Accommodations, Grade 5, Elementary School Students
Stöckert, Alexandra; Bogner, Franz X. – Education Sciences, 2020
Efficient waste management is a major prerequisite for reaching sustainability as every one of us produces waste. Thus, educational interventions need to offer promising assistance to reduce individual waste as much as possible to promote environmentally friendly behavior beyond stereotypical notions about waste disposal. Those who know about all…
Descriptors: Sanitation, Sustainability, Teaching Methods, Environmental Education
Reardon, Sean F.; Kalogrides, Demetra; Fahle, Erin M.; Podolsky, Anne; Zárate, Rosalía C. – Educational Researcher, 2018
Prior research suggests that males outperform females, on average, on multiple-choice items compared to their relative performance on constructed-response items. This paper characterizes the extent to which gender achievement gaps on state accountability tests across the United States are associated with those tests' item formats. Using roughly 8…
Descriptors: Test Items, Test Format, Gender Differences, Achievement Gap
Koretz, Daniel; Jennings, Jennifer L.; Ng, Hui Leng; Yu, Carol; Braslow, David; Langi, Meredith – Educational Assessment, 2016
Test-based accountability often produces score inflation. Most studies have evaluated inflation by comparing trends on a high-stakes test and a lower stakes audit test. However, Koretz and Beguin (2010) noted weaknesses of audit tests and suggested self-monitoring assessments (SMAs), which incorporate audit items into high-stakes tests. This…
Descriptors: Audits (Verification), Scores, Grade Inflation, Self Evaluation (Individuals)
Muijselaar, Marloes M. L.; Swart, Nicole M.; Steenbeek-Planting, Esther G.; Droop, Mienke; Verhoeven, Ludo; de Jong, Peter F. – Journal of Educational Psychology, 2017
Many recent studies have aimed to demonstrate that specific types of reading comprehension depend on different underlying cognitive abilities. In these studies, it is often implicitly assumed that reading comprehension is a multidimensional construct. The general aim of this study was to examine the dimensionality of a large pool of reading…
Descriptors: Reading Comprehension, Foreign Countries, Grade 4, Elementary School Students