Publication Date
In 2025 | 0 |
Since 2024 | 3 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 21 |
Since 2006 (last 20 years) | 59 |
Descriptor
Test Bias | 63 |
Grade 4 | 53 |
Test Items | 24 |
Mathematics Tests | 23 |
Elementary School Students | 21 |
Item Response Theory | 21 |
Grade 8 | 20 |
Foreign Countries | 18 |
Grade 5 | 17 |
Test Validity | 17 |
Achievement Tests | 16 |
More ▼ |
Source
Author
Zumbo, Bruno D. | 3 |
Ercikan, Kadriye | 2 |
French, Brian F. | 2 |
Janssen, Rianne | 2 |
Lee, Yoonsun | 2 |
Meyer, Patrick | 2 |
Middleton, Kyndra | 2 |
Roschmann, Sarina | 2 |
Steinberg, Jonathan | 2 |
Taylor, Catherine S. | 2 |
Witmer, Sara E. | 2 |
More ▼ |
Publication Type
Journal Articles | 46 |
Reports - Research | 42 |
Reports - Evaluative | 11 |
Numerical/Quantitative Data | 8 |
Reports - Descriptive | 7 |
Dissertations/Theses -… | 3 |
Tests/Questionnaires | 1 |
Education Level
Grade 4 | 63 |
Elementary Education | 55 |
Intermediate Grades | 36 |
Grade 5 | 25 |
Middle Schools | 24 |
Grade 3 | 23 |
Grade 8 | 23 |
Grade 7 | 19 |
Junior High Schools | 19 |
Grade 6 | 18 |
Elementary Secondary Education | 17 |
More ▼ |
Audience
Location
New York | 4 |
United States | 4 |
Belgium | 3 |
Germany | 3 |
Singapore | 3 |
Taiwan | 3 |
Australia | 2 |
Austria | 2 |
Canada | 2 |
Florida | 2 |
Hong Kong | 2 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 4 |
Individuals with Disabilities… | 1 |
Individuals with Disabilities… | 1 |
Rehabilitation Act 1973… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Marjolein Muskens; Willem E. Frankenhuis; Lex Borghans – npj Science of Learning, 2024
In many countries, standardized math tests are important for achieving academic success. Here, we examine whether content of items, the story that explains a mathematical question, biases performance of low-SES students. In a large-scale cohort study of Trends in International Mathematics and Science Studies (TIMSS)--including data from 58…
Descriptors: Mathematics Tests, Standardized Tests, Test Items, Low Income Students
Yi-Hsin Chen – Applied Measurement in Education, 2024
This study aims to apply the differential item functioning (DIF) technique with the deterministic inputs, noisy "and" gate (DINA) model to validate the mathematics construct and diagnostic attribute profiles across American and Singaporean students. Even with the same ability level, every single item is expected to show uniform DIF…
Descriptors: Foreign Countries, Achievement Tests, Elementary Secondary Education, International Assessment
Qi Huang; Daniel M. Bolt; Weicong Lyu – Large-scale Assessments in Education, 2024
Large scale international assessments depend on invariance of measurement across countries. An important consideration when observing cross-national differential item functioning (DIF) is whether the DIF actually reflects a source of bias, or might instead be a methodological artifact reflecting item response theory (IRT) model misspecification.…
Descriptors: Test Items, Item Response Theory, Test Bias, Test Validity
Witmer, Sara E.; Roschmann, Sarina – Measurement and Evaluation in Counseling and Development, 2020
It is critical to examine whether test accommodations function as intended in removing construct-irrelevant variance. The measurement comparability of a math test for students with emotional impairments and those without disabilities was examined. Results indicated the presence of limited differential item functioning (DIF) regardless of…
Descriptors: Testing Accommodations, Mathematics Tests, Emotional Disturbances, Students with Disabilities
Witmer, Sara E.; Roschmann, Sarina – Education and Training in Autism and Developmental Disabilities, 2020
Although it is critical for students with autism to be included in large-scale assessment and accountability systems, it is not clear how to best measure their underlying academic skills and knowledge. Additional empirically-supported guidance is necessary to assist school teams that need to make decisions about how to best include students with…
Descriptors: Testing Accommodations, Autism, Pervasive Developmental Disorders, Students with Disabilities
Paladino, Margaret – Journal for Leadership and Instruction, 2020
The opt-out movement, a grassroots coalition of opposition to high-stakes tests that are used to sort students, evaluate teachers, and rank schools, has the largest participation on Long Island, New York, where approximately 50% of the eligible students in grades three to eight opted out of the English Language Arts (ELA) and Mathematics tests in…
Descriptors: High Stakes Tests, Parent Attitudes, Racial Differences, Ethnicity
McLoud, Rachael – ProQuest LLC, 2019
An increasing number of parents are opting-out their children from high-stakes. Accountability systems in education have used students' test scores to measure student learning, teacher effectiveness, and school district performance. Students who are opted-out of high-stakes tests are not being evaluated by the state tests, making their level of…
Descriptors: Evaluation, High Stakes Tests, Parent Attitudes, Decision Making
Chen, Yi-Jui Iva; Wilson, Mark; Irey, Robin C.; Requa, Mary K. – Language Testing, 2020
Orthographic processing -- the ability to perceive, access, differentiate, and manipulate orthographic knowledge -- is essential when learning to recognize words. Despite its critical importance in literacy acquisition, the field lacks a tool to assess this essential cognitive ability. The goal of this study was to design a computer-based…
Descriptors: Orthographic Symbols, Spelling, Word Recognition, Reading Skills
Huggins-Manley, Anne Corinne – Educational and Psychological Measurement, 2017
This study defines subpopulation item parameter drift (SIPD) as a change in item parameters over time that is dependent on subpopulations of examinees, and hypothesizes that the presence of SIPD in anchor items is associated with bias and/or lack of invariance in three psychometric outcomes. Results show that SIPD in anchor items is associated…
Descriptors: Psychometrics, Test Items, Item Response Theory, Hypothesis Testing
Li, Sylvia; Meyer, Patrick – NWEA, 2019
This simulation study examines the measurement precision, item exposure rates, and the depth of the MAP® Growth™ item pools under various grade-level restrictions. Unlike most summative assessments, MAP Growth allows examinees to see items from any grade level, regardless of the examinee's actual grade level. It does not limit the test to items…
Descriptors: Achievement Tests, Item Banks, Test Items, Instructional Program Divisions
Carvajal-Espinoza, Jorge; Welch, Greg W. – Online Submission, 2016
When tests are translated into one or more languages, the question of the equivalence of items across language forms arises. This equivalence can be assessed at the scale level by means of a multiple group confirmatory factor analysis (CFA) in the context of structural equation modeling. This study examined the measurement equivalence of a Spanish…
Descriptors: Translation, Spanish, English, Mathematics Tests
Oliveri, María Elena; Ercikan, Kadriye; Zumbo, Bruno D.; Lawless, René – International Journal of Testing, 2014
In this study, we contrast results from two differential item functioning (DIF) approaches (manifest and latent class) by the number of items and sources of items identified as DIF using data from an international reading assessment. The latter approach yielded three latent classes, presenting evidence of heterogeneity in examinee response…
Descriptors: Test Bias, Comparative Analysis, Reading Tests, Effect Size
Li, Hongli; Qin, Qi; Lei, Pui-Wa – Educational Assessment, 2017
In recent years, students' test scores have been used to evaluate teachers' performance. The assumption underlying this practice is that students' test performance reflects teachers' instruction. However, this assumption is generally not empirically tested. In this study, we examine the effect of teachers' instruction on test performance at the…
Descriptors: Achievement Tests, Foreign Countries, Elementary Secondary Education, Mathematics Achievement
Finch, W. Holmes; Hernández Finch, Maria E.; French, Brian F. – International Journal of Testing, 2016
Differential item functioning (DIF) assessment is key in score validation. When DIF is present scores may not accurately reflect the construct of interest for some groups of examinees, leading to incorrect conclusions from the scores. Given rising immigration, and the increased reliance of educational policymakers on cross-national assessments…
Descriptors: Test Bias, Scores, Native Language, Language Usage
Choi, Youn-Jeng; Alexeev, Natalia; Cohen, Allan S. – International Journal of Testing, 2015
The purpose of this study was to explore what may be contributing to differences in performance in mathematics on the Trends in International Mathematics and Science Study 2007. This was done by using a mixture item response theory modeling approach to first detect latent classes in the data and then to examine differences in performance on items…
Descriptors: Test Bias, Mathematics Achievement, Mathematics Tests, Item Response Theory