Publication Date
| In 2026 | 0 |
| Since 2025 | 36 |
| Since 2022 (last 5 years) | 223 |
| Since 2017 (last 10 years) | 568 |
| Since 2007 (last 20 years) | 1375 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 110 |
| Practitioners | 107 |
| Teachers | 46 |
| Administrators | 25 |
| Policymakers | 24 |
| Counselors | 12 |
| Parents | 7 |
| Students | 7 |
| Support Staff | 4 |
| Community | 2 |
Location
| California | 61 |
| Canada | 60 |
| United States | 57 |
| Turkey | 47 |
| Australia | 43 |
| Florida | 34 |
| Germany | 26 |
| Texas | 26 |
| China | 25 |
| Netherlands | 25 |
| Iran | 22 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
| Does not meet standards | 1 |
Moradi, Elahe; Ghabanchi, Zargham; Pishghadam, Reza – Language Testing in Asia, 2022
Given the significance of the test fairness, this study aimed to investigate a reading comprehension test for evidence of differential item functioning (DIF) based on English as a Foreign Language (EFL) learners' gender and their mode of learning (conventional vs. distance learning). To this end, 514 EFL learners were asked to take a 30-item…
Descriptors: Reading Comprehension, Test Bias, Test Items, Second Language Learning
Zwick, Rebecca; Ye, Lei; Isham, Steven – Journal of Educational Measurement, 2018
In typical differential item functioning (DIF) assessments, an item's DIF status is not influenced by its status in previous test administrations. An item that has shown DIF at multiple administrations may be treated the same way as an item that has shown DIF in only the most recent administration. Therefore, much useful information about the…
Descriptors: Test Bias, Testing, Test Items, Bayesian Statistics
Zheng, Xiaying; Yang, Ji Seung – Measurement: Interdisciplinary Research and Perspectives, 2021
The purpose of this paper is to briefly introduce two most common applications of multiple group item response theory (IRT) models, namely detecting differential item functioning (DIF) analysis and nonequivalent group score linking with a simultaneous calibration. We illustrate how to conduct those analyses using the "Stata" item…
Descriptors: Item Response Theory, Test Bias, Computer Software, Statistical Analysis
Flanagan, Agnes; Cormier, Damien C. – Communique, 2019
One of the areas subsumed under the data-based decision making and accountability practice identified in the National Association of School Psychologists' (NASP) "Model for Integrated School Psychological Services" is to collect information on psychological and educational variables to make decisions at a number of levels of service…
Descriptors: Test Bias, School Psychologists, Measurement, Data Collection
Tulek, Onder Kamil; Kose, Ibrahim Alper – Eurasian Journal of Educational Research, 2019
Purpose: This research investigates Tests that include DIF items and which are purified from DIF items. While doing this, the ability estimations and purified DIF items are compared to understand whether there is a correlation between the estimations. Method: The researcher used to R 3.4.1 in order to compare the items and after this situation;…
Descriptors: Test Items, Item Analysis, Item Response Theory, Test Length
Ip, Edward H.; Strachan, Tyler; Fu, Yanyan; Lay, Alexandra; Willse, John T.; Chen, Shyh-Huei; Rutkowski, Leslie; Ackerman, Terry – Journal of Educational Measurement, 2019
Test items must often be broad in scope to be ecologically valid. It is therefore almost inevitable that secondary dimensions are introduced into a test during test development. A cognitive test may require one or more abilities besides the primary ability to correctly respond to an item, in which case a unidimensional test score overestimates the…
Descriptors: Test Items, Test Bias, Test Construction, Scores
Strait, Julia E.; Wright, Emma Kate C.; Decker, Scott L. – Psychology in the Schools, 2019
Performance on figure copying tasks is empirically linked to the school readiness, learning, cognition, and neuropsychological functioning. These nonverbal tasks are frequently used to evaluate children from diverse backgrounds to minimize bias due to factors such as language, ethnicity, culture, or socioeconomic status on test performance. The…
Descriptors: Perceptual Motor Coordination, Psychological Testing, Test Bias, Whites
Lee, Hyung Rock; Lee, Sunbok; Sung, Jaeyun – International Journal of Assessment Tools in Education, 2019
Applying single-level statistical models to multilevel data typically produces underestimated standard errors, which may result in misleading conclusions. This study examined the impact of ignoring multilevel data structure on the estimation of item parameters and their standard errors of the Rasch, two-, and three-parameter logistic models in…
Descriptors: Item Response Theory, Computation, Error of Measurement, Test Bias
Frey, Meredith C. – Journal of Intelligence, 2019
Fifteen years ago, Frey and Detterman established that the SAT (and later, with Koenig, the ACT) was substantially correlated with measures of general cognitive ability and could be used as a proxy measure for intelligence (Frey and Detterman, 2004; Koenig, Frey, and Detterman, 2008). Since that finding, replicated many times and cited extensively…
Descriptors: College Entrance Examinations, Academic Aptitude, Academic Achievement, Prediction
Witmer, Sara E.; Roschmann, Sarina – Measurement and Evaluation in Counseling and Development, 2020
It is critical to examine whether test accommodations function as intended in removing construct-irrelevant variance. The measurement comparability of a math test for students with emotional impairments and those without disabilities was examined. Results indicated the presence of limited differential item functioning (DIF) regardless of…
Descriptors: Testing Accommodations, Mathematics Tests, Emotional Disturbances, Students with Disabilities
Witmer, Sara E.; Roschmann, Sarina – Education and Training in Autism and Developmental Disabilities, 2020
Although it is critical for students with autism to be included in large-scale assessment and accountability systems, it is not clear how to best measure their underlying academic skills and knowledge. Additional empirically-supported guidance is necessary to assist school teams that need to make decisions about how to best include students with…
Descriptors: Testing Accommodations, Autism, Pervasive Developmental Disorders, Students with Disabilities
Bygren, Magnus – Assessment & Evaluation in Higher Education, 2020
Group differences in average grades prior to and after a step-wise introduction of blinded examinations at Stockholm University are examined. Relative to students with 'native' names, students with 'foreign' names appear to experience weak positive bias in the grading of their examinations, but the estimated effect is sensitive to model…
Descriptors: Foreign Countries, College Students, Student Evaluation, Grading
Kuang, Huan; Sahin, Fusun – Large-scale Assessments in Education, 2023
Background: Examinees may not make enough effort when responding to test items if the assessment has no consequence for them. These disengaged responses can be problematic in low-stakes, large-scale assessments because they can bias item parameter estimates. However, the amount of bias, and whether this bias is similar across administrations, is…
Descriptors: Test Items, Comparative Analysis, Mathematics Tests, Reaction Time
Caroline G. Hodgson; Wes Bonifay; Wenxi Yang; Keith C. Herman – Grantee Submission, 2023
Background: Technically sound measures are necessary for accurately identifying youth at risk for depression, but many studies rely on classical test theory metrics or adult samples to evaluate measures. This study examined the use of the PHQ-8, a common and freely available pediatric depression screener, in an adolescent sample using item…
Descriptors: Depression (Psychology), Measurement, Screening Tests, Adolescents
Lane, Kathleen Lynne – School Psychology Review, 2022
In my brief commentary, I offer a look to the past to celebrate lessons learned in social, emotional, and behavioral assessment. Then, I respectfully pose considerations for researchers and practitioners committed to early detection, intervention, and assessment of students with and at risk for emotional and behavioral disorders. Specifically, I…
Descriptors: Functional Behavioral Assessment, Identification, Early Intervention, At Risk Students

Peer reviewed
Direct link
