Publication Date
| In 2026 | 0 |
| Since 2025 | 38 |
| Since 2022 (last 5 years) | 225 |
| Since 2017 (last 10 years) | 570 |
| Since 2007 (last 20 years) | 1377 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 110 |
| Practitioners | 107 |
| Teachers | 46 |
| Administrators | 25 |
| Policymakers | 24 |
| Counselors | 12 |
| Parents | 7 |
| Students | 7 |
| Support Staff | 4 |
| Community | 2 |
Location
| California | 61 |
| Canada | 60 |
| United States | 57 |
| Turkey | 47 |
| Australia | 43 |
| Florida | 34 |
| Germany | 26 |
| Texas | 26 |
| China | 25 |
| Netherlands | 25 |
| Iran | 22 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
| Does not meet standards | 1 |
Wiberg, Marie – International Journal of Testing, 2009
The aim of this study was to examine log linear modelling (LLM) compared with logistic regression (LR) and Mantel-Haenszel (MH) test for detecting Differential Item Functioning (DIF) in a mastery test. The three methods were chosen because they have similar components. The results showed fairly high matching percentages together with high…
Descriptors: Test Bias, Mastery Tests, Comparative Analysis, Regression (Statistics)
Bowman, Nicholas A.; Hill, Patrick L. – New Directions for Institutional Research, 2011
Colleges and universities are increasingly using national surveys to assess their students' learning and development. Given the importance of the first year of college for student adjustment and retention (Tinto, 1993), some of these surveys are designed specifically to gauge the experiences and outcomes of first-year students. These large-scale…
Descriptors: Undergraduate Students, National Surveys, Measurement Techniques, Educational Experience
Sinharay, Sandip; Dorans, Neil J.; Liang, Longjuan – Educational Measurement: Issues and Practice, 2011
Over the past few decades, those who take tests in the United States have exhibited increasing diversity with respect to native language. Standard psychometric procedures for ensuring item and test fairness that have existed for some time were developed when test-taking groups were predominantly native English speakers. A better understanding of…
Descriptors: Test Bias, Testing Programs, Psychometrics, Language Proficiency
Randall, Jennifer; Cheong, Yuk Fai; Engelhard, George, Jr. – Educational and Psychological Measurement, 2011
To address whether or not modifications in test administration influence item functioning for students with disabilities on a high-stakes statewide problem-solving assessment, a sample of 868 students (with and without disabilities) from 74 Georgia schools were randomly assigned to one of three testing conditions (resource guide, calculator, or…
Descriptors: Item Response Theory, Models, Context Effect, Test Bias
Magis, David; Raiche, Gilles; Beland, Sebastien; Gerard, Paul – International Journal of Testing, 2011
We present an extension of the logistic regression procedure to identify dichotomous differential item functioning (DIF) in the presence of more than two groups of respondents. Starting from the usual framework of a single focal group, we propose a general approach to estimate the item response functions in each group and to test for the presence…
Descriptors: Language Skills, Identification, Foreign Countries, Evaluation Methods
Paek, Insu; Wilson, Mark – Educational and Psychological Measurement, 2011
This study elaborates the Rasch differential item functioning (DIF) model formulation under the marginal maximum likelihood estimation context. Also, the Rasch DIF model performance was examined and compared with the Mantel-Haenszel (MH) procedure in small sample and short test length conditions through simulations. The theoretically known…
Descriptors: Test Bias, Test Length, Statistical Inference, Geometric Concepts
Chakravarti, Amitav; Fang, Christina; Shapira, Zur – Journal of Experimental Psychology: Learning, Memory, and Cognition, 2011
The ability to detect a change, to accurately assess the magnitude of the change, and to react to that change in a commensurate fashion are of critical importance in many decision domains. Thus, it is important to understand the factors that systematically affect people's reactions to change. In this article we document a novel effect: Decision…
Descriptors: Cognitive Processes, Change, Responses, Decision Making Skills
Styles, Irene; Wildy, Helen; Pepper, Vivienne; Faulkner, Joanne; Berman, Ye'Elah – International Research in Early Childhood Education, 2014
The assessment of literacy and numeracy skills of students as they enter school for the first time is not yet established nation-wide in Australia. However, a large proportion of primary schools have chosen to assess their starting students on the Performance Indicators in Primary Schools-Baseline Assessment (PIPS-BLA). This series of three…
Descriptors: Foreign Countries, Indigenous Knowledge, Performance Based Assessment, Test Bias
Geraghty Ward, Emily M.; Semken, Steven; Libarkin, Julie C. – Journal of Geoscience Education, 2014
We present a mixed-methods approach to community-based assessment design that engages tribal college and university faculty, students, and science educators, as well as experts in cultural knowledge from the Blackfeet and Diné (Navajo) nations. Information from cultural experts, gathered through a combination of sequential surveys and focus group…
Descriptors: Mixed Methods Research, Earth Science, Tribally Controlled Education, American Indian Students
Cawthon, Stephanie; Leppo, Rachel; Carr, Therese; Kopriva, Rebecca – Educational Assessment, 2013
When do item adaptations veer from their intent and, instead of increasing access, modify the construct being measured? This study analyzed early elementary student achievement data from a statewide field test containing both standard and adapted science items. Four student groups were included in this analysis: English language learners, students…
Descriptors: Testing Accommodations, Test Items, Adaptive Testing, Science Tests
Kim, Do-Hong; Lambert, Richard G.; Burts, Diane C. – Early Education and Development, 2013
Research Findings: This study examined the measurement equivalence of the "Teaching Strategies GOLD[R]" assessment system across subgroups of children based on their primary language and disability status. This study is based on teacher-collected assessment data for 3-, 4-, and 5-year-old children for the fall of 2010, winter of 2010, and spring…
Descriptors: English Language Learners, Teaching Methods, Educational Strategies, Special Needs Students
Thurlow, Martha L.; Liu, Kristin K.; Ward, Jenna M.; Christensen, Laurene L. – National Center on Educational Outcomes, 2013
The Improving the Validity of Assessment Results for English Language Learners with Disabilities (IVARED) project has identified essential principles of inclusive and valid assessments for English language learners (ELLs) with disabilities. These principles were developed from a Delphi expert review process with nationally recognized experts in…
Descriptors: English Language Learners, Students with Disabilities, Student Evaluation, Evaluation Methods
Sandilands, Debra; Oliveri, Maria Elena; Zumbo, Bruno D.; Ercikan, Kadriye – International Journal of Testing, 2013
International large-scale assessments of achievement often have a large degree of differential item functioning (DIF) between countries, which can threaten score equivalence and reduce the validity of inferences based on comparisons of group performances. It is important to understand potential sources of DIF to improve the validity of future…
Descriptors: Validity, Measures (Individuals), International Studies, Foreign Countries
Gattamorta, Karina A.; Penfield, Randall D.; Myers, Nicholas D. – International Journal of Testing, 2012
Measurement invariance is a common consideration in the evaluation of the validity and fairness of test scores when the tested population contains distinct groups of examinees, such as examinees receiving different forms of a translated test. Measurement invariance in polytomous items has traditionally been evaluated at the item-level,…
Descriptors: Foreign Countries, Psychometrics, Test Bias, Test Items
Dorans, Neil J. – Educational Testing Service, 2010
Santelices and Wilson (2010) claimed to have addressed technical criticisms of Freedle (2003) presented in Dorans (2004a) and elsewhere. Santelices and Wilson's abstract claimed that their study confirmed that SAT[R] verbal items do function differently for African American and White subgroups. In this commentary, I demonstrate that the…
Descriptors: College Entrance Examinations, Verbal Tests, Test Bias, Test Items

Peer reviewed
Direct link
