Publication Date
| In 2026 | 0 |
| Since 2025 | 38 |
| Since 2022 (last 5 years) | 225 |
| Since 2017 (last 10 years) | 570 |
| Since 2007 (last 20 years) | 1377 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 110 |
| Practitioners | 107 |
| Teachers | 46 |
| Administrators | 25 |
| Policymakers | 24 |
| Counselors | 12 |
| Parents | 7 |
| Students | 7 |
| Support Staff | 4 |
| Community | 2 |
Location
| California | 61 |
| Canada | 60 |
| United States | 57 |
| Turkey | 47 |
| Australia | 43 |
| Florida | 34 |
| Germany | 26 |
| Texas | 26 |
| China | 25 |
| Netherlands | 25 |
| Iran | 22 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
| Does not meet standards | 1 |
Frisby, Craig L.; Henry, Betty – Contemporary School Psychology, 2016
A little over 35 years have passed since the original "Larry P." decision was handed down in 1979 by Robert Peckham, a federal judge for the US District Court for the Northern District of California. The "Larry P. case" is a shorthand moniker that refers to a class action lawsuit, supported by the Bay Area Association of Black…
Descriptors: Court Litigation, African American Students, Intellectual Disability, Disproportionate Representation
Finch, W. Holmes; Hernández Finch, Maria E.; French, Brian F. – International Journal of Testing, 2016
Differential item functioning (DIF) assessment is key in score validation. When DIF is present scores may not accurately reflect the construct of interest for some groups of examinees, leading to incorrect conclusions from the scores. Given rising immigration, and the increased reliance of educational policymakers on cross-national assessments…
Descriptors: Test Bias, Scores, Native Language, Language Usage
Cater, Melissa; Ferstel, Sarah D.; O'Neil, Carol E. – Journal of General Education, 2016
Student participation in undergraduate research (ugr) may be influenced by interest in research, future career and educational plans, perceived value of undergraduate research experiences, or perceived competence in research skills. The purpose of this study was to develop a questionnaire that could be used to validly and reliably assess students'…
Descriptors: Undergraduate Students, Student Experience, Questionnaires, Test Construction
Liu, Junhui; Brown, Terran; Chen, Jianshen; Ali, Usama; Hou, Likun; Costanzo, Kate – Partnership for Assessment of Readiness for College and Careers, 2016
The Partnership for Assessment of Readiness for College and Careers (PARCC) is a state-led consortium working to develop next-generation assessments that more accurately, compared to previous assessments, measure student progress toward college and career readiness. The PARCC assessments include both English Language Arts/Literacy (ELA/L) and…
Descriptors: Testing, Achievement Tests, Test Items, Test Bias
Goldhaber, Dan; Chaplin, Duncan Dunbar – Journal of Research on Educational Effectiveness, 2015
In an influential paper, Jesse Rothstein (2010) shows that standard value-added models (VAMs) suggest implausible and large future teacher effects on past student achievement. This is the basis of a falsification test that "appears" to indicate bias in typical VAM estimates of teacher contributions to student learning on standardized…
Descriptors: Teacher Evaluation, Teacher Effectiveness, Teacher Influence, Models
Choi, Youn-Jeng; Alexeev, Natalia; Cohen, Allan S. – International Journal of Testing, 2015
The purpose of this study was to explore what may be contributing to differences in performance in mathematics on the Trends in International Mathematics and Science Study 2007. This was done by using a mixture item response theory modeling approach to first detect latent classes in the data and then to examine differences in performance on items…
Descriptors: Test Bias, Mathematics Achievement, Mathematics Tests, Item Response Theory
Williams, Jazz C. – English in Education, 2015
Several inference types serving distinct purposes are established in the literature on reading comprehension. Despite this highlighting that inference is a non-unitary construct, reading tests tend to treat it as a single ability. Consequently, different tests can assess different inferential abilities. Professionals, knowing what is implicitly…
Descriptors: Inferences, Sentences, Reading Comprehension, Reading Tests
Johnson, Brenda Webb – ProQuest LLC, 2017
Emotional intelligence (EI) has not been studied extensively within the Veterans' Health Administration (VHA). The VHA is the largest healthcare organization in America with over 360,000 employees and the organization invests heavily in competency development. The Tampa VA is a level 1 facility with over 5,000 employees in the Tampa Bay area. The…
Descriptors: Emotional Intelligence, Job Skills, Skill Development, Job Training
Banks, Kathleen; Jeddeeni, Ahmad; Walker, Cindy M. – International Journal of Testing, 2016
Differential bundle functioning (DBF) analyses were conducted to determine whether seventh and eighth grade second language learners (SLLs) had lower probabilities of answering bundles of math word problems correctly that had heavy language demands, when compared to non-SLLs of equal math proficiency. Math word problems on each of four test forms…
Descriptors: Middle School Students, English Language Learners, Second Language Learning, Grade 7
Liu, Yan; Zumbo, Bruno D.; Gustafson, Paul; Huang, Yi; Kroc, Edward; Wu, Amery D. – Practical Assessment, Research & Evaluation, 2016
A variety of differential item functioning (DIF) methods have been proposed and used for ensuring that a test is fair to all test takers in a target population in the situations of, for example, a test being translated to other languages. However, once a method flags an item as DIF, it is difficult to conclude that the grouping variable (e.g.,…
Descriptors: Test Items, Test Bias, Probability, Scores
Cockcroft, Kate; Bloch, Lauren; Moolla, Azra – Education as Change, 2016
This study investigated whether measures of verbal working memory are less sensitive to children's socioeconomic background than traditional vocabulary measures. Participants were 120 school beginners, divided into high and low socioeconomic groups. The groups contained equal numbers of English first-language and second-language speakers. All were…
Descriptors: Foreign Countries, Short Term Memory, Vocabulary, English (Second Language)
French, Brian F.; Finch, W. Holmes – Educational and Psychological Measurement, 2013
Multilevel data structures are ubiquitous in the assessment of differential item functioning (DIF), particularly in large-scale testing programs. There are a handful of DIF procures for researchers to select from that appropriately account for multilevel data structures. However, little, if any, work has been completed to extend a popular DIF…
Descriptors: Test Bias, Statistical Analysis, Comparative Analysis, Correlation
Betts, Donna – Art Therapy: Journal of the American Art Therapy Association, 2013
In an increasingly diverse society, and with the broadening scope of art therapy, the duty of art therapists to ensure responsible and appropriate assessment is ever more important. This article discusses considerations that are necessary for the successful adaptation and use of drawing-based assessments in cross-cultural and multicultural…
Descriptors: Art Therapy, Cultural Relevance, Measures (Individuals), Psychological Testing
Kim, Jihye; Oshima, T. C. – Educational and Psychological Measurement, 2013
In a typical differential item functioning (DIF) analysis, a significance test is conducted for each item. As a test consists of multiple items, such multiple testing may increase the possibility of making a Type I error at least once. The goal of this study was to investigate how to control a Type I error rate and power using adjustment…
Descriptors: Test Bias, Test Items, Statistical Analysis, Error of Measurement
Sireci, Stephen G.; Rios, Joseph A. – Educational Research and Evaluation, 2013
There are numerous statistical procedures for detecting items that function differently across subgroups of examinees that take a test or survey. However, in endeavouring to detect items that may function differentially, selection of the statistical method is only one of many important decisions. In this article, we discuss the important decisions…
Descriptors: Effect Size, Test Bias, Item Analysis, Statistical Analysis

Peer reviewed
Direct link
