Publication Date
| In 2026 | 0 |
| Since 2025 | 40 |
| Since 2022 (last 5 years) | 227 |
| Since 2017 (last 10 years) | 572 |
| Since 2007 (last 20 years) | 1379 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 110 |
| Practitioners | 107 |
| Teachers | 46 |
| Administrators | 25 |
| Policymakers | 24 |
| Counselors | 12 |
| Parents | 7 |
| Students | 7 |
| Support Staff | 4 |
| Community | 2 |
Location
| California | 61 |
| Canada | 60 |
| United States | 57 |
| Turkey | 47 |
| Australia | 43 |
| Florida | 34 |
| Germany | 26 |
| Texas | 26 |
| China | 25 |
| Netherlands | 25 |
| Iran | 22 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 1 |
| Does not meet standards | 1 |
College Board, 2010
This is the College Board's response to a research article by Drs. Maria Veronica Santelices and Mark Wilson in the Harvard Educational Review, entitled "Unfair Treatment? The Case of Freedle, the SAT, and the Standardization Approach to Differential Item Functioning" (see EJ930622).
Descriptors: Test Bias, College Entrance Examinations, Standardized Tests, Test Items
Arendasy, Martin E.; Sommer, Markus – Intelligence, 2010
In complex three-dimensional mental rotation tasks males have been reported to score up to one standard deviation higher than females. However, this effect size estimate could be compromised by the presence of gender bias at the item level, which calls the validity of purely quantitative performance comparisons into question. We hypothesized that…
Descriptors: Effect Size, Psychometrics, Gender Differences, Gender Bias
Mokkink, Lidwine B.; Knol, Dirk L.; van Nispen, Ruth M. A.; Kramer, Sophia E. – Journal of Speech, Language, and Hearing Research, 2010
Purpose: The aim of this study was to improve the quality and applicability of the 6 Dutch scales of the Communication Profile for the Hearing Impaired (CPHI; Demorest & Erdman, 1986, 1987, 1988) using item response theory (IRT). IRT modeling can produce precise, valid, and relatively brief instruments, resulting in minimal response burden (Edelen…
Descriptors: Hearing Impairments, Profiles, Measures (Individuals), Item Response Theory
Bennett, Randy Elliot – Educational Testing Service, 2011
CBAL, an acronym for Cognitively Based Assessment of, for, and as Learning, is a research initiative intended to create a model for an innovative K-12 assessment system that provides summative information for policy makers, as well as formative information for classroom instructional purposes. This paper summarizes empirical results from 16 CBAL…
Descriptors: Educational Assessment, Elementary Secondary Education, Summative Evaluation, Formative Evaluation
Rudner, Lawrence M. – Graduate Management Admission Council, 2011
To articulate a guiding principle at the Graduate Management Admission Council (GMAC), CEO Dave Wilson often quotes Harry Bosch, the protagonist of several Michael Connelly novels, who said, "Everybody matters, or no one matters." With management education now a global field, and with 52 percent of the GMAT (Graduate Management Admission Test)…
Descriptors: College Entrance Examinations, Graduate Study, Business Administration Education, Culture Fair Tests
Ong, Yoke Mooi; Williams, Julian Scott; Lamprianou, Iasonas – International Journal of Testing, 2011
The aims of this study are (a) to examine the sources of differential functioning by gender via differential bundle functioning (DBF) in mathematics assessment and (b) to use DBF to explore whether the differential functioning displayed is construct-relevant or construct-irrelevant. Three qualitatively different areas, namely curriculum domains,…
Descriptors: Test Bias, Gender Differences, Gender Bias, Mathematics Tests
Chulu, Bob Wajizigha; Sireci, Stephen G. – International Journal of Testing, 2011
Many examination agencies, policy makers, media houses, and the public at large make high-stakes decisions based on test scores. Unfortunately, in some cases educational tests are not statistically equated to account for test differences over time, which leads to inappropriate interpretations of students' performance. In this study we illustrate…
Descriptors: Classification, Foreign Countries, Item Response Theory, High Stakes Tests
Wetzel, Eunike; Hell, Benedikt; Passler, Katja – Journal of Career Assessment, 2012
Three test construction strategies are described and illustrated in the development of the Verb Interest Test (VIT), an inventory that assesses vocational interests using verbs. Verbs might be a promising alternative to the descriptions of occupational activities used in most vocational interest inventories because they are context-independent,…
Descriptors: Test Construction, Culture Fair Tests, Vocational Interests, Interest Inventories
Maydosz, Ann; Maydosz, Diane – Multicultural Learning and Teaching, 2013
Despite the fact that disability has been recognized as "a natural part of the human experience" (Developmental Disabilities Assistance and Bill of Rights Act of 2000) and that the Education for All Handicapped Children Act of 1975 and its later reauthorizations as the Individuals with Disabilities Education Act (IDEA) should have served…
Descriptors: Disabilities, Minority Group Students, Court Litigation, Laws
Liu, Kristin K.; Goldstone, Linda; Thurlow, Martha L.; Ward, Jenna; Hatten, James; Christensen, Laurene L. – National Center on Educational Outcomes, 2013
English language learners (ELLs) with disabilities are an increasing presence in schools in the United States. Title I and Title III of the Elementary and Secondary Education Act require that these students meet the same academic grade-level standards and participate in content assessments as their fluent-English speaking peers without…
Descriptors: English Language Learners, Disabilities, State Standards, Standardized Tests
Schatschneider, Christopher; Lane, Kathleen Lynne; Oakes, Wendy Peia; Kalberg, Jemma Robertson – Educational Assessment, 2014
Screening of students at risk for antisocial behaviors in school is an essential step in the implementation of evidence-based supports for academic, behavioral, and social domains at the first sign of concern. This study examined the measurement properties of a free-access systematic behavior screening tool: the Student Risk Screening Scale…
Descriptors: Test Bias, Screening Tests, Antisocial Behavior, At Risk Students
DiStefano, Christine; Greer, Fred W.; Kamphaus, R. W.; Brown, William H. – Journal of Early Intervention, 2014
A screening instrument used to identify young children at risk for behavioral and emotional difficulties, the Behavioral and Emotional Screening System Teacher Rating Scale-Preschool was examined. The Rasch Rating Scale Method was used to provide additional information about psychometric properties of items, respondents, and the response scale.…
Descriptors: Screening Tests, At Risk Persons, Test Validity, Rating Scales
Herman, Joan L.; Heritage, Margaret; Goldschmidt, Pete – Assessment and Accountability Comprehensive Center, 2011
States and districts across the country are grappling with how to incorporate assessments of student learning into their teacher evaluation systems. Sophisticated statistical models have been proposed to estimate the relative value individual teachers add to their students' assessment performance (hence the term teacher "value-added" measures).…
Descriptors: Teacher Evaluation, Testing, Test Selection, Test Construction
Herman, Joan L.; Heritage, Margaret; Goldschmidt, Pete – Assessment and Accountability Comprehensive Center, 2011
States and districts across the country are grappling with how to incorporate assessments of student learning into their teacher evaluation systems. Sophisticated statistical models have been proposed to estimate the relative value individual teachers add to their students' assessment performance (hence the term teacher "value-added" measures).…
Descriptors: Teacher Evaluation, Testing, Test Selection, Test Construction
Atar, Burcu; Kamata, Akihito – Hacettepe University Journal of Education, 2011
The Type I error rates and the power of IRT likelihood ratio test and cumulative logit ordinal logistic regression procedures in detecting differential item functioning (DIF) for polytomously scored items were investigated in this Monte Carlo simulation study. For this purpose, 54 simulation conditions (combinations of 3 sample sizes, 2 sample…
Descriptors: Test Bias, Sample Size, Monte Carlo Methods, Item Response Theory

Peer reviewed
Direct link
