ERIC - Search Results

Publication Date

In 2025	0
Since 2024	13
Since 2021 (last 5 years)	70
Since 2016 (last 10 years)	200
Since 2006 (last 20 years)	487

Descriptor

Test Bias	761
Test Items	761
Item Response Theory	198
Foreign Countries	161
Item Analysis	155
Statistical Analysis	141
Test Construction	139
Difficulty Level	130
Comparative Analysis	124
Test Validity	123
Scores	117
Simulation	91
Evaluation Methods	85
Test Reliability	84
Mathematics Tests	82
Achievement Tests	80
Psychometrics	76
College Entrance Examinations	71
Models	65
Testing Problems	65
Gender Differences	64
Correlation	61
Reading Tests	57
Multiple Choice Tests	55
Regression (Statistics)	55
More ▼

Education Level

Higher Education	86
Secondary Education	82
Elementary Education	67
Postsecondary Education	65
Elementary Secondary Education	43
Middle Schools	43
Junior High Schools	35
High Schools	33
Grade 8	32
Grade 4	24
Early Childhood Education	22
Intermediate Grades	22
Grade 5	21
Grade 7	20
Grade 3	18
Primary Education	16
Grade 9	15
Grade 10	12
Grade 6	9
Kindergarten	7
Grade 2	6
Grade 1	3
Grade 11	3
Preschool Education	3
Adult Education	1
More ▼

Audience

Researchers	32
Teachers	7
Practitioners	6
Administrators	5
Support Staff	3
Counselors	2
Students	2
Community	1
Parents	1

Location

Canada	20
Turkey	20
United States	18
Germany	12
Florida	10
Netherlands	9
Taiwan	9
Iran	8
Singapore	8
California	7
Australia	5
Hong Kong	5
North Carolina	5
South Africa	5
Spain	5
Washington	5
Belgium	4
China	4
Jordan	4
Pennsylvania	4
United Kingdom (England)	4
Indonesia	3
Japan	3
Kuwait	3
Malaysia	3
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	7
Individuals with Disabilities…	6
Every Student Succeeds Act…	3
Rehabilitation Act 1973…	3
Education for All Handicapped…	1
Individuals with Disabilities…	1
Rehabilitation Act 1973	1

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	1
Meets WWC Standards with or without Reservations	1

Showing 1 to 15 of 761 results Save | Export

The Multidimensionality of Measurement Bias in High-Stakes Testing: Using Machine Learning to Evaluate Complex Sources of Differential Item Functioning

Peer reviewed

Direct link

Belzak, William C. M. – Educational Measurement: Issues and Practice, 2023

Test developers and psychometricians have historically examined measurement bias and differential item functioning (DIF) across a single categorical variable (e.g., gender), independently of other variables (e.g., race, age, etc.). This is problematic when more complex forms of measurement bias may adversely affect test responses and, ultimately,…

Descriptors: Test Bias, High Stakes Tests, Artificial Intelligence, Test Items

Reevaluating the SIBTEST Classification Heuristics for Dichotomous Differential Item Functioning

Peer reviewed

Direct link

Weese, James D.; Turner, Ronna C.; Ames, Allison; Crawford, Brandon; Liang, Xinya – Educational and Psychological Measurement, 2022

A simulation study was conducted to investigate the heuristics of the SIBTEST procedure and how it compares with ETS classification guidelines used with the Mantel-Haenszel procedure. Prior heuristics have been used for nearly 25 years, but they are based on a simulation study that was restricted due to computer limitations and that modeled item…

Descriptors: Test Bias, Heuristics, Classification, Statistical Analysis

Testing for Differential Item Functioning under the "D"-Scoring Method

Peer reviewed

Direct link

Dimitrov, Dimiter M.; Atanasov, Dimitar V. – Educational and Psychological Measurement, 2022

This study offers an approach to testing for differential item functioning (DIF) in a recently developed measurement framework, referred to as "D"-scoring method (DSM). Under the proposed approach, called "P-Z" method of testing for DIF, the item response functions of two groups (reference and focal) are compared by…

Descriptors: Test Bias, Methods, Test Items, Scoring

It Ain't near 'Bout Fair: Re-Envisioning the Bias and Sensitivity Review Process from a Justice-Oriented Antiracist Perspective

Peer reviewed

Direct link

Randall, Jennifer – Educational Assessment, 2023

In a justice-oriented antiracist assessment process, attention to the disruption of white supremacy must occur at every stage--from construct articulation to score reporting. An important step in the assessment development process is the item review stage often referred to as Bias/Fairness and Sensitivity Review. I argue that typical approaches to…

Descriptors: Social Justice, Racism, Test Bias, Test Items

The Development of a Standardized Effect Size for the SIBTEST Procedure

Peer reviewed

Direct link

James D. Weese; Ronna C. Turner; Allison Ames; Xinya Liang; Brandon Crawford – Journal of Experimental Education, 2024

In this study a standardized effect size was created for use with the SIBTEST procedure. Using this standardized effect size, a single set of heuristics was developed that are appropriate for data fitting different item response models (e.g., 2-parameter logistic, 3-parameter logistic). The standardized effect size rescales the raw beta-uni value…

Descriptors: Test Bias, Test Items, Item Response Theory, Effect Size

The Impact and Detection of Uniform Differential Item Functioning for Continuous Item Response Models

Peer reviewed

Direct link

Finch, W. Holmes – Educational and Psychological Measurement, 2023

Psychometricians have devoted much research and attention to categorical item responses, leading to the development and widespread use of item response theory for the estimation of model parameters and identification of items that do not perform in the same way for examinees from different population subgroups (e.g., differential item functioning…

Descriptors: Test Bias, Item Response Theory, Computation, Methods

Correcting for Extreme Response Style: Model Choice Matters

Peer reviewed

Direct link

Martijn Schoenmakers; Jesper Tijmstra; Jeroen Vermunt; Maria Bolsinova – Educational and Psychological Measurement, 2024

Extreme response style (ERS), the tendency of participants to select extreme item categories regardless of the item content, has frequently been found to decrease the validity of Likert-type questionnaire results. For this reason, various item response theory (IRT) models have been proposed to model ERS and correct for it. Comparisons of these…

Descriptors: Item Response Theory, Response Style (Tests), Models, Likert Scales

Artificial Intelligence and Educational Measurement: Opportunities and Threats

Peer reviewed

Direct link

Andrew D. Ho – Journal of Educational and Behavioral Statistics, 2024

I review opportunities and threats that widely accessible Artificial Intelligence (AI)-powered services present for educational statistics and measurement. Algorithmic and computational advances continue to improve approaches to item generation, scale maintenance, test security, test scoring, and score reporting. Predictable misuses of AI for…

Descriptors: Artificial Intelligence, Measurement, Educational Assessment, Technology Uses in Education

A Three-Step DIF Analysis of a Reading Comprehension Test across Regional Dialects to Improve Test Score Validity

Peer reviewed

Direct link

Paula Elosua – Language Assessment Quarterly, 2024

In sociolinguistic contexts where standardized languages coexist with regional dialects, the study of differential item functioning is a valuable tool for examining certain linguistic uses or varieties as threats to score validity. From an ecological perspective, this paper describes three stages in the study of differential item functioning…

Descriptors: Reading Tests, Reading Comprehension, Scores, Test Validity

Adjusting for Ability Differences of Equating Samples When Randomization Is Suboptimal

Peer reviewed

Direct link

Kim, Sooyeon; Walker, Michael E. – Educational Measurement: Issues and Practice, 2022

Test equating requires collecting data to link the scores from different forms of a test. Problems arise when equating samples are not equivalent and the test forms to be linked share no common items by which to measure or adjust for the group nonequivalence. Using data from five operational test forms, we created five pairs of research forms for…

Descriptors: Ability, Tests, Equated Scores, Testing Problems

On the Positive Correlation between DIF and Difficulty: A New Theory on the Correlation as Methodological Artifact

Peer reviewed

Direct link

Bolt, Daniel M.; Liao, Xiangyi – Journal of Educational Measurement, 2021

We revisit the empirically observed positive correlation between DIF and difficulty studied by Freedle and commonly seen in tests of verbal proficiency when comparing populations of different mean latent proficiency levels. It is shown that a positive correlation between DIF and difficulty estimates is actually an expected result (absent any true…

Descriptors: Test Bias, Difficulty Level, Correlation, Verbal Tests

Math Items about Real-World Content Lower Test-Scores of Students from Families with Low Socioeconomic Status

Peer reviewed

Direct link

Marjolein Muskens; Willem E. Frankenhuis; Lex Borghans – npj Science of Learning, 2024

In many countries, standardized math tests are important for achieving academic success. Here, we examine whether content of items, the story that explains a mathematical question, biases performance of low-SES students. In a large-scale cohort study of Trends in International Mathematics and Science Studies (TIMSS)--including data from 58…

Descriptors: Mathematics Tests, Standardized Tests, Test Items, Low Income Students

Multi-Group Generalizations of SIBTEST and Crossing-SIBTEST

Peer reviewed

Direct link

Chalmers, R. Philip; Zheng, Guoguo – Applied Measurement in Education, 2023

This article presents generalizations of SIBTEST and crossing-SIBTEST statistics for differential item functioning (DIF) investigations involving more than two groups. After reviewing the original two-group setup for these statistics, a set of multigroup generalizations that support contrast matrices for joint tests of DIF are presented. To…

Descriptors: Test Bias, Test Items, Item Response Theory, Error of Measurement

Testing Differential Item Functioning without Predefined Anchor Items Using Robust Regression

Peer reviewed

Direct link

Wang, Weimeng; Liu, Yang; Liu, Hongyun – Journal of Educational and Behavioral Statistics, 2022

Differential item functioning (DIF) occurs when the probability of endorsing an item differs across groups for individuals with the same latent trait level. The presence of DIF items may jeopardize the validity of an instrument; therefore, it is crucial to identify DIF items in routine operations of educational assessment. While DIF detection…

Descriptors: Test Bias, Test Items, Equated Scores, Regression (Statistics)

Assessing Differential Bundle Functioning Using Meta-Analysis

Direct link

Lanrong Li – ProQuest LLC, 2021

When developing a test, it is essential to ensure that the test is free of items with differential item functioning (DIF). DIF occurs when examinees of equal ability, but from different examinee subgroups, have different chances of getting the item correct. According to the multidimensional perspective, DIF occurs because the test measures more…

Descriptors: Test Bias, Test Items, Meta Analysis, Effect Size

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 51

Educational and Psychological…	67
Journal of Educational…	45
Applied Measurement in…	34
ProQuest LLC	32
International Journal of…	29
Applied Psychological…	25
Educational Measurement:…	19
ETS Research Report Series	17
Journal of Educational and…	15
Educational Assessment	11
Language Testing	9
International Journal of…	8
Online Submission	8
College Board	7
Educational Sciences: Theory…	6
Eurasian Journal of…	6
Grantee Submission	6
Journal of Psychoeducational…	6
Language Assessment Quarterly	6
Large-scale Assessments in…	6
Practical Assessment,…	6
Harvard Educational Review	5
International Journal of…	5
Language Testing in Asia	5
New Meridian Corporation	5
More ▼

Penfield, Randall D.	15
Dorans, Neil J.	11
Hambleton, Ronald K.	10
Wang, Wen-Chung	9
Magis, David	8
Ercikan, Kadriye	7
Oshima, T. C.	7
Plake, Barbara S.	7
Zumbo, Bruno D.	7
De Boeck, Paul	6
Raju, Nambury S.	6
Rogers, H. Jane	6
Banks, Kathleen	5
Cohen, Allan S.	5
Facon, Bruno	5
Gierl, Mark J.	5
Rudner, Lawrence M.	5
Rutkowski, Leslie	5
Thurlow, Martha L.	5
Wilson, Mark	5
Abedi, Jamal	4
Andrich, David	4
Bolt, Daniel M.	4
Childs, Ruth A.	4
More ▼

Journal Articles	520
Reports - Research	504
Reports - Evaluative	122
Speeches/Meeting Papers	82
Reports - Descriptive	52
Dissertations/Theses -…	33
Opinion Papers	31
Information Analyses	24
Tests/Questionnaires	20
Numerical/Quantitative Data	13
Guides - Non-Classroom	11
Guides - General	5
Books	2
Collected Works - General	2
Guides - Classroom - Learner	2
Collected Works - Serials	1
ERIC Publications	1
Reference Materials -…	1
Reports - General	1
More ▼

SAT (College Admission Test)	31
Program for International…	24
Trends in International…	12
Graduate Record Examinations	11
Wechsler Intelligence Scale…	9
ACT Assessment	8
National Assessment of…	7
Iowa Tests of Basic Skills	6
Progress in International…	5
Graduate Management Admission…	4
Wechsler Adult Intelligence…	4
Beck Depression Inventory	3
Boehm Test of Basic Concepts	3
California Achievement Tests	3
Law School Admission Test	3
Peabody Picture Vocabulary…	3
SRA Achievement Series	3
Test of English as a Foreign…	3
Advanced Placement…	2
Florida Comprehensive…	2
New Jersey College Basic…	2
Raven Progressive Matrices	2
Stanford Achievement Tests	2
ACT Interest Inventory	1
Alberta Grade Twelve Diploma…	1
More ▼