ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	7
Since 2016 (last 10 years)	25
Since 2006 (last 20 years)	79

Descriptor

Computation	79
Test Bias	79
Item Response Theory	37
Test Items	32
Simulation	25
Statistical Analysis	25
Comparative Analysis	19
Models	19
Scores	17
Sample Size	15
Evaluation Methods	14
Foreign Countries	13
Accuracy	12
Error of Measurement	12
Bayesian Statistics	11
Maximum Likelihood Statistics	11
Monte Carlo Methods	10
Correlation	9
Effect Size	8
Difficulty Level	7
Regression (Statistics)	7
Test Length	7
Tests	7
Achievement Tests	6
Error Patterns	6
More ▼

Publication Type

Journal Articles	75
Reports - Research	60
Reports - Evaluative	12
Reports - Descriptive	6
Opinion Papers	1

Education Level

Elementary Education	8
Secondary Education	7
Elementary Secondary Education	5
Middle Schools	5
Grade 4	4
Grade 3	3
Grade 5	3
Intermediate Grades	3
Junior High Schools	3
Early Childhood Education	2
Grade 8	2
Grade 9	2
High Schools	2
Higher Education	2
Grade 1	1
Grade 10	1
Grade 12	1
Grade 2	1
Grade 6	1
Grade 7	1
Postsecondary Education	1
Preschool Education	1
Primary Education	1
More ▼

Audience

Location

Alabama	1
Australia	1
Belgium	1
Brazil	1
California	1
Canada	1
China	1
Florida	1
Germany	1
Idaho	1
Iran	1
Ireland	1
Nebraska	1
Netherlands	1
New Mexico	1
New York	1
New Zealand	1
North Dakota	1
Ohio	1
Taiwan	1
Texas	1
United States	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	6
Trends in International…	3
Pre Professional Skills Tests	2
Graduate Record Examinations	1
National Assessment of…	1
Progress in International…	1
Wechsler Adult Intelligence…	1
Woodcock Johnson Tests of…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 79 results Save | Export

Testing for Differential Item Functioning under the "D"-Scoring Method

Peer reviewed

Direct link

Dimitrov, Dimiter M.; Atanasov, Dimitar V. – Educational and Psychological Measurement, 2022

This study offers an approach to testing for differential item functioning (DIF) in a recently developed measurement framework, referred to as "D"-scoring method (DSM). Under the proposed approach, called "P-Z" method of testing for DIF, the item response functions of two groups (reference and focal) are compared by…

Descriptors: Test Bias, Methods, Test Items, Scoring

The Impact and Detection of Uniform Differential Item Functioning for Continuous Item Response Models

Peer reviewed

Direct link

Finch, W. Holmes – Educational and Psychological Measurement, 2023

Psychometricians have devoted much research and attention to categorical item responses, leading to the development and widespread use of item response theory for the estimation of model parameters and identification of items that do not perform in the same way for examinees from different population subgroups (e.g., differential item functioning…

Descriptors: Test Bias, Item Response Theory, Computation, Methods

A Note on Improving Variational Estimation for Multidimensional Item Response Theory

Peer reviewed

Direct link

Chenchen Ma; Jing Ouyang; Chun Wang; Gongjun Xu – Grantee Submission, 2024

Survey instruments and assessments are frequently used in many domains of social science. When the constructs that these assessments try to measure become multifaceted, multidimensional item response theory (MIRT) provides a unified framework and convenient statistical tool for item analysis, calibration, and scoring. However, the computational…

Descriptors: Algorithms, Item Response Theory, Scoring, Accuracy

Ordinal Approaches to Decomposing Between-Group Test Score Disparities

Peer reviewed

Direct link

Quinn, David M.; Ho, Andrew D. – Journal of Educational and Behavioral Statistics, 2021

The estimation of test score "gaps" and gap trends plays an important role in monitoring educational inequality. Researchers decompose gaps and gap changes into within- and between-school portions to generate evidence on the role schools play in shaping these inequalities. However, existing decomposition methods assume an equal-interval…

Descriptors: Scores, Tests, Achievement Gap, Equal Education

Estimating Difference-Score Reliability in Pretest-Posttest Settings

Peer reviewed

Direct link

Gu, Zhengguo; Emons, Wilco H. M.; Sijtsma, Klaas – Journal of Educational and Behavioral Statistics, 2021

Clinical, medical, and health psychologists use difference scores obtained from pretest--posttest designs employing the same test to assess intraindividual change possibly caused by an intervention addressing, for example, anxiety, depression, eating disorder, or addiction. Reliability of difference scores is important for interpreting observed…

Descriptors: Test Reliability, Scores, Pretests Posttests, Computation

Full Information Optimal Scoring

Peer reviewed

Direct link

Ramsay, James; Wiberg, Marie; Li, Juan – Journal of Educational and Behavioral Statistics, 2020

Ramsay and Wiberg used a new version of item response theory that represents test performance over nonnegative closed intervals such as [0, 100] or [0, n] and demonstrated that optimal scoring of binary test data yielded substantial improvements in point-wise root-mean-squared error and bias over number right or sum scoring. We extend these…

Descriptors: Scoring, Weighted Scores, Item Response Theory, Intervals

Generalized Mantel-Haenszel Estimators for Simultaneous Differential Item Functioning Tests

Peer reviewed

Direct link

Liu, Ivy; Suesse, Thomas; Harvey, Samuel; Gu, Peter Yongqi; Fernández, Daniel; Randal, John – Educational and Psychological Measurement, 2023

The Mantel-Haenszel estimator is one of the most popular techniques for measuring differential item functioning (DIF). A generalization of this estimator is applied to the context of DIF to compare items by taking the covariance of odds ratio estimators between dependent items into account. Unlike the Item Response Theory, the method does not rely…

Descriptors: Test Bias, Computation, Statistical Analysis, Achievement Tests

Detecting Differential Item Functioning: Item Response Theory Methods versus the Mantel-Haenszel Procedure

Peer reviewed
PDF on ERIC

Download full text

Diaz, Emily; Brooks, Gordon; Johanson, George – International Journal of Assessment Tools in Education, 2021

This Monte Carlo study assessed Type I error in differential item functioning analyses using Lord's chi-square (LC), Likelihood Ratio Test (LRT), and Mantel-Haenszel (MH) procedure. Two research interests were investigated: item response theory (IRT) model specification in LC and the LRT and continuity correction in the MH procedure. This study…

Descriptors: Test Bias, Item Response Theory, Statistical Analysis, Comparative Analysis

Observed Scores as Matching Variables in Differential Item Functioning under the One- and Two-Parameter Logistic Models: Population Results. Research Report. ETS RR-19-06

Peer reviewed
PDF on ERIC

Download full text

Guo, Hongwen; Dorans, Neil J. – ETS Research Report Series, 2019

We derive formulas for the differential item functioning (DIF) measures that two routinely used DIF statistics are designed to estimate. The DIF measures that match on observed scores are compared to DIF measures based on an unobserved ability (theta or true score) for items that are described by either the one-parameter logistic (1PL) or…

Descriptors: Scores, Test Bias, Statistical Analysis, Item Response Theory

The Impact of Ignoring Multilevel Data Structure on the Estimation of Dichotomous Item Response Theory Models

Peer reviewed
PDF on ERIC

Download full text

Lee, Hyung Rock; Lee, Sunbok; Sung, Jaeyun – International Journal of Assessment Tools in Education, 2019

Applying single-level statistical models to multilevel data typically produces underestimated standard errors, which may result in misleading conclusions. This study examined the impact of ignoring multilevel data structure on the estimation of item parameters and their standard errors of the Rasch, two-, and three-parameter logistic models in…

Descriptors: Item Response Theory, Computation, Error of Measurement, Test Bias

Investigating Measurement Invariance by Means of Parameter Instability Tests for 2PL and 3PL Models

Peer reviewed

Direct link

Debelak, Rudolf; Strobl, Carolin – Educational and Psychological Measurement, 2019

M-fluctuation tests are a recently proposed method for detecting differential item functioning in Rasch models. This article discusses a generalization of this method to two additional item response theory models: the two-parametric logistic model and the three-parametric logistic model with a common guessing parameter. The Type I error rate and…

Descriptors: Test Bias, Item Response Theory, Statistical Analysis, Maximum Likelihood Statistics

Tree-Based Global Model Tests for Polytomous Rasch Models

Peer reviewed

Direct link

Komboz, Basil; Strobl, Carolin; Zeileis, Achim – Educational and Psychological Measurement, 2018

Psychometric measurement models are only valid if measurement invariance holds between test takers of different groups. Global model tests, such as the well-established likelihood ratio (LR) test, are sensitive to violations of measurement invariance, such as differential item functioning and differential step functioning. However, these…

Descriptors: Item Response Theory, Models, Tests, Measurement

When Nonresponse Mechanisms Change: Effects on Trends and Group Comparisons in International Large-Scale Assessments

Peer reviewed

Direct link

Sachse, Karoline A.; Mahler, Nicole; Pohl, Steffi – Educational and Psychological Measurement, 2019

Mechanisms causing item nonresponses in large-scale assessments are often said to be nonignorable. Parameter estimates can be biased if nonignorable missing data mechanisms are not adequately modeled. In trend analyses, it is plausible for the missing data mechanism and the percentage of missing values to change over time. In this article, we…

Descriptors: International Assessment, Response Style (Tests), Achievement Tests, Foreign Countries

Standard Errors for National Trends in International Large-Scale Assessments in the Case of Cross-National Differential Item Functioning

Peer reviewed

Direct link

Sachse, Karoline A.; Haag, Nicole – Applied Measurement in Education, 2017

Standard errors computed according to the operational practices of international large-scale assessment studies such as the Programme for International Student Assessment's (PISA) or the Trends in International Mathematics and Science Study (TIMSS) may be biased when cross-national differential item functioning (DIF) and item parameter drift are…

Descriptors: Error of Measurement, Test Bias, International Assessment, Computation

An NCME Instructional Module on Latent DIF Analysis Using Mixture Item Response Models

Peer reviewed

Direct link

Cho, Sun-Joo; Suh, Youngsuk; Lee, Woo-yeol – Educational Measurement: Issues and Practice, 2016

The purpose of this ITEMS module is to provide an introduction to differential item functioning (DIF) analysis using mixture item response models. The mixture item response models for DIF analysis involve comparing item profiles across latent groups, instead of manifest groups. First, an overview of DIF analysis based on latent groups, called…

Descriptors: Test Bias, Research Methodology, Evaluation Methods, Models

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

Educational and Psychological…	20
Journal of Educational and…	10
Applied Psychological…	8
ETS Research Report Series	6
International Journal of…	6
Applied Measurement in…	5
Journal of Educational…	5
Psychological Methods	3
International Journal of…	2
Journal of Experimental…	2
Educational Measurement:…	1
Educational Testing Service	1
Federal Reserve Bank of Boston	1
Grantee Submission	1
Journal of Abnormal Child…	1
Journal of Psychoeducational…	1
Journal of Research on…	1
Malaysian Journal of Learning…	1
Mathematica Policy Research,…	1
Measurement and Evaluation in…	1
New Directions for Evaluation	1
Psychometrika	1
More ▼

Dorans, Neil J.	6
Sinharay, Sandip	3
Strobl, Carolin	3
Blew, Edwin O.	2
Cai, Li	2
Cho, Sun-Joo	2
De Boeck, Paul	2
Dimitrov, Dimiter M.	2
Goldhaber, Dan	2
Grant, Mary C.	2
Guo, Hongwen	2
He, Wei	2
Lei, Pui-Wa	2
Miao, Jing	2
Moses, Tim	2
Paek, Insu	2
Penfield, Randall D.	2
Pohl, Steffi	2
Sachse, Karoline A.	2
Wang, Wen-Chung	2
Wilson, Mark	2
Woods, Carol M.	2
Zeileis, Achim	2
Zhang, Jinming	2
Atanasov, Dimitar V.	1
More ▼