ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	12

Descriptor

Simulation	13
Test Bias	13
Testing	13
Item Response Theory	9
Test Items	6
Error Patterns	4
Evaluation Methods	4
Measurement	4
Comparative Analysis	3
Error of Measurement	3
Computer Software	2
Effect Size	2
Maximum Likelihood Statistics	2
Measurement Techniques	2
Models	2
Multiple Regression Analysis	2
Psychometrics	2
Questionnaires	2
Regression (Statistics)	2
Sample Size	2
Statistical Analysis	2
Statistical Bias	2
Test Results	2
Tests	2
Achievement Tests	1
More ▼

Source

Applied Psychological…	3
Educational and Psychological…	2
International Journal of…	2
Journal of Educational…	2
ProQuest LLC	2
Language Testing	1
Multivariate Behavioral…	1

Author

Woods, Carol M.	4
Kim, Jihye	2
Oshima, T. C.	2
Bolt, Daniel M.	1
Brown, Richard S.	1
Gierl, Mark J.	1
Grimm, Kevin J.	1
Guler, Nese	1
Henning, Grant	1
Kim, Eun Sook	1
Lee, Taehun	1
Penfield, Randall D.	1
Villarreal, Julio C.	1
White, Nick	1
Wright, Keith	1
Wright, Keith D.	1
Yoon, Myeongsun	1
More ▼

Publication Type

Journal Articles	11
Reports - Evaluative	6
Reports - Research	5
Dissertations/Theses -…	2

Education Level

Elementary Secondary Education

Audience

Practitioners

Location

North Carolina

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 13 results Save | Export

Multiple-Group Noncompensatory Differential Item Functioning in Raju's Differential Functioning of Items and Tests

Peer reviewed

Direct link

Oshima, T. C.; Wright, Keith; White, Nick – International Journal of Testing, 2015

Raju, van der Linden, and Fleer (1995) introduced a framework for differential functioning of items and tests (DFIT) for unidimensional dichotomous models. Since then, DFIT has been shown to be a quite versatile framework as it can handle polytomous as well as multidimensional models both at the item and test levels. However, DFIT is still limited…

Descriptors: Test Bias, Item Response Theory, Test Items, Simulation

Effect of Multiple Testing Adjustment in Differential Item Functioning Detection

Peer reviewed

Direct link

Kim, Jihye; Oshima, T. C. – Educational and Psychological Measurement, 2013

In a typical differential item functioning (DIF) analysis, a significance test is conducted for each item. As a test consists of multiple items, such multiple testing may increase the possibility of making a Type I error at least once. The goal of this study was to investigate how to control a Type I error rate and power using adjustment…

Descriptors: Test Bias, Test Items, Statistical Analysis, Error of Measurement

Testing Measurement Invariance Using MIMIC: Likelihood Ratio Test with a Critical Value Adjustment

Peer reviewed

Direct link

Kim, Eun Sook; Yoon, Myeongsun; Lee, Taehun – Educational and Psychological Measurement, 2012

Multiple-indicators multiple-causes (MIMIC) modeling is often used to test a latent group mean difference while assuming the equivalence of factor loadings and intercepts over groups. However, this study demonstrated that MIMIC was insensitive to the presence of factor loading noninvariance, which implies that factor loading invariance should be…

Descriptors: Test Items, Simulation, Testing, Statistical Analysis

Testing for Nonuniform Differential Item Functioning with Multiple Indicator Multiple Cause Models

Peer reviewed

Direct link

Woods, Carol M.; Grimm, Kevin J. – Applied Psychological Measurement, 2011

In extant literature, multiple indicator multiple cause (MIMIC) models have been presented for identifying items that display uniform differential item functioning (DIF) only, not nonuniform DIF. This article addresses, for apparently the first time, the use of MIMIC models for testing both uniform and nonuniform DIF with categorical indicators. A…

Descriptors: Test Bias, Testing, Interaction, Item Response Theory

A Comparison of the Logistic Regression and Contingency Table Methods for Simultaneous Detection of Uniform and Nonuniform DIF

Peer reviewed

Direct link

Guler, Nese; Penfield, Randall D. – Journal of Educational Measurement, 2009

In this study, we investigate the logistic regression (LR), Mantel-Haenszel (MH), and Breslow-Day (BD) procedures for the simultaneous detection of both uniform and nonuniform differential item functioning (DIF). A simulation study was used to assess and compare the Type I error rate and power of a combined decision rule (CDR), which assesses DIF…

Descriptors: Test Bias, Simulation, Test Items, Measurement

Ramsay-Curve Differential Item Functioning

Peer reviewed

Direct link

Woods, Carol M. – Applied Psychological Measurement, 2011

Differential item functioning (DIF) occurs when an item on a test, questionnaire, or interview has different measurement properties for one group of people versus another, irrespective of true group-mean differences on the constructs being measured. This article is focused on item response theory based likelihood ratio testing for DIF (IRT-LR or…

Descriptors: Simulation, Item Response Theory, Testing, Questionnaires

Controlling Type I Error Rate in Evaluating Differential Item Functioning for Four DIF Methods: Use of Three Procedures for Adjustment of Multiple Item Testing

Direct link

Kim, Jihye – ProQuest LLC, 2010

In DIF studies, a Type I error refers to the mistake of identifying non-DIF items as DIF items, and a Type I error rate refers to the proportion of Type I errors in a simulation study. The possibility of making a Type I error in DIF studies is always present and high possibility of making such an error can weaken the validity of the assessment.…

Descriptors: Test Bias, Test Length, Simulation, Testing

Improvements for Differential Functioning of Items and Tests (DFIT): Investigating the Addition of Reporting an Effect Size Measure and Power

Direct link

Wright, Keith D. – ProQuest LLC, 2011

Standardized testing has been part of the American educational system for decades. Controversy from the beginning has plagued standardized testing, is plaguing testing today, and will continue to be controversial. Given the current federal educational policies supporting increased standardized testing, psychometricians, educators and policy makers…

Descriptors: Test Bias, Test Items, Simulation, Testing

Empirical Selection of Anchors for Tests of Differential Item Functioning

Peer reviewed

Direct link

Woods, Carol M. – Applied Psychological Measurement, 2009

Differential item functioning (DIF) occurs when items on a test or questionnaire have different measurement properties for one group of people versus another, irrespective of group-mean differences on the construct. Methods for testing DIF require matching members of different groups on an estimate of the construct. Preferably, the estimate is…

Descriptors: Test Results, Testing, Item Response Theory, Test Bias

Evaluation of MIMIC-Model Methods for DIF Testing with Comparison to Two-Group Analysis

Peer reviewed

Direct link

Woods, Carol M. – Multivariate Behavioral Research, 2009

Differential item functioning (DIF) occurs when an item on a test or questionnaire has different measurement properties for 1 group of people versus another, irrespective of mean differences on the construct. This study focuses on the use of multiple-indicator multiple-cause (MIMIC) structural equation models for DIF testing, parameterized as item…

Descriptors: Test Bias, Structural Equation Models, Item Response Theory, Testing

Correcting for Person Misfit in Aggregated Score Reporting

Peer reviewed

Direct link

Brown, Richard S.; Villarreal, Julio C. – International Journal of Testing, 2007

There has been considerable research regarding the extent to which psychometric sound assessments sometimes yield individual score estimates that are inconsistent with the response patterns of the individual. It has been suggested that individual response patterns may differ from expectations for a number of reasons, including subject motivation,…

Descriptors: Psychometrics, Test Bias, Testing, Simulation

Testing Features of Graphical DIF: Application of a Regression Correction to Three Nonparametric Statistical Tests

Peer reviewed

Direct link

Bolt, Daniel M.; Gierl, Mark J. – Journal of Educational Measurement, 2006

Inspection of differential item functioning (DIF) in translated test items can be informed by graphical comparisons of item response functions (IRFs) across translated forms. Due to the many forms of DIF that can emerge in such analyses, it is important to develop statistical tests that can confirm various characteristics of DIF when present.…

Descriptors: Regression (Statistics), Tests, Test Bias, Test Items

Accounting for Nonsystematic Error in Performance Ratings.

Peer reviewed

Henning, Grant – Language Testing, 1996

Analyzes simulated performance ratings on a six-point scale by two independent raters to account for nonsystematic error in performance ratings. Results suggest that rater agreement or covariance is not always a dependable estimate of score reliability and that the practice of seeking additional raters for adjudication of discrepant ratings is not…

Descriptors: Correlation, Error Patterns, Interrater Reliability, Language Tests