ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	8
Since 2006 (last 20 years)	26

Descriptor

Comparative Analysis	31
Models	31
Test Bias	31
Test Items	18
Item Response Theory	11
Scores	10
Statistical Analysis	9
Computation	8
Monte Carlo Methods	8
Simulation	7
Sample Size	6
Correlation	5
Evaluation Methods	5
Tests	5
Difficulty Level	4
Foreign Countries	4
Psychometrics	4
Ability	3
Accuracy	3
Error of Measurement	3
Goodness of Fit	3
Mathematics Tests	3
Maximum Likelihood Statistics	3
Probability	3
Regression (Statistics)	3
More ▼

Source

Journal of Educational…	5
Educational and Psychological…	4
ProQuest LLC	4
ETS Research Report Series	2
International Journal of…	2
AERA Open	1
Advances in Health Sciences…	1
Applied Psychological…	1
Educational Measurement:…	1
Educational Sciences: Theory…	1
Hacettepe University Journal…	1
Journal of Educational…	1
Journal of Educational and…	1
Journal of Experimental…	1
Journal of Special Education	1
Large-scale Assessments in…	1
Policy Analysis for…	1
Structural Equation Modeling:…	1
More ▼

Publication Type

Journal Articles	23
Reports - Research	21
Dissertations/Theses -…	4
Reports - Descriptive	2
Reports - Evaluative	2

Education Level

Higher Education	5
Elementary Secondary Education	2
Postsecondary Education	2
Elementary Education	1

Audience

Location

California	1
California (Fresno)	1
California (Long Beach)	1
California (Los Angeles)	1
California (Oakland)	1
California (Sacramento)	1
California (San Francisco)	1
California (Santa Ana)	1
Canada	1
Colombia	1
Turkey	1
More ▼

Laws, Policies, & Programs

Defunis v Odegaard

Assessments and Surveys

Law School Admission Test	2
Program for International…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 31 results Save | Export

Tree-Based Global Model Tests for Polytomous Rasch Models

Peer reviewed

Direct link

Komboz, Basil; Strobl, Carolin; Zeileis, Achim – Educational and Psychological Measurement, 2018

Psychometric measurement models are only valid if measurement invariance holds between test takers of different groups. Global model tests, such as the well-established likelihood ratio (LR) test, are sensitive to violations of measurement invariance, such as differential item functioning and differential step functioning. However, these…

Descriptors: Item Response Theory, Models, Tests, Measurement

Examining Power and Type 1 Error for Step and Item Level Tests of Invariance: Investigating the Effect of the Number of Item Score Levels

Direct link

Ayodele, Alicia Nicole – ProQuest LLC, 2017

Within polytomous items, differential item functioning (DIF) can take on various forms due to the number of response categories. The lack of invariance at this level is referred to as differential step functioning (DSF). The most common DSF methods in the literature are the adjacent category log odds ratio (AC-LOR) estimator and cumulative…

Descriptors: Statistical Analysis, Test Bias, Test Items, Scores

An NCME Instructional Module on Latent DIF Analysis Using Mixture Item Response Models

Peer reviewed

Direct link

Cho, Sun-Joo; Suh, Youngsuk; Lee, Woo-yeol – Educational Measurement: Issues and Practice, 2016

The purpose of this ITEMS module is to provide an introduction to differential item functioning (DIF) analysis using mixture item response models. The mixture item response models for DIF analysis involve comparing item profiles across latent groups, instead of manifest groups. First, an overview of DIF analysis based on latent groups, called…

Descriptors: Test Bias, Research Methodology, Evaluation Methods, Models

Measuring Students' Social-Emotional Learning among California's CORE Districts: An IRT Modeling Approach. Working Paper

Download full text

Meyer, Robert H.; Wang, Caroline; Rice, Andrew B. – Policy Analysis for California Education, PACE, 2018

With an increased appreciation of students' social-emotional skills among researchers and policy makers, many states and school districts are moving toward a systematic process to measure Social-Emotional Learning (SEL). In this study, we examine the measurement properties of California's CORE Districts' SEL survey administered to over 400,000…

Descriptors: Social Development, Emotional Development, Item Response Theory, Models

Hidden Item Variance in Multiple Mini-Interview Scores

Peer reviewed

Direct link

Zaidi, Nikki L.; Swoboda, Christopher M.; Kelcey, Benjamin M.; Manuel, R. Stephen – Advances in Health Sciences Education, 2017

The extant literature has largely ignored a potentially significant source of variance in multiple mini-interview (MMI) scores by "hiding" the variance attributable to the sample of attributes used on an evaluation form. This potential source of hidden variance can be defined as rating items, which typically comprise an MMI evaluation…

Descriptors: Interviews, Scores, Generalizability Theory, Monte Carlo Methods

The Impact of Model Parameterization and Estimation Methods on Tests of Measurement Invariance with Ordered Polytomous Data

Peer reviewed
PDF on ERIC

Download full text

Direct link

Koziol, Natalie A.; Bovaird, James A. – Educational and Psychological Measurement, 2018

Evaluations of measurement invariance provide essential construct validity evidence--a prerequisite for seeking meaning in psychological and educational research and ensuring fair testing procedures in high-stakes settings. However, the quality of such evidence is partly dependent on the validity of the resulting statistical conclusions. Type I or…

Descriptors: Computation, Tests, Error of Measurement, Comparative Analysis

Measuring Student Learning in Technical Programs: A Case Study from Colombia

Peer reviewed
PDF on ERIC

Download full text

Domingue, Benjamin W.; Lang, David; Cuevas, Martha; Castellanos, Melisa; Lopera, Carolina; Mariño, Julián P.; Molina, Adriana; Shavelson, Richard J. – AERA Open, 2017

Technical schools are an integral part of the education system, and yet, little is known about student learning at such institutions. We consider whether assessments of student learning can be jointly administered to both university and technical school students. We examine whether differential test functioning may bias inferences regarding the…

Descriptors: Academic Achievement, Foreign Countries, Vocational Schools, Test Bias

Anchor Selection Strategies for DIF Analysis: Review, Assessment, and New Approaches

Peer reviewed

Direct link

Kopf, Julia; Zeileis, Achim; Strobl, Carolin – Educational and Psychological Measurement, 2015

Differential item functioning (DIF) indicates the violation of the invariance assumption, for instance, in models based on item response theory (IRT). For item-wise DIF analysis using IRT, a common metric for the item parameters of the groups that are to be compared (e.g., for the reference and the focal group) is necessary. In the Rasch model,…

Descriptors: Test Items, Equated Scores, Test Bias, Item Response Theory

Rasch Mixture Models for DIF Detection: A Comparison of Old and New Score Specifications

Peer reviewed

Direct link

Frick, Hannah; Strobl, Carolin; Zeileis, Achim – Educational and Psychological Measurement, 2015

Rasch mixture models can be a useful tool when checking the assumption of measurement invariance for a single Rasch model. They provide advantages compared to manifest differential item functioning (DIF) tests when the DIF groups are only weakly correlated with the manifest covariates available. Unlike in single Rasch models, estimation of Rasch…

Descriptors: Item Response Theory, Test Bias, Comparative Analysis, Scores

An Odds Ratio Approach for Detecting DDF under the Nested Logit Modeling Framework

Peer reviewed

Direct link

Terzi, Ragip; Suh, Youngsuk – Journal of Educational Measurement, 2015

An odds ratio approach (ORA) under the framework of a nested logit model was proposed for evaluating differential distractor functioning (DDF) in multiple-choice items and was compared with an existing ORA developed under the nominal response model. The performances of the two ORAs for detecting DDF were investigated through an extensive…

Descriptors: Test Bias, Multiple Choice Tests, Test Items, Comparative Analysis

Comparing DIF Methods for Data with Dual Dependency

Peer reviewed

Direct link

Jin, Ying; Kang, Minsoo – Large-scale Assessments in Education, 2016

Background: The current study compared four differential item functioning (DIF) methods to examine their performances in terms of accounting for dual dependency (i.e., person and item clustering effects) simultaneously by a simulation study, which is not sufficiently studied under the current DIF literature. The four methods compared are logistic…

Descriptors: Comparative Analysis, Test Bias, Simulation, Regression (Statistics)

Complex versus Simple Modeling for Differential Item Functioning (DIF) Detection: When the Intraclass Correlation Coefficient (Rho) of the Studied Item Is Less than the Rho of the Total Score

Direct link

Jin, Ying – ProQuest LLC, 2013

Previous research has demonstrated that DIF methods that do not account for multilevel data structure could result in too frequent rejection of the null hypothesis (i.e., no DIF) when the intraclass correlation coefficient (?) of the studied item was the same as ? of the total score. The current study extended previous research by comparing the…

Descriptors: Test Bias, Models, Correlation, Test Items

Differential Item Functioning Assessment in Cognitive Diagnostic Modeling: Application of the Wald Test to Investigate DIF in the DINA Model

Peer reviewed

Direct link

Hou, Likun; de la Torre, Jimmy; Nandakumar, Ratna – Journal of Educational Measurement, 2014

Analyzing examinees' responses using cognitive diagnostic models (CDMs) has the advantage of providing diagnostic information. To ensure the validity of the results from these models, differential item functioning (DIF) in CDMs needs to be investigated. In this article, the Wald test is proposed to examine DIF in the context of CDMs. This study…

Descriptors: Test Bias, Models, Simulation, Error Patterns

Comparing Performances (Type I Error and Power) of IRT Likelihood Ratio SIBTEST and Mantel-Haenszel Methods in the Determination of Differential Item Functioning

Peer reviewed
PDF on ERIC

Download full text

Atalay Kabasakal, Kübra; Arsan, Nihan; Gök, Bilge; Kelecioglu, Hülya – Educational Sciences: Theory and Practice, 2014

This simulation study compared the performances (Type I error and power) of Mantel-Haenszel (MH), SIBTEST, and item response theory-likelihood ratio (IRT-LR) methods under certain conditions. Manipulated factors were sample size, ability differences between groups, test length, the percentage of differential item functioning (DIF), and underlying…

Descriptors: Comparative Analysis, Item Response Theory, Statistical Analysis, Test Bias

The MIMIC Model as a Tool for Differential Bundle Functioning Detection

Peer reviewed

Direct link

Finch, W. Holmes – Applied Psychological Measurement, 2012

Increasingly, researchers interested in identifying potentially biased test items are encouraged to use a confirmatory, rather than exploratory, approach. One such method for confirmatory testing is rooted in differential bundle functioning (DBF), where hypotheses regarding potential differential item functioning (DIF) for sets of items (bundles)…

Descriptors: Test Bias, Test Items, Statistical Analysis, Models

Previous Page | Next Page »

Pages: 1 | 2 | 3

Strobl, Carolin	3
Zeileis, Achim	3
Cho, Sun-Joo	2
Jin, Ying	2
Magis, David	2
Suh, Youngsuk	2
Ariel, Adelaide	1
Arsan, Nihan	1
Atalay Kabasakal, Kübra	1
Atar, Burcu	1
Ayodele, Alicia Nicole	1
Beland, Sebastien	1
Bottge, Brian A.	1
Bovaird, James A.	1
Breland, Hunter M.	1
Castellanos, Melisa	1
Cohen, Allan S.	1
Cuevas, Martha	1
De Boeck, Paul	1
Domingue, Benjamin W.	1
DuVernet, Amy M.	1
Finch, W. Holmes	1
Frazer, William G.	1
Frederickx, Sofie	1
More ▼