ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	26

Descriptor

Simulation	39
Test Bias	39
Test Items	24
Item Response Theory	21
Evaluation Methods	10
Sample Size	10
Comparative Analysis	8
Computer Assisted Testing	8
Adaptive Testing	7
Models	7
Scores	7
Testing	6
Computation	5
Error of Measurement	5
Psychometrics	5
Measurement	4
Regression (Statistics)	4
Statistical Analysis	4
Effect Size	3
Factor Analysis	3
Item Analysis	3
Mathematical Models	3
Maximum Likelihood Statistics	3
Measurement Techniques	3
Statistical Bias	3
More ▼

Source

Journal of Educational…	9
Educational and Psychological…	6
Applied Psychological…	5
Applied Measurement in…	4
International Journal of…	2
Asia Pacific Education Review	1
Journal of Educational and…	1
Multivariate Behavioral…	1
National Center for Research…	1
Psicologica: International…	1
Psychological Methods	1
Psychometrika	1
More ▼

Publication Type

Reports - Evaluative	39
Journal Articles	32
Speeches/Meeting Papers	4

Education Level

High Schools

Audience

Practitioners

Location

Netherlands	1
North Carolina	1

Laws, Policies, & Programs

Assessments and Surveys

Graduate Record Examinations	1
Program for International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 39 results Save | Export

A Nested Logit Approach for Investigating Distractors as Causes of Differential Item Functioning

Peer reviewed

Direct link

Suh, Youngsuk; Bolt, Daniel M. – Journal of Educational Measurement, 2011

In multiple-choice items, differential item functioning (DIF) in the correct response may or may not be caused by differentially functioning distractors. Identifying distractors as causes of DIF can provide valuable information for potential item revision or the design of new test items. In this paper, we examine a two-step approach based on…

Descriptors: Test Items, Test Bias, Multiple Choice Tests, Simulation

Assessing Measurement Equivalence in Ordered-Categorical Data

Peer reviewed
PDF on ERIC

Download full text

Elosua, Paula – Psicologica: International Journal of Methodology and Experimental Psychology, 2011

Assessing measurement equivalence in the framework of the common factor linear models (CFL) is known as factorial invariance. This methodology is used to evaluate the equivalence among the parameters of a measurement model among different groups. However, when dichotomous, Likert, or ordered responses are used, one of the assumptions of the CFL is…

Descriptors: Measurement, Models, Data, Factor Analysis

Testing for Nonuniform Differential Item Functioning with Multiple Indicator Multiple Cause Models

Peer reviewed

Direct link

Woods, Carol M.; Grimm, Kevin J. – Applied Psychological Measurement, 2011

In extant literature, multiple indicator multiple cause (MIMIC) models have been presented for identifying items that display uniform differential item functioning (DIF) only, not nonuniform DIF. This article addresses, for apparently the first time, the use of MIMIC models for testing both uniform and nonuniform DIF with categorical indicators. A…

Descriptors: Test Bias, Testing, Interaction, Item Response Theory

Differential Item Functioning Detection Using the Multiple Indicators, Multiple Causes Method with a Pure Short Anchor

Peer reviewed

Direct link

Shih, Ching-Lin; Wang, Wen-Chung – Applied Psychological Measurement, 2009

The multiple indicators, multiple causes (MIMIC) method with a pure short anchor was proposed to detect differential item functioning (DIF). A simulation study showed that the MIMIC method with an anchor of 1, 2, 4, or 10 DIF-free items yielded a well-controlled Type I error rate even when such tests contained as many as 40% DIF items. In general,…

Descriptors: Test Bias, Simulation, Methods, Factor Analysis

The Effects of Small Sample Size on Identifying Polytomous DIF Using the Liu-Agresti Estimator of the Cumulative Common Odds Ratio

Peer reviewed

Direct link

Carvajal, Jorge; Skorupski, William P. – Educational and Psychological Measurement, 2010

This study is an evaluation of the behavior of the Liu-Agresti estimator of the cumulative common odds ratio when identifying differential item functioning (DIF) with polytomously scored test items using small samples. The Liu-Agresti estimator has been proposed by Penfield and Algina as a promising approach for the study of polytomous DIF but no…

Descriptors: Test Bias, Sample Size, Test Items, Computation

Examining Type I Error and Power for Detection of Differential Item and Testlet Functioning

Peer reviewed

Direct link

Lee, Young-Sun; Cohen, Allan; Toro, Maritsa – Asia Pacific Education Review, 2009

In this study, the effectiveness of detection of differential item functioning (DIF) and testlet DIF using SIBTEST and Poly-SIBTEST were examined in tests composed of testlets. An example using data from a reading comprehension test showed that results from SIBTEST and Poly-SIBTEST were not completely consistent in the detection of DIF and testlet…

Descriptors: Test Bias, Reading Comprehension, Simulation, Reading Tests

A Comparison of the Logistic Regression and Contingency Table Methods for Simultaneous Detection of Uniform and Nonuniform DIF

Peer reviewed

Direct link

Guler, Nese; Penfield, Randall D. – Journal of Educational Measurement, 2009

In this study, we investigate the logistic regression (LR), Mantel-Haenszel (MH), and Breslow-Day (BD) procedures for the simultaneous detection of both uniform and nonuniform differential item functioning (DIF). A simulation study was used to assess and compare the Type I error rate and power of a combined decision rule (CDR), which assesses DIF…

Descriptors: Test Bias, Simulation, Test Items, Measurement

The Effects of Referent Item Parameters on Differential Item Functioning Detection Using the Free Baseline Likelihood Ratio Test

Peer reviewed

Direct link

Lopez Rivas, Gabriel E.; Stark, Stephen; Chernyshenko, Oleksandr S. – Applied Psychological Measurement, 2009

The purpose of this simulation study is to investigate the effects of anchor subtest composition on the accuracy of item response theory (IRT) likelihood ratio (LR) differential item functioning (DIF) detection (Thissen, Steinberg, & Wainer, 1988). Here, the IRT LR test was implemented with a free baseline approach wherein a baseline model was…

Descriptors: Simulation, Item Response Theory, Test Bias, Test Items

What Probably Works in Alternative Assessment. CRESST Report 772

Download full text

Baker, Eva L. – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2010

This report provides an overview of what was known about alternative assessment at the time that the article was written in 1991. Topics include beliefs about assessment reform, overview of alternative assessment including research knowledge, evidence of assessment impact, and critical features of alternative assessment. The author notes that in…

Descriptors: Alternative Assessment, Evaluation Methods, Evaluation Research, Performance Based Assessment

Fitting the Rasch Model to Account for Variation in Item Discrimination

Peer reviewed

Direct link

Weitzman, R. A. – Educational and Psychological Measurement, 2009

Building on the Kelley and Gulliksen versions of classical test theory, this article shows that a logistic model having only a single item parameter can account for varying item discrimination, as well as difficulty, by using item-test correlations to adjust incorrect-correct (0-1) item responses prior to an initial model fit. The fit occurs…

Descriptors: Item Response Theory, Test Items, Difficulty Level, Test Bias

Using Log-Linear Smoothing to Improve Small-Sample DIF Estimation

Peer reviewed

Direct link

Puhan, Gautam; Moses, Timothy P.; Yu, Lei; Dorans, Neil J. – Journal of Educational Measurement, 2009

This study examined the extent to which log-linear smoothing could improve the accuracy of differential item functioning (DIF) estimates in small samples of examinees. Examinee responses from a certification test were analyzed using White examinees in the reference group and African American examinees in the focal group. Using a simulation…

Descriptors: Test Items, Reference Groups, Testing Programs, Raw Scores

Anomalous Type I Error Rates for Identifying One Type of Differential Item Functioning in the Presence of the Other

Peer reviewed

Direct link

Finch, W. Holmes; French, Brian F. – Educational and Psychological Measurement, 2008

A number of statistical methods exist for the detection of differential item functioning (DIF). The performance of DIF methods has been widely studied and generally found to be effective in the detection of both uniform and nonuniform DIF. Anecdotal reports suggest that these techniques may too often incorrectly detect the presence of one type of…

Descriptors: Test Bias, Simulation, Statistical Analysis, Probability

Empirical Selection of Anchors for Tests of Differential Item Functioning

Peer reviewed

Direct link

Woods, Carol M. – Applied Psychological Measurement, 2009

Differential item functioning (DIF) occurs when items on a test or questionnaire have different measurement properties for one group of people versus another, irrespective of group-mean differences on the construct. Methods for testing DIF require matching members of different groups on an estimate of the construct. Preferably, the estimate is…

Descriptors: Test Results, Testing, Item Response Theory, Test Bias

Evaluation of MIMIC-Model Methods for DIF Testing with Comparison to Two-Group Analysis

Peer reviewed

Direct link

Woods, Carol M. – Multivariate Behavioral Research, 2009

Differential item functioning (DIF) occurs when an item on a test or questionnaire has different measurement properties for 1 group of people versus another, irrespective of mean differences on the construct. This study focuses on the use of multiple-indicator multiple-cause (MIMIC) structural equation models for DIF testing, parameterized as item…

Descriptors: Test Bias, Structural Equation Models, Item Response Theory, Testing

Differential Item Functioning Analysis Using Rasch Item Information Functions

Peer reviewed

Direct link

Wyse, Adam E.; Mapuranga, Raymond – International Journal of Testing, 2009

Differential item functioning (DIF) analysis is a statistical technique used for ensuring the equity and fairness of educational assessments. This study formulates a new DIF analysis method using the information similarity index (ISI). ISI compares item information functions when data fits the Rasch model. Through simulations and an international…

Descriptors: Test Bias, Evaluation Methods, Test Items, Educational Assessment

Previous Page | Next Page »

Pages: 1 | 2 | 3

Penfield, Randall D.	4
French, Brian F.	3
Wang, Wen-Chung	3
Woods, Carol M.	3
Bolt, Daniel M.	2
Finch, W. Holmes	2
Flowers, Claudia P.	2
Gierl, Mark J.	2
Stocking, Martha L.	2
Su, Ya-Hui	2
Algina, James	1
Baker, Eva L.	1
Beretvas, S. Natasha	1
Borsman, Denny	1
Boughton, Keith A.	1
Brown, Richard S.	1
Carvajal, Jorge	1
Chen, Shu-Ying	1
Chernyshenko, Oleksandr S.	1
Cohen, Allan	1
Dorans, Neil J.	1
Elosua, Paula	1
Frey, Sharon L.	1
Gotzmann, Andrea	1
More ▼