ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	4

Source

Journal of Educational and…	4
Journal of Educational…	3
Applied Measurement in…	1
Applied Psychological…	1
ETS Research Report Series	1
Educational Measurement:…	1
Educational and Psychological…	1
Journal of Educational…	1
Measurement:…	1

Author

Zwick, Rebecca	23
Thayer, Dorothy T.	5
Isham, Steven	3
Ye, Lei	3
Lewis, Charles	2
Rudas, Tamas	2
Ercikan, Kadriye	1
Haberman, Shelby J.	1
Loomis, Susan Cooper	1
Mazzeo, John	1
Senturk, Deniz	1
Sinharay, Sandip	1
Thayer, Dorothy	1
Wang, Joyce	1
More ▼

Publication Type

Reports - Evaluative	15
Journal Articles	14
Reports - Research	7
Speeches/Meeting Papers	2
Opinion Papers	1

Education Level

Higher Education	1
Postsecondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	2
Advanced Placement…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 23 results Save | Export

Aggregating Polytomous DIF Results over Multiple Test Administrations

Peer reviewed

Direct link

Zwick, Rebecca; Ye, Lei; Isham, Steven – Journal of Educational Measurement, 2018

In typical differential item functioning (DIF) assessments, an item's DIF status is not influenced by its status in previous test administrations. An item that has shown DIF at multiple administrations may be treated the same way as an item that has shown DIF in only the most recent administration. Therefore, much useful information about the…

Descriptors: Test Bias, Testing, Test Items, Bayesian Statistics

An Investigation of the Efficacy of Criterion Refinement Procedures in Mantel-Haenszel DIF Analysis. Research Report. ETS RR-13-16

Peer reviewed
PDF on ERIC

Download full text

Zwick, Rebecca; Ye, Lei; Isham, Steven – ETS Research Report Series, 2013

Differential item functioning (DIF) analysis is a key component in the evaluation of the fairness and validity of educational tests. Although it is often assumed that refinement of the matching criterion always provides more accurate DIF results, the actual situation proves to be more complex. To explore the effectiveness of refinement, we…

Descriptors: Test Bias, Statistical Analysis, Simulation, Educational Testing

Improving Mantel-Haenszel DIF Estimation through Bayesian Updating

Peer reviewed

Direct link

Zwick, Rebecca; Ye, Lei; Isham, Steven – Journal of Educational and Behavioral Statistics, 2012

This study demonstrates how the stability of Mantel-Haenszel (MH) DIF (differential item functioning) methods can be improved by integrating information across multiple test administrations using Bayesian updating (BU). The authors conducted a simulation that showed that this approach, which is based on earlier work by Zwick, Thayer, and Lewis,…

Descriptors: Test Bias, Computation, Statistical Analysis, Bayesian Statistics

Issues with Self-Monitoring Assessments: Comments on Koretz and Beguin (2010)

Peer reviewed

Direct link

Sinharay, Sandip; Haberman, Shelby J.; Zwick, Rebecca – Measurement: Interdisciplinary Research and Perspectives, 2010

Several researchers (e.g., Klein, Hamilton, McCaffrey, & Stecher, 2000; Koretz & Barron, 1998; Linn, 2000) have asserted that test-based accountability, a crucial component of U.S. education policy, has resulted in score inflation. This inference has relied on comparisons with performance on other tests such as the National Assessment of…

Descriptors: Audits (Verification), Test Items, Scores, Measurement

An Empirical Bayes Approach to Mantel-Haenszel DIF Analysis.

Peer reviewed

Zwick, Rebecca; Thayer, Dorothy T.; Lewis, Charles – Journal of Educational Measurement, 1999

Developed an empirical Bayes enhancement to Mantel-Haenszel (MH) analysis of differential item functioning (DIF) in which it is assumed that the MH statistics are normally distributed and that the prior distribution of underlying DIF parameters is also normal. (Author/SLD)

Descriptors: Bayesian Statistics, Item Bias, Statistical Distributions, Test Items

Using Loss Functions for DIF Detection: An Empirical Bayes Approach.

Peer reviewed

Zwick, Rebecca; Thayer, Dorothy; Lewis, Charles – Journal of Educational and Behavioral Statistics, 2000

Studied a method for flagging differential item functioning (DIF) based on loss functions. Builds on earlier research that led to the development of an empirical Bayes enhancement to the Mantel-Haenszel DIF analysis. Tested the method through simulation and found its performance better than some commonly used DIF classification systems. (SLD)

Descriptors: Bayesian Statistics, Identification, Item Bias, Simulation

Descriptive and Inferential Procedures for Assessing Differential Item Functioning in Polytomous Items.

Peer reviewed

Zwick, Rebecca; Thayer, Dorothy T.; Mazzeo, John – Applied Measurement in Education, 1997

Differential item functioning (DIF) assessment procedures for items with more than two ordered score categories, referred to as polytomous items, were evaluated. Three descriptive statistics (standardized mean difference and two procedures based on the SIBTEST computer program) and five inferential procedures were used. Conditions under which the…

Descriptors: Item Bias, Research Methodology, Statistical Inference, Test Construction

The Effect of Adaptive Administration on the Variability of the Mantel-Haenszel Measure of Differential Item Functioning.

Peer reviewed

Zwick, Rebecca – Educational and Psychological Measurement, 1997

Recent simulations have shown that, for a given sample size, the Mantel-Haenszel (MH) variances tend to be larger when items are administered to randomly selected examinees than when they are administered adaptively. Results suggest that adaptive testing may lead to more efficient application of MH differential item functioning analyses. (SLD)

Descriptors: Adaptive Testing, Item Bias, Sample Size, Simulation

Estimating the Importance of Differential Item Functioning.

Peer reviewed

Rudas, Tamas; Zwick, Rebecca – Journal of Educational and Behavioral Statistics, 1997

The mixture index of fit (T. Rudas et al, 1994) is used to estimate the fraction of a population for which differential item functioning (DIF) occurs, and this approach is compared to the Mantel Haenszel test of DIF. The proposed noniterative procedure provides information about data portions contributing to DIF. (SLD)

Descriptors: Comparative Analysis, Estimation (Mathematics), Item Bias, Maximum Likelihood Statistics

Application of an Empirical Bayes Enhancement of Mantel-Haenszel Differential Item Functioning Analysis to a Computerized Adaptive Test.

Peer reviewed

Zwick, Rebecca; Thayer, Dorothy T. – Applied Psychological Measurement, 2002

Used a simulation to investigate the applicability to computerized adaptive test data of a differential item functioning (DIF) analysis method. Results show the performance of this empirical Bayes enhancement of the Mantel Haenszel DIF analysis method to be quite promising. (SLD)

Descriptors: Adaptive Testing, Bayesian Statistics, Computer Assisted Testing, Item Bias

The Effect of the Probability of Correct Response on the Variability of Measures of Differential Item Functioning. Program Statistics Research Technical Report No. 94-4.

Download full text

Zwick, Rebecca – 1994

The Mantel Haenszel (MH; 1959) approach of Holland and Thayer (1988) is a well-established method for assessing differential item functioning (DIF). The formula for the variance of the MH DIF statistic is based on work by Phillips and Holland (1987) and Robins, Breslow, and Greenland (1986). Recent simulation studies showed that the MH variances…

Descriptors: Adaptive Testing, Evaluation Methods, Item Bias, Measurement Techniques

Evaluating the Magnitude of Differential Item Functioning in Polytomous Items.

Peer reviewed

Zwick, Rebecca; Thayer, Dorothy T. – Journal of Educational and Behavioral Statistics, 1996

Two possible standard error formulas for the polytomous differential item functioning index proposed by N. J. Dorans and A. P. Schmitt (1991) were derived. These standard errors, and associated hypothesis-testing procedures, were evaluated through simulated data. The standard error that performed better is based on N. Mantel's (1963)…

Descriptors: Error of Measurement, Evaluation Methods, Hypothesis Testing, Item Bias

An Investigation of Alternative Methods for Item Mapping in the National Assessment of Educational Progress.

Peer reviewed

Zwick, Rebecca; Senturk, Deniz; Wang, Joyce; Loomis, Susan Cooper – Educational Measurement: Issues and Practice, 2001

Compared four mapping item methods using data from the physical science test of the National Assessment of Educational Progress and studied the opinions of science content area experts about the difficulty of the items through a survey completed by 148 science teachers or scientists. Results of model-based mapping methods were more concordant with…

Descriptors: Comparative Analysis, Physical Sciences, Science Teachers, Science Tests

When Do Item Response Function and Mantel-Haenszel Definitions of Differential Item Functioning Coincide?

Peer reviewed

Zwick, Rebecca – Journal of Educational Statistics, 1990

Use of the Mantel-Haenszel procedure as a test for differential item functioning under the Rasch model of item-response theory is examined. Results of the procedure cannot be generalized to the class of items for which item-response functions are monotonic and local independence holds. (TJH)

Descriptors: Demography, Equations (Mathematics), Error of Measurement, Item Bias

Assessing Differential Item Functioning in Performance Tests.

Download full text

Zwick, Rebecca; And Others – 1993

Although the belief has been expressed that performance assessments are intrinsically more fair than multiple-choice measures, some forms of performance assessment may in fact be more likely than conventional tests to tap construct-irrelevant factors. As performance assessment grows in popularity, it will be increasingly important to monitor the…

Descriptors: Educational Assessment, Item Bias, Multiple Choice Tests, Performance Based Assessment

Previous Page | Next Page »

Pages: 1 | 2

Test Items	23
Item Bias	17
Simulation	9
Adaptive Testing	6
Bayesian Statistics	5
Error of Measurement	5
Estimation (Mathematics)	5
Item Response Theory	5
Computer Assisted Testing	4
Comparative Analysis	3
Educational Assessment	3
Evaluation Methods	3
Identification	3
Mathematical Models	3
Scoring	3
Statistical Inference	3
Test Bias	3
Test Construction	3
Computer Simulation	2
Correlation	2
Difficulty Level	2
Educational Testing	2
Equations (Mathematics)	2
Hypothesis Testing	2
Maximum Likelihood Statistics	2
More ▼