ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	19

Descriptor

Test Bias	26
College Entrance Examinations	11
Statistical Analysis	11
Test Items	11
Scores	7
Computation	6
Equated Scores	6
Item Response Theory	6
Sample Size	6
African Americans	4
Difficulty Level	4
Psychometrics	4
Reading Tests	4
Scoring	4
Whites	4
Bayesian Statistics	3
Correlation	3
English (Second Language)	3
Gender Differences	3
Language Proficiency	3
Mathematics Tests	3
Measurement	3
Racial Differences	3
Test Reliability	3
Testing Programs	3
More ▼

Source

ETS Research Report Series	8
Educational Testing Service	6
Journal of Educational and…	3
Educational Measurement:…	2
Harvard Educational Review	2
Journal of Educational…	2
College Board	1

Author

Dorans, Neil J.	26
Sinharay, Sandip	6
Liang, Longjuan	3
Blew, Edwin O.	2
Grant, Mary C.	2
Guo, Hongwen	2
Kulick, Edward	2
Middleton, Kyndra	2
Puhan, Gautam	2
Yu, Lei	2
Zeller, Karin	2
Knorr, Colleen M.	1
Liu, Jinghua	1
Mapuranga, Raymond	1
Matthews-López, Joy L.	1
Miao, Jing	1
Moses, Tim	1
Moses, Tim P.	1
Moses, Timothy P.	1
Qu, Yanxuan	1
Tan, Xuan	1
Xiang, Bihua	1
Zhang, Yanling	1
More ▼

Publication Type

Journal Articles	17
Reports - Research	16
Opinion Papers	4
Reports - Evaluative	4
Reports - Descriptive	2
Numerical/Quantitative Data	1
Speeches/Meeting Papers	1

Education Level

Higher Education	4
Postsecondary Education	4
Elementary Secondary Education	2
High Schools	2
Secondary Education	2
Elementary Education	1
Grade 4	1
Grade 8	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
More ▼

Audience

Researchers

Location

United States

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)	11
Preliminary Scholastic…	3
National Merit Scholarship…	2
Pre Professional Skills Tests	2
ACT Assessment	1
Gates MacGinitie Reading Tests	1
Graduate Record Examinations	1
Test of Standard Written…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 26 results Save | Export

Observed Scores as Matching Variables in Differential Item Functioning under the One- and Two-Parameter Logistic Models: Population Results. Research Report. ETS RR-19-06

Peer reviewed
PDF on ERIC

Download full text

Guo, Hongwen; Dorans, Neil J. – ETS Research Report Series, 2019

We derive formulas for the differential item functioning (DIF) measures that two routinely used DIF statistics are designed to estimate. The DIF measures that match on observed scores are compared to DIF measures based on an unobserved ability (theta or true score) for items that are described by either the one-parameter logistic (1PL) or…

Descriptors: Scores, Test Bias, Statistical Analysis, Item Response Theory

A Note on Using Weighted Sum Scores in the P-DIF Statistic. Research Report. ETS RR-19-32

Peer reviewed
PDF on ERIC

Download full text

Guo, Hongwen; Dorans, Neil J. – ETS Research Report Series, 2019

The Mantel-Haenszel delta difference (MH D-DIF) and the standardized proportion difference (STD P-DIF) are two observed-score methods that have been used to assess differential item functioning (DIF) at Educational Testing Service since the early 1990s. Latentvariable approaches to assessing measurement invariance at the item level have been…

Descriptors: Test Bias, Educational Testing, Statistical Analysis, Item Response Theory

ETS Contributions to the Quantitative Assessment of Item, Test, and Score Fairness. Research Report. ETS RR-13-27. ETS R&D Scientific and Policy Contributions Series. ETS SPC-13-04

Peer reviewed
PDF on ERIC

Download full text

Dorans, Neil J. – ETS Research Report Series, 2013

Quantitative fairness procedures have been developed and modified by ETS staff over the past several decades. ETS has been a leader in fairness assessment, and its efforts are reviewed in this report. The first section deals with differential prediction and differential validity procedures that examine whether test scores predict a criterion, such…

Descriptors: Test Bias, Statistical Analysis, Test Validity, Scores

The Value of the Studied Item in the Matching Criterion in Differential Item Functioning (DIF) Analysis. Research Report. ETS RR-10-13

Download full text

Tan, Xuan; Xiang, Bihua; Dorans, Neil J.; Qu, Yanxuan – Educational Testing Service, 2010

The nature of the matching criterion (usually the total score) in the study of differential item functioning (DIF) has been shown to impact the accuracy of different DIF detection procedures. One of the topics related to the nature of the matching criterion is whether the studied item should be included. Although many studies exist that suggest…

Descriptors: Test Bias, Test Items, Item Response Theory

Two Simple Approaches to Overcome a Problem with the Mantel-Haenszel Statistic: Comments on Wang, Bradlow, Wainer, and Muller (2008)

Peer reviewed

Direct link

Sinharay, Sandip; Dorans, Neil J. – Journal of Educational and Behavioral Statistics, 2010

The Mantel-Haenszel (MH) procedure (Mantel and Haenszel) is a popular method for estimating and testing a common two-factor association parameter in a 2 x 2 x K table. Holland and Holland and Thayer described how to use the procedure to detect differential item functioning (DIF) for tests with dichotomously scored items. Wang, Bradlow, Wainer, and…

Descriptors: Test Bias, Statistical Analysis, Computation, Bayesian Statistics

A Comparison of Strategies for Estimating Conditional DIF

Peer reviewed

Direct link

Moses, Tim; Miao, Jing; Dorans, Neil J. – Journal of Educational and Behavioral Statistics, 2010

In this study, the accuracies of four strategies were compared for estimating conditional differential item functioning (DIF), including raw data, logistic regression, log-linear models, and kernel smoothing. Real data simulations were used to evaluate the estimation strategies across six items, DIF and No DIF situations, and four sample size…

Descriptors: Test Bias, Statistical Analysis, Computation, Comparative Analysis

Misrepresentations in Unfair Treatment by Santelices and Wilson

Peer reviewed

Direct link

Dorans, Neil J. – Harvard Educational Review, 2010

In his 2003 article in the "Harvard Educational Review" (HER), Freedle claimed that the SAT was both culturally and statistically biased and proposed a solution to ameliorate this bias. The author argued (Dorans, 2004a) that these claims were based on serious computational errors. In particular, he focused on how Freedle's table 2 was…

Descriptors: College Entrance Examinations, Test Bias, Test Items, Difficulty Level

The Contestant Perspective on Taking Tests: Emanations from the Statue within

Peer reviewed

Direct link

Dorans, Neil J. – Educational Measurement: Issues and Practice, 2012

Views on testing--its purpose and uses and how its data are analyzed--are related to one's perspective on test takers. Test takers can be viewed as learners, examinees, or contestants. I briefly discuss the perspective of test takers as learners. I maintain that much of psychometrics views test takers as examinees. I discuss test takers as a…

Descriptors: Testing, Test Theory, Item Response Theory, Test Reliability

Assessing the Falsifiability of Extreme Linking. Research Report. ETS RR-11-04

Download full text

Middleton, Kyndra; Dorans, Neil J. – Educational Testing Service, 2011

Extreme linkings are performed in settings in which neither equivalent groups nor anchor material is available to link scores on two assessments. Examples of extreme linkages include links between scores on tests administered in different languages or between scores on tests administered across disability groups. The strength of interpretation…

Descriptors: Equated Scores, Testing, Difficulty Level, Test Reliability

First Language of Test Takers and Fairness Assessment Procedures

Peer reviewed

Direct link

Sinharay, Sandip; Dorans, Neil J.; Liang, Longjuan – Educational Measurement: Issues and Practice, 2011

Over the past few decades, those who take tests in the United States have exhibited increasing diversity with respect to native language. Standard psychometric procedures for ensuring item and test fairness that have existed for some time were developed when test-taking groups were predominantly native English speakers. A better understanding of…

Descriptors: Test Bias, Testing Programs, Psychometrics, Language Proficiency

Unfair Treatment vs. Confirmation Bias? Comments on Santelices and Wilson. Research Report. ETS RR-10-20

Download full text

Dorans, Neil J. – Educational Testing Service, 2010

Santelices and Wilson (2010) claimed to have addressed technical criticisms of Freedle (2003) presented in Dorans (2004a) and elsewhere. Santelices and Wilson's abstract claimed that their study confirmed that SAT[R] verbal items do function differently for African American and White subgroups. In this commentary, I demonstrate that the…

Descriptors: College Entrance Examinations, Verbal Tests, Test Bias, Test Items

Using Past Data to Enhance Small Sample DIF Estimation: A Bayesian Approach

Peer reviewed

Direct link

Sinharay, Sandip; Dorans, Neil J.; Grant, Mary C.; Blew, Edwin O. – Journal of Educational and Behavioral Statistics, 2009

Test administrators often face the challenge of detecting differential item functioning (DIF) with samples of size smaller than that recommended by experts. A Bayesian approach can incorporate, in the form of a prior distribution, existing information on the inference problem at hand, which yields more stable estimation, especially for small…

Descriptors: Test Bias, Computation, Bayesian Statistics, Data

Using Log-Linear Smoothing to Improve Small-Sample DIF Estimation

Peer reviewed

Direct link

Puhan, Gautam; Moses, Timothy P.; Yu, Lei; Dorans, Neil J. – Journal of Educational Measurement, 2009

This study examined the extent to which log-linear smoothing could improve the accuracy of differential item functioning (DIF) estimates in small samples of examinees. Examinee responses from a certification test were analyzed using White examinees in the reference group and African American examinees in the focal group. Using a simulation…

Descriptors: Test Items, Reference Groups, Testing Programs, Raw Scores

First Language of Examinees and Its Relationship to Equating. Research Report. ETS RR-09-05

Download full text

Liang, Longjuan; Dorans, Neil J.; Sinharay, Sandip – Educational Testing Service, 2009

To ensure fairness, it is important to better understand the relationship of language proficiency with the standard procedures of psychometric analysis. This paper examines how equating results are affected by an increase in the proportion of examinees who report that English is not their first language, using the analysis samples for a…

Descriptors: Equated Scores, English (Second Language), Reading Tests, Mathematics Tests

First Language of Examinees and Its Relationship to Differential Item Functioning. Research Report. ETS RR-09-11

Download full text

Sinharay, Sandip; Dorans, Neil J.; Liang, Longjuan – Educational Testing Service, 2009

To ensure fairness, it is important to better understand the relationship of language proficiency to standard psychometric analysis procedures. This paper examines how results of differential item functioning (DIF) analysis are affected by an increase in the proportion of examinees who report that English is not their first language in the…

Descriptors: Test Bias, Language Proficiency, English (Second Language), Measurement

Previous Page | Next Page »

Pages: 1 | 2