ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	11

Descriptor

Test Bias	65
Test Items	65
Testing Problems	65
Test Construction	20
Item Analysis	19
Test Validity	16
Higher Education	12
Latent Trait Theory	12
Evaluation Methods	10
White Students	9
Black Students	8
Culture Fair Tests	8
Foreign Countries	8
Mathematical Models	8
Minority Groups	8
Sex Differences	8
Elementary Secondary Education	7
Racial Differences	7
Standardized Tests	7
Statistical Analysis	7
Test Reliability	7
Achievement Tests	6
Difficulty Level	6
Educational Assessment	6
Multiple Choice Tests	6
More ▼

Source

Educational Measurement:…	5
Alberta Journal of…	1
Assessment in Education:…	1
Canadian Journal of Education	1
Educational Research and…	1
Educational Technology &…	1
Educational and Psychological…	1
Evaluation Review	1
Journal of Educational…	1
Journal of Educational and…	1
Journal of Experimental…	1
Journal of Visual Impairment…	1
Measurement:…	1
New Directions for Testing…	1
Psychology of Women Quarterly	1
Review of Research in…	1
More ▼

Publication Type

Reports - Research	38
Journal Articles	20
Speeches/Meeting Papers	18
Reports - Evaluative	15
Information Analyses	5
Opinion Papers	5
Guides - Non-Classroom	3
Reports - Descriptive	3
Books	1
Collected Works - General	1
Numerical/Quantitative Data	1
Tests/Questionnaires	1
More ▼

Education Level

Elementary Secondary Education	3
Higher Education	1
Postsecondary Education	1
Secondary Education	1

Audience

Researchers	8
Practitioners	3
Counselors	2
Teachers	1

Location

Canada	2
Netherlands	2
South Africa	2
Arizona	1
California	1
New Jersey	1
United Kingdom	1
United States	1

Laws, Policies, & Programs

Education for All Handicapped…	1
Individuals with Disabilities…	1
No Child Left Behind Act 2001	1

Assessments and Surveys

ACT Assessment	2
Graduate Record Examinations	2
National Assessment of…	2
New Jersey College Basic…	2
SAT (College Admission Test)	2
Wechsler Adult Intelligence…	2
Armed Services Vocational…	1
Comprehensive Tests of Basic…	1
Iowa Tests of Basic Skills	1
National Teacher Examinations	1
Peabody Picture Vocabulary…	1
Program for International…	1
Wechsler Intelligence Scale…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 65 results Save | Export

The Development of a Standardized Effect Size for the SIBTEST Procedure

Peer reviewed

Direct link

James D. Weese; Ronna C. Turner; Allison Ames; Xinya Liang; Brandon Crawford – Journal of Experimental Education, 2024

In this study a standardized effect size was created for use with the SIBTEST procedure. Using this standardized effect size, a single set of heuristics was developed that are appropriate for data fitting different item response models (e.g., 2-parameter logistic, 3-parameter logistic). The standardized effect size rescales the raw beta-uni value…

Descriptors: Test Bias, Test Items, Item Response Theory, Effect Size

Adjusting for Ability Differences of Equating Samples When Randomization Is Suboptimal

Peer reviewed

Direct link

Kim, Sooyeon; Walker, Michael E. – Educational Measurement: Issues and Practice, 2022

Test equating requires collecting data to link the scores from different forms of a test. Problems arise when equating samples are not equivalent and the test forms to be linked share no common items by which to measure or adjust for the group nonequivalence. Using data from five operational test forms, we created five pairs of research forms for…

Descriptors: Ability, Tests, Equated Scores, Testing Problems

Simultaneously Modeling Differential Testlet Functioning and Differential Item Functioning: Addressing Variance Heterogeneity with a Multigroup One-Parameter Testlet Model

Peer reviewed

Direct link

Luo, Yong; Liang, Xinya – Measurement: Interdisciplinary Research and Perspectives, 2019

Current methods that simultaneously model differential testlet functioning (DTLF) and differential item functioning (DIF) constrain the variances of latent ability and testlet effects to be equal between the focal and the reference groups. Such a constraint can be stringent and unrealistic with real data. In this study, we propose a multigroup…

Descriptors: Test Items, Item Response Theory, Test Bias, Models

Screening Test Items for Differential Item Functioning

Peer reviewed

Direct link

Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 2014

A method for medical screening is adapted to differential item functioning (DIF). Its essential elements are explicit declarations of the level of DIF that is acceptable and of the loss function that quantifies the consequences of the two kinds of inappropriate classification of an item. Instead of a single level and a single function, sets of…

Descriptors: Test Items, Test Bias, Simulation, Hypothesis Testing

Language Effects in International Testing: The Case of PISA 2006 Science Items

Peer reviewed

Direct link

El Masri, Yasmine H.; Baird, Jo-Anne; Graesser, Art – Assessment in Education: Principles, Policy & Practice, 2016

We investigate the extent to which language versions (English, French and Arabic) of the same science test are comparable in terms of item difficulty and demands. We argue that language is an inextricable part of the scientific literacy construct, be it intended or not by the examiner. This argument has considerable implications on methodologies…

Descriptors: International Assessment, Difficulty Level, Test Items, Language Variation

The Effect of Missing Data Treatment on Mantel-Haenszel DIF Detection

Peer reviewed

Direct link

Emenogu, Barnabas C.; Falenchuk, Olesya; Childs, Ruth A. – Alberta Journal of Educational Research, 2010

Most implementations of the Mantel-Haenszel differential item functioning procedure delete records with missing responses or replace missing responses with scores of 0. These treatments of missing data make strong assumptions about the causes of the missing data. Such assumptions may be particularly problematic when groups differ in their patterns…

Descriptors: Foreign Countries, Test Bias, Test Items, Educational Testing

An NCME Instructional Module on Using Differential Step Functioning to Refine the Analysis of DIF in Polytomous Items

Peer reviewed

Direct link

Penfield, Randall D.; Gattamorta, Karina; Childs, Ruth A. – Educational Measurement: Issues and Practice, 2009

Traditional methods for examining differential item functioning (DIF) in polytomously scored test items yield a single item-level index of DIF and thus provide no information concerning which score levels are implicated in the DIF effect. To address this limitation of DIF methodology, the framework of differential step functioning (DSF) has…

Descriptors: Test Bias, Test Items, Evaluation Methods, Scores

Ongoing Issues in Test Fairness

Peer reviewed

Direct link

Camilli, Gregory – Educational Research and Evaluation, 2013

In the attempt to identify or prevent unfair tests, both quantitative analyses and logical evaluation are often used. For the most part, fairness evaluation is a pragmatic attempt at determining whether procedural or substantive due process has been accorded to either a group of test takers or an individual. In both the individual and comparative…

Descriptors: Alternative Assessment, Test Bias, Test Content, Test Format

Impact of Missing Data on the Detection of Differential Item Functioning: The Case of Mantel-Haenszel and Logistic Regression Analysis

Peer reviewed

Direct link

Robitzsch, Alexander; Rupp, Andre A. – Educational and Psychological Measurement, 2009

This article describes the results of a simulation study to investigate the impact of missing data on the detection of differential item functioning (DIF). Specifically, it investigates how four methods for dealing with missing data (listwise deletion, zero imputation, two-way imputation, response function imputation) interact with two methods of…

Descriptors: Test Bias, Simulation, Interaction, Effect Size

Equity in Testing after Golden Rule.

Download full text

Goldstein, Harvey – 1989

The use of "bias elimination procedures" to reduce the racial bias of test items is discussed. These procedures were forwarded by G. R. Anrig (1988) and R. L. Linn and F. Drasgow (1987). Anrig stated that subjects who "know the same amount about a test item" should have a similar chance of answering it correctly…

Descriptors: Latent Trait Theory, Racial Bias, Test Bias, Test Construction

Item Bias Issues: Background, Problems, and Where We Are Today.

Diamond, Esther E. – 1981

As test standards and research literature in general indicate, definitions of test bias and item bias vary considerably, as do the results of existing methods of identifying biased items. The situation is further complicated by issues of content, context, construct, and criterion. In achievement tests, for example, content validity may impose…

Descriptors: Achievement Tests, Aptitude Tests, Psychometrics, Test Bias

Review for Perceived Bias on ASVAB Forms 11, 12, and 13.

Download full text

Boldt, Robert F. – 1983

The project reported here consisted of a sensitivity review of the items of Forms 11, 12, and 13 of the Armed Services Vocational Aptitude Battery (ASVAB). Because administration of this battery is a required step in the accession process, it should be free from perceived bias or offensiveness that could detract from the measurement process. In…

Descriptors: Aptitude Tests, Attitudes, Military Personnel, Opinions

The Golden Rule Bias Reduction Principle: A Practical Reform.

Peer reviewed

Weiss, John – Educational Measurement: Issues and Practice, 1987

Differences in test scores can be attributed to various causes, including genuine knowledge differences, test-taking abilities, and irrelevant and biased questions. The Golden Rule reform is a safeguard to ensure that standardized tests measure relevant knowledge differences between test takers and not irrelevant, culturally specific factors. (JAZ)

Descriptors: Culture Fair Tests, Minority Groups, Standardized Tests, Standards

Randomised Items in Computer-Based Tests: Russian Roulette in Assessment?

Peer reviewed

Direct link

Marks, Anthony M.; Cronje, Johannes C. – Educational Technology & Society, 2008

Computer-based assessments are becoming more commonplace, perhaps as a necessity for faculty to cope with large class sizes. These tests often occur in large computer testing venues in which test security may be compromised. In an attempt to limit the likelihood of cheating in such venues, randomised presentation of items is automatically…

Descriptors: Educational Assessment, Educational Testing, Research Needs, Test Items

Differential Performance of Males and Females on Easy to Hard Item Arrangements; Influence of Feedback at the Item Level.

Plake, Barbara S.; And Others – 1983

Differential test performance by undergraduate males and females enrolled in a developmental educational psychology course (n=167) was reported on a quantitative examination as a function of item arrangement. Males were expected to perform better than females on tests whose items arranged easy to hard. Plake and Ansorge (1982) speculated this may…

Descriptors: Difficulty Level, Feedback, Higher Education, Scoring

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5

Hambleton, Ronald K.	3
Childs, Ruth A.	2
Kelderman, Henk	2
Plake, Barbara S.	2
Rogers, H. Jane	2
Smith, Richard M.	2
Stocking, Martha L.	2
Abel, R. Robert	1
Allison Ames	1
Argulewicz, Ed N.	1
Baird, Jo-Anne	1
Boldt, R. F.	1
Boldt, Robert F.	1
Bond, Lloyd	1
Brambring, M.	1
Brandon Crawford	1
Broussard, Rolland L.	1
Camilli, Gregory	1
Cole, Nancy S.	1
Craig, Robert	1
Cronje, Johannes C.	1
Datta, Lois-ellin	1
Diamond, Esther E.	1
El Masri, Yasmine H.	1
More ▼