ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	7
Since 2006 (last 20 years)	24

Descriptor

Comparative Analysis	34
Evaluation Methods	34
Test Bias	34
Test Items	17
Foreign Countries	7
Test Validity	7
Educational Assessment	6
Simulation	6
Achievement Tests	5
Mathematics Achievement	5
Models	5
Psychometrics	5
Scores	5
Test Construction	5
Elementary School Students	4
English (Second Language)	4
Grade 4	4
Higher Education	4
Item Response Theory	4
Measurement	4
Student Evaluation	4
Test Reliability	4
Academic Achievement	3
Blacks	3
Computation	3
More ▼

Publication Type

Journal Articles	29
Reports - Research	20
Reports - Evaluative	7
Opinion Papers	3
Reports - Descriptive	3
Dissertations/Theses -…	2
Guides - Non-Classroom	1
Information Analyses	1
Speeches/Meeting Papers	1

Education Level

Elementary Secondary Education	5
Higher Education	5
Elementary Education	4
Grade 4	4
Postsecondary Education	3
Intermediate Grades	2
Early Childhood Education	1
Grade 5	1
Grade 8	1
Middle Schools	1
Secondary Education	1
More ▼

Audience

Location

Canada	3
Australia	1
Colombia	1
Slovakia	1
United Kingdom (England)	1
United States	1

Laws, Policies, & Programs

No Child Left Behind Act 2001	2
Social Security	1

Assessments and Surveys

Program for International…	2
Trends in International…	2
Graduate Record Examinations	1
Group Assessment of Logical…	1
National Assessment of…	1
Progress in International…	1
SAT (College Admission Test)	1
Woodcock Johnson Tests of…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 34 results Save | Export

An Intersectional Approach to DIF: Comparing Outcomes across Methods

Peer reviewed

Direct link

Russell, Michael; Szendey, Olivia; Li, Zhushan – Educational Assessment, 2022

Recent research provides evidence that an intersectional approach to defining reference and focal groups results in a higher percentage of comparisons flagged for potential DIF. The study presented here examined the generalizability of this pattern across methods for examining DIF. While the level of DIF detection differed among the four methods…

Descriptors: Comparative Analysis, Item Analysis, Test Items, Test Construction

An NCME Instructional Module on Latent DIF Analysis Using Mixture Item Response Models

Peer reviewed

Direct link

Cho, Sun-Joo; Suh, Youngsuk; Lee, Woo-yeol – Educational Measurement: Issues and Practice, 2016

The purpose of this ITEMS module is to provide an introduction to differential item functioning (DIF) analysis using mixture item response models. The mixture item response models for DIF analysis involve comparing item profiles across latent groups, instead of manifest groups. First, an overview of DIF analysis based on latent groups, called…

Descriptors: Test Bias, Research Methodology, Evaluation Methods, Models

Five Methods for Estimating Angoff Cut Scores with IRT

Peer reviewed

Direct link

Wyse, Adam E. – Educational Measurement: Issues and Practice, 2017

This article illustrates five different methods for estimating Angoff cut scores using item response theory (IRT) models. These include maximum likelihood (ML), expected a priori (EAP), modal a priori (MAP), and weighted maximum likelihood (WML) estimators, as well as the most commonly used approach based on translating ratings through the test…

Descriptors: Cutting Scores, Item Response Theory, Bayesian Statistics, Maximum Likelihood Statistics

The Comparability of Scores from Different Digital Devices: A Literature Review and Synthesis with Recommendations for Practice

Peer reviewed

Direct link

Dadey, Nathan; Lyons, Susan; DePascale, Charles – Applied Measurement in Education, 2018

Evidence of comparability is generally needed whenever there are variations in the conditions of an assessment administration, including variations introduced by the administration of an assessment on multiple digital devices (e.g., tablet, laptop, desktop). This article is meant to provide a comprehensive examination of issues relevant to the…

Descriptors: Evaluation Methods, Computer Assisted Testing, Educational Technology, Technology Uses in Education

Differential Item Functioning Assessment in Cognitive Diagnostic Modeling: Application of the Wald Test to Investigate DIF in the DINA Model

Peer reviewed

Direct link

Hou, Likun; de la Torre, Jimmy; Nandakumar, Ratna – Journal of Educational Measurement, 2014

Analyzing examinees' responses using cognitive diagnostic models (CDMs) has the advantage of providing diagnostic information. To ensure the validity of the results from these models, differential item functioning (DIF) in CDMs needs to be investigated. In this article, the Wald test is proposed to examine DIF in the context of CDMs. This study…

Descriptors: Test Bias, Models, Simulation, Error Patterns

Investigating Causal DIF via Propensity Score Methods

Peer reviewed
PDF on ERIC

Download full text

Liu, Yan; Zumbo, Bruno D.; Gustafson, Paul; Huang, Yi; Kroc, Edward; Wu, Amery D. – Practical Assessment, Research & Evaluation, 2016

A variety of differential item functioning (DIF) methods have been proposed and used for ensuring that a test is fair to all test takers in a target population in the situations of, for example, a test being translated to other languages. However, once a method flags an item as DIF, it is difficult to conclude that the grouping variable (e.g.,…

Descriptors: Test Items, Test Bias, Probability, Scores

Linking U.S. School District Test Score Distributions to a Common Scale. CEPA Working Paper No. 16-09

Download full text

Reardon, Sean F.; Kalogrides, Demetra; Ho, Andrew D. – Stanford Center for Education Policy Analysis, 2017

There is no comprehensive database of U.S. district-level test scores that is comparable across states. We describe and evaluate a method for constructing such a database. First, we estimate linear, reliability-adjusted linking transformations from state test score scales to the scale of the National Assessment of Educational Progress (NAEP). We…

Descriptors: School Districts, Scores, Statistical Distributions, Database Design

Test Score Equating Using Discrete Anchor Items versus Passage-Based Anchor Items: A Case Study Using "SAT"® Data. Research Report. ETS RR-14-14

Peer reviewed
PDF on ERIC

Download full text

Liu, Jinghua; Zu, Jiyun; Curley, Edward; Carey, Jill – ETS Research Report Series, 2014

The purpose of this study is to investigate the impact of discrete anchor items versus passage-based anchor items on observed score equating using empirical data.This study compares an "SAT"® critical reading anchor that contains more discrete items proportionally, compared to the total tests to be equated, to another anchor that…

Descriptors: Equated Scores, Test Items, College Entrance Examinations, Comparative Analysis

A Comparison of Linking Methods for Estimating National Trends in International Comparative Large-Scale Assessments in the Presence of Cross-national DIF

Peer reviewed

Direct link

Sachse, Karoline A.; Roppelt, Alexander; Haag, Nicole – Journal of Educational Measurement, 2016

Trend estimation in international comparative large-scale assessments relies on measurement invariance between countries. However, cross-national differential item functioning (DIF) has been repeatedly documented. We ran a simulation study using national item parameters, which required trends to be computed separately for each country, to compare…

Descriptors: Comparative Analysis, Measurement, Test Bias, Simulation

Iterative Linking with the Differential Functioning of Items and Tests (DFIT) Method: Comparison of Testwide and Item Parameter Replication (IPR) Critical Values

Peer reviewed

Direct link

Seybert, Jacob; Stark, Stephen – Applied Psychological Measurement, 2012

A Monte Carlo study was conducted to examine the accuracy of differential item functioning (DIF) detection using the differential functioning of items and tests (DFIT) method. Specifically, the performance of DFIT was compared using "testwide" critical values suggested by Flowers, Oshima, and Raju, based on simulations involving large numbers of…

Descriptors: Test Bias, Monte Carlo Methods, Form Classes (Languages), Simulation

Investigating Linguistic Sources of Differential Item Functioning Using Expert Think-Aloud Protocols in Science Achievement Tests

Peer reviewed

Direct link

Roth, Wolff-Michael; Oliveri, Maria Elena; Sandilands, Debra Dallie; Lyons-Thomas, Juliette; Ercikan, Kadriye – International Journal of Science Education, 2013

Even if national and international assessments are designed to be comparable, subsequent psychometric analyses often reveal differential item functioning (DIF). Central to achieving comparability is to examine the presence of DIF, and if DIF is found, to investigate its sources to ensure differentially functioning items that do not lead to bias.…

Descriptors: Test Bias, Evaluation Methods, Protocol Analysis, Science Achievement

Executive Functioning in Three Groups of Pupils in D-KEFSs: Selected Issues in Adapting the Test Battery for Slovakia

Peer reviewed

Direct link

Ferjencík, Ján; Slavkovská, Miriam; Kresila, Juraj – Journal of Pedagogy, 2015

The paper reports on the adaptation of a D-KEFS test battery for Slovakia. Drawing on concrete examples, it describes and illustrates the key issues relating to the transfer of test items from one socio-cultural environment to another. The standardisation sample of the population of Slovak pupils in the fourth year of primary school included 250…

Descriptors: Executive Function, Foreign Countries, Test Construction, Test Items

Detecting Cognitive Change in the Math Skills of Low-Achieving Adolescents

Peer reviewed

Direct link

Cho, Sun-Joo; Bottge, Brian A.; Cohen, Allan S.; Kim, Seock-Ho – Journal of Special Education, 2011

Current methods for detecting growth of students' problem-solving skills in math focus mainly on analyzing changes in test scores. Score-level analysis, however, may fail to reflect subtle changes that might be evident at the item level. This article demonstrates a method for studying item-level changes using data from a multiwave experiment with…

Descriptors: Test Bias, Group Membership, Mathematics Skills, Ability

A Generalized Logistic Regression Procedure to Detect Differential Item Functioning among Multiple Groups

Peer reviewed

Direct link

Magis, David; Raiche, Gilles; Beland, Sebastien; Gerard, Paul – International Journal of Testing, 2011

We present an extension of the logistic regression procedure to identify dichotomous differential item functioning (DIF) in the presence of more than two groups of respondents. Starting from the usual framework of a single focal group, we propose a general approach to estimate the item response functions in each group and to test for the presence…

Descriptors: Language Skills, Identification, Foreign Countries, Evaluation Methods

Evidence of the Validity of "Teaching Strategies GOLD[R]" Assessment Tool for English Language Learners and Children with Disabilities

Peer reviewed

Direct link

Kim, Do-Hong; Lambert, Richard G.; Burts, Diane C. – Early Education and Development, 2013

Research Findings: This study examined the measurement equivalence of the "Teaching Strategies GOLD[R]" assessment system across subgroups of children based on their primary language and disability status. This study is based on teacher-collected assessment data for 3-, 4-, and 5-year-old children for the fall of 2010, winter of 2010, and spring…

Descriptors: English Language Learners, Teaching Methods, Educational Strategies, Special Needs Students

Previous Page | Next Page »

Pages: 1 | 2 | 3

International Journal of…	3
Journal of Educational…	3
Educational Measurement:…	2
Journal of Applied Testing…	2
ProQuest LLC	2
Western Journal of Speech…	2
American Journal of Education	1
Applied Measurement in…	1
Applied Psychological…	1
Assessment & Evaluation in…	1
College English	1
ELT Journal	1
ETS Research Report Series	1
Early Education and…	1
Educational Assessment	1
Educational and Psychological…	1
Evaluation Practice	1
International Journal of…	1
Journal of Chemical Education	1
Journal of Educational…	1
Journal of Pedagogy	1
Journal of Special Education	1
Practical Assessment,…	1
Stanford Center for Education…	1
More ▼

Cho, Sun-Joo	2
Ercikan, Kadriye	2
Wyse, Adam E.	2
Zumbo, Bruno D.	2
Abedi, Jamal	1
Avila, Dolores R.	1
Beland, Sebastien	1
Bottge, Brian A.	1
Burts, Diane C.	1
Carey, Jill	1
Chen, Shu-Ying	1
Clarke, Rufus	1
Cline, Frederick	1
Cohen, Allan S.	1
Cook, Linda	1
Craig, Pippa	1
Curley, Edward	1
Dadey, Nathan	1
DePascale, Charles	1
Ferjencík, Ján	1
Fulcher, Glenn	1
Garcia, Alicia	1
Gerard, Paul	1
Gordon, Jill	1
More ▼