ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	9
Since 2006 (last 20 years)	14

Descriptor

Comparative Analysis	14
Test Bias	14
Foreign Countries	12
International Assessment	9
Secondary School Students	8
Achievement Tests	7
Mathematics Tests	7
Test Items	6
Item Response Theory	5
Science Tests	4
Scores	4
Effect Size	3
English	3
Mathematics Achievement	3
Computation	2
Cross Cultural Studies	2
Educational Assessment	2
Elementary Secondary Education	2
Error of Measurement	2
Evaluation Methods	2
French	2
Item Analysis	2
Measurement	2
Reaction Time	2
Reliability	2
More ▼

Source

International Journal of…	4
Applied Measurement in…	3
Assessment in Education:…	1
Educational Assessment	1
Educational Assessment,…	1
Eurasian Journal of…	1
Journal of Educational…	1
Journal of Educational and…	1
Large-scale Assessments in…	1

Publication Type

Journal Articles	14
Reports - Research	12
Reports - Evaluative	2

Education Level

Secondary Education	9
Elementary Secondary Education	2

Audience

Location

Canada	4
Turkey	3
United States	3
Australia	1
China	1
Europe	1
North America	1
Russia	1
Singapore	1
South Korea	1
United Kingdom	1
United Kingdom (England)	1
United Kingdom (Scotland)	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	14
Trends in International…	3
National Assessment of…	1

What Works Clearinghouse Rating

Showing all 14 results Save | Export

Mean Comparisons of Many Groups in the Presence of DIF: An Evaluation of Linking and Concurrent Scaling Approaches

Peer reviewed

Direct link

Robitzsch, Alexander; Lüdtke, Oliver – Journal of Educational and Behavioral Statistics, 2022

One of the primary goals of international large-scale assessments in education is the comparison of country means in student achievement. This article introduces a framework for discussing differential item functioning (DIF) for such mean comparisons. We compare three different linking methods: concurrent scaling based on full invariance,…

Descriptors: Test Bias, International Assessment, Scaling, Comparative Analysis

Comparison of Disengagement Levels and the Impact of Disengagement on Item Parameters between PISA 2015 and PISA 2018 in the United States

Peer reviewed

Direct link

Kuang, Huan; Sahin, Fusun – Large-scale Assessments in Education, 2023

Background: Examinees may not make enough effort when responding to test items if the assessment has no consequence for them. These disengaged responses can be problematic in low-stakes, large-scale assessments because they can bias item parameter estimates. However, the amount of bias, and whether this bias is similar across administrations, is…

Descriptors: Test Items, Comparative Analysis, Mathematics Tests, Reaction Time

Latent Class Approach to Detect Differential Item Functioning: PISA 2015 Science Sample

Peer reviewed
PDF on ERIC

Download full text

Uyar, Seyma – Eurasian Journal of Educational Research, 2020

Purpose: This study aimed to compare the performance of latent class differential item functioning (DIF) approach and IRT based DIF methods using manifest grouping. With this study, it was thought to draw attention to carry out latent class DIF studies in Turkey. The purpose of this study was to examine DIF in PISA 2015 science data set. Research…

Descriptors: Item Response Theory, Foreign Countries, Cross Cultural Studies, Item Analysis

Standard Errors for National Trends in International Large-Scale Assessments in the Case of Cross-National Differential Item Functioning

Peer reviewed

Direct link

Sachse, Karoline A.; Haag, Nicole – Applied Measurement in Education, 2017

Standard errors computed according to the operational practices of international large-scale assessment studies such as the Programme for International Student Assessment's (PISA) or the Trends in International Mathematics and Science Study (TIMSS) may be biased when cross-national differential item functioning (DIF) and item parameter drift are…

Descriptors: Error of Measurement, Test Bias, International Assessment, Computation

Use of Response Process Data to Inform Group Comparisons and Fairness Research

Peer reviewed

Direct link

Ercikan, Kadriye; Guo, Hongwen; He, Qiwei – Educational Assessment, 2020

Comparing group is one of the key uses of large-scale assessment results, which are used to gain insights to inform policy and practice and to examine the comparability of scores and score meaning. Such comparisons typically focus on examinees' final answers and responses to test questions, ignoring response process differences groups may engage…

Descriptors: Data Use, Responses, Comparative Analysis, Test Bias

Analyzing Fairness among Linguistic Minority Populations Using a Latent Class Differential Item Functioning Approach

Peer reviewed

Direct link

Oliveri, Maria Elena; Ercikan, Kadriye; Lyons-Thomas, Juliette; Holtzman, Steven – Applied Measurement in Education, 2016

Differential item functioning (DIF) analyses have been used as the primary method in large-scale assessments to examine fairness for subgroups. Currently, DIF analyses are conducted utilizing manifest methods using observed characteristics (gender and race/ethnicity) for grouping examinees. Homogeneity of item responses is assumed denoting that…

Descriptors: Test Bias, Language Minorities, Effect Size, Foreign Countries

Checking the Possibility of Equating a Mathematics Assessment between Russia, Scotland and England for Children Starting School

Peer reviewed

Direct link

Ivanova, Alina; Kardanova, Elena; Merrell, Christine; Tymms, Peter; Hawker, David – Assessment in Education: Principles, Policy & Practice, 2018

Is it possible to compare the results in assessments of mathematics across countries with different curricula, traditions and age of starting school? As part of the iPIPS project, a Russian version of the iPIPS baseline assessment was developed and trial data were available from about 300 Russian children at the start and end of their first year…

Descriptors: Mathematics Instruction, Foreign Countries, Mathematics Tests, Item Response Theory

PISA Mathematics and Reading Performance Differences of Mainstream European and Turkish Immigrant Students

Peer reviewed

Direct link

Arikan, Serkan; van de Vijver, Fons J. R.; Yagmur, Kutlay – Educational Assessment, Evaluation and Accountability, 2017

Lower reading and mathematics performance of Turkish immigrant students as compared to mainstream European students could reflect differential learning outcomes, differential socioeconomic backgrounds of the groups, differential mainstream language proficiency, and/or test bias. Using PISA reading and mathematics scores of these groups, we…

Descriptors: Foreign Countries, Achievement Tests, International Assessment, Secondary School Students

Reading Proficiency and Comparability of Mathematics and Science Scores for Students from English and Non-English Backgrounds: An International Perspective

Peer reviewed

Direct link

Ercikan, Kadriye; Chen, Michelle Y.; Lyons-Thomas, Juliette; Goodrich, Shawna; Sandilands, Debra; Roth, Wolff-Michael; Simon, Marielle – International Journal of Testing, 2015

The purpose of this research is to examine the comparability of mathematics and science scores for students from English language backgrounds (ELB) and non-English language backgrounds (NELB). We examine the relationship between English reading proficiency and performance on mathematics and science assessments in Australia, Canada, the United…

Descriptors: Scores, Mathematics Tests, Science Tests, Native Speakers

A Comparison of Linking Methods for Estimating National Trends in International Comparative Large-Scale Assessments in the Presence of Cross-national DIF

Peer reviewed

Direct link

Sachse, Karoline A.; Roppelt, Alexander; Haag, Nicole – Journal of Educational Measurement, 2016

Trend estimation in international comparative large-scale assessments relies on measurement invariance between countries. However, cross-national differential item functioning (DIF) has been repeatedly documented. We ran a simulation study using national item parameters, which required trends to be computed separately for each country, to compare…

Descriptors: Comparative Analysis, Measurement, Test Bias, Simulation

Do Different Approaches to Examining Construct Comparability in Multilanguage Assessments Lead to Similar Conclusions?

Peer reviewed

Direct link

Oliveri, Maria E.; Ercikan, Kadriye – Applied Measurement in Education, 2011

In this study, we examine the degree of construct comparability and possible sources of incomparability of the English and French versions of the Programme for International Student Assessment (PISA) 2003 problem-solving measure administered in Canada. Several approaches were used to examine construct comparability at the test- (examination of…

Descriptors: Foreign Countries, English, French, Tests

Methodologies for Investigating Item- and Test-Level Measurement Equivalence in International Large-Scale Assessments

Peer reviewed

Direct link

Oliveri, Maria Elena; Olson, Brent F.; Ercikan, Kadriye; Zumbo, Bruno D. – International Journal of Testing, 2012

In this study, the Canadian English and French versions of the Problem-Solving Measure of the Programme for International Student Assessment 2003 were examined to investigate their degree of measurement comparability at the item- and test-levels. Three methods of differential item functioning (DIF) were compared: parametric and nonparametric item…

Descriptors: Foreign Students, Test Bias, Speech Communication, Effect Size

Differential Item Functioning Analysis Using Rasch Item Information Functions

Peer reviewed

Direct link

Wyse, Adam E.; Mapuranga, Raymond – International Journal of Testing, 2009

Differential item functioning (DIF) analysis is a statistical technique used for ensuring the equity and fairness of educational assessments. This study formulates a new DIF analysis method using the information similarity index (ISI). ISI compares item information functions when data fits the Rasch model. Through simulations and an international…

Descriptors: Test Bias, Evaluation Methods, Test Items, Educational Assessment

Judgmental and Statistical DIF Analyses of the PISA-2003 Mathematics Literacy Items

Peer reviewed

Direct link

Yildirim, Huseyin Husnu; Berberoglu, Giray – International Journal of Testing, 2009

Comparisons of human characteristics across different language groups and cultures become more important in today's educational assessment practices as evidenced by the increasing interest in international comparative studies. Within this context, the fairness of the results across different language and cultural groups draws the attention of…

Descriptors: Test Bias, Cross Cultural Studies, Comparative Analysis, Factor Analysis

Ercikan, Kadriye	5
Haag, Nicole	2
Lyons-Thomas, Juliette	2
Oliveri, Maria Elena	2
Sachse, Karoline A.	2
Arikan, Serkan	1
Berberoglu, Giray	1
Chen, Michelle Y.	1
Goodrich, Shawna	1
Guo, Hongwen	1
Hawker, David	1
He, Qiwei	1
Holtzman, Steven	1
Ivanova, Alina	1
Kardanova, Elena	1
Kuang, Huan	1
Lüdtke, Oliver	1
Mapuranga, Raymond	1
Merrell, Christine	1
Oliveri, Maria E.	1
Olson, Brent F.	1
Robitzsch, Alexander	1
Roppelt, Alexander	1
Roth, Wolff-Michael	1
Sahin, Fusun	1
More ▼