Publication Date
In 2025 | 0 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 5 |
Since 2016 (last 10 years) | 25 |
Since 2006 (last 20 years) | 65 |
Descriptor
Test Bias | 67 |
Grade 8 | 56 |
Mathematics Tests | 34 |
Test Items | 32 |
Foreign Countries | 23 |
Item Response Theory | 22 |
Grade 4 | 20 |
Achievement Tests | 19 |
Grade 7 | 18 |
Scores | 15 |
Comparative Analysis | 14 |
More ▼ |
Source
Author
Steinberg, Jonathan | 5 |
Ling, Guangming | 3 |
Young, John W. | 3 |
Cho, Yeonsuk | 2 |
Cline, Fred | 2 |
Fu, Jianbin | 2 |
Hambleton, Ronald K. | 2 |
Meyer, Patrick | 2 |
Sireci, Stephen G. | 2 |
Stone, Elizabeth | 2 |
Allen, Nancy | 1 |
More ▼ |
Publication Type
Journal Articles | 48 |
Reports - Research | 44 |
Reports - Evaluative | 14 |
Numerical/Quantitative Data | 7 |
Dissertations/Theses -… | 6 |
Reports - Descriptive | 3 |
Tests/Questionnaires | 2 |
Speeches/Meeting Papers | 1 |
Education Level
Grade 8 | 67 |
Elementary Education | 43 |
Middle Schools | 40 |
Junior High Schools | 37 |
Secondary Education | 36 |
Grade 7 | 24 |
Grade 4 | 23 |
Elementary Secondary Education | 20 |
Grade 5 | 18 |
Intermediate Grades | 17 |
Grade 6 | 15 |
More ▼ |
Audience
Location
Turkey | 10 |
New York | 5 |
United States | 5 |
Florida | 3 |
Australia | 2 |
California | 2 |
Singapore | 2 |
Texas | 2 |
Belgium | 1 |
Bosnia and Herzegovina | 1 |
Botswana | 1 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 3 |
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 1 |
Meets WWC Standards with or without Reservations | 1 |
Marjolein Muskens; Willem E. Frankenhuis; Lex Borghans – npj Science of Learning, 2024
In many countries, standardized math tests are important for achieving academic success. Here, we examine whether content of items, the story that explains a mathematical question, biases performance of low-SES students. In a large-scale cohort study of Trends in International Mathematics and Science Studies (TIMSS)--including data from 58…
Descriptors: Mathematics Tests, Standardized Tests, Test Items, Low Income Students
Saaatcioglu, Fatima Munevver – International Journal of Assessment Tools in Education, 2022
The aim of this study is to investigate the presence of DIF over the gender variable with the latent class modeling approach. The data were collected from 953 students who participated in the PISA 2018 8th-grade financial literacy assessment in the USA. Latent Class Analysis (LCA) approach was used to identify the latent classes, and the data fit…
Descriptors: International Assessment, Achievement Tests, Secondary School Students, Gender Differences
Russell, Michael; Szendey, Olivia; Kaplan, Larry – Educational Assessment, 2021
Differential Item Function (DIF) analysis is commonly employed to examine potential bias produced by a test item. Since its introduction DIF analyses have focused on potential bias related to broad categories of oppression, including gender, racial stratification, economic class, and ableness. More recently, efforts to examine the effects of…
Descriptors: Test Bias, Achievement Tests, Individual Characteristics, Disadvantaged
Soysal, Sumeyra; Yilmaz Kogar, Esin – International Journal of Assessment Tools in Education, 2021
In this study, whether item position effects lead to DIF in the condition where different test booklets are used was investigated. To do this the methods of Lord's chi-square and Raju's unsigned area with the 3PL model under with and without item purification were used. When the performance of the methods was compared, it was revealed that…
Descriptors: Item Response Theory, Test Bias, Test Items, Comparative Analysis
Qi Huang; Daniel M. Bolt; Weicong Lyu – Large-scale Assessments in Education, 2024
Large scale international assessments depend on invariance of measurement across countries. An important consideration when observing cross-national differential item functioning (DIF) is whether the DIF actually reflects a source of bias, or might instead be a methodological artifact reflecting item response theory (IRT) model misspecification.…
Descriptors: Test Items, Item Response Theory, Test Bias, Test Validity
Gübes, Nese; Uyar, Seyma – International Journal of Progressive Education, 2020
This study aims to compare the performance of different small sample equating methods in the presence and absence of differential item functioning (DIF) in common items. In this research, Tucker linear equating, Levine linear equating, unsmoothed and pre-smoothed (C=4) chained equipercentile equating, and simplified circle arc equating methods…
Descriptors: Test Bias, Equated Scores, Test Items, Methods
Yildirim, Halime; Büyüköztürk, Sener – Educational Sciences: Theory and Practice, 2018
The aim of this study is to determine whether items from the mathematics section of the 2012 Level Determination Exam indicate item bias according to gender and school type. In particular, the process of item bias has been determined using the Delphi technique and focus group interviews. A two-stage mixed method research has been used for the…
Descriptors: Delphi Technique, Test Bias, Test Items, Mathematics Education
The Comparison of Differential Item Functioning Predicted through Experts and Statistical Techniques
Dogan, Nuri; Hambleton, Ronald K.; Yurtcu, Meltem; Yavuz, Sinan – Cypriot Journal of Educational Sciences, 2018
Validity is one of the psychometric properties of the achievement tests. To determine the validity, one of the examination is item bias studies, which are based on differential item functioning (DIF) analyses and field experts' opinion. In this study, field experts were asked to estimate the DIF levels of the items to compare the estimations…
Descriptors: Test Bias, Comparative Analysis, Predictor Variables, Statistical Analysis
McLoud, Rachael – ProQuest LLC, 2019
An increasing number of parents are opting-out their children from high-stakes. Accountability systems in education have used students' test scores to measure student learning, teacher effectiveness, and school district performance. Students who are opted-out of high-stakes tests are not being evaluated by the state tests, making their level of…
Descriptors: Evaluation, High Stakes Tests, Parent Attitudes, Decision Making
Cheng, Ying; Shao, Can; Lathrop, Quinn N. – Educational and Psychological Measurement, 2016
Due to its flexibility, the multiple-indicator, multiple-causes (MIMIC) model has become an increasingly popular method for the detection of differential item functioning (DIF). In this article, we propose the mediated MIMIC model method to uncover the underlying mechanism of DIF. This method extends the usual MIMIC model by including one variable…
Descriptors: Test Bias, Models, Simulation, Sample Size
Ayva Yörü, Fatma Gökçen; Atar, Hakan Yavuz – Journal of Pedagogical Research, 2019
The aim of this study is to examine whether the items in the mathematics subtest of the Centralized High School Entrance Placement Test [HSEPT] administered in 2012 by the Ministry of National Education in Turkey show DIF according to gender and type of school. For this purpose, SIBTEST, Breslow-Day, Lord's [chi-squared] and Raju's area…
Descriptors: Test Bias, Mathematics Tests, Test Items, Gender Differences
Shanmugam, S. Kanageswari Suppiah – Malaysian Journal of Learning and Instruction, 2018
Purpose:In an attempt to explore item characteristics that behave differently between boys and girls, this comparative study examines gender Differential Item Functioning in a school culture that is noted to be 'thriving' mathematically. Methodology: Some 24 grade eight mathematics items from TIMSS 2003 and TIMSS 2007 released items, with equal…
Descriptors: Gender Differences, Test Bias, Coeducation, Foreign Countries
He, Jia; Barrera-Pedemonte, Fabián; Buchholz, Janine – Assessment in Education: Principles, Policy & Practice, 2019
Noncognitive assessments in Programme for International Student Assessment (PISA) and Trends in International Mathematics and Science Study share certain similarities and provide complementary information, yet their comparability is seldom checked and convergence not sought. We made use of student self-report data of Instrumental Motivation,…
Descriptors: Foreign Countries, Secondary School Students, International Assessment, Elementary Secondary Education
Ozdemir, Burhanettin – International Journal of Progressive Education, 2017
The purpose of this study is to equate Trends in International Mathematics and Science Study (TIMSS) mathematics subtest scores obtained from TIMSS 2011 to scores obtained from TIMSS 2007 form with different nonlinear observed score equating methods under Non-Equivalent Anchor Test (NEAT) design where common items are used to link two or more test…
Descriptors: Achievement Tests, Elementary Secondary Education, Foreign Countries, International Assessment
Turkan, Azmi; Cetin, Bayram – Journal of Education and Practice, 2017
Validity and reliability are among the most crucial characteristics of a test. One of the steps to make sure that a test is valid and reliable is to examine the bias in test items. The purpose of this study was to examine the bias in 2012 Placement Test items in terms of gender variable using Rasch Model in Turkey. The sample of this study was…
Descriptors: Item Response Theory, Gender Differences, Test Bias, Test Items