ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	11
Since 2006 (last 20 years)	52

Descriptor

Models	65
Test Bias	65
Test Items	65
Item Response Theory	32
Comparative Analysis	18
Simulation	18
Difficulty Level	16
Statistical Analysis	16
Scores	14
Regression (Statistics)	11
Monte Carlo Methods	10
Correlation	9
Foreign Countries	9
Sample Size	9
Test Construction	9
Evaluation Methods	8
Item Analysis	8
Computation	7
Mathematics Tests	7
Test Validity	7
Goodness of Fit	6
Measurement Techniques	6
Psychometrics	6
Test Format	6
Test Reliability	6
More ▼

Publication Type

Journal Articles	50
Reports - Research	37
Reports - Evaluative	12
Reports - Descriptive	8
Dissertations/Theses -…	7
Speeches/Meeting Papers	6
Numerical/Quantitative Data	1
Opinion Papers	1

Education Level

Elementary Education	6
Higher Education	5
Secondary Education	5
Postsecondary Education	3
Elementary Secondary Education	2
Grade 3	2
Grade 4	2
Grade 8	2
Junior High Schools	2
Middle Schools	2
Early Childhood Education	1
High Schools	1
Intermediate Grades	1
Primary Education	1
More ▼

Audience

Location

Belgium	2
Turkey	2
Canada	1
France	1
Jordan	1
Taiwan	1
Texas	1
United Kingdom	1
United Kingdom (England)	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	4
Graduate Record Examinations	2
Law School Admission Test	2
Trends in International…	2
Alberta Grade Twelve Diploma…	1
California Achievement Tests	1
Medical College Admission Test	1
Peabody Picture Vocabulary…	1
SAT (College Admission Test)	1
Test of Standard Written…	1
Texas Assessment of Academic…	1
Wechsler Adult Intelligence…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 65 results Save | Export

Correcting for Extreme Response Style: Model Choice Matters

Peer reviewed

Direct link

Martijn Schoenmakers; Jesper Tijmstra; Jeroen Vermunt; Maria Bolsinova – Educational and Psychological Measurement, 2024

Extreme response style (ERS), the tendency of participants to select extreme item categories regardless of the item content, has frequently been found to decrease the validity of Likert-type questionnaire results. For this reason, various item response theory (IRT) models have been proposed to model ERS and correct for it. Comparisons of these…

Descriptors: Item Response Theory, Response Style (Tests), Models, Likert Scales

Establishing the Validity and Reliability of the LOCUS Assessments

Peer reviewed
PDF on ERIC

Download full text

Tim Jacobbe; Bob delMas; Brad Hartlaub; Jeff Haberstroh; Catherine Case; Steven Foti; Douglas Whitaker – Numeracy, 2023

The development of assessments as part of the funded LOCUS project is described. The assessments measure students' conceptual understanding of statistics as outlined in the GAISE PreK-12 Framework. Results are reported from a large-scale administration to 3,430 students in grades 6 through 12 in the United States. Items were designed to assess…

Descriptors: Statistics Education, Common Core State Standards, Student Evaluation, Elementary School Students

Investigating Item Complexity as a Source of Cross-National DIF in TIMSS Math and Science

Peer reviewed

Direct link

Qi Huang; Daniel M. Bolt; Weicong Lyu – Large-scale Assessments in Education, 2024

Large scale international assessments depend on invariance of measurement across countries. An important consideration when observing cross-national differential item functioning (DIF) is whether the DIF actually reflects a source of bias, or might instead be a methodological artifact reflecting item response theory (IRT) model misspecification.…

Descriptors: Test Items, Item Response Theory, Test Bias, Test Validity

Simultaneously Modeling Differential Testlet Functioning and Differential Item Functioning: Addressing Variance Heterogeneity with a Multigroup One-Parameter Testlet Model

Peer reviewed

Direct link

Luo, Yong; Liang, Xinya – Measurement: Interdisciplinary Research and Perspectives, 2019

Current methods that simultaneously model differential testlet functioning (DTLF) and differential item functioning (DIF) constrain the variances of latent ability and testlet effects to be equal between the focal and the reference groups. Such a constraint can be stringent and unrealistic with real data. In this study, we propose a multigroup…

Descriptors: Test Items, Item Response Theory, Test Bias, Models

The Trade-Off between Model Fit, Invariance, and Validity: The Case of PISA Science Assessments

Peer reviewed

Direct link

El Masri, Yasmine H.; Andrich, David – Applied Measurement in Education, 2020

In large-scale educational assessments, it is generally required that tests are composed of items that function invariantly across the groups to be compared. Despite efforts to ensure invariance in the item construction phase, for a range of reasons (including the security of items) it is often necessary to account for differential item…

Descriptors: Models, Goodness of Fit, Test Validity, Achievement Tests

Within-Item Interactions in Bifactor Models for Ordered-Categorical Item Responses

Direct link

Fager, Meghan L. – ProQuest LLC, 2019

Recent research in multidimensional item response theory has introduced within-item interaction effects between latent dimensions in the prediction of item responses. The objective of this study was to extend this research to bifactor models to include an interaction effect between the general and specific latent variables measured by an item.…

Descriptors: Test Items, Item Response Theory, Factor Analysis, Simulation

Examining Power and Type 1 Error for Step and Item Level Tests of Invariance: Investigating the Effect of the Number of Item Score Levels

Direct link

Ayodele, Alicia Nicole – ProQuest LLC, 2017

Within polytomous items, differential item functioning (DIF) can take on various forms due to the number of response categories. The lack of invariance at this level is referred to as differential step functioning (DSF). The most common DSF methods in the literature are the adjacent category log odds ratio (AC-LOR) estimator and cumulative…

Descriptors: Statistical Analysis, Test Bias, Test Items, Scores

Controlling Bias in Both Constructed Response and Multiple-Choice Items When Analyzed with the Dichotomous Rasch Model

Peer reviewed

Direct link

Andrich, David; Marais, Ida – Journal of Educational Measurement, 2018

Even though guessing biases difficulty estimates as a function of item difficulty in the dichotomous Rasch model, assessment programs with tests which include multiple-choice items often construct scales using this model. Research has shown that when all items are multiple-choice, this bias can largely be eliminated. However, many assessments have…

Descriptors: Multiple Choice Tests, Test Items, Guessing (Tests), Test Bias

Multidimensional Extension of Multiple Indicators Multiple Causes Models to Detect DIF

Peer reviewed

Direct link

Lee, Soo; Bulut, Okan; Suh, Youngsuk – Educational and Psychological Measurement, 2017

A number of studies have found multiple indicators multiple causes (MIMIC) models to be an effective tool in detecting uniform differential item functioning (DIF) for individual items and item bundles. A recently developed MIMIC-interaction model is capable of detecting both uniform and nonuniform DIF in the unidimensional item response theory…

Descriptors: Test Bias, Test Items, Models, Item Response Theory

Hidden Item Variance in Multiple Mini-Interview Scores

Peer reviewed

Direct link

Zaidi, Nikki L.; Swoboda, Christopher M.; Kelcey, Benjamin M.; Manuel, R. Stephen – Advances in Health Sciences Education, 2017

The extant literature has largely ignored a potentially significant source of variance in multiple mini-interview (MMI) scores by "hiding" the variance attributable to the sample of attributes used on an evaluation form. This potential source of hidden variance can be defined as rating items, which typically comprise an MMI evaluation…

Descriptors: Interviews, Scores, Generalizability Theory, Monte Carlo Methods

Anchor Selection Strategies for DIF Analysis: Review, Assessment, and New Approaches

Peer reviewed

Direct link

Kopf, Julia; Zeileis, Achim; Strobl, Carolin – Educational and Psychological Measurement, 2015

Differential item functioning (DIF) indicates the violation of the invariance assumption, for instance, in models based on item response theory (IRT). For item-wise DIF analysis using IRT, a common metric for the item parameters of the groups that are to be compared (e.g., for the reference and the focal group) is necessary. In the Rasch model,…

Descriptors: Test Items, Equated Scores, Test Bias, Item Response Theory

Rasch Mixture Models for DIF Detection: A Comparison of Old and New Score Specifications

Peer reviewed

Direct link

Frick, Hannah; Strobl, Carolin; Zeileis, Achim – Educational and Psychological Measurement, 2015

Rasch mixture models can be a useful tool when checking the assumption of measurement invariance for a single Rasch model. They provide advantages compared to manifest differential item functioning (DIF) tests when the DIF groups are only weakly correlated with the manifest covariates available. Unlike in single Rasch models, estimation of Rasch…

Descriptors: Item Response Theory, Test Bias, Comparative Analysis, Scores

A Methodology for Zumbo's Third Generation DIF Analyses and the Ecology of Item Responding

Peer reviewed

Direct link

Zumbo, Bruno D.; Liu, Yan; Wu, Amery D.; Shear, Benjamin R.; Olvera Astivia, Oscar L.; Ark, Tavinder K. – Language Assessment Quarterly, 2015

Methods for detecting differential item functioning (DIF) and item bias are typically used in the process of item analysis when developing new measures; adapting existing measures for different populations, languages, or cultures; or more generally validating test score inferences. In 2007 in "Language Assessment Quarterly," Zumbo…

Descriptors: Test Bias, Test Items, Holistic Approach, Models

Linear Logistic Test Modeling with R

Peer reviewed
PDF on ERIC

Download full text

Baghaei, Purya; Kubinger, Klaus D. – Practical Assessment, Research & Evaluation, 2015

The present paper gives a general introduction to the linear logistic test model (Fischer, 1973), an extension of the Rasch model with linear constraints on item parameters, along with eRm (an R package to estimate different types of Rasch models; Mair, Hatzinger, & Mair, 2014) functions to estimate the model and interpret its parameters. The…

Descriptors: Item Response Theory, Models, Test Validity, Hypothesis Testing

On the Bias-Amplifying Effect of Near Instruments in Observational Studies

Peer reviewed
PDF on ERIC

Download full text

Steiner, Peter M.; Kim, Yongnam – Society for Research on Educational Effectiveness, 2014

In contrast to randomized experiments, the estimation of unbiased treatment effects from observational data requires an analysis that conditions on all confounding covariates. Conditioning on covariates can be done via standard parametric regression techniques or nonparametric matching like propensity score (PS) matching. The regression or…

Descriptors: Observation, Research Methodology, Test Bias, Regression (Statistics)

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5

Educational and Psychological…	13
Journal of Educational…	9
Applied Psychological…	8
ProQuest LLC	7
International Journal of…	3
ETS Research Report Series	2
Large-scale Assessments in…	2
Online Submission	2
Practical Assessment,…	2
Advances in Health Sciences…	1
Applied Measurement in…	1
Educational Measurement:…	1
Educational Sciences: Theory…	1
Hacettepe University Journal…	1
Journal of Educational and…	1
Language Assessment Quarterly	1
Measurement:…	1
Numeracy	1
Psychometrika	1
Society for Research on…	1
Theory and Research in…	1
More ▼

Wang, Wen-Chung	3
Andrich, David	2
Beretvas, S. Natasha	2
De Boeck, Paul	2
Finch, W. Holmes	2
Flowers, Claudia P.	2
Janssen, Rianne	2
Jin, Ying	2
Oshima, T. C.	2
Penfield, Randall D.	2
Strobl, Carolin	2
Suh, Youngsuk	2
Zeileis, Achim	2
Acar, Tulin	1
Albano, Anthony D.	1
Ariel, Adelaide	1
Ark, Tavinder K.	1
Atar, Burcu	1
Ayodele, Alicia Nicole	1
Baghaei, Purya	1
Bao, Han	1
Bilir, Mustafa Kuzey	1
Bob delMas	1
Brad Hartlaub	1
More ▼