ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	4
Since 2007 (last 20 years)	9

Descriptor

Differences	9
Test Length	9
Ability	5
Sample Size	5
Test Bias	5
Error of Measurement	4
Item Response Theory	4
Test Items	4
Comparative Analysis	3
Statistical Analysis	3
Achievement Tests	2
Computer Assisted Testing	2
Correlation	2
Grade 2	2
Grade 3	2
Grade 4	2
Models	2
Reading Tests	2
Scores	2
Simulation	2
Test Reliability	2
True Scores	2
Accuracy	1
COVID-19	1
Computation	1
More ▼

Source

Educational and Psychological…	2
Behavioral Research and…	1
Educational Measurement:…	1
Educational Sciences: Theory…	1
International Journal of…	1
Journal of Educational and…	1
Montgomery County Public…	1
ProQuest LLC	1

Publication Type

Reports - Research	7
Journal Articles	6
Numerical/Quantitative Data	2
Dissertations/Theses -…	1
Reports - Evaluative	1
Tests/Questionnaires	1

Education Level

Early Childhood Education	2
Elementary Education	2
Grade 2	2
Grade 3	2
Grade 4	2
Intermediate Grades	2
Primary Education	2
Grade 1	1
Grade 5	1
Grade 6	1
Grade 7	1
Grade 8	1
Grade 9	1
High Schools	1
Junior High Schools	1
Kindergarten	1
Middle Schools	1
Secondary Education	1
More ▼

Audience

Location

Maryland	1
Turkey	1

Laws, Policies, & Programs

Assessments and Surveys

Advanced Placement…	1
Measures of Academic Progress	1

What Works Clearinghouse Rating

Showing all 9 results Save | Export

What Are the Conditions Associated with Subscore Added Value Noninvariance? Implications for Improving Subscore Interpretation Fairness

Peer reviewed

Direct link

Rios, Joseph A.; Miranda, Alejandra A. – Educational Measurement: Issues and Practice, 2021

Subscore added value analyses assume invariance across test taking populations; however, this assumption may be untenable in practice as differential subdomain relationships may be present among subgroups. The purpose of this simulation study was to understand the conditions associated with subscore added value noninvariance when manipulating: (1)…

Descriptors: Scores, Test Length, Ability, Correlation

Multidimensional Extension of Multiple Indicators Multiple Causes Models to Detect DIF

Peer reviewed

Direct link

Lee, Soo; Bulut, Okan; Suh, Youngsuk – Educational and Psychological Measurement, 2017

A number of studies have found multiple indicators multiple causes (MIMIC) models to be an effective tool in detecting uniform differential item functioning (DIF) for individual items and item bundles. A recently developed MIMIC-interaction model is capable of detecting both uniform and nonuniform DIF in the unidimensional item response theory…

Descriptors: Test Bias, Test Items, Models, Item Response Theory

The Matching Criterion Purification for Differential Item Functioning Analyses in a Large-Scale Assessment

Peer reviewed

Direct link

Lee, HyeSun; Geisinger, Kurt F. – Educational and Psychological Measurement, 2016

The current study investigated the impact of matching criterion purification on the accuracy of differential item functioning (DIF) detection in large-scale assessments. The three matching approaches for DIF analyses (block-level matching, pooled booklet matching, and equated pooled booklet matching) were employed with the Mantel-Haenszel…

Descriptors: Test Bias, Measurement, Accuracy, Statistical Analysis

Effects of Differential Item Functioning on Examinees' Test Performance and Reliability of Test

Peer reviewed

Direct link

Lee, Yi-Hsuan; Zhang, Jinming – International Journal of Testing, 2017

Simulations were conducted to examine the effect of differential item functioning (DIF) on measurement consequences such as total scores, item response theory (IRT) ability estimates, and test reliability in terms of the ratio of true-score variance to observed-score variance and the standard error of estimation for the IRT ability parameter. The…

Descriptors: Test Bias, Test Reliability, Performance, Scores

Student Outcomes on MAP Growth: Comparison of Virtual and In-Person Administrations

Download full text

James, Syretta R.; Liu, Shihching Jessica; Maina, Nyambura; Wade, Julie; Wang, Helen; Wilson, Heather; Wolanin, Natalie – Montgomery County Public Schools, 2021

The impact of the COVID-19 pandemic continues to overwhelm the functioning and outcomes of educational systems throughout the nation. The public education system is under particular scrutiny given that students, families, and educators are under considerable stress to maintain academic progress. Since the beginning of the crisis, school-systems…

Descriptors: Achievement Tests, COVID-19, Pandemics, Public Schools

Teacher Survey of the Accessibility and Text Features of the Computerized Oral Reading Evaluation (CORE). Technical Report #1601

Download full text

Kahn, Josh; Nese, Joseph T.; Alonzo, Julie – Behavioral Research and Teaching, 2016

There is strong theoretical support for oral reading fluency (ORF) as an essential building block of reading proficiency. The current and standard ORF assessment procedure requires that students read aloud a grade-level passage (˜ 250 words) in a one-to-one administration, with the number of words read correctly in 60 seconds constituting their…

Descriptors: Teacher Surveys, Oral Reading, Reading Tests, Computer Assisted Testing

Comparing Performances (Type I Error and Power) of IRT Likelihood Ratio SIBTEST and Mantel-Haenszel Methods in the Determination of Differential Item Functioning

Peer reviewed
PDF on ERIC

Download full text

Atalay Kabasakal, Kübra; Arsan, Nihan; Gök, Bilge; Kelecioglu, Hülya – Educational Sciences: Theory and Practice, 2014

This simulation study compared the performances (Type I error and power) of Mantel-Haenszel (MH), SIBTEST, and item response theory-likelihood ratio (IRT-LR) methods under certain conditions. Manipulated factors were sample size, ability differences between groups, test length, the percentage of differential item functioning (DIF), and underlying…

Descriptors: Comparative Analysis, Item Response Theory, Statistical Analysis, Test Bias

Mixed-Format Test Score Equating: Effect of Item-Type Multidimensionality, Length and Composition of Common-Item Set, and Group Ability Difference

Direct link

Wang, Wei – ProQuest LLC, 2013

Mixed-format tests containing both multiple-choice (MC) items and constructed-response (CR) items are now widely used in many testing programs. Mixed-format tests often are considered to be superior to tests containing only MC items although the use of multiple item formats leads to measurement challenges in the context of equating conducted under…

Descriptors: Equated Scores, Test Format, Test Items, Test Length

Modification of the Mantel-Haenszel and Logistic Regression DIF Procedures to Incorporate the SIBTEST Regression Correction

Peer reviewed

Direct link

DeMars, Christine E. – Journal of Educational and Behavioral Statistics, 2009

The Mantel-Haenszel (MH) and logistic regression (LR) differential item functioning (DIF) procedures have inflated Type I error rates when there are large mean group differences, short tests, and large sample sizes.When there are large group differences in mean score, groups matched on the observed number-correct score differ on true score,…

Descriptors: Regression (Statistics), Test Bias, Error of Measurement, True Scores

Alonzo, Julie	1
Arsan, Nihan	1
Atalay Kabasakal, Kübra	1
Bulut, Okan	1
DeMars, Christine E.	1
Geisinger, Kurt F.	1
Gök, Bilge	1
James, Syretta R.	1
Kahn, Josh	1
Kelecioglu, Hülya	1
Lee, HyeSun	1
Lee, Soo	1
Lee, Yi-Hsuan	1
Liu, Shihching Jessica	1
Maina, Nyambura	1
Miranda, Alejandra A.	1
Nese, Joseph T.	1
Rios, Joseph A.	1
Suh, Youngsuk	1
Wade, Julie	1
Wang, Helen	1
Wang, Wei	1
Wilson, Heather	1
Wolanin, Natalie	1
Zhang, Jinming	1
More ▼