ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	17

Descriptor

Test Bias	17
Testing Programs	17
Test Items	7
Item Response Theory	6
Mathematics Tests	5
Models	5
Scoring	5
Test Construction	5
Academic Achievement	4
English	4
Equated Scores	4
Error of Measurement	4
Psychometrics	4
Sample Size	4
Testing	4
Achievement Tests	3
College Entrance Examinations	3
Common Core State Standards	3
Data Collection	3
Educational Assessment	3
Foreign Countries	3
Grade 3	3
Grade 4	3
Grade 5	3
Grade 6	3
More ▼

Source

Journal of Educational…	3
New York State Education…	3
Applied Measurement in…	1
Applied Psychological…	1
Council of Chief State School…	1
Educational Measurement:…	1
Educational Testing Service	1
International Journal of…	1
Journal of Educational and…	1
Journal of Mixed Methods…	1
Journal of Psychoeducational…	1
ProQuest LLC	1
Science Education Review	1
More ▼

Publication Type

Journal Articles	11
Reports - Research	7
Reports - Descriptive	5
Reports - Evaluative	4
Numerical/Quantitative Data	3
Dissertations/Theses -…	1
Opinion Papers	1

Education Level

Early Childhood Education	3
Elementary Education	3
Grade 3	3
Grade 4	3
Grade 5	3
Grade 6	3
Grade 7	3
Grade 8	3
Intermediate Grades	3
Junior High Schools	3
Middle Schools	3
Primary Education	3
Secondary Education	3
Elementary Secondary Education	2
Higher Education	2
Postsecondary Education	1
More ▼

Audience

Location

New York	3
United States	3
Australia	1
Denmark	1
France	1
Slovakia	1
Spain	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	3
Graduate Record Examinations	1
Law School Admission Test	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 17 results Save | Export

Net and Global Differential Item Functioning in PISA Polytomously Scored Science Items: Application of the Differential Step Functioning Framework

Peer reviewed

Direct link

Akour, Mutasem; Sabah, Saed; Hammouri, Hind – Journal of Psychoeducational Assessment, 2015

The purpose of this study was to apply two types of Differential Item Functioning (DIF), net and global DIF, as well as the framework of Differential Step Functioning (DSF) to real testing data to investigate measurement invariance related to test language. Data from the Program for International Student Assessment (PISA)-2006 polytomously scored…

Descriptors: Test Bias, Science Tests, Test Items, Scoring

Analysis of Nonequivalent Assessments across Different Linguistic Groups Using a Mixed Methods Approach: Understanding the Causes of Differential Item Functioning by Cognitive Interviewing

Peer reviewed

Direct link

Benítez, Isabel; Padilla, José-Luis – Journal of Mixed Methods Research, 2014

Differential item functioning (DIF) can undermine the validity of cross-lingual comparisons. While a lot of efficient statistics for detecting DIF are available, few general findings have been found to explain DIF results. The objective of the article was to study DIF sources by using a mixed method design. The design involves a quantitative phase…

Descriptors: Foreign Countries, Mixed Methods Research, Test Bias, Cross Cultural Studies

Multilevel Modeling of Item Position Effects

Peer reviewed

Direct link

Albano, Anthony D. – Journal of Educational Measurement, 2013

In many testing programs it is assumed that the context or position in which an item is administered does not have a differential effect on examinee responses to the item. Violations of this assumption may bias item response theory estimates of item and person parameters. This study examines the potentially biasing effects of item position. A…

Descriptors: Test Items, Item Response Theory, Test Format, Questioning Techniques

Accuracy of DIF Estimates and Power in Unbalanced Designs Using the Mantel-Haenszel DIF Detection Procedure

Peer reviewed

Direct link

Paek, Insu; Guo, Hongwen – Applied Psychological Measurement, 2011

This study examined how much improvement was attainable with respect to accuracy of differential item functioning (DIF) measures and DIF detection rates in the Mantel-Haenszel procedure when employing focal and reference groups with notably unbalanced sample sizes where the focal group has a fixed small sample which does not satisfy the minimum…

Descriptors: Test Bias, Accuracy, Reference Groups, Investigations

New York State Testing Program 2016: English Language Arts and Mathematics Grades 3-8. Technical Report

Download full text

New York State Education Department, 2016

This technical report provides detailed information regarding the technical, statistical, and measurement attributes of the New York State Testing Program (NYSTP) for the Grades 3-8 Common Core English Language Arts (ELA) and Mathematics 2016 Operational Tests. This report includes information about test content and test development, item (i.e.,…

Descriptors: Testing Programs, English, Language Arts, Mathematics Tests

Hierarchical Logistic Regression: Accounting for Multilevel Data in DIF Detection

Peer reviewed

Direct link

French, Brian F.; Finch, W. Holmes – Journal of Educational Measurement, 2010

The purpose of this study was to examine the performance of differential item functioning (DIF) assessment in the presence of a multilevel structure that often underlies data from large-scale testing programs. Analyses were conducted using logistic regression (LR), a popular, flexible, and effective tool for DIF detection. Data were simulated…

Descriptors: Test Bias, Testing Programs, Evaluation, Measurement

New York State Testing Program 2015: English Language Arts and Mathematics Grades 3-8. Technical Report

Download full text

New York State Education Department, 2015

This technical report provides detailed information regarding the technical, statistical, and measurement attributes of the New York State Testing Program (NYSTP) for the Grades 3-8 Common Core English Language Arts (ELA) and Mathematics 2015 Operational Tests. This report includes information about test content and test development, item (i.e.,…

Descriptors: Testing Programs, English, Language Arts, Mathematics Tests

First Language of Test Takers and Fairness Assessment Procedures

Peer reviewed

Direct link

Sinharay, Sandip; Dorans, Neil J.; Liang, Longjuan – Educational Measurement: Issues and Practice, 2011

Over the past few decades, those who take tests in the United States have exhibited increasing diversity with respect to native language. Standard psychometric procedures for ensuring item and test fairness that have existed for some time were developed when test-taking groups were predominantly native English speakers. A better understanding of…

Descriptors: Test Bias, Testing Programs, Psychometrics, Language Proficiency

New York State Testing Program 2014: English Language Arts and Mathematics Grades 3-8. Technical Report

Download full text

New York State Education Department, 2014

This technical report provides detailed information regarding the technical, statistical, and measurement attributes of the New York State Testing Program (NYSTP) for the Grades 3-8 Common Core English Language Arts (ELA) and Mathematics 2014 Operational Tests. This report includes information about test content and test development, item (i.e.,…

Descriptors: Testing Programs, English, Language Arts, Mathematics Tests

Addressing Two Commonly Unrecognized Sources of Score Instability in Annual State Assessments

Download full text

Doorey, Nancy A. – Council of Chief State School Officers, 2011

The work reported in this paper reflects a collaborative effort of many individuals representing multiple organizations. It began during a session at the October 2008 meeting of TILSA when a representative of a member state asked the group if any of their programs had experienced unexpected fluctuations in the annual state assessment scores, and…

Descriptors: Testing, Sampling, Expertise, Testing Programs

Mixture Item Response Theory-MIMIC Model: Simultaneous Estimation of Differential Item Functioning for Manifest Groups and Latent Classes

Direct link

Bilir, Mustafa Kuzey – ProQuest LLC, 2009

This study uses a new psychometric model (mixture item response theory-MIMIC model) that simultaneously estimates differential item functioning (DIF) across manifest groups and latent classes. Current DIF detection methods investigate DIF from only one side, either across manifest groups (e.g., gender, ethnicity, etc.), or across latent classes…

Descriptors: Test Items, Testing Programs, Markov Processes, Psychometrics

Using Log-Linear Smoothing to Improve Small-Sample DIF Estimation

Peer reviewed

Direct link

Puhan, Gautam; Moses, Timothy P.; Yu, Lei; Dorans, Neil J. – Journal of Educational Measurement, 2009

This study examined the extent to which log-linear smoothing could improve the accuracy of differential item functioning (DIF) estimates in small samples of examinees. Examinee responses from a certification test were analyzed using White examinees in the reference group and African American examinees in the focal group. Using a simulation…

Descriptors: Test Items, Reference Groups, Testing Programs, Raw Scores

Differential Item Functioning Analysis Using Rasch Item Information Functions

Peer reviewed

Direct link

Wyse, Adam E.; Mapuranga, Raymond – International Journal of Testing, 2009

Differential item functioning (DIF) analysis is a statistical technique used for ensuring the equity and fairness of educational assessments. This study formulates a new DIF analysis method using the information similarity index (ISI). ISI compares item information functions when data fits the Rasch model. Through simulations and an international…

Descriptors: Test Bias, Evaluation Methods, Test Items, Educational Assessment

Score Equity Assessment:Development of a Prototype Analysis Using SAT[R] Mathematics Test Data Across Several Administrations. Research Report. ETS RR-09-08

Download full text

Dorans, Neil J.; Liu, Jinghua – Educational Testing Service, 2009

The equating process links scores from different editions of the same test. For testing programs that build nearly parallel forms to the same explicit content and statistical specifications and administer forms under the same conditions, the linkings between the forms are expected to be equatings. Score equity assessment (SEA) provides a useful…

Descriptors: Testing Programs, Mathematics Tests, Quality Control, Psychometrics

Rethinking Unsupervised Summative Assessment

Peer reviewed
PDF on ERIC

Download full text

Eastwell, Peter – Science Education Review, 2006

Unsupervised summative assessment has become a feature of the educational landscape in various educational jurisdictions around the world, including the state of Queensland in Australia. However, I suggest it is an invalid and unnecessary practice that can impact negatively on the affect of students, call for a reconsideration of its use, and…

Descriptors: Summative Evaluation, Science Education, Educational Practices, Learning Activities

Previous Page | Next Page »

Pages: 1 | 2

Dorans, Neil J.	3
Akour, Mutasem	1
Albano, Anthony D.	1
Ariel, Adelaide	1
Benítez, Isabel	1
Bilir, Mustafa Kuzey	1
Bolt, Sara E.	1
Doorey, Nancy A.	1
Eastwell, Peter	1
Finch, W. Holmes	1
French, Brian F.	1
Guo, Hongwen	1
Hammouri, Hind	1
Liang, Longjuan	1
Liu, Jinghua	1
Mapuranga, Raymond	1
Moses, Timothy P.	1
Padilla, José-Luis	1
Paek, Insu	1
Puhan, Gautam	1
Sabah, Saed	1
Sinharay, Sandip	1
Veldkamp, Bernard P.	1
Wyse, Adam E.	1
Ysseldyke, James E.	1
More ▼