ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	25

Descriptor

Testing Programs	25
Test Bias	17
Test Items	8
Academic Achievement	7
Item Response Theory	7
Equated Scores	6
Racial Bias	6
Mathematics Tests	5
Models	5
Sample Size	5
Scoring	5
Test Construction	5
Testing	5
Educational Improvement	4
Elementary Secondary Education	4
English	4
Error of Measurement	4
Ethnicity	4
Grade 7	4
Psychometrics	4
Scaling	4
State Standards	4
Test Reliability	4
Access to Computers	3
Achievement Tests	3
More ▼

Source

Department of Defense…	3
Journal of Educational…	3
New York State Education…	3
ProQuest LLC	2
Applied Measurement in…	1
Applied Psychological…	1
Communique	1
Council of Chief State School…	1
ETS Research Report Series	1
Educational Measurement:…	1
Educational Researcher	1
Educational Testing Service	1
Educational and Psychological…	1
International Journal of…	1
Journal of Educational and…	1
Journal of Mixed Methods…	1
Journal of Psychoeducational…	1
Science Education Review	1
More ▼

Publication Type

Journal Articles	15
Reports - Research	13
Numerical/Quantitative Data	6
Reports - Descriptive	6
Reports - Evaluative	4
Dissertations/Theses -…	2
Opinion Papers	1

Education Level

Grade 7	7
Grade 4	6
Grade 5	6
Grade 6	6
Grade 8	6
Elementary Secondary Education	5
Early Childhood Education	3
Elementary Education	3
Grade 10	3
Grade 11	3
Grade 12	3
Grade 3	3
Grade 9	3
Higher Education	3
Intermediate Grades	3
Junior High Schools	3
Middle Schools	3
Primary Education	3
Secondary Education	3
Postsecondary Education	1
More ▼

Audience

Location

New York	3
United States	3
Australia	1
California	1
Denmark	1
France	1
Slovakia	1
Spain	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	3
California Achievement Tests	1
Graduate Record Examinations	1
Law School Admission Test	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 25 results Save | Export

Reducing Racism-Based Stress in Black Youth during the Assessment Process

Direct link

Biddanda, Haley C. – Communique, 2022

Race-based traumatic stress, also called racial trauma, refers to "mental and emotional injury caused by encounters with racial bias and ethnic discrimination, racism, and hate crimes" (Mental Health America, n.d.). While much research on racism-based stress in schools focuses on teachers, school psychologists can just as easily cause…

Descriptors: Racial Discrimination, Anxiety, African American Students, Testing Programs

Net and Global Differential Item Functioning in PISA Polytomously Scored Science Items: Application of the Differential Step Functioning Framework

Peer reviewed

Direct link

Akour, Mutasem; Sabah, Saed; Hammouri, Hind – Journal of Psychoeducational Assessment, 2015

The purpose of this study was to apply two types of Differential Item Functioning (DIF), net and global DIF, as well as the framework of Differential Step Functioning (DSF) to real testing data to investigate measurement invariance related to test language. Data from the Program for International Student Assessment (PISA)-2006 polytomously scored…

Descriptors: Test Bias, Science Tests, Test Items, Scoring

Analysis of Nonequivalent Assessments across Different Linguistic Groups Using a Mixed Methods Approach: Understanding the Causes of Differential Item Functioning by Cognitive Interviewing

Peer reviewed

Direct link

Benítez, Isabel; Padilla, José-Luis – Journal of Mixed Methods Research, 2014

Differential item functioning (DIF) can undermine the validity of cross-lingual comparisons. While a lot of efficient statistics for detecting DIF are available, few general findings have been found to explain DIF results. The objective of the article was to study DIF sources by using a mixed method design. The design involves a quantitative phase…

Descriptors: Foreign Countries, Mixed Methods Research, Test Bias, Cross Cultural Studies

The Long-Term Sustainability of Different Item Response Theory Scaling Methods

Peer reviewed

Direct link

Keller, Lisa A.; Keller, Robert R. – Educational and Psychological Measurement, 2011

This article investigates the accuracy of examinee classification into performance categories and the estimation of the theta parameter for several item response theory (IRT) scaling techniques when applied to six administrations of a test. Previous research has investigated only two administrations; however, many testing programs equate tests…

Descriptors: Item Response Theory, Scaling, Sustainability, Classification

Multilevel Modeling of Item Position Effects

Peer reviewed

Direct link

Albano, Anthony D. – Journal of Educational Measurement, 2013

In many testing programs it is assumed that the context or position in which an item is administered does not have a differential effect on examinee responses to the item. Violations of this assumption may bias item response theory estimates of item and person parameters. This study examines the potentially biasing effects of item position. A…

Descriptors: Test Items, Item Response Theory, Test Format, Questioning Techniques

Accuracy of DIF Estimates and Power in Unbalanced Designs Using the Mantel-Haenszel DIF Detection Procedure

Peer reviewed

Direct link

Paek, Insu; Guo, Hongwen – Applied Psychological Measurement, 2011

This study examined how much improvement was attainable with respect to accuracy of differential item functioning (DIF) measures and DIF detection rates in the Mantel-Haenszel procedure when employing focal and reference groups with notably unbalanced sample sizes where the focal group has a fixed small sample which does not satisfy the minimum…

Descriptors: Test Bias, Accuracy, Reference Groups, Investigations

New York State Testing Program 2016: English Language Arts and Mathematics Grades 3-8. Technical Report

Download full text

New York State Education Department, 2016

This technical report provides detailed information regarding the technical, statistical, and measurement attributes of the New York State Testing Program (NYSTP) for the Grades 3-8 Common Core English Language Arts (ELA) and Mathematics 2016 Operational Tests. This report includes information about test content and test development, item (i.e.,…

Descriptors: Testing Programs, English, Language Arts, Mathematics Tests

Hierarchical Logistic Regression: Accounting for Multilevel Data in DIF Detection

Peer reviewed

Direct link

French, Brian F.; Finch, W. Holmes – Journal of Educational Measurement, 2010

The purpose of this study was to examine the performance of differential item functioning (DIF) assessment in the presence of a multilevel structure that often underlies data from large-scale testing programs. Analyses were conducted using logistic regression (LR), a popular, flexible, and effective tool for DIF detection. Data were simulated…

Descriptors: Test Bias, Testing Programs, Evaluation, Measurement

New York State Testing Program 2015: English Language Arts and Mathematics Grades 3-8. Technical Report

Download full text

New York State Education Department, 2015

This technical report provides detailed information regarding the technical, statistical, and measurement attributes of the New York State Testing Program (NYSTP) for the Grades 3-8 Common Core English Language Arts (ELA) and Mathematics 2015 Operational Tests. This report includes information about test content and test development, item (i.e.,…

Descriptors: Testing Programs, English, Language Arts, Mathematics Tests

First Language of Test Takers and Fairness Assessment Procedures

Peer reviewed

Direct link

Sinharay, Sandip; Dorans, Neil J.; Liang, Longjuan – Educational Measurement: Issues and Practice, 2011

Over the past few decades, those who take tests in the United States have exhibited increasing diversity with respect to native language. Standard psychometric procedures for ensuring item and test fairness that have existed for some time were developed when test-taking groups were predominantly native English speakers. A better understanding of…

Descriptors: Test Bias, Testing Programs, Psychometrics, Language Proficiency

Asian American and Pacific Islander Students: Equity and the Achievement Gap

Peer reviewed

Direct link

Pang, Valerie Ooka; Han, Peggy P.; Pang, Jennifer M. – Educational Researcher, 2011

The authors studied more than 1 million Asian American and Pacific Islander (AAPI) and White seventh graders in a statewide California testing program between 2003 and 2008, examining their reading and math achievement. AAPI student performance is often reported as an aggregate in discussions of the success of schoolchildren and issues of racial…

Descriptors: Achievement Gap, Testing Programs, Pacific Islanders, Grade 7

New York State Testing Program 2014: English Language Arts and Mathematics Grades 3-8. Technical Report

Download full text

New York State Education Department, 2014

This technical report provides detailed information regarding the technical, statistical, and measurement attributes of the New York State Testing Program (NYSTP) for the Grades 3-8 Common Core English Language Arts (ELA) and Mathematics 2014 Operational Tests. This report includes information about test content and test development, item (i.e.,…

Descriptors: Testing Programs, English, Language Arts, Mathematics Tests

Addressing Two Commonly Unrecognized Sources of Score Instability in Annual State Assessments

Download full text

Doorey, Nancy A. – Council of Chief State School Officers, 2011

The work reported in this paper reflects a collaborative effort of many individuals representing multiple organizations. It began during a session at the October 2008 meeting of TILSA when a representative of a member state asked the group if any of their programs had experienced unexpected fluctuations in the annual state assessment scores, and…

Descriptors: Testing, Sampling, Expertise, Testing Programs

Mixture Item Response Theory-MIMIC Model: Simultaneous Estimation of Differential Item Functioning for Manifest Groups and Latent Classes

Direct link

Bilir, Mustafa Kuzey – ProQuest LLC, 2009

This study uses a new psychometric model (mixture item response theory-MIMIC model) that simultaneously estimates differential item functioning (DIF) across manifest groups and latent classes. Current DIF detection methods investigate DIF from only one side, either across manifest groups (e.g., gender, ethnicity, etc.), or across latent classes…

Descriptors: Test Items, Testing Programs, Markov Processes, Psychometrics

Using Log-Linear Smoothing to Improve Small-Sample DIF Estimation

Peer reviewed

Direct link

Puhan, Gautam; Moses, Timothy P.; Yu, Lei; Dorans, Neil J. – Journal of Educational Measurement, 2009

This study examined the extent to which log-linear smoothing could improve the accuracy of differential item functioning (DIF) estimates in small samples of examinees. Examinee responses from a certification test were analyzed using White examinees in the reference group and African American examinees in the focal group. Using a simulation…

Descriptors: Test Items, Reference Groups, Testing Programs, Raw Scores

Previous Page | Next Page »

Pages: 1 | 2

Dorans, Neil J.	3
Akour, Mutasem	1
Albano, Anthony D.	1
Ariel, Adelaide	1
Benítez, Isabel	1
Biddanda, Haley C.	1
Bilir, Mustafa Kuzey	1
Blatt, Jessica	1
Bolt, Sara E.	1
Doorey, Nancy A.	1
Eastwell, Peter	1
Finch, W. Holmes	1
French, Brian F.	1
Guo, Hongwen	1
Haberman, Shelby	1
Hammouri, Hind	1
Han, Peggy P.	1
Keller, Lisa A.	1
Keller, Robert R.	1
Kim, Sooyeon	1
Liang, Longjuan	1
Liu, Jinghua	1
Mapuranga, Raymond	1
Moses, Timothy P.	1
Padilla, José-Luis	1
More ▼