ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	6
Since 2016 (last 10 years)	10
Since 2006 (last 20 years)	23

Descriptor

Test Bias	561
Testing Problems	561
Test Validity	180
Elementary Secondary Education	161
Standardized Tests	135
Test Interpretation	94
Intelligence Tests	93
Achievement Tests	90
Test Construction	90
Culture Fair Tests	82
Minority Groups	81
Test Reliability	77
Educational Testing	74
Test Items	65
Higher Education	64
Student Evaluation	61
Test Use	60
Educational Assessment	56
Testing	56
Elementary Education	48
Evaluation Methods	48
Racial Differences	45
Cultural Differences	44
Black Students	42
College Entrance Examinations	40
More ▼

Education Level

Elementary Secondary Education	7
Postsecondary Education	4
Higher Education	3
Secondary Education	2

Audience

Practitioners	31
Researchers	26
Teachers	8
Administrators	5
Counselors	5
Policymakers	2
Parents	1
Support Staff	1

Location

California	11
Canada	8
Florida	5
Illinois	3
Netherlands	3
South Africa	3
United States	3
Arizona	2
Australia	2
China	2
Japan	2
United Kingdom	2
Arkansas	1
Brazil	1
California (Los Angeles)	1
California (San Bernardino)	1
Chile	1
Costa Rica	1
France	1
Georgia	1
Greece	1
Iran	1
Maryland	1
Minnesota	1
New Jersey	1
More ▼

Laws, Policies, & Programs

Larry P v Riles	7
Education for All Handicapped…	5
Bakke v Regents of University…	2
Civil Rights Act 1964 Title…	2
Elementary and Secondary…	2
Rehabilitation Act 1973…	2
Bilingual Education Act 1968	1
Elementary and Secondary…	1
Emergency School Aid Act 1972	1
Individuals with Disabilities…	1
Job Training Partnership Act…	1
No Child Left Behind Act 2001	1
Social Security	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 561 results Save | Export

The Development of a Standardized Effect Size for the SIBTEST Procedure

Peer reviewed

Direct link

James D. Weese; Ronna C. Turner; Allison Ames; Xinya Liang; Brandon Crawford – Journal of Experimental Education, 2024

In this study a standardized effect size was created for use with the SIBTEST procedure. Using this standardized effect size, a single set of heuristics was developed that are appropriate for data fitting different item response models (e.g., 2-parameter logistic, 3-parameter logistic). The standardized effect size rescales the raw beta-uni value…

Descriptors: Test Bias, Test Items, Item Response Theory, Effect Size

Adjusting for Ability Differences of Equating Samples When Randomization Is Suboptimal

Peer reviewed

Direct link

Kim, Sooyeon; Walker, Michael E. – Educational Measurement: Issues and Practice, 2022

Test equating requires collecting data to link the scores from different forms of a test. Problems arise when equating samples are not equivalent and the test forms to be linked share no common items by which to measure or adjust for the group nonequivalence. Using data from five operational test forms, we created five pairs of research forms for…

Descriptors: Ability, Tests, Equated Scores, Testing Problems

Reframing Research and Assessment Practices: Advancing an Antiracist and Anti-Ableist Research Agenda

Peer reviewed

Direct link

Angela Johnson; Elizabeth Barker; Marcos Viveros Cespedes – Educational Measurement: Issues and Practice, 2024

Educators and researchers strive to build policies and practices on data and evidence, especially on academic achievement scores. When assessment scores are inaccurate for specific student populations or when scores are inappropriately used, even data-driven decisions will be misinformed. To maximize the impact of the research-practice-policy…

Descriptors: Equal Education, Inclusion, Evaluation Methods, Error of Measurement

Test Affordances or Test Function? Did We Get Messick's Message Right?

Download full text

Salmani Nodoushan, Mohammad Ali – Online Submission, 2021

This paper follows a line of logical argumentation to claim that what Samuel Messick conceptualized about construct validation has probably been misunderstood by some educational policy makers, practicing educators, and classroom teachers. It argues that, while Messick's unified theory of test validation aimed at (a) warning educational…

Descriptors: Construct Validity, Test Theory, Test Use, Affordances

Personalized Online Learning, Test Fairness, and Educational Measurement: Considering Differential Content Exposure Prior to a High Stakes End of Course Exam

Peer reviewed
PDF on ERIC

Download full text

Direct link

Daniel Katz; Anne Corinne Huggins-Manley; Walter Leite – Grantee Submission, 2022

According to the Standards for Educational and Psychological Testing (2014), one aspect of test fairness concerns examinees having comparable opportunities to learn prior to taking tests. Meanwhile, many researchers are developing platforms enhanced by artificial intelligence (AI) that can personalize curriculum to individual student needs. This…

Descriptors: High Stakes Tests, Test Bias, Testing Problems, Prior Learning

Personalized Online Learning, Test Fairness, and Educational Measurement: Considering Differential Content Exposure Prior to a High Stakes End of Course Exam

Peer reviewed

Direct link

Daniel Katz; Anne Corinne Huggins-Manley; Walter Leite – Applied Measurement in Education, 2022

According to the "Standards for Educational and Psychological Testing" (2014), one aspect of test fairness concerns examinees having comparable opportunities to learn prior to taking tests. Meanwhile, many researchers are developing platforms enhanced by artificial intelligence (AI) that can personalize curriculum to individual student…

Descriptors: High Stakes Tests, Test Bias, Testing Problems, Prior Learning

Simultaneously Modeling Differential Testlet Functioning and Differential Item Functioning: Addressing Variance Heterogeneity with a Multigroup One-Parameter Testlet Model

Peer reviewed

Direct link

Luo, Yong; Liang, Xinya – Measurement: Interdisciplinary Research and Perspectives, 2019

Current methods that simultaneously model differential testlet functioning (DTLF) and differential item functioning (DIF) constrain the variances of latent ability and testlet effects to be equal between the focal and the reference groups. Such a constraint can be stringent and unrealistic with real data. In this study, we propose a multigroup…

Descriptors: Test Items, Item Response Theory, Test Bias, Models

An Evidence-Based Review of Celpe-Bras: The Exam for Certification of Proficiency in Portuguese as a Foreign Language

Peer reviewed

Direct link

Zhao, Cecilia Guanfang; Liu, Carina Jiayu – Language Testing, 2019

Celpe-Bras, is the exam for the certification of proficiency in Portuguese as a foreign language. It, is the only Portuguese proficiency test recognized by the Brazilian government (Ministério da Educação, 2013). Given the recent growth of interest and also its unique design as a large-scale proficiency test, this article provides a general…

Descriptors: Portuguese, Second Language Learning, Language Proficiency, Language Tests

An Introduction to Missing Data in the Context of Differential Item Functioning

Peer reviewed
PDF on ERIC

Download full text

Banks, Kathleen – Practical Assessment, Research & Evaluation, 2015

This article introduces practitioners and researchers to the topic of missing data in the context of differential item functioning (DIF), reviews the current literature on the issue, discusses implications of the review, and offers suggestions for future research. A total of nine studies were reviewed. All of these studies determined what effect…

Descriptors: Test Bias, Data, Literature Reviews, Evaluation Research

Screening Test Items for Differential Item Functioning

Peer reviewed

Direct link

Longford, Nicholas T. – Journal of Educational and Behavioral Statistics, 2014

A method for medical screening is adapted to differential item functioning (DIF). Its essential elements are explicit declarations of the level of DIF that is acceptable and of the loss function that quantifies the consequences of the two kinds of inappropriate classification of an item. Instead of a single level and a single function, sets of…

Descriptors: Test Items, Test Bias, Simulation, Hypothesis Testing

Reconsideration of Language Assessment Is a MUST for Democratic Testing in the Educational System of Iran

Peer reviewed

Direct link

Safari, Parvin – Interchange: A Quarterly Review of Education, 2016

Recently, there has been a change from traditional language testing approaches, with a focus on psychometric properties towards critical language testing (CLT) with its social practice nature. CLT assumes tests not as neutral devices but as instruments of power and control which are related to authorities' policy agendas to shape individuals' and…

Descriptors: Foreign Countries, Language Tests, Educational Practices, High Stakes Tests

Language Effects in International Testing: The Case of PISA 2006 Science Items

Peer reviewed

Direct link

El Masri, Yasmine H.; Baird, Jo-Anne; Graesser, Art – Assessment in Education: Principles, Policy & Practice, 2016

We investigate the extent to which language versions (English, French and Arabic) of the same science test are comparable in terms of item difficulty and demands. We argue that language is an inextricable part of the scientific literacy construct, be it intended or not by the examiner. This argument has considerable implications on methodologies…

Descriptors: International Assessment, Difficulty Level, Test Items, Language Variation

The Right Test for the Wrong Reason

Direct link

Popham, W. James – Phi Delta Kappan, 2014

The tests we use to evaluate student achievement may well be sound measures of what students know, but they are faulty indicators at best of how well they have been taught. A remedy to this this situation of judging teachers by the performance of their students on high-stakes tests may be in hand already. We should look to the methods successfully…

Descriptors: High Stakes Tests, Academic Achievement, Teacher Evaluation, Evaluation Methods

The Effect of Missing Data Treatment on Mantel-Haenszel DIF Detection

Peer reviewed

Direct link

Emenogu, Barnabas C.; Falenchuk, Olesya; Childs, Ruth A. – Alberta Journal of Educational Research, 2010

Most implementations of the Mantel-Haenszel differential item functioning procedure delete records with missing responses or replace missing responses with scores of 0. These treatments of missing data make strong assumptions about the causes of the missing data. Such assumptions may be particularly problematic when groups differ in their patterns…

Descriptors: Foreign Countries, Test Bias, Test Items, Educational Testing

An NCME Instructional Module on Using Differential Step Functioning to Refine the Analysis of DIF in Polytomous Items

Peer reviewed

Direct link

Penfield, Randall D.; Gattamorta, Karina; Childs, Ruth A. – Educational Measurement: Issues and Practice, 2009

Traditional methods for examining differential item functioning (DIF) in polytomously scored test items yield a single item-level index of DIF and thus provide no information concerning which score levels are implicated in the DIF effect. To address this limitation of DIF methodology, the framework of differential step functioning (DSF) has…

Descriptors: Test Bias, Test Items, Evaluation Methods, Scores

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 38

Educational Measurement:…	13
Journal of Educational…	10
School Psychology Review	9
Educational and Psychological…	7
American Psychologist	6
Journal of Vocational Behavior	6
Measurement and Evaluation in…	6
New Directions for Testing…	6
Psychology in the Schools	6
Journal of Non-White Concerns…	5
NCME Measurement in Education	5
Journal of School Psychology	4
School Guidance Worker	4
School Psychology Digest	4
Applied Measurement in…	3
Journal of Afro-American…	3
Journal of Negro Education	3
Journal of Research and…	3
Language, Speech, and Hearing…	3
Measurement and Evaluation in…	3
Phi Delta Kappan	3
Alberta Journal of…	2
Applied Psychological…	2
Diagnostique	2
Education and Urban Society	2
More ▼

Hilliard, Asa G., III	7
Ebel, Robert L.	6
Green, Donald Ross	6
Williams, Robert L.	6
Bond, Lloyd	4
Hambleton, Ronald K.	4
Clarizio, Harvey F.	3
Fuchs, Douglas	3
Linn, Robert L.	3
Plake, Barbara S.	3
Ragosta, Marjorie	3
Rosser, Phyllis	3
Samuda, Ronald J.	3
Tittle, Carol Kehr	3
Anne Corinne Huggins-Manley	2
Ascher, Carol	2
Bennett, Randy Elliot	2
Childs, Ruth A.	2
Daniel Katz	2
Diamond, Esther E.	2
Dyer, Henry S.	2
Farr, Roger	2
Figueroa, Richard A.	2
Findley, Warren G.	2
More ▼

Journal Articles	180
Reports - Research	172
Speeches/Meeting Papers	108
Opinion Papers	103
Information Analyses	72
Reports - Evaluative	72
Reports - Descriptive	30
Guides - Non-Classroom	21
Books	14
Collected Works - Proceedings	11
Reference Materials -…	9
ERIC Publications	8
Reports - General	8
Collected Works - General	6
Collected Works - Serials	6
ERIC Digests in Full Text	5
Tests/Questionnaires	5
Legal/Legislative/Regulatory…	4
Guides - Classroom - Teacher	3
Historical Materials	2
Numerical/Quantitative Data	2
Book/Product Reviews	1
Creative Works	1
Guides - General	1
More ▼

SAT (College Admission Test)	21
Wechsler Intelligence Scale…	14
National Assessment of…	10
California Achievement Tests	5
National Teacher Examinations	5
Comprehensive Tests of Basic…	4
Graduate Record Examinations	4
Peabody Picture Vocabulary…	4
System of Multicultural…	4
Iowa Tests of Basic Skills	3
Metropolitan Achievement Tests	3
ACT Assessment	2
Adaptive Behavior Scale	2
Armed Services Vocational…	2
Florida State Student…	2
General Aptitude Test Battery	2
Metropolitan Readiness Tests	2
Minnesota Multiphasic…	2
New Jersey College Basic…	2
Wechsler Adult Intelligence…	2
California Basic Educational…	1
College Level Examination…	1
Defining Issues Test	1
Estes Attitude Scale	1
Holland Vocational Preference…	1
More ▼