ERIC - Search Results

Publication Date

In 2025	2
Since 2024	3
Since 2021 (last 5 years)	14
Since 2016 (last 10 years)	29
Since 2006 (last 20 years)	40

Descriptor

Test Validity	106
Test Construction	32
Testing Problems	31
Test Use	27
Elementary Secondary Education	22
Test Interpretation	19
Test Reliability	18
Evaluation Methods	17
Scores	17
Achievement Tests	16
Standards	16
Educational Assessment	15
Standardized Tests	15
Test Items	15
Computer Assisted Testing	14
Educational Testing	14
Psychometrics	13
Testing Programs	12
Minimum Competency Testing	10
Test Bias	10
Court Litigation	9
Licensing Examinations…	9
State Programs	9
Norm Referenced Tests	8
Scoring	8
More ▼

Source

Educational Measurement:…

106

Publication Type

Journal Articles	106
Opinion Papers	31
Reports - Evaluative	28
Reports - Research	27
Reports - Descriptive	25
Information Analyses	10
Speeches/Meeting Papers	5
Guides - Non-Classroom	1
Tests/Questionnaires	1

Education Level

Secondary Education	5
Higher Education	4
Junior High Schools	4
Middle Schools	4
Postsecondary Education	3
Early Childhood Education	2
Elementary Education	2
Elementary Secondary Education	2
Grade 3	1
Grade 4	1
Grade 5	1
Grade 6	1
Grade 7	1
Grade 8	1
Grade 9	1
High Schools	1
Intermediate Grades	1
Preschool Education	1
Primary Education	1
More ▼

Audience

Researchers

Location

California	1
Greece	1
Idaho	1
Israel	1
Kansas	1
Michigan	1
Ohio	1
Texas	1
Vermont	1

Laws, Policies, & Programs

Debra P v Turlington	4
Civil Rights Act 1964 Title…	2
No Child Left Behind Act 2001	2
Fourteenth Amendment	1

Assessments and Surveys

Florida State Student…	3
SAT (College Admission Test)	3
National Teacher Examinations	2
Comprehensive Tests of Basic…	1
National Assessment of…	1
Program for International…	1
Stanford Achievement Tests	1
Teacher Performance…	1
Watson Glaser Critical…	1
Wechsler Intelligence Scale…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 106 results Save | Export

Defining Test-Score Interpretation, Use, and Claims: Delphi Study for the Validity Argument

Peer reviewed

Direct link

Folger, Timothy D.; Bostic, Jonathan; Krupa, Erin E. – Educational Measurement: Issues and Practice, 2023

Validity is a fundamental consideration of test development and test evaluation. The purpose of this study is to define and reify three key aspects of validity and validation, namely test-score interpretation, test-score use, and the claims supporting interpretation and use. This study employed a Delphi methodology to explore how experts in…

Descriptors: Test Interpretation, Scores, Test Use, Test Validity

Instruction-Tuned Large-Language Models for Quality Control in Automatic Item Generation: A Feasibility Study

Peer reviewed

Direct link

Guher Gorgun; Okan Bulut – Educational Measurement: Issues and Practice, 2025

Automatic item generation may supply many items instantly and efficiently to assessment and learning environments. Yet, the evaluation of item quality persists to be a bottleneck for deploying generated items in learning and assessment settings. In this study, we investigated the utility of using large-language models, specifically Llama 3-8B, for…

Descriptors: Artificial Intelligence, Quality Control, Technology Uses in Education, Automation

Supporting the Interpretive Validity of Student-Level Claims in Science Assessment with Tiered Claim Structures

Peer reviewed

Direct link

Student, Sanford R.; Gong, Brian – Educational Measurement: Issues and Practice, 2022

We address two persistent challenges in large-scale assessments of the Next Generation Science Standards: (a) the validity of score interpretations that target the standards broadly and (b) how to structure claims for assessments of this complex domain. The NGSS pose a particular challenge for specifying claims about students that evidence from…

Descriptors: Science Tests, Test Validity, Test Items, Test Construction

Disrupted Data: Using Longitudinal Assessment Systems to Monitor Test Score Quality

Peer reviewed

Direct link

An, Lily Shiao; Ho, Andrew Dean; Davis, Laurie Laughlin – Educational Measurement: Issues and Practice, 2022

Technical documentation for educational tests focuses primarily on properties of individual scores at single points in time. Reliability, standard errors of measurement, item parameter estimates, fit statistics, and linking constants are standard technical features that external stakeholders use to evaluate items and individual scale scores.…

Descriptors: Documentation, Scores, Evaluation Methods, Longitudinal Studies

Reframing Research and Assessment Practices: Advancing an Antiracist and Anti-Ableist Research Agenda

Peer reviewed

Direct link

Angela Johnson; Elizabeth Barker; Marcos Viveros Cespedes – Educational Measurement: Issues and Practice, 2024

Educators and researchers strive to build policies and practices on data and evidence, especially on academic achievement scores. When assessment scores are inaccurate for specific student populations or when scores are inappropriately used, even data-driven decisions will be misinformed. To maximize the impact of the research-practice-policy…

Descriptors: Equal Education, Inclusion, Evaluation Methods, Error of Measurement

Digital Module 30: Validity and Educational Testing--Purposes and Uses of Educational Tests

Peer reviewed

Direct link

Lewis, Jennifer; Sireci, Stephen G. – Educational Measurement: Issues and Practice, 2022

This module is designed for educators, educational researchers, and psychometricians who would like to develop an understanding of the basic concepts of validity theory, test validation, and documenting a "validity argument." It also describes how an in-depth understanding of the purposes and uses of educational tests sets the foundation…

Descriptors: Test Validity, Tests, Testing Problems, Faculty Development

When Should I Use a Measure to Support Instructional Improvement at Scale? The Importance of Considering Both Intended and Actual Use in Validity Arguments

Peer reviewed

Direct link

Ing, Marsha; Chinen, Starlie; Jackson, Kara; Smith, Thomas M. – Educational Measurement: Issues and Practice, 2021

Despite the ease of accessing a wide range of measures, little attention is given to validity arguments when considering whether to use the measure for a new purpose or in a different context. Making a validity argument has historically focused on the intended interpretation and use. There has been a press to consider both the intended and actual…

Descriptors: Instructional Improvement, Measures (Individuals), Test Validity, Test Interpretation

The Effect of Drag-and-Drop Item Features on Test-Taker Performance and Response Strategies

Peer reviewed

Direct link

Arslan, Burcu; Jiang, Yang; Keehner, Madeleine; Gong, Tao; Katz, Irvin R.; Yan, Fred – Educational Measurement: Issues and Practice, 2020

Computer-based educational assessments often include items that involve drag-and-drop responses. There are different ways that drag-and-drop items can be laid out and different choices that test developers can make when designing these items. Currently, these decisions are based on experts' professional judgments and design constraints, rather…

Descriptors: Test Items, Computer Assisted Testing, Test Format, Decision Making

Growth across Grades and Common Item Grade Alignment in Vertical Scaling Using the Rasch Model

Peer reviewed

Direct link

Sanford R. Student; Derek C. Briggs; Laurie Davis – Educational Measurement: Issues and Practice, 2025

Vertical scales are frequently developed using common item nonequivalent group linking. In this design, one can use upper-grade, lower-grade, or mixed-grade common items to estimate the linking constants that underlie the absolute measurement of growth. Using the Rasch model and a dataset from Curriculum Associates' i-Ready Diagnostic in math in…

Descriptors: Elementary School Mathematics, Elementary School Students, Middle School Mathematics, Middle School Students

The Impact of the COVID-19 Pandemic on American Board of Surgery's Oral Certifying Exams

Peer reviewed

Direct link

Barry, Carol L.; Jones, Andrew T.; Ibáñez, Beatriz; Grambau, Marni; Buyske, Jo – Educational Measurement: Issues and Practice, 2022

In response to the COVID-19 pandemic, the American Board of Surgery (ABS) shifted from in-person to remote administrations of the oral certifying exam (CE). Although the overall exam architecture remains the same, there are a number of differences in administration and staffing costs, exam content, security concerns, and the tools used to give the…

Descriptors: COVID-19, Pandemics, Computer Assisted Testing, Verbal Tests

Using the "Joint Standards" to Design Postsecondary Assessments with Evidence of Validity and Reliability: An Approach to CAEP Accreditation

Peer reviewed

Direct link

Wilkerson, Judy R. – Educational Measurement: Issues and Practice, 2020

Validity and reliability are a major focus in teacher education accreditation by the Council for Accreditation of Educator Preparation (CAEP). CAEP requires the use of "accepted research standards," but many faculty and administrators are unsure how to meet this requirement. The Standards of Educational and Psychological Testing…

Descriptors: Test Construction, Test Validity, Test Reliability, Teacher Education Programs

Clarifying the Terminology of Validity and the Investigative Stages of Validation

Peer reviewed

Direct link

Russell, Michael – Educational Measurement: Issues and Practice, 2022

Despite agreement about the central importance of validity for educational and psychological testing, consensus regarding the definition of validity remains elusive. Differences in the definition of validity are examined and reveals that a potential cause of disagreement stems from differences in word use and meanings given to key terms commonly…

Descriptors: Test Validity, Psychological Testing, Educational Testing, Vocabulary

Examining Effectiveness and Validity of Accommodations for English Language Learners in Mathematics: An Evidence-Based Computer Accommodation Decision System

Peer reviewed

Direct link

Abedi, Jamal; Zhang, Yu; Rowe, Susan E.; Lee, Hansol – Educational Measurement: Issues and Practice, 2020

Research indicates that the performance-gap between English Language Learners (ELLs) and their non-ELL peers is partly due to ELLs' difficulty in understanding assessment language. Accommodations have been shown to narrow this performance-gap, but many accommodations studies have not used a randomized design and are based on relatively small…

Descriptors: English Language Learners, Achievement Gap, Mathematics Tests, Standards

An Evaluative Framework for Reviewing Fairness Standards and Practices in Educational Tests

Peer reviewed

Direct link

Jonson, Jessica L.; Trantham, Pamela; Usher-Tate, Betty Jean – Educational Measurement: Issues and Practice, 2019

One of the substantive changes in the 2014 Standards for Educational and Psychological Testing was the elevation of fairness in testing as a foundational element of practice in addition to validity and reliability. Previous research indicates that testing practices often do not align with professional standards and guidelines. Therefore, to raise…

Descriptors: Culture Fair Tests, Test Validity, Test Reliability, Intelligence Tests

Digital Module 09: Sociocognitive Assessment for Diverse Populations

Peer reviewed

Direct link

Mislevy, Robert J.; Oliveri, Maria Elena – Educational Measurement: Issues and Practice, 2019

In this digital ITEMS module, Dr. Robert [Bob] Mislevy and Dr. Maria Elena Oliveri introduce and illustrate a sociocognitive perspective on educational measurement, which focuses on a variety of design and implementation considerations for creating fair and valid assessments for learners from diverse populations with diverse sociocultural…

Descriptors: Educational Testing, Reliability, Test Validity, Test Reliability

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8

Linn, Robert L.	4
Mehrens, William A.	4
Frisbie, David A.	3
Sireci, Stephen G.	3
Cizek, Gregory J.	2
Madaus, George F.	2
Popham, W. James	2
Rudner, Lawrence M.	2
Shepard, Lorrie A.	2
Wise, Steven L.	2
Abedi, Jamal	1
Agrimson, Jared	1
An, Lily Shiao	1
Angela Johnson	1
Arslan, Burcu	1
Bakeman, Roger	1
Barry, Carol L.	1
Bejar, Issac I.	1
Beller, Michal	1
Bhola, Dennison S.	1
Bond, Lloyd	1
Bostic, Jonathan	1
Bottsford-Miller, Nicole A.	1
Boyer, Michelle	1
More ▼