ERIC - Search Results

Publication Date

In 2025	5
Since 2024	8
Since 2021 (last 5 years)	21
Since 2016 (last 10 years)	45
Since 2006 (last 20 years)	76

Descriptor

Scores	203
Test Interpretation	203
Test Validity	157
Test Reliability	55
Test Construction	44
Test Use	43
Test Results	33
Validity	33
Achievement Tests	31
Testing	29
Standardized Tests	27
Elementary Secondary Education	25
Testing Problems	22
Educational Assessment	21
Higher Education	20
Student Evaluation	20
Construct Validity	18
Educational Testing	17
English (Second Language)	17
Evaluation Methods	17
Language Tests	17
Test Items	17
Foreign Countries	16
Inferences	16
Psychometrics	16
More ▼

Education Level

Higher Education	15
Postsecondary Education	11
Secondary Education	5
Elementary Secondary Education	4
Elementary Education	2
Grade 5	2
High Schools	2
Middle Schools	2
Grade 7	1
Grade 9	1
Junior High Schools	1
More ▼

Audience

Practitioners	9
Researchers	8
Parents	3
Community	1
Policymakers	1
Students	1
Teachers	1

Location

United Kingdom	3
Italy	2
Japan	2
Alaska	1
Australia	1
California	1
California (Los Angeles)	1
China	1
Cyprus	1
Hawaii	1
Illinois	1
Iowa	1
Iran	1
Kentucky (Louisville)	1
Massachusetts	1
South Korea	1
Spain	1
Sweden	1
Taiwan	1
Thailand	1
United Kingdom (Reading)	1
United States	1
More ▼

Laws, Policies, & Programs

Americans with Disabilities…	1
Every Student Succeeds Act…	1
Individuals with Disabilities…	1
Individuals with Disabilities…	1
No Child Left Behind Act 2001	1

What Works Clearinghouse Rating

Showing 1 to 15 of 203 results Save | Export

Defining Test-Score Interpretation, Use, and Claims: Delphi Study for the Validity Argument

Peer reviewed

Direct link

Folger, Timothy D.; Bostic, Jonathan; Krupa, Erin E. – Educational Measurement: Issues and Practice, 2023

Validity is a fundamental consideration of test development and test evaluation. The purpose of this study is to define and reify three key aspects of validity and validation, namely test-score interpretation, test-score use, and the claims supporting interpretation and use. This study employed a Delphi methodology to explore how experts in…

Descriptors: Test Interpretation, Scores, Test Use, Test Validity

Building a Validity Argument for the TOEFL Junior® Tests. TOEFL® Research Report. RR-102. ETS RR-24-05

Peer reviewed
PDF on ERIC

Download full text

Ching-Ni Hsieh – ETS Research Report Series, 2024

The TOEFL Junior® tests are designed to evaluate young language students' English reading, listening, speaking, and writing skills in an English-medium secondary instructional context. This paper articulates a validity argument constructed to support the use and interpretation of the TOEFL Junior test scores for the purpose of placement, progress…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Scores

A Historic Review and Empirical Revitalization of the Stages of Concern Questionnaire

Peer reviewed
PDF on ERIC

Download full text

Kent Anderson Seidel – School Leadership Review, 2025

This paper examines one of three central diagnostic tools of the Concerns Based Adoption Model, the Stages of Concern Questionnaire (SoCQ). The SoCQ was developed with a focus on K12 education. It has been used widely since developed in 1973, in early childhood, higher education, medical, business, community, and military settings. The SoCQ…

Descriptors: Questionnaires, Educational Change, Educational Innovation, Intervention

Assessing the Validity of Test Scores Using Response Process Data from an Eye-Tracking Study: A New Approach

Peer reviewed

Direct link

Yaneva, Victoria; Clauser, Brian E.; Morales, Amy; Paniagua, Miguel – Advances in Health Sciences Education, 2022

Understanding the response process used by test takers when responding to multiple-choice questions (MCQs) is particularly important in evaluating the validity of score interpretations. Previous authors have recommended eye-tracking technology as a useful approach for collecting data on the processes test taker's use to respond to test questions.…

Descriptors: Eye Movements, Artificial Intelligence, Scores, Test Interpretation

Investigating the Process of Consequential Validity with the Ambassador Questionnaire

Direct link

Kuhn, Melissa Gayle – ProQuest LLC, 2022

Validity in psychometrics refers to the degree to which evidence and theory supports the interpretations drawn from a test, and Messick's Contemporary Validity Theory (1994) includes several facets with well-established evidence collection methods. However, there is a lack of consensus on appropriate methods of evaluating the facet of…

Descriptors: Test Validity, Psychometrics, Test Interpretation, Scores

Does Timed Testing Affect the Interpretation of Efficiency Scores?--A GLMM Analysis of Reading Components

Peer reviewed

Direct link

Frank Goldhammer; Ulf Kroehne; Carolin Hahnel; Johannes Naumann; Paul De Boeck – Journal of Educational Measurement, 2024

The efficiency of cognitive component skills is typically assessed with speeded performance tests. Interpreting only effective ability or effective speed as efficiency may be challenging because of the within-person dependency between both variables (speed-ability tradeoff, SAT). The present study measures efficiency as effective ability…

Descriptors: Timed Tests, Efficiency, Scores, Test Interpretation

A Note on the Use of Categorical Subscores

Peer reviewed

Direct link

Kylie Gorney; Sandip Sinharay – Journal of Educational Measurement, 2025

Although there exists an extensive amount of research on subscores and their properties, limited research has been conducted on categorical subscores and their interpretations. In this paper, we focus on the claim of Feinberg and von Davier that categorical subscores are useful for remediation and instructional purposes. We investigate this claim…

Descriptors: Tests, Scores, Test Interpretation, Alternative Assessment

Using Item Scores and Distractors to Detect Aberrant Behavior

Direct link

Gorney, Kylie – ProQuest LLC, 2023

Aberrant behavior refers to any type of unusual behavior that would not be expected under normal circumstances. In educational and psychological testing, such behaviors have the potential to severely bias the aberrant examinee's test score while also jeopardizing the test scores of countless others. It is therefore crucial that aberrant examinees…

Descriptors: Behavior Problems, Educational Testing, Psychological Testing, Test Bias

The Broad Autism Phenotype--International Test (BAP-IT): A Two-Domain-Based Test for the Assessment of the Broad Autism Phenotype

Peer reviewed

Direct link

Marta Godoy-Giménez; Ángel García-Pérez; Fernando Cañadas; Angeles F. Estévez; Pablo Sayans-Jiménez – Autism: The International Journal of Research and Practice, 2024

The broad autism phenotype is the phenotypic expression of the primary characteristics of autism. However, currently available tests do not agree with the two-domain operationalization of broad autism phenotype or autism, and their internal structure has shown instability across applications. This study presents the Broad Autism…

Descriptors: Autism Spectrum Disorders, Genetics, Diagnostic Tests, Foreign Countries

Development of the Quantitative Modelling Observation Protocol (QMOP) for Undergraduate Biology Courses: Validity Evidence for Score Interpretation and Uses

Peer reviewed

Direct link

Lyrica Lucas; Anum Khushal; Robert Mayes; Brian A. Couch; Joseph Dauer – International Journal of Science Education, 2025

Educational reform priorities such as emphasis on quantitative modelling (QM) have positioned undergraduate biology instructors as designers of QM experiences to engage students in authentic science practices that support the development of data-driven and evidence-based reasoning. Yet, little is known about how biology instructors adapt to the…

Descriptors: Undergraduate Students, College Science, Biology, Classroom Observation Techniques

Content Validity of Creativity Self-Report Questionnaires from PISA 2022

Peer reviewed

Direct link

B. Goecke; S. Weiss; B. Barbot – Journal of Creative Behavior, 2025

The present paper questions the content validity of the eight creativity-related self-report scales available in PISA 2022's context questionnaire and provides a set of considerations for researchers interested in using these indexes. Specifically, we point out some threats to the content validity of these scales (e.g., "creative thinking…

Descriptors: Creativity, Creativity Tests, Questionnaires, Content Validity

Re-Examining Measurement Invariance of School Climate Surveys across Race/Ethnicity

Peer reviewed

Direct link

Stephen M. Leach; Jason C. Immekus; Jeffrey C. Valentine; Prathiba Batley; Dena Dossett; Tamara Lewis; Thomas Reece – Assessment for Effective Intervention, 2025

Educators commonly use school climate survey scores to inform and evaluate interventions for equitably improving learning and reducing educational disparities. Unfortunately, validity evidence to support these (and other) score uses often falls short. In response, Whitehouse et al. proposed a collaborative, two-part validity testing framework for…

Descriptors: School Surveys, Measurement, Hierarchical Linear Modeling, Educational Environment

Disrupted Data: Using Longitudinal Assessment Systems to Monitor Test Score Quality

Peer reviewed

Direct link

An, Lily Shiao; Ho, Andrew Dean; Davis, Laurie Laughlin – Educational Measurement: Issues and Practice, 2022

Technical documentation for educational tests focuses primarily on properties of individual scores at single points in time. Reliability, standard errors of measurement, item parameter estimates, fit statistics, and linking constants are standard technical features that external stakeholders use to evaluate items and individual scale scores.…

Descriptors: Documentation, Scores, Evaluation Methods, Longitudinal Studies

Making Sense of Spring 2021 Assessment Results

Download full text

Dadey, Nathan; Keng, Leslie; Boyer, Michelle; Marion, Scott – National Center for the Improvement of Educational Assessment, 2021

State summative educational assessment is about to begin in earnest. Rightfully, many are raising questions about the quality, meaning, and appropriate use of the assessment results. This document was written to support state educational agencies (SEAs) and their assessment providers in devising effective and efficient analysis plans. This…

Descriptors: Educational Assessment, Summative Evaluation, Student Evaluation, Test Use

Comparison of Two Approaches to Interpretive Use Arguments

Peer reviewed

Direct link

Carney, Michele; Crawford, Angela; Siebert, Carl; Osguthorpe, Rich; Thiede, Keith – Applied Measurement in Education, 2019

The "Standards for Educational and Psychological Testing" recommend an argument-based approach to validation that involves a clear statement of the intended interpretation and use of test scores, the identification of the underlying assumptions and inferences in that statement--termed the interpretation/use argument, and gathering of…

Descriptors: Inquiry, Test Interpretation, Validity, Scores

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 14

Educational Measurement:…	10
Journal of Educational…	10
Applied Measurement in…	8
Language Assessment Quarterly	5
ProQuest LLC	5
Assessment in Education:…	4
Diagnostique	4
ETS Research Report Series	4
Measurement:…	4
Educational Researcher	3
Educational and Psychological…	3
Language Testing	3
Psychology in the Schools	3
Assessment for Effective…	2
Educational Psychologist	2
Harvard Educational Review	2
Journal of Clinical Psychology	2
NCME Measurement in Education	2
School Psychology Review	2
Test Service Bulletin	2
AEDS Journal	1
Advances in Health Sciences…	1
American Psychologist	1
Assessment	1
Autism: The International…	1
More ▼

Messick, Samuel	7
Kane, Michael T.	5
Anderson, Paul D.	2
Beaujean, A. Alexander	2
Borsboom, Denny	2
Boyer, Michelle	2
Canivez, Gary L.	2
Geisinger, Kurt F.	2
Haertel, Edward H.	2
Krupa, Erin E.	2
Linn, Robert L.	2
McCarney, Stephen B.	2
McGill, Ryan J.	2
Mehrens, William A.	2
Millman, Jason	2
Schmidgall, Jonathan	2
Afghari, Akbar	1
Aiken, Lewis R.	1
Airasian, Peter W.	1
Allen, Thomas E.	1
Aloisi, Cesare	1
Alonazi, Zaha	1
Alspaugh, John W.	1
An, Lily Shiao	1
More ▼

Journal Articles	112
Reports - Research	79
Reports - Evaluative	43
Opinion Papers	25
Speeches/Meeting Papers	20
Guides - Non-Classroom	16
Reports - Descriptive	13
Tests/Questionnaires	12
Information Analyses	8
Books	5
Dissertations/Theses -…	5
Collected Works - Proceedings	2
Collected Works - Serials	2
ERIC Publications	2
Numerical/Quantitative Data	2
Book/Product Reviews	1
ERIC Digests in Full Text	1
Guides - General	1
Reference Materials -…	1
Reference Materials - General	1
Reports - General	1
More ▼

SAT (College Admission Test)	8
Test of English as a Foreign…	6
Wechsler Intelligence Scale…	6
Stanford Binet Intelligence…	4
International English…	2
Iowa Tests of Basic Skills	2
Metropolitan Achievement Tests	2
Minnesota Multiphasic…	2
National Assessment of…	2
Stanford Achievement Tests	2
Test of English for…	2
Woodcock Johnson Psycho…	2
ACT Assessment	1
ACTFL Oral Proficiency…	1
Adaptive Behavior Scale	1
Bracken Basic Concept Scale	1
College Level Examination…	1
English Proficiency Test	1
General Educational…	1
Graduate Record Examinations	1
Myers Briggs Type Indicator	1
New Jersey College Basic…	1
Pennsylvania Educational…	1
Program for International…	1
Rokeach Value Survey	1
More ▼