ERIC - Search Results

Publication Date

In 2025	3
Since 2024	6
Since 2021 (last 5 years)	34
Since 2016 (last 10 years)	87
Since 2006 (last 20 years)	159

Descriptor

Scores	650
Test Interpretation	650
Test Validity	156
Test Results	149
Elementary Secondary Education	141
Achievement Tests	135
Standardized Tests	121
Test Use	98
Test Construction	97
Test Reliability	91
Academic Achievement	87
Educational Assessment	83
Testing Problems	80
Testing	73
Statistical Analysis	65
Student Evaluation	64
Higher Education	63
Testing Programs	61
Norm Referenced Tests	60
State Programs	55
Educational Testing	51
Intelligence Tests	50
Evaluation Methods	48
Comparative Analysis	47
Correlation	47
More ▼

Education Level

Higher Education	27
Postsecondary Education	22
Secondary Education	21
Elementary Secondary Education	16
Elementary Education	12
Junior High Schools	6
Middle Schools	6
High Schools	5
Grade 7	3
Grade 8	3
Early Childhood Education	2
Grade 1	2
Grade 4	2
Grade 5	2
Grade 6	2
Grade 9	2
Intermediate Grades	2
Primary Education	2
Grade 2	1
Grade 3	1
Kindergarten	1
More ▼

Audience

Practitioners	48
Researchers	23
Teachers	16
Administrators	15
Parents	8
Policymakers	6
Community	4
Students	4
Counselors	2

Location

Canada	8
Pennsylvania	7
Alaska	6
Japan	6
New Jersey	6
United Kingdom	5
California	4
Delaware	4
Michigan	4
United States	4
Australia	3
Oregon	3
China	2
Italy	2
Massachusetts	2
Netherlands	2
South Carolina	2
Taiwan	2
Texas (Austin)	2
Vermont	2
Alabama	1
California (Los Angeles)	1
China (Shanghai)	1
Colombia	1
Connecticut	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…	8
No Child Left Behind Act 2001	6
Americans with Disabilities…	1
Every Student Succeeds Act…	1
Individuals with Disabilities…	1
Individuals with Disabilities…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 650 results Save | Export

Defining Test-Score Interpretation, Use, and Claims: Delphi Study for the Validity Argument

Peer reviewed

Direct link

Folger, Timothy D.; Bostic, Jonathan; Krupa, Erin E. – Educational Measurement: Issues and Practice, 2023

Validity is a fundamental consideration of test development and test evaluation. The purpose of this study is to define and reify three key aspects of validity and validation, namely test-score interpretation, test-score use, and the claims supporting interpretation and use. This study employed a Delphi methodology to explore how experts in…

Descriptors: Test Interpretation, Scores, Test Use, Test Validity

Building a Validity Argument for the TOEFL Junior® Tests. TOEFL® Research Report. RR-102. ETS RR-24-05

Peer reviewed
PDF on ERIC

Download full text

Ching-Ni Hsieh – ETS Research Report Series, 2024

The TOEFL Junior® tests are designed to evaluate young language students' English reading, listening, speaking, and writing skills in an English-medium secondary instructional context. This paper articulates a validity argument constructed to support the use and interpretation of the TOEFL Junior test scores for the purpose of placement, progress…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Scores

Integration of Prediction Scores from Various Automated Essay Scoring Models Using Item Response Theory

Peer reviewed

Direct link

Uto, Masaki; Aomi, Itsuki; Tsutsumi, Emiko; Ueno, Maomi – IEEE Transactions on Learning Technologies, 2023

In automated essay scoring (AES), essays are automatically graded without human raters. Many AES models based on various manually designed features or various architectures of deep neural networks (DNNs) have been proposed over the past few decades. Each AES model has unique advantages and characteristics. Therefore, rather than using a single-AES…

Descriptors: Prediction, Scores, Computer Assisted Testing, Scoring

Assessing the Validity of Test Scores Using Response Process Data from an Eye-Tracking Study: A New Approach

Peer reviewed

Direct link

Yaneva, Victoria; Clauser, Brian E.; Morales, Amy; Paniagua, Miguel – Advances in Health Sciences Education, 2022

Understanding the response process used by test takers when responding to multiple-choice questions (MCQs) is particularly important in evaluating the validity of score interpretations. Previous authors have recommended eye-tracking technology as a useful approach for collecting data on the processes test taker's use to respond to test questions.…

Descriptors: Eye Movements, Artificial Intelligence, Scores, Test Interpretation

What Are the Conditions Associated with Subscore Added Value Noninvariance? Implications for Improving Subscore Interpretation Fairness

Peer reviewed

Direct link

Rios, Joseph A.; Miranda, Alejandra A. – Educational Measurement: Issues and Practice, 2021

Subscore added value analyses assume invariance across test taking populations; however, this assumption may be untenable in practice as differential subdomain relationships may be present among subgroups. The purpose of this simulation study was to understand the conditions associated with subscore added value noninvariance when manipulating: (1)…

Descriptors: Scores, Test Length, Ability, Correlation

Investigating the Process of Consequential Validity with the Ambassador Questionnaire

Direct link

Kuhn, Melissa Gayle – ProQuest LLC, 2022

Validity in psychometrics refers to the degree to which evidence and theory supports the interpretations drawn from a test, and Messick's Contemporary Validity Theory (1994) includes several facets with well-established evidence collection methods. However, there is a lack of consensus on appropriate methods of evaluating the facet of…

Descriptors: Test Validity, Psychometrics, Test Interpretation, Scores

Does Timed Testing Affect the Interpretation of Efficiency Scores?--A GLMM Analysis of Reading Components

Peer reviewed

Direct link

Frank Goldhammer; Ulf Kroehne; Carolin Hahnel; Johannes Naumann; Paul De Boeck – Journal of Educational Measurement, 2024

The efficiency of cognitive component skills is typically assessed with speeded performance tests. Interpreting only effective ability or effective speed as efficiency may be challenging because of the within-person dependency between both variables (speed-ability tradeoff, SAT). The present study measures efficiency as effective ability…

Descriptors: Timed Tests, Efficiency, Scores, Test Interpretation

Wechsler Intelligence Scale for Children--Fifth Edition Ancillary and Complementary Index Critical Values and Base Rates for the Normative Sample

Peer reviewed

Direct link

Puttaswamy, Ash; Barone, Anjelica; Viezel, Kathleen D.; Willis, John O.; Dumont, Ron – Journal of Psychoeducational Assessment, 2020

An area of particular importance when examining index scores on the Wechsler Intelligence Scale for Children--Fifth Edition (WISC-V) is the utilization and interpretation of critical values and base rates associated with differences between an individual's subtest scaled score and the individual's mean scaled score for an index. For the WISC-V,…

Descriptors: Children, Intelligence Tests, Scores, Differences

A Note on the Use of Categorical Subscores

Peer reviewed

Direct link

Kylie Gorney; Sandip Sinharay – Journal of Educational Measurement, 2025

Although there exists an extensive amount of research on subscores and their properties, limited research has been conducted on categorical subscores and their interpretations. In this paper, we focus on the claim of Feinberg and von Davier that categorical subscores are useful for remediation and instructional purposes. We investigate this claim…

Descriptors: Tests, Scores, Test Interpretation, Alternative Assessment

Difference Score Reliabilities within the RIAS-2 and WISC-V

Peer reviewed

Direct link

Farmer, Ryan L.; Kim, Samuel Y. – Psychology in the Schools, 2020

Many prominent intelligence tests (e.g., Wechsler Intelligence Scale for Children, Fifth Edition [WISC-V] and Reynolds Intellectual Abilities Scale, Second Edition [RIAS-2]) offer methods for computing subtest- and composite-level difference scores. This study uses data provided in the technical manual of the WISC-V and RIAS-2 to calculate…

Descriptors: Children, Intelligence Tests, Scores, Test Reliability

Using Item Scores and Distractors to Detect Aberrant Behavior

Direct link

Gorney, Kylie – ProQuest LLC, 2023

Aberrant behavior refers to any type of unusual behavior that would not be expected under normal circumstances. In educational and psychological testing, such behaviors have the potential to severely bias the aberrant examinee's test score while also jeopardizing the test scores of countless others. It is therefore crucial that aberrant examinees…

Descriptors: Behavior Problems, Educational Testing, Psychological Testing, Test Bias

Establishing Time-Continuous Normative Scores for Teaching Strategies Gold® Using a Multilevel Growth Curve Modeling Approach

Direct link

Hannah E. Luce – ProQuest LLC, 2023

Young children are assessed to meet federal mandates and inform policy decisions, provide teachers with useful information to make instructional decisions and set reasonable learning goals, and facilitate communication with families. While young children are frequently assessed using whole-child assessments which often yield criterion-referenced…

Descriptors: Scores, Norm Referenced Tests, Test Interpretation, Student Evaluation

The Broad Autism Phenotype--International Test (BAP-IT): A Two-Domain-Based Test for the Assessment of the Broad Autism Phenotype

Peer reviewed

Direct link

Marta Godoy-Giménez; Ángel García-Pérez; Fernando Cañadas; Angeles F. Estévez; Pablo Sayans-Jiménez – Autism: The International Journal of Research and Practice, 2024

The broad autism phenotype is the phenotypic expression of the primary characteristics of autism. However, currently available tests do not agree with the two-domain operationalization of broad autism phenotype or autism, and their internal structure has shown instability across applications. This study presents the Broad Autism…

Descriptors: Autism Spectrum Disorders, Genetics, Diagnostic Tests, Foreign Countries

Development of the Quantitative Modelling Observation Protocol (QMOP) for Undergraduate Biology Courses: Validity Evidence for Score Interpretation and Uses

Peer reviewed

Direct link

Lyrica Lucas; Anum Khushal; Robert Mayes; Brian A. Couch; Joseph Dauer – International Journal of Science Education, 2025

Educational reform priorities such as emphasis on quantitative modelling (QM) have positioned undergraduate biology instructors as designers of QM experiences to engage students in authentic science practices that support the development of data-driven and evidence-based reasoning. Yet, little is known about how biology instructors adapt to the…

Descriptors: Undergraduate Students, College Science, Biology, Classroom Observation Techniques

Historical Perspectives on Score Comparability Issues Raised by Innovations in Testing

Peer reviewed

Direct link

Baldwin, Peter; Clauser, Brian E. – Journal of Educational Measurement, 2022

While score comparability across test forms typically relies on common (or randomly equivalent) examinees or items, innovations in item formats, test delivery, and efforts to extend the range of score interpretation may require a special data collection before examinees or items can be used in this way--or may be incompatible with common examinee…

Descriptors: Scoring, Testing, Test Items, Test Format

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 44

Educational Measurement:…	27
Journal of Educational…	23
Educational and Psychological…	17
Applied Measurement in…	12
Psychology in the Schools	12
ETS Research Report Series	11
ProQuest LLC	9
Diagnostique	7
International Journal of…	7
Language Assessment Quarterly	6
Test Service Bulletin	6
Assessment in Education:…	5
College and University	5
Journal of Clinical Psychology	5
Journal of Psychoeducational…	5
Journal of School Psychology	5
Measurement:…	5
New Directions for Testing…	5
Applied Psychological…	4
Journal of Educational…	4
Journal of Learning…	4
Language Testing	4
Psychological Assessment	4
School Psychology Review	4
Chronicle of Higher Education	3
More ▼

Messick, Samuel	7
Linn, Robert L.	6
Hills, John R.	5
Kane, Michael T.	5
Plake, Barbara S.	5
Reynolds, Cecil R.	5
Beaujean, A. Alexander	4
Benson, Nicholas F.	4
Hambleton, Ronald K.	4
Mislevy, Robert J.	4
Myerberg, N. James	4
Naglieri, Jack A.	4
Beck, Michael D.	3
Brennan, Robert L.	3
Canivez, Gary L.	3
Echternacht, Gary	3
Gallas, Edwin J.	3
Green, Donald Ross	3
Haertel, Edward H.	3
Hertzog, James F., Comp.	3
La Marca, Paul M.	3
Ligon, Glynn	3
Livingston, Samuel A.	3
Lunneborg, Patricia W.	3
More ▼

Journal Articles	303
Reports - Research	260
Reports - Evaluative	107
Guides - Non-Classroom	74
Reports - Descriptive	71
Speeches/Meeting Papers	69
Opinion Papers	53
Information Analyses	26
Tests/Questionnaires	25
Numerical/Quantitative Data	23
Books	11
ERIC Publications	10
Dissertations/Theses -…	9
Guides - General	9
ERIC Digests in Full Text	7
Collected Works - Serials	6
Reports - General	6
Reference Materials - General	3
Book/Product Reviews	2
Collected Works - Proceedings	2
Reference Materials -…	2
Collected Works - General	1
Guides - Classroom - Learner	1
Non-Print Media	1
Reference Materials -…	1
More ▼

SAT (College Admission Test)	29
Wechsler Intelligence Scale…	26
Iowa Tests of Basic Skills	16
National Assessment of…	12
Metropolitan Achievement Tests	11
Test of English as a Foreign…	10
Comprehensive Tests of Basic…	9
Wechsler Adult Intelligence…	9
Stanford Binet Intelligence…	8
Minnesota Multiphasic…	7
ACT Assessment	6
California Achievement Tests	6
Delaware Student Testing…	6
Program for International…	6
Woodcock Johnson Psycho…	6
Graduate Record Examinations	5
Pennsylvania Educational…	5
Stanford Achievement Tests	5
Test of English for…	5
Nelson Denny Reading Tests	4
Strong Campbell Interest…	4
Armed Services Vocational…	3
College Board Achievement…	3
General Educational…	3
Kaufman Assessment Battery…	3
More ▼