ERIC - Search Results

Publication Date

In 2026	0
Since 2025	17
Since 2022 (last 5 years)	74
Since 2017 (last 10 years)	189
Since 2007 (last 20 years)	384

Descriptor

Test Interpretation	3982
Test Validity	963
Test Construction	690
Elementary Secondary Education	678
Scores	654
Test Reliability	625
Test Results	623
Testing	550
Achievement Tests	511
Standardized Tests	491
Testing Problems	488
Test Use	383
Academic Achievement	370
Higher Education	366
Evaluation Methods	362
Intelligence Tests	360
Scoring	352
Student Evaluation	351
Educational Assessment	298
Statistical Analysis	295
Educational Testing	291
Testing Programs	281
Elementary Education	263
Comparative Analysis	260
Criterion Referenced Tests	255
More ▼

Education Level

Elementary Secondary Education	64
Higher Education	64
Postsecondary Education	50
Elementary Education	46
Secondary Education	44
Middle Schools	17
High Schools	13
Junior High Schools	13
Early Childhood Education	12
Grade 4	12
Grade 8	10
Intermediate Grades	8
Primary Education	7
Grade 7	6
Kindergarten	6
Adult Education	5
Grade 1	4
Grade 3	4
Grade 5	4
Grade 6	4
Preschool Education	3
Grade 12	2
Grade 2	2
Grade 9	2
Adult Basic Education	1
More ▼

Audience

Practitioners	274
Researchers	122
Teachers	102
Administrators	63
Counselors	28
Parents	21
Policymakers	21
Students	15
Community	8

Location

Canada	45
Australia	33
California	33
United Kingdom	23
United States	20
Pennsylvania	18
United Kingdom (England)	17
New York	15
Japan	14
Michigan	14
New Jersey	12
Massachusetts	10
United Kingdom (Great Britain)	10
Illinois	9
Israel	9
Netherlands	9
Texas	9
United Kingdom (Wales)	9
Alaska	8
Florida	8
Germany	8
Indiana	8
West Germany	8
Delaware	7
Kentucky	7
More ▼

What Works Clearinghouse Rating

Showing 31 to 45 of 3,982 results Save | Export

Which Assessment Is Harder? Some Limits of Statistical Linking

Download full text

Benton, Tom; Williamson, Joanna – Research Matters, 2022

Equating methods are designed to adjust between alternate versions of assessments targeting the same content at the same level, with the aim that scores from the different versions can be used interchangeably. The statistical processes used in equating have, however, been extended to statistically "link" assessments that differ, such as…

Descriptors: Statistical Analysis, Equated Scores, Definitions, Alternative Assessment

Using Item Scores and Distractors to Detect Aberrant Behavior

Direct link

Gorney, Kylie – ProQuest LLC, 2023

Aberrant behavior refers to any type of unusual behavior that would not be expected under normal circumstances. In educational and psychological testing, such behaviors have the potential to severely bias the aberrant examinee's test score while also jeopardizing the test scores of countless others. It is therefore crucial that aberrant examinees…

Descriptors: Behavior Problems, Educational Testing, Psychological Testing, Test Bias

Establishing Time-Continuous Normative Scores for Teaching Strategies Gold® Using a Multilevel Growth Curve Modeling Approach

Direct link

Hannah E. Luce – ProQuest LLC, 2023

Young children are assessed to meet federal mandates and inform policy decisions, provide teachers with useful information to make instructional decisions and set reasonable learning goals, and facilitate communication with families. While young children are frequently assessed using whole-child assessments which often yield criterion-referenced…

Descriptors: Scores, Norm Referenced Tests, Test Interpretation, Student Evaluation

Can Leadership Assessments Do Harm? Considerations for the Ethical Use of Assessments in Leadership Development

Peer reviewed

Direct link

Barnes, Amy C. – New Directions for Student Leadership, 2021

This article explores the ethical use of assessments in leadership training, education, and development. From the importance of having well-trained facilitators to the consideration of power and social identity in the interpretation of individual results, this article advocates for approaching the use of leadership assessments and inventories with…

Descriptors: Leadership, Measures (Individuals), Ethics, Test Use

When Should I Use a Measure to Support Instructional Improvement at Scale? The Importance of Considering Both Intended and Actual Use in Validity Arguments

Peer reviewed

Direct link

Ing, Marsha; Chinen, Starlie; Jackson, Kara; Smith, Thomas M. – Educational Measurement: Issues and Practice, 2021

Despite the ease of accessing a wide range of measures, little attention is given to validity arguments when considering whether to use the measure for a new purpose or in a different context. Making a validity argument has historically focused on the intended interpretation and use. There has been a press to consider both the intended and actual…

Descriptors: Instructional Improvement, Measures (Individuals), Test Validity, Test Interpretation

The Broad Autism Phenotype--International Test (BAP-IT): A Two-Domain-Based Test for the Assessment of the Broad Autism Phenotype

Peer reviewed

Direct link

Marta Godoy-Giménez; Ángel García-Pérez; Fernando Cañadas; Angeles F. Estévez; Pablo Sayans-Jiménez – Autism: The International Journal of Research and Practice, 2024

The broad autism phenotype is the phenotypic expression of the primary characteristics of autism. However, currently available tests do not agree with the two-domain operationalization of broad autism phenotype or autism, and their internal structure has shown instability across applications. This study presents the Broad Autism…

Descriptors: Autism Spectrum Disorders, Genetics, Diagnostic Tests, Foreign Countries

Gender Differences in Item Nonresponse in the PISA 2018 Student Questionnaire

Peer reviewed

Direct link

Kseniia Marcq; Johan Braeken – Educational Assessment, Evaluation and Accountability, 2024

Gender differences in item nonresponse are well-documented in high-stakes achievement tests, where female students are shown to omit more items than male students. These gender differences in item nonresponse are often linked to differential risk-taking strategies, with females being risk-averse and unwilling to guess on an item, even if it could…

Descriptors: Secondary School Students, International Assessment, Gender Differences, Response Rates (Questionnaires)

Leveraging ChatGPT for Scoring Students' Subjective Tests

Peer reviewed
PDF on ERIC

Download full text

Tri Sedya Febrianti; Siti Fatimah; Yuni Fitriyah; Hanifah Nurhayati – International Journal of Education in Mathematics, Science and Technology, 2024

Assessing students' understanding of circle-related material through subjective tests is effective, though grading these tests can be challenging and often requires technological support. ChatGPT has shown promise in providing reliable and objective evaluations. Many teachers in Indonesia, however, continue to face difficulties integrating…

Descriptors: Artificial Intelligence, Computer Assisted Testing, Scoring, Tests

Development of the Quantitative Modelling Observation Protocol (QMOP) for Undergraduate Biology Courses: Validity Evidence for Score Interpretation and Uses

Peer reviewed

Direct link

Lyrica Lucas; Anum Khushal; Robert Mayes; Brian A. Couch; Joseph Dauer – International Journal of Science Education, 2025

Educational reform priorities such as emphasis on quantitative modelling (QM) have positioned undergraduate biology instructors as designers of QM experiences to engage students in authentic science practices that support the development of data-driven and evidence-based reasoning. Yet, little is known about how biology instructors adapt to the…

Descriptors: Undergraduate Students, College Science, Biology, Classroom Observation Techniques

Historical Perspectives on Score Comparability Issues Raised by Innovations in Testing

Peer reviewed

Direct link

Baldwin, Peter; Clauser, Brian E. – Journal of Educational Measurement, 2022

While score comparability across test forms typically relies on common (or randomly equivalent) examinees or items, innovations in item formats, test delivery, and efforts to extend the range of score interpretation may require a special data collection before examinees or items can be used in this way--or may be incompatible with common examinee…

Descriptors: Scoring, Testing, Test Items, Test Format

Evaluating Emergent Bilinguals for Specific Learning Disabilities: Considering Second Language Development and Culture

Peer reviewed

Direct link

Edward Karl Schultz; Emily Smith; Stephanie Zamora-Robles – Journal of the American Academy of Special Education Professionals, 2024

Evaluating students from culturally and linguistically diverse backgrounds (i.e., emergent bilinguals) presents challenges to evaluation teams, as distinguishing between a language disorder and typical second language development is more complex. The skills and knowledge required to do this task often exceed the level of training that evaluators…

Descriptors: Emergent Literacy, Bilingualism, Bilingual Students, Learning Disabilities

From Byproduct to Design Factor: On Validating the Interpretation of Process Indicators Based on Log Data

Peer reviewed

Direct link

Goldhammer, Frank; Hahnel, Carolin; Kroehne, Ulf; Zehner, Fabian – Large-scale Assessments in Education, 2021

International large-scale assessments such as PISA or PIAAC have started to provide public or scientific use files for log data; that is, events, event-related attributes and timestamps of test-takers' interactions with the assessment system. Log data and the process indicators derived from it can be used for many purposes. However, the intended…

Descriptors: International Assessment, Data, Computer Assisted Testing, Validity

Comparing Interpretations of the Rosenberg Self-Esteem Scale with 4-, 5-, and 101-Point Scales

Peer reviewed

Direct link

Colvin, Kimberly F.; Gorgun, Guher; Zhang, Sijun – Journal of Psychoeducational Assessment, 2020

The Rosenberg Self-Esteem Scale was administered with a 1-4, 1-5, or 0-100 scale to 819 participants, to compare score interpretations across the different versions. A rating scale utility analysis revealed that the categories in the 101-point scale were used inconsistently; based on the analysis, adjacent categories were collapsed resulting in a…

Descriptors: Self Concept Measures, Self Esteem, Test Interpretation, Scores

Content Validity of Creativity Self-Report Questionnaires from PISA 2022

Peer reviewed

Direct link

B. Goecke; S. Weiss; B. Barbot – Journal of Creative Behavior, 2025

The present paper questions the content validity of the eight creativity-related self-report scales available in PISA 2022's context questionnaire and provides a set of considerations for researchers interested in using these indexes. Specifically, we point out some threats to the content validity of these scales (e.g., "creative thinking…

Descriptors: Creativity, Creativity Tests, Questionnaires, Content Validity

Relations among Multiple Dimensions of Self-Reported Listening Effort in Response to an Auditory Psychomotor Vigilance Task

Peer reviewed

Direct link

Edward J. Golob; Ricardo C. Olayo; Denver M. Y. Brown; Jeffrey R. Mock – Journal of Speech, Language, and Hearing Research, 2024

Purpose: Listening effort is a broad construct, and there is no consensus on how to subdivide listening effort into dimensions. This project focuses on the subjective experience of effortful listening and tests if cognitive workload, mental fatigue, and mood are interrelated dimensions. Method: Two online studies tested young adults (n = 74 and n…

Descriptors: Adults, Psychomotor Skills, Psychomotor Objectives, Listening Comprehension Tests

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 266

Educational and Psychological…	94
Journal of Educational…	79
Educational Measurement:…	76
Psychology in the Schools	68
Journal of Consulting and…	51
Journal of Clinical Psychology	44
Diagnostique	40
Perceptual and Motor Skills	40
Measurement and Evaluation in…	32
Journal of Learning…	30
Journal of School Psychology	30
Psychological Reports	30
Psychometrika	28
Journal of Psychoeducational…	26
Applied Measurement in…	25
Journal of Counseling…	24
School Psychology Review	24
Applied Psychological…	19
Journal of Personality…	19
Journal of Special Education	19
Psychol Rep	18
American Psychologist	16
Assessment	16
Journal of Reading	16
ProQuest LLC	16
More ▼

Linn, Robert L.	18
Hambleton, Ronald K.	16
Plake, Barbara S.	13
Prediger, Dale J.	13
Reynolds, Cecil R.	13
Thompson, Bruce	13
Messick, Samuel	12
Green, Donald Ross	11
Brennan, Robert L.	10
Ebel, Robert L.	10
Kaufman, Alan S.	10
Naglieri, Jack A.	10
Hills, John R.	9
Mehrens, William A.	9
Echternacht, Gary	8
Frary, Robert B.	8
Kane, Michael T.	8
Canivez, Gary L.	7
Farr, Roger	7
Livingston, Samuel A.	7
Mislevy, Robert J.	7
Popham, W. James	7
Tatsuoka, Kikumi K.	7
More ▼

Journal Articles	1399
Reports - Research	1255
Speeches/Meeting Papers	437
Reports - Evaluative	412
Reports - Descriptive	327
Guides - Non-Classroom	309
Opinion Papers	283
Information Analyses	165
Tests/Questionnaires	147
Guides - General	73
Numerical/Quantitative Data	65
Books	56
Reports - General	53
ERIC Publications	40
Guides - Classroom - Teacher	34
ERIC Digests in Full Text	33
Collected Works - Serials	21
Collected Works - Proceedings	18
Dissertations/Theses -…	17
Reference Materials -…	14
Legal/Legislative/Regulatory…	13
Book/Product Reviews	11
Guides - Classroom - Learner	11
Collected Works - General	7
Dissertations/Theses	7
More ▼

Elementary and Secondary…	31
No Child Left Behind Act 2001	14
Individuals with Disabilities…	6
Education for All Handicapped…	4
Elementary and Secondary…	4
Americans with Disabilities…	3
Education Consolidation…	2
Elementary and Secondary…	2
National Defense Education Act	2
Race to the Top	2
Bakke v Regents of University…	1
Bilingual Education Act 1968	1
Education Amendments 1972	1
Elementary and Secondary…	1
Elementary and Secondary…	1
Elementary and Secondary…	1
Emergency School Aid Act 1972	1
Every Student Succeeds Act…	1
Improving Americas Schools…	1
Improving Americas Schools…	1
Individuals with Disabilities…	1
Larry P v Riles	1
Proposition 227 (California…	1
Title IX Education Amendments…	1
More ▼

Wechsler Intelligence Scale…	144
National Assessment of…	72
SAT (College Admission Test)	52
Minnesota Multiphasic…	49
Stanford Binet Intelligence…	42
Wechsler Adult Intelligence…	40
Iowa Tests of Basic Skills	36
Stanford Achievement Tests	30
Comprehensive Tests of Basic…	24
Metropolitan Achievement Tests	22
Peabody Picture Vocabulary…	22
Test of English as a Foreign…	22
Kaufman Assessment Battery…	19
California Achievement Tests	18
Strong Campbell Interest…	18
ACT Assessment	17
Graduate Record Examinations	16
Sequential Tests of…	16
General Aptitude Test Battery	15
Program for International…	14
Illinois Test of…	13
Armed Services Vocational…	12
College Board Achievement…	11
McCarthy Scales of Childrens…	11
Strong Vocational Interest…	11
More ▼