ERIC - Search Results

Publication Date

In 2026	0
Since 2025	10
Since 2022 (last 5 years)	41
Since 2017 (last 10 years)	147
Since 2007 (last 20 years)	266

Descriptor

Test Reliability	566
Test Validity	566
Scoring	409
Test Construction	200
Testing	115
Scoring Rubrics	98
Test Items	82
Psychometrics	81
Test Interpretation	74
Foreign Countries	72
Scoring Formulas	66
Student Evaluation	64
Item Analysis	62
Measurement Techniques	58
Evaluation Methods	57
Higher Education	54
Scores	54
Language Tests	52
Multiple Choice Tests	51
Item Response Theory	49
Elementary Secondary Education	47
Correlation	44
Rating Scales	44
Interrater Reliability	43
Test Bias	42
More ▼

Education Level

Elementary Education	61
Secondary Education	61
Higher Education	60
Postsecondary Education	50
Middle Schools	34
Junior High Schools	32
Elementary Secondary Education	29
Early Childhood Education	27
High Schools	26
Primary Education	21
Intermediate Grades	20
Grade 3	18
Grade 4	18
Grade 5	18
Grade 6	18
Grade 8	18
Grade 7	17
Kindergarten	11
Preschool Education	8
Grade 1	7
Grade 2	5
Grade 9	5
Grade 11	4
Grade 10	3
Adult Education	2
More ▼

Audience

Practitioners	26
Researchers	12
Administrators	9
Teachers	9
Policymakers	6
Students	3
Counselors	1
Parents	1

Location

New York	13
Turkey	10
Nebraska	9
Canada	8
Australia	6
Florida	6
Pennsylvania	6
California	5
United Kingdom	5
United States	4
Idaho	3
New Mexico	3
Texas	3
United Kingdom (England)	3
Brazil	2
Colorado (Denver)	2
Indonesia	2
Japan	2
Malaysia	2
Mississippi	2
Missouri	2
Netherlands	2
North Carolina (Charlotte)	2
Oregon	2
Spain	2
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	5
No Child Left Behind Act 2001	3
Education Consolidation…	1
Elementary and Secondary…	1
Individuals with Disabilities…	1
Individuals with Disabilities…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 566 results Save | Export

TOEFL iBT® Technical Manual. TOEFL® Research Series. RR-106. ETS Research Report. RR-25-12

Peer reviewed
PDF on ERIC

Download full text

Venessa F. Manna; Shuhong Li; Spiros Papageorgiou; Lixiong Gu – ETS Research Report Series, 2025

This technical manual describes the purpose and intended uses of the TOEFL iBT test, its target test-taker population, and relevant language use domains. The test design and scoring procedures are presented first, followed by a research agenda intended to support the interpretation and use of test scores. Given the updates to the test starting…

Descriptors: Second Language Learning, English (Second Language), Language Tests, Test Construction

Investigation of Response Aggregation Methods in Divergent Thinking Assessments

Peer reviewed

Direct link

Janika Saretzki; Rosalie Andrae; Boris Forthmann; Mathias Benedek – Journal of Creative Behavior, 2025

Divergent thinking (DT) ability is widely regarded as a central cognitive capacity underlying creativity, but its assessment is challenged by the fact that DT tasks yield a variable number of responses. Various approaches for the scoring of DT tasks have been proposed, which differ in how responses are evaluated and aggregated within a task. The…

Descriptors: Creative Thinking, Creativity Tests, Scoring, Metacognition

Validity, Reliability, and Fairness Evidence for the JD-Next Exam. Research Report. ETS RR-24-04

Peer reviewed
PDF on ERIC

Download full text

Steven Holtzman; Jonathan Steinberg; Jonathan Weeks; Christopher Robertson; Jessica Findley; David Klieger – ETS Research Report Series, 2024

At a time when institutions of higher education are exploring alternatives to traditional admissions testing, institutions are also seeking to better support students and prepare them for academic success. Under such an engaged model, one may seek to measure not just the accumulated knowledge and skills that students would bring to a new academic…

Descriptors: Law Schools, College Applicants, Legal Education (Professions), College Entrance Examinations

NIET Aspiring Teacher Rubric: A Valid and Reliable Tool to Measure Aspiring Teacher Instruction. Research Brief

Download full text

National Institute for Excellence in Teaching, 2023

Aspiring teachers must develop an in-depth understanding of high-quality instructional practices. In order to prepare, instruct, and coach aspiring teachers, the National Institute for Excellence in Teaching (NIET) has developed a the NIET Aspiring Teacher Rubric (ATR) based on principles of excellence in instruction. This research brief…

Descriptors: Scoring Rubrics, Preservice Teachers, Test Construction, Test Validity

Computational Concepts and Their Assessment in Preschool Students: An Empirical Study

Peer reviewed

Direct link

Marcos Jiménez; María Zapata-Cáceres; Marcos Román-González; Gregorio Robles; Jesús Moreno-León; Estefanía Martín-Barroso – Journal of Science Education and Technology, 2024

Computational thinking (CT) is a multidimensional term that encompasses a wide variety of problem-solving skills related to the field of computer science. Unfortunately, standardized, valid, and reliable methods to assess CT skills in preschool children are lacking, compromising the reliability of the results reported in CT interventions. To…

Descriptors: Computation, Thinking Skills, Student Evaluation, Preschool Children

A Systematic Review of Early Writing Assessment Tools

Peer reviewed

Direct link

Katherine L. Buchanan; Milena Keller-Margulis; Amanda Hut; Weihua Fan; Sarah S. Mire; G. Thomas Schanding Jr. – Early Childhood Education Journal, 2025

There is considerable research regarding measures of early reading but much less in early writing. Nevertheless, writing is a critical skill for success in school and early difficulties in writing are likely to persist without intervention. A necessary step toward identifying those students who need additional support is the use of screening…

Descriptors: Writing Evaluation, Evaluation Methods, Emergent Literacy, Beginning Writing

Selecting Technically Adequate Tests

Peer reviewed

Direct link

Susan K. Johnsen – Gifted Child Today, 2024

The author provides a checklist for educators who are selecting technically adequate tests for identifying and referring students for gifted education services and programs. The checklist includes questions related to how the test was normed, reliability and validity studies as well as questions related to types of scores, administration, and…

Descriptors: Test Selection, Academically Gifted, Gifted Education, Test Validity

Reliability and Validity of an Automated Model for Assessing the Learning of Machine Learning in Middle and High School: Experiences from the "ML for All!" Course

Peer reviewed
PDF on ERIC

Download full text

Marcelo Fernando Rauber; Christiane Gresse von Wangenheim; Pedro Alberto Barbetta; Adriano Ferreti Borgatto; Ramon Mayor Martins; Jean Carlo Rossa Hauck – Informatics in Education, 2024

The insertion of Machine Learning (ML) in everyday life demonstrates the importance of popularizing an understanding of ML already in school. Accompanying this trend arises the need to assess the students' learning. Yet, so far, few assessments have been proposed, most lacking an evaluation. Therefore, we evaluate the reliability and validity of…

Descriptors: Artificial Intelligence, Measures (Individuals), Test Reliability, Test Validity

Preservice Teachers' Knowledge of Math Modeling: Initial Scale Development and Validation

Peer reviewed

Direct link

Reuben S. Asempapa; Doris Lee – Discover Education, 2025

Across the world, standards and practices for preparing teachers of mathematics emphasize the importance of math modeling (MM) in developing students' mathematical thinking. The aim of this research study was to develop the Mathematical Modeling Knowledge Scale (MAMKS), capable of determining preservice teachers' (PSTs') knowledge of MM. The study…

Descriptors: Preservice Teachers, Preservice Teacher Education, Mathematics Education, Mathematics Curriculum

A Note on the Use of Categorical Subscores

Peer reviewed

Direct link

Kylie Gorney; Sandip Sinharay – Journal of Educational Measurement, 2025

Although there exists an extensive amount of research on subscores and their properties, limited research has been conducted on categorical subscores and their interpretations. In this paper, we focus on the claim of Feinberg and von Davier that categorical subscores are useful for remediation and instructional purposes. We investigate this claim…

Descriptors: Tests, Scores, Test Interpretation, Alternative Assessment

2023-2024 NSCAS Growth: English Language Arts, Mathematics, and Science Technical Report

Download full text

Nebraska Department of Education, 2024

The Nebraska Student-Centered Assessment System (NSCAS) is a statewide assessment system that embodies Nebraska's holistic view of students and helps them prepare for success in postsecondary education, career, and civic life. It uses multiple measures throughout the year to provide educators and decision-makers at all levels with the insights…

Descriptors: Student Evaluation, Evaluation Methods, Elementary School Students, Middle School Students

A Review of Test Use: The Test Anxiety Inventory

Peer reviewed
PDF on ERIC

Download full text

Alatli, Betül – International Journal of Curriculum and Instruction, 2022

This study was conducted to review the use of tests. For this purpose, 45 articles in which the Turkish form of the "Test Anxiety Inventory (TAI)," which is one of the tests frequently used in the field of education, was employed and that were published between 2000 and 2020 were examined in terms of factors that should be considered in…

Descriptors: Anxiety, Likert Scales, Test Anxiety, Test Reliability

A Rubric for Assessing Mathematical Modelling Problems in a Scientific-Engineering Context

Peer reviewed

Direct link

Kohen, Zehavit; Gharra-Badran, Yasmin – Teaching Mathematics and Its Applications, 2023

Mathematics modelling is a vital competency for students of all ages. In this study, we aim to fill the research gap about valid and reliable tools for assessing and grading mathematical modeling problems, particularly those reflecting multiple steps of the modelling cycle. We present in this paper the design of a reliable and valid assessment…

Descriptors: Scoring Rubrics, Mathematical Models, Test Construction, Test Validity

Measuring Student and Educator Digital Competence beyond Self-Assessment: Developing and Validating Two Rubric-Based Frameworks

Peer reviewed

Direct link

Flor de Lis González-Mujico – Education and Information Technologies, 2024

Over the past decade, self-assessment tools have garnered significant attention in the interest of measuring the skillset required by educators and students to function productively and ethically in digitally mediated environments, particularly in relation to education policy implementation. Since stated beliefs do not always align with actual…

Descriptors: Technological Literacy, Evaluation Methods, Test Validity, Test Construction

Handicapping in Squash

Peer reviewed

Direct link

Wagaman, John; Fletcher, Michael – Teaching Statistics: An International Journal for Teachers, 2018

This article considers how a handicapping system should be devised for squash. It looks at the American scoring system, and whether it is possible to have a fair system of handicapping. We consider "fair" from a perspective of expected number of rallies won and probability of winning.

Descriptors: Probability, Athletes, Athletics, Inhibition

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 38

Journal of Psychoeducational…	42
Educational and Psychological…	16
Grantee Submission	12
Journal of Educational…	11
New York State Education…	11
ETS Research Report Series	8
ProQuest LLC	8
Canadian Journal of School…	7
Nebraska Department of…	7
Online Submission	7
Applied Psychological…	4
Educational Assessment	4
Educational Measurement:…	4
Journal of Autism and…	4
Language Testing	4
Partnership for Assessment of…	4
Psychology in the Schools	4
Assessment & Evaluation in…	3
Bill & Melinda Gates…	3
Education and Information…	3
Evaluation and the Health…	3
Journal of Chemical Education	3
Journal of Language and…	3
Journal of Learning…	3
Language Assessment Quarterly	3
More ▼

Johnson, Evelyn S.	9
Moylan, Laura A.	9
Zheng, Yuzhu	9
Crawford, Angela R.	6
Hambleton, Ronald K.	6
McCrimmon, Adam W.	6
Frary, Robert B.	4
Reilly, Richard R.	4
Stansfield, Charles W.	4
Breland, Hunter M.	3
Crawford, Angela	3
Echternacht, Gary	3
Guthrie, P. D.	3
Paek, Insu	3
Rippey, Robert M.	3
Schoen, Robert C.	3
Yang, Xiaotong	3
Anderson, Frances E.	2
Anna-Maria Fall	2
Bae, Yunhee	2
Balkin, Richard S.	2
Bergin, Christi	2
Beula M. Magimairaj	2
Brennan, Robert L.	2
More ▼

Journal Articles	263
Reports - Research	235
Reports - Evaluative	132
Speeches/Meeting Papers	59
Tests/Questionnaires	45
Reports - Descriptive	41
Guides - Non-Classroom	31
Numerical/Quantitative Data	29
Information Analyses	19
Opinion Papers	14
Guides - General	11
Book/Product Reviews	8
Dissertations/Theses -…	8
Books	7
Guides - Classroom - Teacher	7
Reports - General	5
Reference Materials -…	4
Collected Works - General	3
Guides - Classroom - Learner	2
Collected Works - Proceedings	1
Collected Works - Serial	1
Historical Materials	1
Multilingual/Bilingual…	1
Reference Materials -…	1
More ▼

Wechsler Intelligence Scale…	8
Graduate Record Examinations	6
SAT (College Admission Test)	6
Test of English as a Foreign…	6
ACT Assessment	4
Program for International…	4
Kaufman Test of Educational…	3
Wechsler Adult Intelligence…	3
Wechsler Individual…	3
Wechsler Preschool and…	3
Woodcock Johnson Tests of…	3
ACT Interest Inventory	2
Autism Diagnostic Observation…	2
Beery Developmental Test of…	2
Bender Gestalt Test	2
California Achievement Tests	2
Clinical Evaluation of…	2
Dynamic Indicators of Basic…	2
Florida Comprehensive…	2
General Educational…	2
Graduate Management Admission…	2
Group Embedded Figures Test	2
Learning Style Inventory	2
Myers Briggs Type Indicator	2
National Assessment of…	2
More ▼