ERIC - Search Results

Publication Date

In 2025	9
Since 2024	26
Since 2021 (last 5 years)	94
Since 2016 (last 10 years)	255
Since 2006 (last 20 years)	447

Descriptor

Test Validity	953
Scoring	675
Test Reliability	560
Test Construction	316
Scoring Rubrics	166
Testing	152
Test Items	122
Psychometrics	113
Evaluation Methods	112
Scores	111
Higher Education	110
Test Interpretation	108
Foreign Countries	107
Scoring Formulas	105
Language Tests	104
Student Evaluation	102
Measurement Techniques	90
Elementary Secondary Education	89
Item Analysis	83
Computer Assisted Testing	80
Correlation	79
Interrater Reliability	77
English (Second Language)	73
Multiple Choice Tests	73
Rating Scales	65
More ▼

Education Level

Higher Education	114
Postsecondary Education	86
Secondary Education	85
Elementary Education	76
Elementary Secondary Education	47
Middle Schools	44
High Schools	39
Junior High Schools	39
Early Childhood Education	36
Primary Education	27
Intermediate Grades	25
Grade 8	22
Grade 4	20
Grade 6	20
Grade 7	20
Grade 3	19
Grade 5	19
Kindergarten	15
Preschool Education	11
Grade 1	9
Grade 11	7
Grade 2	7
Grade 9	7
Grade 10	5
Adult Education	4
More ▼

Audience

Practitioners	30
Researchers	21
Administrators	11
Teachers	10
Policymakers	9
Students	3
Counselors	1
Parents	1

Location

New York	16
Turkey	12
United States	12
California	11
Australia	9
Canada	8
Nebraska	8
United Kingdom	7
China	6
Florida	6
Pennsylvania	6
Tennessee	6
Colorado	4
Idaho	4
Indonesia	4
Iran	4
Israel	4
Japan	4
Netherlands	4
New Jersey	4
New Mexico	4
Texas	4
United Kingdom (England)	4
Utah	4
Vermont	4
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	7
Individuals with Disabilities…	6
Comprehensive Education…	3
Education Consolidation…	1
Elementary and Secondary…	1
Elementary and Secondary…	1
Family Educational Rights and…	1
Health Insurance Portability…	1
Individuals with Disabilities…	1
Individuals with Disabilities…	1
Lau v Nichols	1
Race to the Top	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 953 results Save | Export

Validity, Reliability, and Fairness Evidence for the JD-Next Exam. Research Report. ETS RR-24-04

Peer reviewed
PDF on ERIC

Download full text

Steven Holtzman; Jonathan Steinberg; Jonathan Weeks; Christopher Robertson; Jessica Findley; David Klieger – ETS Research Report Series, 2024

At a time when institutions of higher education are exploring alternatives to traditional admissions testing, institutions are also seeking to better support students and prepare them for academic success. Under such an engaged model, one may seek to measure not just the accumulated knowledge and skills that students would bring to a new academic…

Descriptors: Law Schools, College Applicants, Legal Education (Professions), College Entrance Examinations

Application of Concept Maps as an Assessment Tool in Engineering Education: Systematic Literature Review

Peer reviewed

Direct link

Alexandra Jackson; Elise Barrella; Cheryl Bodnar – Journal of Engineering Education, 2024

Background: Concept maps are a valid assessment tool to explore student understanding of diverse topics. Many types of academic programs have integrated concept mapping into their courses, resulting in various activities and scoring methods to understand student perceptions. Purpose: Few prior reviews of concept mapping have addressed their use…

Descriptors: Engineering Education, Concept Mapping, Scoring Rubrics, Evaluation Methods

Confirmatory Factor Analysis of the KeyMath-3 Diagnostic Assessment

Peer reviewed

Direct link

Michael D. Wray; Matthew R. Reynolds – Journal of Psychoeducational Assessment, 2025

The KeyMath-3 Diagnostic Assessment (KM-3) is an individually-administered math assessment used in educational placement and diagnostic decisions. It includes 10 subtests making up Basic Concepts, Operations, and Applications indexes and a "Total Test" composite that measures overall math ability. Here, covariances among subtests from…

Descriptors: Diagnostic Tests, Mathematics Tests, Arithmetic, Factor Analysis

NIET Aspiring Teacher Rubric: A Valid and Reliable Tool to Measure Aspiring Teacher Instruction. Research Brief

Download full text

National Institute for Excellence in Teaching, 2023

Aspiring teachers must develop an in-depth understanding of high-quality instructional practices. In order to prepare, instruct, and coach aspiring teachers, the National Institute for Excellence in Teaching (NIET) has developed a the NIET Aspiring Teacher Rubric (ATR) based on principles of excellence in instruction. This research brief…

Descriptors: Scoring Rubrics, Preservice Teachers, Test Construction, Test Validity

Assessing the Quality of Science Teachers' Lesson Plans: Evaluation and Application of a Novel Instrument

Peer reviewed

Direct link

Großmann, Leroy; Krüger, Dirk – Science Education, 2024

Lesson planning is a core part of teachers' professional competence. Written lesson plans play a significant role in science teacher education as a preparation for demonstration lessons during the final teacher certification exam. However, the few existing scoring rubrics on lesson plans are not particularly theoretically sound and are barely…

Descriptors: Science Instruction, Lesson Plans, Planning, Scoring Rubrics

The Future of Standardised Assessment: Validity and Trust in Algorithms for Assessment and Scoring

Peer reviewed

Direct link

Aloisi, Cesare – European Journal of Education, 2023

This article considers the challenges of using artificial intelligence (AI) and machine learning (ML) to assist high-stakes standardised assessment. It focuses on the detrimental effect that even state-of-the-art AI and ML systems could have on the validity of national exams of secondary education, and how lower validity would negatively affect…

Descriptors: Standardized Tests, Test Validity, Credibility, Algorithms

Computational Concepts and Their Assessment in Preschool Students: An Empirical Study

Peer reviewed

Direct link

Marcos Jiménez; María Zapata-Cáceres; Marcos Román-González; Gregorio Robles; Jesús Moreno-León; Estefanía Martín-Barroso – Journal of Science Education and Technology, 2024

Computational thinking (CT) is a multidimensional term that encompasses a wide variety of problem-solving skills related to the field of computer science. Unfortunately, standardized, valid, and reliable methods to assess CT skills in preschool children are lacking, compromising the reliability of the results reported in CT interventions. To…

Descriptors: Computation, Thinking Skills, Student Evaluation, Preschool Children

A Systematic Review of Automated Writing Evaluation Systems

Peer reviewed

Direct link

Huawei, Shi; Aryadoust, Vahid – Education and Information Technologies, 2023

Automated writing evaluation (AWE) systems are developed based on interdisciplinary research and technological advances such as natural language processing, computer sciences, and latent semantic analysis. Despite a steady increase in research publications in this area, the results of AWE investigations are often mixed, and their validity may be…

Descriptors: Writing Evaluation, Writing Tests, Computer Assisted Testing, Automation

One Score to Rule Them All? Comparing the Predictive and Concurrent Validity of 30 Hearts and Flowers Scoring Approaches

Peer reviewed

Direct link

Tiffany Wu; Christina Weiland; Meghan McCormick; JoAnn Hsueh; Catherine Snow; Jason Sachs – Grantee Submission, 2024

The Hearts and Flowers (H&F) task is a computerized executive functioning (EF) assessment that has been used to measure EF from early childhood to adulthood. It provides data on accuracy and reaction time (RT) across three different task blocks (hearts, flowers, and mixed). However, there is a lack of consensus in the field on how to score the…

Descriptors: Scoring, Executive Function, Kindergarten, Young Children

Selecting Technically Adequate Tests

Peer reviewed

Direct link

Susan K. Johnsen – Gifted Child Today, 2024

The author provides a checklist for educators who are selecting technically adequate tests for identifying and referring students for gifted education services and programs. The checklist includes questions related to how the test was normed, reliability and validity studies as well as questions related to types of scores, administration, and…

Descriptors: Test Selection, Academically Gifted, Gifted Education, Test Validity

Reliability and Validity of an Automated Model for Assessing the Learning of Machine Learning in Middle and High School: Experiences from the "ML for All!" Course

Peer reviewed
PDF on ERIC

Download full text

Marcelo Fernando Rauber; Christiane Gresse von Wangenheim; Pedro Alberto Barbetta; Adriano Ferreti Borgatto; Ramon Mayor Martins; Jean Carlo Rossa Hauck – Informatics in Education, 2024

The insertion of Machine Learning (ML) in everyday life demonstrates the importance of popularizing an understanding of ML already in school. Accompanying this trend arises the need to assess the students' learning. Yet, so far, few assessments have been proposed, most lacking an evaluation. Therefore, we evaluate the reliability and validity of…

Descriptors: Artificial Intelligence, Measures (Individuals), Test Reliability, Test Validity

Interpreting Testing and Assessment: A State-of-the-Art Review

Peer reviewed

Direct link

Han, Chao – Language Testing, 2022

Over the past decade, testing and assessing spoken-language interpreting has garnered an increasing amount of attention from stakeholders in interpreter education, professional certification, and interpreting research. This is because in these fields assessment results provide a critical evidential basis for high-stakes decisions, such as the…

Descriptors: Translation, Language Tests, Testing, Evaluation Methods

Preservice Teachers' Knowledge of Math Modeling: Initial Scale Development and Validation

Peer reviewed

Direct link

Reuben S. Asempapa; Doris Lee – Discover Education, 2025

Across the world, standards and practices for preparing teachers of mathematics emphasize the importance of math modeling (MM) in developing students' mathematical thinking. The aim of this research study was to develop the Mathematical Modeling Knowledge Scale (MAMKS), capable of determining preservice teachers' (PSTs') knowledge of MM. The study…

Descriptors: Preservice Teachers, Preservice Teacher Education, Mathematics Education, Mathematics Curriculum

A Note on the Use of Categorical Subscores

Peer reviewed

Direct link

Kylie Gorney; Sandip Sinharay – Journal of Educational Measurement, 2025

Although there exists an extensive amount of research on subscores and their properties, limited research has been conducted on categorical subscores and their interpretations. In this paper, we focus on the claim of Feinberg and von Davier that categorical subscores are useful for remediation and instructional purposes. We investigate this claim…

Descriptors: Tests, Scores, Test Interpretation, Alternative Assessment

Development and Validation of a Measure of Authentic Online Work

Peer reviewed

Direct link

Darling-Aduana, Jennifer – Educational Technology Research and Development, 2021

Researchers tout digital learning as a tool that can increase the authenticity of student learning and assessment tasks but lack a psychometrically valid instrument to test this hypothesis. Further, there are several complementary definitions of authentic work, versus a single agreed upon definition, presented in academic literature. I synthesized…

Descriptors: Test Construction, Test Validity, Authentic Learning, Online Courses

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 64

Journal of Psychoeducational…	45
Educational and Psychological…	30
Journal of Educational…	23
ProQuest LLC	22
Grantee Submission	18
ETS Research Report Series	14
Language Testing	14
Educational Measurement:…	12
New York State Education…	12
Online Submission	10
Educational Assessment	8
Applied Measurement in…	7
Canadian Journal of School…	7
Evaluation and the Health…	7
Language Assessment Quarterly	6
Nebraska Department of…	6
Psychology in the Schools	6
Applied Psychological…	5
Educational Testing Service	5
Physical Review Physics…	5
Assessment & Evaluation in…	4
Assessment for Effective…	4
College Board	4
Education and Information…	4
International Journal of…	4
More ▼

Johnson, Evelyn S.	11
Moylan, Laura A.	11
Zheng, Yuzhu	11
Hambleton, Ronald K.	7
Bowman, Harry L.	6
Crawford, Angela R.	6
Frary, Robert B.	6
McCrimmon, Adam W.	6
Stansfield, Charles W.	6
Crawford, Angela	5
Baker, Eva L.	4
Bejar, Isaac I.	4
Lembke, Erica S.	4
Reilly, Richard R.	4
Wainer, Howard	4
Weiss, David J.	4
Allen, Abigail A.	3
Bennett, Randy Elliot	3
Breland, Hunter M.	3
Bridgeman, Brent	3
Chasteen, Stephanie V.	3
Crocker, Linda	3
Echternacht, Gary	3
Guthrie, P. D.	3
More ▼

Journal Articles	455
Reports - Research	431
Reports - Evaluative	195
Speeches/Meeting Papers	113
Tests/Questionnaires	78
Reports - Descriptive	76
Guides - Non-Classroom	33
Information Analyses	32
Numerical/Quantitative Data	32
Opinion Papers	29
Dissertations/Theses -…	22
Guides - General	14
Book/Product Reviews	10
Guides - Classroom - Teacher	9
Books	8
Reports - General	7
Collected Works - General	6
Reference Materials -…	5
Collected Works - Proceedings	2
Collected Works - Serials	2
Guides - Classroom - Learner	2
Multilingual/Bilingual…	2
Reports -…	2
Collected Works - Serial	1
Creative Works	1
More ▼

Test of English as a Foreign…	17
Wechsler Intelligence Scale…	12
SAT (College Admission Test)	11
Graduate Record Examinations	8
National Assessment of…	8
National Teacher Examinations	8
Wechsler Individual…	6
ACT Assessment	5
Program for International…	5
Wechsler Adult Intelligence…	5
Bender Gestalt Test	4
Kaufman Test of Educational…	4
Torrance Tests of Creative…	4
Autism Diagnostic Observation…	3
College Level Examination…	3
Peabody Picture Vocabulary…	3
Strong Vocational Interest…	3
Thematic Apperception Test	3
Trends in International…	3
Wechsler Preschool and…	3
Woodcock Johnson Tests of…	3
ACT Interest Inventory	2
Beery Developmental Test of…	2
California Achievement Tests	2
Clinical Evaluation of…	2
More ▼