ERIC - Search Results

Publication Date

In 2025	6
Since 2024	15
Since 2021 (last 5 years)	53
Since 2016 (last 10 years)	152
Since 2006 (last 20 years)	272

Descriptor

Test Reliability	560
Test Validity	560
Scoring	403
Test Construction	198
Testing	113
Scoring Rubrics	97
Psychometrics	80
Test Items	79
Test Interpretation	73
Foreign Countries	69
Scoring Formulas	66
Student Evaluation	63
Item Analysis	61
Measurement Techniques	58
Evaluation Methods	55
Higher Education	54
Scores	53
Language Tests	50
Multiple Choice Tests	50
Item Response Theory	48
Elementary Secondary Education	47
Correlation	44
Rating Scales	44
Interrater Reliability	43
Test Bias	41
More ▼

Education Level

Secondary Education	60
Higher Education	59
Elementary Education	58
Postsecondary Education	49
Middle Schools	33
Junior High Schools	31
Elementary Secondary Education	29
Early Childhood Education	26
High Schools	26
Intermediate Grades	20
Primary Education	20
Grade 3	18
Grade 4	18
Grade 5	18
Grade 6	18
Grade 8	18
Grade 7	17
Kindergarten	10
Preschool Education	8
Grade 1	7
Grade 2	5
Grade 9	5
Grade 11	4
Grade 10	3
Adult Education	2
More ▼

Audience

Practitioners	26
Researchers	12
Administrators	9
Teachers	9
Policymakers	6
Students	3
Counselors	1
Parents	1

Location

New York	13
Turkey	10
Canada	8
Nebraska	8
Australia	6
Florida	6
Pennsylvania	6
California	5
United Kingdom	5
United States	4
Idaho	3
New Mexico	3
Texas	3
United Kingdom (England)	3
Brazil	2
Colorado (Denver)	2
Indonesia	2
Japan	2
Malaysia	2
Mississippi	2
Missouri	2
Netherlands	2
North Carolina (Charlotte)	2
Oregon	2
Spain	2
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	5
No Child Left Behind Act 2001	3
Education Consolidation…	1
Elementary and Secondary…	1
Individuals with Disabilities…	1
Individuals with Disabilities…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 560 results Save | Export

Validity, Reliability, and Fairness Evidence for the JD-Next Exam. Research Report. ETS RR-24-04

Peer reviewed
PDF on ERIC

Download full text

Steven Holtzman; Jonathan Steinberg; Jonathan Weeks; Christopher Robertson; Jessica Findley; David Klieger – ETS Research Report Series, 2024

At a time when institutions of higher education are exploring alternatives to traditional admissions testing, institutions are also seeking to better support students and prepare them for academic success. Under such an engaged model, one may seek to measure not just the accumulated knowledge and skills that students would bring to a new academic…

Descriptors: Law Schools, College Applicants, Legal Education (Professions), College Entrance Examinations

NIET Aspiring Teacher Rubric: A Valid and Reliable Tool to Measure Aspiring Teacher Instruction. Research Brief

Download full text

National Institute for Excellence in Teaching, 2023

Aspiring teachers must develop an in-depth understanding of high-quality instructional practices. In order to prepare, instruct, and coach aspiring teachers, the National Institute for Excellence in Teaching (NIET) has developed a the NIET Aspiring Teacher Rubric (ATR) based on principles of excellence in instruction. This research brief…

Descriptors: Scoring Rubrics, Preservice Teachers, Test Construction, Test Validity

Computational Concepts and Their Assessment in Preschool Students: An Empirical Study

Peer reviewed

Direct link

Marcos Jiménez; María Zapata-Cáceres; Marcos Román-González; Gregorio Robles; Jesús Moreno-León; Estefanía Martín-Barroso – Journal of Science Education and Technology, 2024

Computational thinking (CT) is a multidimensional term that encompasses a wide variety of problem-solving skills related to the field of computer science. Unfortunately, standardized, valid, and reliable methods to assess CT skills in preschool children are lacking, compromising the reliability of the results reported in CT interventions. To…

Descriptors: Computation, Thinking Skills, Student Evaluation, Preschool Children

Selecting Technically Adequate Tests

Peer reviewed

Direct link

Susan K. Johnsen – Gifted Child Today, 2024

The author provides a checklist for educators who are selecting technically adequate tests for identifying and referring students for gifted education services and programs. The checklist includes questions related to how the test was normed, reliability and validity studies as well as questions related to types of scores, administration, and…

Descriptors: Test Selection, Academically Gifted, Gifted Education, Test Validity

Reliability and Validity of an Automated Model for Assessing the Learning of Machine Learning in Middle and High School: Experiences from the "ML for All!" Course

Peer reviewed
PDF on ERIC

Download full text

Marcelo Fernando Rauber; Christiane Gresse von Wangenheim; Pedro Alberto Barbetta; Adriano Ferreti Borgatto; Ramon Mayor Martins; Jean Carlo Rossa Hauck – Informatics in Education, 2024

The insertion of Machine Learning (ML) in everyday life demonstrates the importance of popularizing an understanding of ML already in school. Accompanying this trend arises the need to assess the students' learning. Yet, so far, few assessments have been proposed, most lacking an evaluation. Therefore, we evaluate the reliability and validity of…

Descriptors: Artificial Intelligence, Measures (Individuals), Test Reliability, Test Validity

Preservice Teachers' Knowledge of Math Modeling: Initial Scale Development and Validation

Peer reviewed

Direct link

Reuben S. Asempapa; Doris Lee – Discover Education, 2025

Across the world, standards and practices for preparing teachers of mathematics emphasize the importance of math modeling (MM) in developing students' mathematical thinking. The aim of this research study was to develop the Mathematical Modeling Knowledge Scale (MAMKS), capable of determining preservice teachers' (PSTs') knowledge of MM. The study…

Descriptors: Preservice Teachers, Preservice Teacher Education, Mathematics Education, Mathematics Curriculum

A Note on the Use of Categorical Subscores

Peer reviewed

Direct link

Kylie Gorney; Sandip Sinharay – Journal of Educational Measurement, 2025

Although there exists an extensive amount of research on subscores and their properties, limited research has been conducted on categorical subscores and their interpretations. In this paper, we focus on the claim of Feinberg and von Davier that categorical subscores are useful for remediation and instructional purposes. We investigate this claim…

Descriptors: Tests, Scores, Test Interpretation, Alternative Assessment

A Review of Test Use: The Test Anxiety Inventory

Peer reviewed
PDF on ERIC

Download full text

Alatli, Betül – International Journal of Curriculum and Instruction, 2022

This study was conducted to review the use of tests. For this purpose, 45 articles in which the Turkish form of the "Test Anxiety Inventory (TAI)," which is one of the tests frequently used in the field of education, was employed and that were published between 2000 and 2020 were examined in terms of factors that should be considered in…

Descriptors: Anxiety, Likert Scales, Test Anxiety, Test Reliability

Handicapping in Squash

Peer reviewed

Direct link

Wagaman, John; Fletcher, Michael – Teaching Statistics: An International Journal for Teachers, 2018

This article considers how a handicapping system should be devised for squash. It looks at the American scoring system, and whether it is possible to have a fair system of handicapping. We consider "fair" from a perspective of expected number of rallies won and probability of winning.

Descriptors: Probability, Athletes, Athletics, Inhibition

A Rubric for Assessing Mathematical Modelling Problems in a Scientific-Engineering Context

Peer reviewed

Direct link

Kohen, Zehavit; Gharra-Badran, Yasmin – Teaching Mathematics and Its Applications, 2023

Mathematics modelling is a vital competency for students of all ages. In this study, we aim to fill the research gap about valid and reliable tools for assessing and grading mathematical modeling problems, particularly those reflecting multiple steps of the modelling cycle. We present in this paper the design of a reliable and valid assessment…

Descriptors: Scoring Rubrics, Mathematical Models, Test Construction, Test Validity

Measuring Student and Educator Digital Competence beyond Self-Assessment: Developing and Validating Two Rubric-Based Frameworks

Peer reviewed

Direct link

Flor de Lis González-Mujico – Education and Information Technologies, 2024

Over the past decade, self-assessment tools have garnered significant attention in the interest of measuring the skillset required by educators and students to function productively and ethically in digitally mediated environments, particularly in relation to education policy implementation. Since stated beliefs do not always align with actual…

Descriptors: Technological Literacy, Evaluation Methods, Test Validity, Test Construction

Item Response Theory Modeling of the Verb Naming Test

Peer reviewed

Direct link

Fergadiotis, Gerasimos; Casilio, Marianne; Dickey, Michael Walsh; Steel, Stacey; Nicholson, Hannele; Fleegle, Mikala; Swiderski, Alexander; Hula, William D. – Journal of Speech, Language, and Hearing Research, 2023

Purpose: Item response theory (IRT) is a modern psychometric framework with several advantageous properties as compared with classical test theory. IRT has been successfully used to model performance on anomia tests in individuals with aphasia; however, all efforts to date have focused on noun production accuracy. The purpose of this study is to…

Descriptors: Item Response Theory, Psychometrics, Verbs, Naming

Is It Actually Reliable? Examining Statistical Methods for Inter-Rater Reliability of a Rubric in Graduate Education

Peer reviewed
PDF on ERIC

Download full text

Brent J. Goertzen; Kaley Klaus – Research & Practice in Assessment, 2023

When evaluating student learning, educators often employ scoring rubrics, for which quality can be determined through evaluating validity and reliability. This article discusses the norming process utilized in a graduate organizational leadership program for a capstone scoring rubric. Concepts of validity and reliability are discussed, as is the…

Descriptors: Graduate Students, Graduate Study, Graduate School Faculty, Scoring Rubrics

Design of a Simple Rubric to Peer-Evaluate the Teamwork Skills of Engineering Students

Peer reviewed

Direct link

Swapneel Thite; Jayashri Ravishankar; Inmaculada Tomeo-Reyes; Araceli Martinez Ortiz – European Journal of Engineering Education, 2024

Effectively working in an engineering workplace requires strong teamwork skills, yet the existing literature within various disciplines reveals discrepancies in evaluating these skills. This complicates the design of a generic teamwork peer evaluation tool for engineering students. This study aims to address this gap by introducing the DRIVE…

Descriptors: Scoring Rubrics, Evaluation Methods, Peer Evaluation, Teamwork

The Development and Test of the Public Speaking Competency Rubric+

Peer reviewed

Direct link

Maria Blevins; Bryce Hughes; Jennifer Green; Leila Sterman; Shannon Willoughby – Journal of College Science Teaching, 2025

In this work, the authors document an expansion of the Public Speaking Competency Rubric (PSCR). First developed in 2012 by Schreiber, et al., the original rubric has only one item related to non-verbal communication. The authors of this work expanded the rubric to include 10 items related to the non-verbal aspects of public speaking and had it…

Descriptors: Test Construction, Public Speaking, Competence, Scoring Rubrics

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 38

Journal of Psychoeducational…	42
Educational and Psychological…	16
Grantee Submission	12
Journal of Educational…	11
New York State Education…	11
ProQuest LLC	8
Canadian Journal of School…	7
ETS Research Report Series	7
Online Submission	7
Nebraska Department of…	6
Applied Psychological…	4
Educational Assessment	4
Educational Measurement:…	4
Journal of Autism and…	4
Language Testing	4
Partnership for Assessment of…	4
Psychology in the Schools	4
Assessment & Evaluation in…	3
Bill & Melinda Gates…	3
Education and Information…	3
Evaluation and the Health…	3
Journal of Chemical Education	3
Journal of Language and…	3
Journal of Learning…	3
Language Assessment Quarterly	3
More ▼

Johnson, Evelyn S.	9
Moylan, Laura A.	9
Zheng, Yuzhu	9
Crawford, Angela R.	6
Hambleton, Ronald K.	6
McCrimmon, Adam W.	6
Frary, Robert B.	4
Reilly, Richard R.	4
Stansfield, Charles W.	4
Breland, Hunter M.	3
Crawford, Angela	3
Echternacht, Gary	3
Guthrie, P. D.	3
Paek, Insu	3
Rippey, Robert M.	3
Schoen, Robert C.	3
Yang, Xiaotong	3
Anderson, Frances E.	2
Anna-Maria Fall	2
Bae, Yunhee	2
Balkin, Richard S.	2
Bergin, Christi	2
Beula M. Magimairaj	2
Brennan, Robert L.	2
More ▼

Journal Articles	258
Reports - Research	232
Reports - Evaluative	132
Speeches/Meeting Papers	59
Tests/Questionnaires	45
Reports - Descriptive	39
Guides - Non-Classroom	30
Numerical/Quantitative Data	28
Information Analyses	18
Opinion Papers	14
Guides - General	11
Book/Product Reviews	8
Dissertations/Theses -…	8
Books	7
Guides - Classroom - Teacher	7
Reports - General	5
Reference Materials -…	4
Collected Works - General	3
Guides - Classroom - Learner	2
Collected Works - Proceedings	1
Collected Works - Serial	1
Historical Materials	1
Multilingual/Bilingual…	1
Reference Materials -…	1
More ▼

Wechsler Intelligence Scale…	8
Graduate Record Examinations	6
SAT (College Admission Test)	6
Test of English as a Foreign…	5
ACT Assessment	4
Kaufman Test of Educational…	3
Program for International…	3
Wechsler Adult Intelligence…	3
Wechsler Individual…	3
Wechsler Preschool and…	3
Woodcock Johnson Tests of…	3
ACT Interest Inventory	2
Autism Diagnostic Observation…	2
Beery Developmental Test of…	2
Bender Gestalt Test	2
California Achievement Tests	2
Clinical Evaluation of…	2
Dynamic Indicators of Basic…	2
Florida Comprehensive…	2
General Educational…	2
Graduate Management Admission…	2
Group Embedded Figures Test	2
Learning Style Inventory	2
Myers Briggs Type Indicator	2
National Assessment of…	2
More ▼