ERIC - Search Results

Publication Date

In 2025	12
Since 2024	27
Since 2021 (last 5 years)	94
Since 2016 (last 10 years)	242
Since 2006 (last 20 years)	409

Descriptor

Test Reliability	964
Scoring	675
Test Validity	560
Test Construction	293
Scoring Rubrics	154
Testing	153
Scoring Formulas	146
Test Items	145
Test Interpretation	126
Multiple Choice Tests	109
Psychometrics	107
Foreign Countries	106
Higher Education	104
Scores	101
Evaluation Methods	96
Item Analysis	95
Interrater Reliability	94
Measurement Techniques	90
Student Evaluation	89
Elementary Secondary Education	80
Item Response Theory	77
Language Tests	77
Computer Assisted Testing	73
Correlation	73
Testing Problems	72
More ▼

Education Level

Higher Education	97
Postsecondary Education	82
Elementary Education	75
Secondary Education	69
Elementary Secondary Education	41
Middle Schools	41
Early Childhood Education	35
Junior High Schools	35
High Schools	32
Primary Education	26
Intermediate Grades	22
Grade 3	21
Grade 4	21
Grade 5	20
Grade 8	20
Grade 6	18
Grade 7	18
Kindergarten	13
Grade 1	9
Preschool Education	9
Grade 2	8
Grade 9	6
Adult Education	5
Grade 11	4
Grade 10	3
More ▼

Audience

Practitioners	37
Researchers	22
Teachers	16
Administrators	10
Policymakers	7
Students	4
Counselors	1
Parents	1

Location

New York	16
California	15
Turkey	15
Florida	11
Canada	10
Nebraska	8
Australia	6
Pennsylvania	6
United Kingdom	6
United Kingdom (England)	6
United States	5
Vermont	5
Mississippi	4
Netherlands	4
Texas	4
Europe	3
Germany	3
Idaho	3
Indonesia	3
Japan	3
Malaysia	3
New Jersey	3
New Mexico	3
New York (New York)	3
Spain	3
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	5
Elementary and Secondary…	3
No Child Left Behind Act 2001	3
Education Consolidation…	1
Elementary and Secondary…	1
Individuals with Disabilities…	1
Individuals with Disabilities…	1

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	1
Meets WWC Standards with or without Reservations	1

Showing 1 to 15 of 964 results Save | Export

Linking Errors Introduced by Rapid Guessing Responses When Employing Multigroup Concurrent IRT Scaling

Direct link

Jiayi Deng – ProQuest LLC, 2024

Test score comparability in international large-scale assessments (LSA) is of utmost importance in measuring the effectiveness of education systems and understanding the impact of education on economic growth. To effectively compare test scores on an international scale, score linking is widely used to convert raw scores from different linguistic…

Descriptors: Item Response Theory, Scoring Rubrics, Scoring, Error of Measurement

A General Method for Adjusting Test Score Distributions to Account for Rescoring and Retesting

Peer reviewed

Direct link

Sophie Litschwartz – Society for Research on Educational Effectiveness, 2021

Background/Context: Pass/fail standardized exams frequently selectively rescore failing exams and retest failing examinees. This practice distorts the test score distribution and can confuse those who do analysis on these distributions. In 2011, the Wall Street Journal showed large discontinuities in the New York City Regent test score…

Descriptors: Standardized Tests, Pass Fail Grading, Scoring Rubrics, Scoring Formulas

Validity, Reliability, and Fairness Evidence for the JD-Next Exam. Research Report. ETS RR-24-04

Peer reviewed
PDF on ERIC

Download full text

Steven Holtzman; Jonathan Steinberg; Jonathan Weeks; Christopher Robertson; Jessica Findley; David Klieger – ETS Research Report Series, 2024

At a time when institutions of higher education are exploring alternatives to traditional admissions testing, institutions are also seeking to better support students and prepare them for academic success. Under such an engaged model, one may seek to measure not just the accumulated knowledge and skills that students would bring to a new academic…

Descriptors: Law Schools, College Applicants, Legal Education (Professions), College Entrance Examinations

A Study on Psychometric Properties of Creativity Indices

Peer reviewed

Direct link

M. Arda Atakaya; Ugur Sak; M. Bahadir Ayas – Creativity Research Journal, 2024

Scoring in creativity research has been a central problem since creativity became an important issue in psychology and education in the 1950s. The current study examined the psychometric properties of 27 creativity indices derived from summed and averaged scores using 15 scoring methods. Participants included 2802 middle-school students. Data…

Descriptors: Psychometrics, Creativity, Creativity Tests, Scoring

NIET Aspiring Teacher Rubric: A Valid and Reliable Tool to Measure Aspiring Teacher Instruction. Research Brief

Download full text

National Institute for Excellence in Teaching, 2023

Aspiring teachers must develop an in-depth understanding of high-quality instructional practices. In order to prepare, instruct, and coach aspiring teachers, the National Institute for Excellence in Teaching (NIET) has developed a the NIET Aspiring Teacher Rubric (ATR) based on principles of excellence in instruction. This research brief…

Descriptors: Scoring Rubrics, Preservice Teachers, Test Construction, Test Validity

The Sensitivity of Value-Added Estimates to Test Scoring Decisions. EdWorkingPaper No. 25-1226

Download full text

Joshua B. Gilbert; James G. Soland; Benjamin W. Domingue – Annenberg Institute for School Reform at Brown University, 2025

Value-Added Models (VAMs) are both common and controversial in education policy and accountability research. While the sensitivity of VAMs to model specification and covariate selection is well documented, the extent to which test scoring methods (e.g., mean scores vs. IRT-based scores) may affect VA estimates is less studied. We examine the…

Descriptors: Value Added Models, Tests, Testing, Scoring

Is Effort Moderated Scoring Robust to Multidimensional Rapid Guessing?

Peer reviewed

Direct link

Joseph A. Rios; Jiayi Deng – Educational and Psychological Measurement, 2025

To mitigate the potential damaging consequences of rapid guessing (RG), a form of noneffortful responding, researchers have proposed a number of scoring approaches. The present simulation study examines the robustness of the most popular of these approaches, the unidimensional effort-moderated (EM) scoring procedure, to multidimensional RG (i.e.,…

Descriptors: Scoring, Guessing (Tests), Reaction Time, Item Response Theory

The Importance of Thinking Multivariately When Setting Subscale Cutoff Scores

Peer reviewed

Direct link

Kroc, Edward; Olvera Astivia, Oscar L. – Educational and Psychological Measurement, 2022

Setting cutoff scores is one of the most common practices when using scales to aid in classification purposes. This process is usually done univariately where each optimal cutoff value is decided sequentially, subscale by subscale. While it is widely known that this process necessarily reduces the probability of "passing" such a test,…

Descriptors: Multivariate Analysis, Cutting Scores, Classification, Measurement

Do Scoring Techniques and Number of Choices Affect the Reliability of Multiple-Choice Tests in Elementary Schools?

Peer reviewed
PDF on ERIC

Download full text

Herwin, Herwin; Pristiwaluyo, Triyanto; Ruslan, Ruslan; Dahalan, Shakila Che – Cypriot Journal of Educational Sciences, 2022

The application of multiple-choice tests often does not consider the scoring technique and the number of choices. The study aims at describing the effect of the scoring technique and numerous options towards the reliability of multiple-choice objective tests on social subjects in elementary school. The study is quantitative research with…

Descriptors: Scoring, Multiple Choice Tests, Test Reliability, Elementary School Students

Assessing How Well Students Understand the Molecular Basis of Evolution by Natural Selection

Peer reviewed

Direct link

Sievers, Matt; Reemts, Connor; Dickinson, Katherine J.; Mukerji, Joya; Beltran, Ismael Barreras; Theobald, Elli J.; Velasco, Vicente; Freeman, Scott – Biochemistry and Molecular Biology Education, 2023

Researchers have called for undergraduate courses to update teaching frameworks based on the Modern Synthesis with insights from molecular biology, by stressing the molecular underpinnings of variation and adaptation. To support this goal, we developed a modified version of the widely used Assessing Conceptual Reasoning of Natural Selection…

Descriptors: Student Evaluation, Knowledge Level, Molecular Biology, Evolution

Evaluating the Consistency and Reliability of Attribution Methods in Automated Short Answer Grading (ASAG) Systems: Toward an Explainable Scoring System

Peer reviewed

Direct link

Wallace N. Pinto Jr.; Jinnie Shin – Journal of Educational Measurement, 2025

In recent years, the application of explainability techniques to automated essay scoring and automated short-answer grading (ASAG) models, particularly those based on transformer architectures, has gained significant attention. However, the reliability and consistency of these techniques remain underexplored. This study systematically investigates…

Descriptors: Automation, Grading, Computer Assisted Testing, Scoring

Computational Concepts and Their Assessment in Preschool Students: An Empirical Study

Peer reviewed

Direct link

Marcos Jiménez; María Zapata-Cáceres; Marcos Román-González; Gregorio Robles; Jesús Moreno-León; Estefanía Martín-Barroso – Journal of Science Education and Technology, 2024

Computational thinking (CT) is a multidimensional term that encompasses a wide variety of problem-solving skills related to the field of computer science. Unfortunately, standardized, valid, and reliable methods to assess CT skills in preschool children are lacking, compromising the reliability of the results reported in CT interventions. To…

Descriptors: Computation, Thinking Skills, Student Evaluation, Preschool Children

Selecting Technically Adequate Tests

Peer reviewed

Direct link

Susan K. Johnsen – Gifted Child Today, 2024

The author provides a checklist for educators who are selecting technically adequate tests for identifying and referring students for gifted education services and programs. The checklist includes questions related to how the test was normed, reliability and validity studies as well as questions related to types of scores, administration, and…

Descriptors: Test Selection, Academically Gifted, Gifted Education, Test Validity

Reliability and Validity of an Automated Model for Assessing the Learning of Machine Learning in Middle and High School: Experiences from the "ML for All!" Course

Peer reviewed
PDF on ERIC

Download full text

Marcelo Fernando Rauber; Christiane Gresse von Wangenheim; Pedro Alberto Barbetta; Adriano Ferreti Borgatto; Ramon Mayor Martins; Jean Carlo Rossa Hauck – Informatics in Education, 2024

The insertion of Machine Learning (ML) in everyday life demonstrates the importance of popularizing an understanding of ML already in school. Accompanying this trend arises the need to assess the students' learning. Yet, so far, few assessments have been proposed, most lacking an evaluation. Therefore, we evaluate the reliability and validity of…

Descriptors: Artificial Intelligence, Measures (Individuals), Test Reliability, Test Validity

Rubric Development and Validation for Assessing Tasks' Solving via AI Chatbots

Peer reviewed
PDF on ERIC

Download full text

Mohammad Hmoud; Hadeel Swaity; Eman Anjass; Eva María Aguaded-Ramírez – Electronic Journal of e-Learning, 2024

This research aimed to develop and validate a rubric to assess Artificial Intelligence (AI) chatbots' effectiveness in accomplishing tasks, particularly within educational contexts. Given the rapidly growing integration of AI in various sectors, including education, a systematic and robust tool for evaluating AI chatbot performance is essential.…

Descriptors: Artificial Intelligence, Man Machine Systems, Natural Language Processing, Test Construction

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 65

Journal of Psychoeducational…	43
Educational and Psychological…	41
Journal of Educational…	27
ETS Research Report Series	16
Grantee Submission	16
ProQuest LLC	15
Online Submission	13
Psychology in the Schools	13
New York State Education…	12
Applied Psychological…	10
Educational Measurement:…	9
Canadian Journal of School…	8
Language Testing	8
Psychometrika	8
Applied Measurement in…	7
Evaluation and the Health…	7
Nebraska Department of…	6
Assessment & Evaluation in…	5
Measurement and Evaluation in…	5
Partnership for Assessment of…	5
Educational Assessment	4
J Educ Meas	4
Journal of Autism and…	4
Journal of Experimental…	4
Physical Review Physics…	4
More ▼

Johnson, Evelyn S.	11
Moylan, Laura A.	11
Zheng, Yuzhu	11
Crawford, Angela R.	8
Schoen, Robert C.	7
Frary, Robert B.	6
Hambleton, Ronald K.	6
Livingston, Samuel A.	6
McCrimmon, Adam W.	6
White, Edward M.	6
Echternacht, Gary	5
Reilly, Richard R.	5
Rippey, Robert M.	5
Stansfield, Charles W.	5
Weiss, David J.	5
Breland, Hunter M.	4
Koretz, Daniel	4
Wilcox, Rand R.	4
Yang, Xiaotong	4
Albanese, Mark A.	3
Anderson, Daniel	3
Attali, Yigal	3
Bauduin, Charity	3
Burton, Richard F.	3
More ▼

Reports - Research	463
Journal Articles	449
Reports - Evaluative	188
Speeches/Meeting Papers	116
Tests/Questionnaires	80
Reports - Descriptive	70
Guides - Non-Classroom	43
Numerical/Quantitative Data	38
Information Analyses	27
Opinion Papers	23
Dissertations/Theses -…	15
Guides - General	12
Books	10
Book/Product Reviews	9
Guides - Classroom - Teacher	9
Reports - General	7
Collected Works - General	5
Reference Materials -…	5
ERIC Publications	2
Guides - Classroom - Learner	2
Collected Works - Proceedings	1
Collected Works - Serial	1
Collected Works - Serials	1
ERIC Digests in Full Text	1
Historical Materials	1
More ▼

Wechsler Intelligence Scale…	13
SAT (College Admission Test)	12
Graduate Record Examinations	11
Test of English as a Foreign…	9
National Assessment of…	8
ACT Assessment	6
Bender Gestalt Test	5
Torrance Tests of Creative…	5
Advanced Placement…	4
General Educational…	4
Goodenough Harris Drawing Test	4
Wechsler Individual…	4
Beery Developmental Test of…	3
California Achievement Tests	3
Graduate Management Admission…	3
Kaufman Test of Educational…	3
Learning Style Inventory	3
McCarthy Scales of Childrens…	3
Medical College Admission Test	3
Program for International…	3
Wechsler Adult Intelligence…	3
Wechsler Preschool and…	3
Woodcock Johnson Tests of…	3
Woodcock Reading Mastery Test	3
ACT Interest Inventory	2
More ▼