ERIC - Search Results

Publication Date

In 2026	0
Since 2025	4
Since 2022 (last 5 years)	15
Since 2017 (last 10 years)	36
Since 2007 (last 20 years)	89

Descriptor

Evaluation Methods	172
Scoring	172
Computer Assisted Testing	77
Student Evaluation	58
Testing	39
Test Construction	31
Educational Assessment	30
Educational Testing	30
Elementary Secondary Education	27
Test Validity	25
Testing Problems	25
Writing Evaluation	25
Foreign Countries	23
Scores	23
Test Items	22
Interrater Reliability	21
Testing Programs	21
Test Reliability	20
Comparative Analysis	19
Higher Education	19
Educational Technology	18
Standardized Tests	18
Essays	17
Item Response Theory	15
Measurement Techniques	15
More ▼

Publication Type

Journal Articles	102
Reports - Research	73
Reports - Evaluative	34
Reports - Descriptive	29
Speeches/Meeting Papers	16
Guides - Non-Classroom	12
Tests/Questionnaires	8
Guides - Classroom - Teacher	7
Opinion Papers	7
Information Analyses	6
Dissertations/Theses -…	5
Numerical/Quantitative Data	4
Books	3
Guides - General	3
Multilingual/Bilingual…	2
Reference Materials -…	2
Reports - General	2
Collected Works - Proceedings	1
More ▼

Education Level

Higher Education	27
Postsecondary Education	18
Elementary Secondary Education	15
Secondary Education	13
Elementary Education	8
High Schools	5
Middle Schools	5
Early Childhood Education	4
Junior High Schools	4
Grade 6	3
Grade 4	2
Grade 5	2
Grade 8	2
Grade 3	1
Grade 9	1
Intermediate Grades	1
Kindergarten	1
Preschool Education	1
Primary Education	1
More ▼

Audience

Practitioners	10
Teachers	9
Researchers	6
Administrators	3
Counselors	1
Policymakers	1

Location

Australia	8
China	4
Vermont	4
Canada	3
United States	3
Connecticut	2
Florida	2
Hong Kong	2
Idaho	2
Kentucky	2
New Hampshire	2
New York	2
Rhode Island	2
Taiwan	2
Texas	2
United Kingdom	2
United Kingdom (England)	2
California	1
India	1
Iran	1
Japan	1
Mexico	1
Nebraska	1
New Jersey	1
North Carolina	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…	3
No Child Left Behind Act 2001	3
Every Student Succeeds Act…	2
Comprehensive Education…	1
Individuals with Disabilities…	1

Assessments and Surveys

National Assessment of…	6
Advanced Placement…	3
Test of English as a Foreign…	3
Graduate Record Examinations	2
New York State Regents…	2
ACT Assessment	1
National Teacher Examinations	1
Program for International…	1
SAT (College Admission Test)	1
Strengths and Difficulties…	1
Systematic Screening for…	1
Wechsler Individual…	1
Wechsler Intelligence Scale…	1
Wechsler Memory Scale	1
Woodcock Johnson Tests of…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 172 results Save | Export

Grading Exams Using Large Language Models: A Comparison between Human and AI Grading of Exams in Higher Education Using ChatGPT

Peer reviewed

Direct link

Jonas Flodén – British Educational Research Journal, 2025

This study compares how the generative AI (GenAI) large language model (LLM) ChatGPT performs in grading university exams compared to human teachers. Aspects investigated include consistency, large discrepancies and length of answer. Implications for higher education, including the role of teachers and ethics, are also discussed. Three…

Descriptors: College Faculty, Artificial Intelligence, Comparative Testing, Scoring

A Comparison of Final Scoring Methods under the Multistage Adaptive Testing Framework

Direct link

Hacer Karamese – ProQuest LLC, 2022

Multistage adaptive testing (MST) has become popular in the testing industry because the research has shown that it combines the advantages of both linear tests and item-level computer adaptive testing (CAT). The previous research efforts primarily focused on MST design issues such as panel design, module length, test length, distribution of test…

Descriptors: Adaptive Testing, Scoring, Computer Assisted Testing, Design

Evaluating the Consistency and Reliability of Attribution Methods in Automated Short Answer Grading (ASAG) Systems: Toward an Explainable Scoring System

Peer reviewed

Direct link

Wallace N. Pinto Jr.; Jinnie Shin – Journal of Educational Measurement, 2025

In recent years, the application of explainability techniques to automated essay scoring and automated short-answer grading (ASAG) models, particularly those based on transformer architectures, has gained significant attention. However, the reliability and consistency of these techniques remain underexplored. This study systematically investigates…

Descriptors: Automation, Grading, Computer Assisted Testing, Scoring

Benefits of Alternative Evaluation Methods for Automated Essay Scoring

Peer reviewed
PDF on ERIC

Download full text

Andersen, Øistein E.; Yuan, Zheng; Watson, Rebecca; Cheung, Kevin Yet Fong – International Educational Data Mining Society, 2021

Automated essay scoring (AES), where natural language processing is applied to score written text, can underpin educational resources in blended and distance learning. AES performance has typically been reported in terms of correlation coefficients or agreement statistics calculated between a system and an expert human examiner. We describe the…

Descriptors: Evaluation Methods, Scoring, Essays, Computer Assisted Testing

Utilizing Deep Learning AI to Analyze Scientific Models: Overcoming Challenges

Peer reviewed

Direct link

Tingting Li; Kevin Haudek; Joseph Krajcik – Journal of Science Education and Technology, 2025

Scientific modeling is a vital educational practice that helps students apply scientific knowledge to real-world phenomena. Despite advances in AI, challenges in accurately assessing such models persist, primarily due to the complexity of cognitive constructs and data imbalances in educational settings. This study addresses these challenges by…

Descriptors: Artificial Intelligence, Scientific Concepts, Models, Automation

Adapting Paper-Based Tests for Computer Administration: Lessons Learned from 30 Years of Mode Effects Studies in Education

Peer reviewed
PDF on ERIC

Download full text

Lynch, Sarah – Practical Assessment, Research & Evaluation, 2022

In today's digital age, tests are increasingly being delivered on computers. Many of these computer-based tests (CBTs) have been adapted from paper-based tests (PBTs). However, this change in mode of test administration has the potential to introduce construct-irrelevant variance, affecting the validity of score interpretations. Because of this,…

Descriptors: Computer Assisted Testing, Tests, Scores, Scoring

Best Practices for Administering Attitudes and Beliefs Surveys in Physics

Peer reviewed

Direct link

Madsen, Adrian; McKagan, Sarah B.; Sayre, Eleanor C. – Physics Teacher, 2020

Physics faculty care about their students learning physics content. In addition, they usually hope that their students will learn some deeper lessons about thinking critically and scientifically. They hope that as a result of taking a physics class, students will come to appreciate physics as a coherent and logical method of understanding the…

Descriptors: Science Instruction, Physics, Student Surveys, Student Attitudes

Interpreting Testing and Assessment: A State-of-the-Art Review

Peer reviewed

Direct link

Han, Chao – Language Testing, 2022

Over the past decade, testing and assessing spoken-language interpreting has garnered an increasing amount of attention from stakeholders in interpreter education, professional certification, and interpreting research. This is because in these fields assessment results provide a critical evidential basis for high-stakes decisions, such as the…

Descriptors: Translation, Language Tests, Testing, Evaluation Methods

2023-2024 NSCAS Growth: English Language Arts, Mathematics, and Science Technical Report

Download full text

Nebraska Department of Education, 2024

The Nebraska Student-Centered Assessment System (NSCAS) is a statewide assessment system that embodies Nebraska's holistic view of students and helps them prepare for success in postsecondary education, career, and civic life. It uses multiple measures throughout the year to provide educators and decision-makers at all levels with the insights…

Descriptors: Student Evaluation, Evaluation Methods, Elementary School Students, Middle School Students

Digital Games for Creativity Assessment: Strengths, Weaknesses and Opportunities

Peer reviewed

Direct link

Rafner, Janet; Biskjaer, Michael Mose; Zana, Blanka; Langsford, Steven; Bergenholtz, Carsten; Rahimi, Seyedahmad; Carugati, Andrea; Noy, Lior; Sherson, Jacob – Creativity Research Journal, 2022

Creativity assessments should be valid, reliable, and scalable to support various stakeholders (e.g., policy-makers, educators, corporations, and the general public) in their decision-making processes. Established initiatives toward scalable creativity assessments have relied on well-studied standardized tests. Although robust in many ways, most…

Descriptors: Creativity, Evaluation Methods, Video Games, Computer Assisted Testing

Resolving and Re-Scoring Constructed Response Items in Mixed-Format Assessments: An Exploration of Three Approaches

Peer reviewed

Direct link

Stefanie A. Wind; Yangmeng Xu – Educational Assessment, 2024

We explored three approaches to resolving or re-scoring constructed-response items in mixed-format assessments: rater agreement, person fit, and targeted double scoring (TDS). We used a simulation study to consider how the three approaches impact the psychometric properties of student achievement estimates, with an emphasis on person fit. We found…

Descriptors: Interrater Reliability, Error of Measurement, Evaluation Methods, Examiners

AI-Automated Assignment Scoring to Scale a Professional Development Micro-Credential Program

Peer reviewed

Direct link

Cathy Cavanaugh; Bryn Humphrey; Paige Pullen – International Journal on E-Learning, 2024

To address needs in one US state to provide a professional development micro-credential for tens of thousands of educators, we automated an assignment scoring workflow in an online course by developing and refining an AI model to scan submitted assignments and score them against a rubric. This article outlines the AI model development process and…

Descriptors: Artificial Intelligence, Automation, Scoring, Microcredentials

Assessing Creativity across Multi-Step Intervention Using Generative AI Models

Peer reviewed
PDF on ERIC

Download full text

Eran Hadas; Arnon Hershkovitz – Journal of Learning Analytics, 2025

Creativity is an imperative skill for today's learners, one that has important contributions to issues of inclusion and equity in education. Therefore, assessing creativity is of major importance in educational contexts. However, scoring creativity based on traditional tools suffers from subjectivity and is heavily time- and labour-consuming. This…

Descriptors: Creativity, Evaluation Methods, Computer Assisted Testing, Artificial Intelligence

Scoring Methods of Innovative Items

Direct link

Bradley J. Ungurait – ProQuest LLC, 2021

Advancements in technology and computer-based testing has allowed for greater flexibility in assessing examinee knowledge on large-scale, high-stakes assessments. Through computer-based delivery, cognitive ability and skills can be effectively assessed cost-efficiently and measure domains that are difficult or even impossible to measure with…

Descriptors: Computer Assisted Testing, Evaluation Methods, Scoring, Student Evaluation

Validity Arguments Meet Artificial Intelligence in Innovative Educational Assessment

Peer reviewed

Direct link

Dorsey, David W.; Michaels, Hillary R. – Journal of Educational Measurement, 2022

We have dramatically advanced our ability to create rich, complex, and effective assessments across a range of uses through technology advancement. Artificial Intelligence (AI) enabled assessments represent one such area of advancement--one that has captured our collective interest and imagination. Scientists and practitioners within the domains…

Descriptors: Validity, Ethics, Artificial Intelligence, Evaluation Methods

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12

Computers & Education	5
ETS Research Report Series	5
Journal of Educational…	5
ProQuest LLC	5
Grantee Submission	4
Journal of Technology,…	4
Educational Technology &…	3
International Educational…	3
International Journal of…	3
Applied Measurement in…	2
Assessing Writing	2
College Teaching	2
Educational Measurement:…	2
Educational and Psychological…	2
Journal of Educational…	2
Journal of Science Education…	2
Language Testing	2
Measurement and Evaluation in…	2
Yearbook of the National…	2
ACT, Inc.	1
ASCD	1
American Educational Research…	1
Anatomical Sciences Education	1
Annenberg Institute for…	1
Applied Psychological…	1
More ▼

Williamson, David M.	4
Bridgeman, Brent	3
Newhouse, C. Paul	3
Schafer, William D.	3
Brown, Michelle Stallone	2
Clariana, Roy B.	2
Darling-Hammond, Linda	2
Davey, Tim	2
Han, Chao	2
Hwang, Gwo-Jen	2
Jaeger, Richard M.	2
Koretz, Daniel	2
Quenemoen, Rachel	2
Ramineni, Chaitanya	2
Stergiopoulos, Charalampos	2
Thurlow, Martha	2
Triantis, Dimos	2
Tsiakas, Panagiotis	2
Ventouras, Errikos	2
Wang, Jinhao	2
Zechner, Klaus	2
Agruso, Susan A.	1
Allen, Laura K.	1
More ▼