ERIC - Search Results

Publication Date

In 2025	3
Since 2024	16
Since 2021 (last 5 years)	36
Since 2016 (last 10 years)	44
Since 2006 (last 20 years)	45

Descriptor

Artificial Intelligence	45
Computer Assisted Testing	45
Scoring	45
Automation	18
Computer Software	14
Natural Language Processing	13
Accuracy	12
Essays	12
Feedback (Response)	10
Foreign Countries	9
Computational Linguistics	7
Algorithms	6
Evaluation Methods	6
Grading	6
Language Tests	6
Models	6
Prediction	6
Semantics	6
Comparative Analysis	5
English (Second Language)	5
Ethics	5
Evaluators	5
Language Proficiency	5
Science Tests	5
Scores	5
More ▼

Publication Type

Journal Articles	36
Reports - Research	30
Reports - Evaluative	6
Information Analyses	4
Speeches/Meeting Papers	3
Collected Works - Proceedings	2
Reports - Descriptive	2
Books	1
Collected Works - General	1
Dissertations/Theses -…	1

Education Level

Higher Education	6
Postsecondary Education	6
Secondary Education	5
Elementary Education	4
Junior High Schools	3
Middle Schools	3
Early Childhood Education	2
Elementary Secondary Education	2
Grade 9	2
High Schools	2
Grade 10	1
Grade 11	1
Kindergarten	1
Primary Education	1
More ▼

Audience

Location

China	2
Canada	1
Europe	1
Florida	1
Indonesia	1
Singapore	1
Turkey	1
United Kingdom	1
United Kingdom (England)	1

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	2
Torrance Tests of Creative…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 45 results Save | Export

The Vulnerability of AI-Based Scoring Systems to Gaming Strategies: A Case Study

Peer reviewed

Direct link

Peter Baldwin; Victoria Yaneva; Kai North; Le An Ha; Yiyun Zhou; Alex J. Mechaber; Brian E. Clauser – Journal of Educational Measurement, 2025

Recent developments in the use of large-language models have led to substantial improvements in the accuracy of content-based automated scoring of free-text responses. The reported accuracy levels suggest that automated systems could have widespread applicability in assessment. However, before they are used in operational testing, other aspects of…

Descriptors: Artificial Intelligence, Scoring, Computational Linguistics, Accuracy

Grading Exams Using Large Language Models: A Comparison between Human and AI Grading of Exams in Higher Education Using ChatGPT

Peer reviewed

Direct link

Jonas Flodén – British Educational Research Journal, 2025

This study compares how the generative AI (GenAI) large language model (LLM) ChatGPT performs in grading university exams compared to human teachers. Aspects investigated include consistency, large discrepancies and length of answer. Implications for higher education, including the role of teachers and ethics, are also discussed. Three…

Descriptors: College Faculty, Artificial Intelligence, Comparative Testing, Scoring

The Language of Creativity: Evidence from Humans and Large Language Models

Peer reviewed

Direct link

William Orwig; Emma R. Edenbaum; Joshua D. Greene; Daniel L. Schacter – Journal of Creative Behavior, 2024

Recent developments in computerized scoring via semantic distance have provided automated assessments of verbal creativity. Here, we extend past work, applying computational linguistic approaches to characterize salient features of creative text. We hypothesize that, in addition to semantic diversity, the degree to which a story includes…

Descriptors: Computer Assisted Testing, Scoring, Creativity, Computational Linguistics

Automated Pipeline for Multi-Lingual Automated Essay Scoring with ReaderBench

Peer reviewed

Direct link

Stefan Ruseti; Ionut Paraschiv; Mihai Dascalu; Danielle S. McNamara – Grantee Submission, 2024

Automated Essay Scoring (AES) is a well-studied problem in Natural Language Processing applied in education. Solutions vary from handcrafted linguistic features to large Transformer-based models, implying a significant effort in feature extraction and model implementation. We introduce a novel Automated Machine Learning (AutoML) pipeline…

Descriptors: Computer Assisted Testing, Scoring, Automation, Essays

Automated Pipeline for Multi-Lingual Automated Essay Scoring with ReaderBench

Peer reviewed

Direct link

Stefan Ruseti; Ionut Paraschiv; Mihai Dascalu; Danielle S. McNamara – International Journal of Artificial Intelligence in Education, 2024

Descriptors: Computer Assisted Testing, Scoring, Automation, Essays

The Machines Take Over: A Comparison of Various Supervised Learning Approaches for Automated Scoring of Divergent Thinking Tasks

Peer reviewed

Direct link

Buczak, Philip; Huang, He; Forthmann, Boris; Doebler, Philipp – Journal of Creative Behavior, 2023

Traditionally, researchers employ human raters for scoring responses to creative thinking tasks. Apart from the associated costs this approach entails two potential risks. First, human raters can be subjective in their scoring behavior (inter-rater-variance). Second, individual raters are prone to inconsistent scoring patterns…

Descriptors: Computer Assisted Testing, Scoring, Automation, Creative Thinking

Evaluating Coherence in Writing: Comparing the Capacity of Automated Essay Scoring Technologies

Peer reviewed

Direct link

Shin, Jinnie; Gierl, Mark J. – Journal of Applied Testing Technology, 2022

Automated Essay Scoring (AES) technologies provide innovative solutions to score the written essays with a much shorter time span and at a fraction of the current cost. Traditionally, AES emphasized the importance of capturing the "coherence" of writing because abundant evidence indicated the connection between coherence and the overall…

Descriptors: Computer Assisted Testing, Scoring, Essays, Automation

Reducing Workload in Short Answer Grading Using Machine Learning

Peer reviewed

Direct link

Rebecka Weegar; Peter Idestam-Almquist – International Journal of Artificial Intelligence in Education, 2024

Machine learning methods can be used to reduce the manual workload in exam grading, making it possible for teachers to spend more time on other tasks. However, when it comes to grading exams, fully eliminating manual work is not yet possible even with very accurate automated grading, as any grading mistakes could have significant consequences for…

Descriptors: Grading, Computer Assisted Testing, Introductory Courses, Computer Science Education

Best Practices for Constructed-Response Scoring. Research Report. ETS RR-22-17

Peer reviewed
PDF on ERIC

Download full text

McCaffrey, Daniel F.; Casabianca, Jodi M.; Ricker-Pedley, Kathryn L.; Lawless, René R.; Wendler, Cathy – ETS Research Report Series, 2022

This document describes a set of best practices for developing, implementing, and maintaining the critical process of scoring constructed-response tasks. These practices address both the use of human raters and automated scoring systems as part of the scoring process and cover the scoring of written, spoken, performance, or multimodal responses.…

Descriptors: Best Practices, Scoring, Test Format, Computer Assisted Testing

The Effect of Fine-Tuned Word Embedding Techniques on the Accuracy of Automated Essay Scoring Systems Using Neural Networks

Peer reviewed

Direct link

Firoozi, Tahereh; Bulut, Okan; Epp, Carrie Demmans; Naeimabadi, Ali; Barbosa, Denilson – Journal of Applied Testing Technology, 2022

Automated Essay Scoring (AES) using neural networks has helped increase the accuracy and efficiency of scoring students' written tasks. Generally, the improved accuracy of neural network approaches has been attributed to the use of modern word embedding techniques. However, which word embedding techniques produce higher accuracy in AES systems…

Descriptors: Computer Assisted Testing, Scoring, Essays, Artificial Intelligence

Automated Short Answer Scoring Using an Ensemble of Neural Networks and Latent Semantic Analysis Classifiers

Peer reviewed

Direct link

Ormerod, Christopher; Lottridge, Susan; Harris, Amy E.; Patel, Milan; van Wamelen, Paul; Kodeswaran, Balaji; Woolf, Sharon; Young, Mackenzie – International Journal of Artificial Intelligence in Education, 2023

We introduce a short answer scoring engine made up of an ensemble of deep neural networks and a Latent Semantic Analysis-based model to score short constructed responses for a large suite of questions from a national assessment program. We evaluate the performance of the engine and show that the engine achieves above-human-level performance on a…

Descriptors: Computer Assisted Testing, Scoring, Artificial Intelligence, Semantics

AI-Automated Assignment Scoring to Scale a Professional Development Micro-Credential Program

Peer reviewed

Direct link

Cathy Cavanaugh; Bryn Humphrey; Paige Pullen – International Journal on E-Learning, 2024

To address needs in one US state to provide a professional development micro-credential for tens of thousands of educators, we automated an assignment scoring workflow in an online course by developing and refining an AI model to scan submitted assignments and score them against a rubric. This article outlines the AI model development process and…

Descriptors: Artificial Intelligence, Automation, Scoring, Microcredentials

Leveraging ChatGPT for Scoring Students' Subjective Tests

Peer reviewed
PDF on ERIC

Download full text

Tri Sedya Febrianti; Siti Fatimah; Yuni Fitriyah; Hanifah Nurhayati – International Journal of Education in Mathematics, Science and Technology, 2024

Assessing students' understanding of circle-related material through subjective tests is effective, though grading these tests can be challenging and often requires technological support. ChatGPT has shown promise in providing reliable and objective evaluations. Many teachers in Indonesia, however, continue to face difficulties integrating…

Descriptors: Artificial Intelligence, Computer Assisted Testing, Scoring, Tests

Assessing Creativity across Multi-Step Intervention Using Generative AI Models

Peer reviewed
PDF on ERIC

Download full text

Eran Hadas; Arnon Hershkovitz – Journal of Learning Analytics, 2025

Creativity is an imperative skill for today's learners, one that has important contributions to issues of inclusion and equity in education. Therefore, assessing creativity is of major importance in educational contexts. However, scoring creativity based on traditional tools suffers from subjectivity and is heavily time- and labour-consuming. This…

Descriptors: Creativity, Evaluation Methods, Computer Assisted Testing, Artificial Intelligence

Assessing the Ethical Capabilities of Chat GPT in Healthcare: A Study on Its Proficiency in Situational Judgement Test

Peer reviewed

Direct link

Kunal Sareen – Innovations in Education and Teaching International, 2024

This study examines the proficiency of Chat GPT, an AI language model, in answering questions on the Situational Judgement Test (SJT), a widely used assessment tool for evaluating the fundamental competencies of medical graduates in the UK. A total of 252 SJT questions from the "Oxford Assess and Progress: Situational Judgement" Test…

Descriptors: Ethics, Decision Making, Artificial Intelligence, Computer Software

Previous Page | Next Page »

Pages: 1 | 2 | 3

International Journal of…	4
ETS Research Report Series	3
Grantee Submission	3
International Educational…	3
Journal of Applied Testing…	2
Journal of Computer Assisted…	2
Journal of Creative Behavior	2
Journal of Educational…	2
Advances in Physiology…	1
British Educational Research…	1
British Journal of…	1
Canadian Journal of Learning…	1
Contemporary Educational…	1
EURASIA Journal of…	1
Education and Information…	1
Educational Technology &…	1
IAP - Information Age…	1
IEEE Transactions on Learning…	1
Innovations in Education and…	1
International Journal of…	1
International Journal on…	1
Journal of Chemical Education	1
Journal of Learning Analytics	1
Journal of Science Education…	1
Language Assessment Quarterly	1
More ▼

Danielle S. McNamara	2
Evanini, Keelan	2
Ionut Paraschiv	2
Mihai Dascalu	2
Shi, Lehong	2
Stefan Ruseti	2
Zhai, Xiaoming	2
Alex J. Mechaber	1
Allen, Laura K.	1
Amanda Huee-Ping Wong	1
Andrew B. Wolf	1
Arnon Hershkovitz	1
Aslam, Muhammad	1
Attali, Yigal	1
Baig, Basim	1
Baral, Sami	1
Barbosa, Denilson	1
Barnes, Tiffany, Ed.	1
Botarleanu, Robert-Mihai	1
Botelho, Anthony	1
Brandon J. Yik	1
Brian E. Clauser	1
Bryn Humphrey	1
Buczak, Philip	1
Bulut, Okan	1
More ▼