ERIC - Search Results

Publication Date

In 2026	0
Since 2025	16
Since 2022 (last 5 years)	89
Since 2017 (last 10 years)	194
Since 2007 (last 20 years)	327

Descriptor

Computer Assisted Testing	417
Scoring	333
Evaluation Methods	88
Foreign Countries	88
Automation	77
Scoring Rubrics	77
Second Language Learning	72
Writing Evaluation	72
Language Tests	67
English (Second Language)	66
Essays	66
Correlation	65
Student Evaluation	64
Computer Software	63
Scores	62
Comparative Analysis	58
Test Items	55
Test Construction	48
Test Validity	48
Artificial Intelligence	45
Feedback (Response)	42
Test Reliability	42
Grading	41
Educational Technology	40
Test Scoring Machines	40
More ▼

Publication Type

Journal Articles	417
Reports - Research	259
Reports - Evaluative	73
Reports - Descriptive	67
Tests/Questionnaires	21
Information Analyses	13
Opinion Papers	9
Book/Product Reviews	4
Guides - Non-Classroom	2
Reports - General	2
Speeches/Meeting Papers	2
Collected Works - General	1
Guides - Classroom - Teacher	1
More ▼

Education Level

Higher Education	103
Postsecondary Education	78
Secondary Education	36
Elementary Education	24
Elementary Secondary Education	22
High Schools	16
Middle Schools	13
Junior High Schools	9
Early Childhood Education	6
Adult Education	5
Grade 8	5
Grade 5	4
Grade 6	4
Preschool Education	4
Grade 4	3
Intermediate Grades	3
Primary Education	3
Grade 7	2
Kindergarten	2
Grade 11	1
Grade 3	1
Grade 9	1
More ▼

Audience

Practitioners	5
Researchers	5
Teachers	3
Administrators	1
Counselors	1

Location

Australia	9
China	9
Iran	6
Netherlands	6
Taiwan	6
Japan	5
United Kingdom	5
Germany	4
Spain	4
California	3
Canada	3
Europe	3
Indonesia	3
Malaysia	3
South Korea	3
Arizona	2
France	2
Hong Kong	2
Israel	2
New York (New York)	2
Switzerland	2
Texas	2
Turkey	2
United Kingdom (England)	2
United States	2
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	2
Family Educational Rights and…	1
Health Insurance Portability…	1
Individuals with Disabilities…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 417 results Save | Export

Automatic Prompt Engineering for Automatic Scoring

Peer reviewed

Direct link

Mingfeng Xue; Yunting Liu; Xingyao Xiao; Mark Wilson – Journal of Educational Measurement, 2025

Prompts play a crucial role in eliciting accurate outputs from large language models (LLMs). This study examines the effectiveness of an automatic prompt engineering (APE) framework for automatic scoring in educational measurement. We collected constructed-response data from 930 students across 11 items and used human scores as the true labels. A…

Descriptors: Computer Assisted Testing, Prompting, Educational Assessment, Automation

Identifying Features Contributing to Differential Prediction Bias of Automated Scoring Systems

Peer reviewed

Direct link

Ikkyu Choi; Matthew S. Johnson – Journal of Educational Measurement, 2025

Automated scoring systems provide multiple benefits but also pose challenges, notably potential bias. Various methods exist to evaluate these algorithms and their outputs for bias. Upon detecting bias, the next logical step is to investigate its cause, often by examining feature distributions. Recently, Johnson and McCaffrey proposed an…

Descriptors: Prediction, Bias, Automation, Scoring

Using Linkage Sets to Improve Connectedness in Rater Response Model Estimation

Peer reviewed

Direct link

Casabianca, Jodi M.; Donoghue, John R.; Shin, Hyo Jeong; Chao, Szu-Fu; Choi, Ikkyu – Journal of Educational Measurement, 2023

Using item-response theory to model rater effects provides an alternative solution for rater monitoring and diagnosis, compared to using standard performance metrics. In order to fit such models, the ratings data must be sufficiently connected in order to estimate rater effects. Due to popular rating designs used in large-scale testing scenarios,…

Descriptors: Item Response Theory, Alternative Assessment, Evaluators, Research Problems

Automated Short Answer Scoring Using an Ensemble of Neural Networks and Latent Semantic Analysis Classifiers

Peer reviewed

Direct link

Ormerod, Christopher; Lottridge, Susan; Harris, Amy E.; Patel, Milan; van Wamelen, Paul; Kodeswaran, Balaji; Woolf, Sharon; Young, Mackenzie – International Journal of Artificial Intelligence in Education, 2023

We introduce a short answer scoring engine made up of an ensemble of deep neural networks and a Latent Semantic Analysis-based model to score short constructed responses for a large suite of questions from a national assessment program. We evaluate the performance of the engine and show that the engine achieves above-human-level performance on a…

Descriptors: Computer Assisted Testing, Scoring, Artificial Intelligence, Semantics

Automatic Essay Scoring for Discussion Forum in Online Learning Based on Semantic and Keyword Similarities

Peer reviewed

Direct link

Dhini, Bachriah Fatwa; Girsang, Abba Suganda; Sufandi, Unggul Utan; Kurniawati, Heny – Asian Association of Open Universities Journal, 2023

Purpose: The authors constructed an automatic essay scoring (AES) model in a discussion forum where the result was compared with scores given by human evaluators. This research proposes essay scoring, which is conducted through two parameters, semantic and keyword similarities, using a SentenceTransformers pre-trained model that can construct the…

Descriptors: Computer Assisted Testing, Scoring, Writing Evaluation, Essays

Using GPT-4 to Augment Imbalanced Data for Automatic Scoring

Peer reviewed

Direct link

Luyang Fang; Gyeonggeon Lee; Xiaoming Zhai – Journal of Educational Measurement, 2025

Machine learning-based automatic scoring faces challenges with imbalanced student responses across scoring categories. To address this, we introduce a novel text data augmentation framework that leverages GPT-4, a generative large language model specifically tailored for imbalanced datasets in automatic scoring. Our experimental dataset consisted…

Descriptors: Computer Assisted Testing, Artificial Intelligence, Automation, Scoring

Application of an Automated Essay Scoring Engine to English Writing Assessment Using Many-Facet Rasch Measurement

Peer reviewed

Direct link

Chan, Kinnie Kin Yee; Bond, Trevor; Yan, Zi – Language Testing, 2023

We investigated the relationship between the scores assigned by an Automated Essay Scoring (AES) system, the Intelligent Essay Assessor (IEA), and grades allocated by trained, professional human raters to English essay writing by instigating two procedures novel to written-language assessment: the logistic transformation of AES raw scores into…

Descriptors: Computer Assisted Testing, Essays, Scoring, Scores

Peer reviewed

Direct link

Ramnarain-Seetohul, Vidasha; Bassoo, Vandana; Rosunally, Yasmine – Education and Information Technologies, 2022

In automated essay scoring (AES) systems, similarity techniques are used to compute the score for student answers. Several methods to compute similarity have emerged over the years. However, only a few of them have been widely used in the AES domain. This work shows the findings of a ten-year review on similarity techniques applied in AES systems…

Descriptors: Computer Assisted Testing, Essays, Scoring, Automation

Automated Scoring of Figural Tests of Creativity with Computer Vision

Peer reviewed

Direct link

Selcuk Acar; Peter Organisciak; Denis Dumas – Journal of Creative Behavior, 2025

In this three-study investigation, we applied various approaches to score drawings created in response to both Form A and Form B of the Torrance Tests of Creative Thinking-Figural (broadly TTCT-F) as well as the Multi-Trial Creative Ideation task (MTCI). We focused on TTCT-F in Study 1, and utilizing a random forest classifier, we achieved 79% and…

Descriptors: Scoring, Computer Assisted Testing, Models, Correlation

The Vulnerability of AI-Based Scoring Systems to Gaming Strategies: A Case Study

Peer reviewed

Direct link

Peter Baldwin; Victoria Yaneva; Kai North; Le An Ha; Yiyun Zhou; Alex J. Mechaber; Brian E. Clauser – Journal of Educational Measurement, 2025

Recent developments in the use of large-language models have led to substantial improvements in the accuracy of content-based automated scoring of free-text responses. The reported accuracy levels suggest that automated systems could have widespread applicability in assessment. However, before they are used in operational testing, other aspects of…

Descriptors: Artificial Intelligence, Scoring, Computational Linguistics, Accuracy

Comparing the Effect of Contextualized versus Generic Automated Feedback on Students' Scientific Argumentation. Research Report. ETS RR-22-03

Peer reviewed
PDF on ERIC

Download full text

Olivera-Aguilar, Margarita; Lee, Hee-Sun; Pallant, Amy; Belur, Vinetha; Mulholland, Matthew; Liu, Ou Lydia – ETS Research Report Series, 2022

This study uses a computerized formative assessment system that provides automated scoring and feedback to help students write scientific arguments in a climate change curriculum. We compared the effect of contextualized versus generic automated feedback on students' explanations of scientific claims and attributions of uncertainty to those…

Descriptors: Computer Assisted Testing, Formative Evaluation, Automation, Scoring

Accuracy and Reliability of Large Language Models in Assessing Learning Outcomes Achievement across Cognitive Domains

Peer reviewed

Direct link

Swapna Haresh Teckwani; Amanda Huee-Ping Wong; Nathasha Vihangi Luke; Ivan Cherh Chiet Low – Advances in Physiology Education, 2024

The advent of artificial intelligence (AI), particularly large language models (LLMs) like ChatGPT and Gemini, has significantly impacted the educational landscape, offering unique opportunities for learning and assessment. In the realm of written assessment grading, traditionally viewed as a laborious and subjective process, this study sought to…

Descriptors: Accuracy, Reliability, Computational Linguistics, Standards

Automated Scoring of Constructed Response Items in Math Assessment Using Large Language Models

Peer reviewed

Direct link

Wesley Morris; Langdon Holmes; Joon Suh Choi; Scott Crossley – International Journal of Artificial Intelligence in Education, 2025

Recent developments in the field of artificial intelligence allow for improved performance in the automated assessment of extended response items in mathematics, potentially allowing for the scoring of these items cheaply and at scale. This study details the grand prize-winning approach to developing large language models (LLMs) to automatically…

Descriptors: Automation, Computer Assisted Testing, Mathematics Tests, Scoring

Grading Exams Using Large Language Models: A Comparison between Human and AI Grading of Exams in Higher Education Using ChatGPT

Peer reviewed

Direct link

Jonas Flodén – British Educational Research Journal, 2025

This study compares how the generative AI (GenAI) large language model (LLM) ChatGPT performs in grading university exams compared to human teachers. Aspects investigated include consistency, large discrepancies and length of answer. Implications for higher education, including the role of teachers and ethics, are also discussed. Three…

Descriptors: College Faculty, Artificial Intelligence, Comparative Testing, Scoring

Integration of Prediction Scores from Various Automated Essay Scoring Models Using Item Response Theory

Peer reviewed

Direct link

Uto, Masaki; Aomi, Itsuki; Tsutsumi, Emiko; Ueno, Maomi – IEEE Transactions on Learning Technologies, 2023

In automated essay scoring (AES), essays are automatically graded without human raters. Many AES models based on various manually designed features or various architectures of deep neural networks (DNNs) have been proposed over the past few decades. Each AES model has unique advantages and characteristics. Therefore, rather than using a single-AES…

Descriptors: Prediction, Scores, Computer Assisted Testing, Scoring

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 28

ETS Research Report Series	34
Journal of Educational…	18
Language Testing	16
Educational Measurement:…	10
International Journal of…	10
Journal of Applied Testing…	10
Journal of Technology,…	10
Applied Measurement in…	9
Assessing Writing	9
Language Assessment Quarterly	9
Educational and Psychological…	8
Journal of Educational…	8
Applied Psychological…	6
Computers & Education	6
Educational Technology &…	6
Grantee Submission	6
Education and Information…	5
International Journal of…	5
Journal of Speech, Language,…	5
Educational Assessment	4
Educational Technology…	4
IEEE Transactions on Learning…	4
Journal of Computer Assisted…	4
Journal of Creative Behavior	4
Practical Assessment,…	4
More ▼

Attali, Yigal	9
Williamson, David M.	6
Bejar, Isaac I.	5
Bridgeman, Brent	5
Ramineni, Chaitanya	5
Xi, Xiaoming	5
Zechner, Klaus	5
Bennett, Randy Elliot	4
Evanini, Keelan	4
Mulholland, Matthew	4
Newhouse, C. Paul	4
Rupp, André A.	4
Wilson, Joshua	4
Breyer, F. Jay	3
Casabianca, Jodi M.	3
Clariana, Roy B.	3
Clauser, Brian E.	3
Clyman, Stephen G.	3
Davey, Tim	3
Higgins, Derrick	3
Lee, Hee-Sun	3
Linn, Marcia C.	3
Liu, Ou Lydia	3
Pallant, Amy	3
More ▼

Test of English as a Foreign…	32
Graduate Record Examinations	10
National Assessment of…	4
Wechsler Intelligence Scale…	3
Advanced Placement…	2
International English…	2
Praxis Series	2
Program for International…	2
Wechsler Individual…	2
ACTFL Oral Proficiency…	1
Behavior Assessment System…	1
California Achievement Tests	1
Center for Epidemiologic…	1
Computer Attitude Scale	1
Conners Rating Scales	1
Dynamic Indicators of Basic…	1
Expressive One Word Picture…	1
Foreign Language Classroom…	1
Graduate Management Admission…	1
Kaufman Test of Educational…	1
Mean Length of Utterance	1
Minnesota Multiphasic…	1
NEO Personality Inventory	1
Oral and Written Language…	1
Peabody Picture Vocabulary…	1
More ▼