ERIC - Search Results

Publication Date

In 2026	0
Since 2025	17
Since 2022 (last 5 years)	115
Since 2017 (last 10 years)	257
Since 2007 (last 20 years)	426

Descriptor

Computer Assisted Testing	635
Scoring	514
Test Construction	120
Test Items	120
Foreign Countries	115
Evaluation Methods	106
Automation	100
Scoring Rubrics	97
Essays	90
Student Evaluation	90
Scores	89
Adaptive Testing	88
Computer Software	88
Writing Evaluation	87
Language Tests	85
Second Language Learning	83
Comparative Analysis	82
English (Second Language)	80
Test Validity	80
Correlation	79
Test Reliability	74
Test Format	65
Educational Technology	63
Models	60
Higher Education	57
More ▼

Education Level

Higher Education	125
Postsecondary Education	98
Secondary Education	60
Elementary Education	48
Elementary Secondary Education	39
Middle Schools	33
Junior High Schools	27
High Schools	21
Grade 8	13
Intermediate Grades	11
Early Childhood Education	10
Grade 4	10
Grade 5	9
Grade 6	8
Grade 7	8
Primary Education	6
Adult Education	5
Grade 3	5
Preschool Education	4
Grade 9	3
Kindergarten	3
Grade 11	2
Grade 2	2
Grade 10	1
Grade 12	1
More ▼

Audience

Practitioners	11
Administrators	8
Researchers	8
Teachers	6
Policymakers	3
Students	2
Counselors	1

Location

Australia	13
China	12
New York	9
Japan	8
Canada	7
Netherlands	7
Germany	6
Iran	6
Taiwan	6
United Kingdom	6
Spain	5
United States	5
Europe	4
Indonesia	4
Malaysia	4
South Korea	4
United Kingdom (England)	4
California	3
Denmark	3
France	3
Hong Kong	3
Nebraska	3
New York (New York)	3
Singapore	3
Turkey	3
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	4
Every Student Succeeds Act…	2
Elementary and Secondary…	1
Elementary and Secondary…	1
Family Educational Rights and…	1
Health Insurance Portability…	1
Individuals with Disabilities…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 635 results Save | Export

Automatic Prompt Engineering for Automatic Scoring

Peer reviewed

Direct link

Mingfeng Xue; Yunting Liu; Xingyao Xiao; Mark Wilson – Journal of Educational Measurement, 2025

Prompts play a crucial role in eliciting accurate outputs from large language models (LLMs). This study examines the effectiveness of an automatic prompt engineering (APE) framework for automatic scoring in educational measurement. We collected constructed-response data from 930 students across 11 items and used human scores as the true labels. A…

Descriptors: Computer Assisted Testing, Prompting, Educational Assessment, Automation

Identifying Features Contributing to Differential Prediction Bias of Automated Scoring Systems

Peer reviewed

Direct link

Ikkyu Choi; Matthew S. Johnson – Journal of Educational Measurement, 2025

Automated scoring systems provide multiple benefits but also pose challenges, notably potential bias. Various methods exist to evaluate these algorithms and their outputs for bias. Upon detecting bias, the next logical step is to investigate its cause, often by examining feature distributions. Recently, Johnson and McCaffrey proposed an…

Descriptors: Prediction, Bias, Automation, Scoring

Using Linkage Sets to Improve Connectedness in Rater Response Model Estimation

Peer reviewed

Direct link

Casabianca, Jodi M.; Donoghue, John R.; Shin, Hyo Jeong; Chao, Szu-Fu; Choi, Ikkyu – Journal of Educational Measurement, 2023

Using item-response theory to model rater effects provides an alternative solution for rater monitoring and diagnosis, compared to using standard performance metrics. In order to fit such models, the ratings data must be sufficiently connected in order to estimate rater effects. Due to popular rating designs used in large-scale testing scenarios,…

Descriptors: Item Response Theory, Alternative Assessment, Evaluators, Research Problems

Automated Short Answer Scoring Using an Ensemble of Neural Networks and Latent Semantic Analysis Classifiers

Peer reviewed

Direct link

Ormerod, Christopher; Lottridge, Susan; Harris, Amy E.; Patel, Milan; van Wamelen, Paul; Kodeswaran, Balaji; Woolf, Sharon; Young, Mackenzie – International Journal of Artificial Intelligence in Education, 2023

We introduce a short answer scoring engine made up of an ensemble of deep neural networks and a Latent Semantic Analysis-based model to score short constructed responses for a large suite of questions from a national assessment program. We evaluate the performance of the engine and show that the engine achieves above-human-level performance on a…

Descriptors: Computer Assisted Testing, Scoring, Artificial Intelligence, Semantics

Automatic Essay Scoring for Discussion Forum in Online Learning Based on Semantic and Keyword Similarities

Peer reviewed

Direct link

Dhini, Bachriah Fatwa; Girsang, Abba Suganda; Sufandi, Unggul Utan; Kurniawati, Heny – Asian Association of Open Universities Journal, 2023

Purpose: The authors constructed an automatic essay scoring (AES) model in a discussion forum where the result was compared with scores given by human evaluators. This research proposes essay scoring, which is conducted through two parameters, semantic and keyword similarities, using a SentenceTransformers pre-trained model that can construct the…

Descriptors: Computer Assisted Testing, Scoring, Writing Evaluation, Essays

On the Limitations of Human-Computer Agreement in Automated Essay Scoring

Peer reviewed
PDF on ERIC

Download full text

Doewes, Afrizal; Pechenizkiy, Mykola – International Educational Data Mining Society, 2021

Scoring essays is generally an exhausting and time-consuming task for teachers. Automated Essay Scoring (AES) facilitates the scoring process to be faster and more consistent. The most logical way to assess the performance of an automated scorer is by measuring the score agreement with the human raters. However, we provide empirical evidence that…

Descriptors: Man Machine Systems, Automation, Computer Assisted Testing, Scoring

Using GPT-4 to Augment Imbalanced Data for Automatic Scoring

Peer reviewed

Direct link

Luyang Fang; Gyeonggeon Lee; Xiaoming Zhai – Journal of Educational Measurement, 2025

Machine learning-based automatic scoring faces challenges with imbalanced student responses across scoring categories. To address this, we introduce a novel text data augmentation framework that leverages GPT-4, a generative large language model specifically tailored for imbalanced datasets in automatic scoring. Our experimental dataset consisted…

Descriptors: Computer Assisted Testing, Artificial Intelligence, Automation, Scoring

Application of an Automated Essay Scoring Engine to English Writing Assessment Using Many-Facet Rasch Measurement

Peer reviewed

Direct link

Chan, Kinnie Kin Yee; Bond, Trevor; Yan, Zi – Language Testing, 2023

We investigated the relationship between the scores assigned by an Automated Essay Scoring (AES) system, the Intelligent Essay Assessor (IEA), and grades allocated by trained, professional human raters to English essay writing by instigating two procedures novel to written-language assessment: the logistic transformation of AES raw scores into…

Descriptors: Computer Assisted Testing, Essays, Scoring, Scores

Peer reviewed

Direct link

Ramnarain-Seetohul, Vidasha; Bassoo, Vandana; Rosunally, Yasmine – Education and Information Technologies, 2022

In automated essay scoring (AES) systems, similarity techniques are used to compute the score for student answers. Several methods to compute similarity have emerged over the years. However, only a few of them have been widely used in the AES domain. This work shows the findings of a ten-year review on similarity techniques applied in AES systems…

Descriptors: Computer Assisted Testing, Essays, Scoring, Automation

Automated Scoring of Figural Tests of Creativity with Computer Vision

Peer reviewed

Direct link

Selcuk Acar; Peter Organisciak; Denis Dumas – Journal of Creative Behavior, 2025

In this three-study investigation, we applied various approaches to score drawings created in response to both Form A and Form B of the Torrance Tests of Creative Thinking-Figural (broadly TTCT-F) as well as the Multi-Trial Creative Ideation task (MTCI). We focused on TTCT-F in Study 1, and utilizing a random forest classifier, we achieved 79% and…

Descriptors: Scoring, Computer Assisted Testing, Models, Correlation

The Vulnerability of AI-Based Scoring Systems to Gaming Strategies: A Case Study

Peer reviewed

Direct link

Peter Baldwin; Victoria Yaneva; Kai North; Le An Ha; Yiyun Zhou; Alex J. Mechaber; Brian E. Clauser – Journal of Educational Measurement, 2025

Recent developments in the use of large-language models have led to substantial improvements in the accuracy of content-based automated scoring of free-text responses. The reported accuracy levels suggest that automated systems could have widespread applicability in assessment. However, before they are used in operational testing, other aspects of…

Descriptors: Artificial Intelligence, Scoring, Computational Linguistics, Accuracy

Comparing the Effect of Contextualized versus Generic Automated Feedback on Students' Scientific Argumentation. Research Report. ETS RR-22-03

Peer reviewed
PDF on ERIC

Download full text

Olivera-Aguilar, Margarita; Lee, Hee-Sun; Pallant, Amy; Belur, Vinetha; Mulholland, Matthew; Liu, Ou Lydia – ETS Research Report Series, 2022

This study uses a computerized formative assessment system that provides automated scoring and feedback to help students write scientific arguments in a climate change curriculum. We compared the effect of contextualized versus generic automated feedback on students' explanations of scientific claims and attributions of uncertainty to those…

Descriptors: Computer Assisted Testing, Formative Evaluation, Automation, Scoring

Accuracy and Reliability of Large Language Models in Assessing Learning Outcomes Achievement across Cognitive Domains

Peer reviewed

Direct link

Swapna Haresh Teckwani; Amanda Huee-Ping Wong; Nathasha Vihangi Luke; Ivan Cherh Chiet Low – Advances in Physiology Education, 2024

The advent of artificial intelligence (AI), particularly large language models (LLMs) like ChatGPT and Gemini, has significantly impacted the educational landscape, offering unique opportunities for learning and assessment. In the realm of written assessment grading, traditionally viewed as a laborious and subjective process, this study sought to…

Descriptors: Accuracy, Reliability, Computational Linguistics, Standards

Automated Scoring of Constructed Response Items in Math Assessment Using Large Language Models

Peer reviewed

Direct link

Wesley Morris; Langdon Holmes; Joon Suh Choi; Scott Crossley – International Journal of Artificial Intelligence in Education, 2025

Recent developments in the field of artificial intelligence allow for improved performance in the automated assessment of extended response items in mathematics, potentially allowing for the scoring of these items cheaply and at scale. This study details the grand prize-winning approach to developing large language models (LLMs) to automatically…

Descriptors: Automation, Computer Assisted Testing, Mathematics Tests, Scoring

Automated Topical Component Extraction Using Neural Network Attention Scores from Source-Based Essay Scoring

Peer reviewed
PDF on ERIC

Download full text

Zhang, Haoran; Litman, Diane – Grantee Submission, 2020

While automated essay scoring (AES) can reliably grade essays at scale, automated writing evaluation (AWE) additionally provides formative feedback to guide essay revision. However, a neural AES typically does not provide useful feature representations for supporting AWE. This paper presents a method for linking AWE and neural AES, by extracting…

Descriptors: Computer Assisted Testing, Scoring, Essay Tests, Writing Evaluation

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 43

ETS Research Report Series	34
Grantee Submission	28
ProQuest LLC	22
Journal of Educational…	18
Language Testing	16
Educational Measurement:…	10
International Journal of…	10
Journal of Applied Testing…	10
Journal of Technology,…	10
Applied Measurement in…	9
Assessing Writing	9
International Educational…	9
Language Assessment Quarterly	9
Educational and Psychological…	8
Journal of Educational…	8
New York State Education…	7
Applied Psychological…	6
Computers & Education	6
Educational Technology &…	6
Online Submission	6
Education and Information…	5
Educational Testing Service	5
International Journal of…	5
Journal of Speech, Language,…	5
Educational Assessment	4
More ▼

Bennett, Randy Elliot	11
Anderson, Paul S.	9
Attali, Yigal	9
Weiss, David J.	9
Bejar, Isaac I.	6
Williamson, David M.	6
Bridgeman, Brent	5
McNamara, Danielle S.	5
Ramineni, Chaitanya	5
Stocking, Martha L.	5
Xi, Xiaoming	5
Zechner, Klaus	5
Davey, Tim	4
Evanini, Keelan	4
Higgins, Derrick	4
Lee, Hee-Sun	4
Linn, Marcia C.	4
Liu, Ou Lydia	4
McNamara, Danielle	4
Mulholland, Matthew	4
Newhouse, C. Paul	4
O'Neil, Harold F., Jr.	4
Pallant, Amy	4
Rupp, André A.	4
More ▼

Journal Articles	417
Reports - Research	340
Reports - Evaluative	114
Reports - Descriptive	97
Speeches/Meeting Papers	66
Tests/Questionnaires	32
Dissertations/Theses -…	23
Information Analyses	17
Numerical/Quantitative Data	16
Books	15
Guides - Non-Classroom	15
Collected Works - General	12
Opinion Papers	11
Collected Works - Proceedings	8
Book/Product Reviews	4
Guides - General	4
Guides - Classroom - Teacher	3
Reports - General	3
Non-Print Media	2
Reference Materials -…	1
More ▼

Test of English as a Foreign…	40
Graduate Record Examinations	17
National Assessment of…	11
Wechsler Intelligence Scale…	4
International English…	3
Program for International…	3
ACTFL Oral Proficiency…	2
Advanced Placement…	2
Armed Services Vocational…	2
Dynamic Indicators of Basic…	2
New York State Regents…	2
Praxis Series	2
Preliminary Scholastic…	2
Progress in International…	2
SAT (College Admission Test)	2
Torrance Tests of Creative…	2
Trends in International…	2
Wechsler Individual…	2
ACT Assessment	1
Behavior Assessment System…	1
California Achievement Tests	1
Center for Epidemiologic…	1
Computer Attitude Scale	1
Conners Rating Scales	1
Expressive One Word Picture…	1
More ▼