ERIC - Search Results

Publication Date

In 2026	0
Since 2025	6
Since 2022 (last 5 years)	16
Since 2017 (last 10 years)	30
Since 2007 (last 20 years)	57

Descriptor

Computer Assisted Testing	61
Evaluators	61
Scoring	32
Second Language Learning	32
English (Second Language)	30
Language Tests	27
Foreign Countries	23
Comparative Analysis	21
Correlation	19
Essays	19
Scores	19
Interrater Reliability	16
Computer Software	15
Language Proficiency	14
Writing Evaluation	14
Oral Language	12
Rating Scales	11
Statistical Analysis	11
Accuracy	10
Evaluation Methods	10
Second Language Instruction	9
Undergraduate Students	9
Writing Tests	9
Artificial Intelligence	8
College Students	8
More ▼

Publication Type

Journal Articles	61
Reports - Research	54
Tests/Questionnaires	8
Reports - Descriptive	4
Reports - Evaluative	2
Information Analyses	1

Education Level

Higher Education	21
Postsecondary Education	20
Secondary Education	5
Elementary Education	2
High Schools	2
Elementary Secondary Education	1
Grade 11	1
Grade 5	1
Intermediate Grades	1
Middle Schools	1

Audience

Location

China	6
Hong Kong	4
Germany	2
Taiwan	2
United Kingdom	2
Australia	1
California	1
China (Beijing)	1
Cyprus	1
Europe	1
Greece	1
Iran	1
Japan	1
Singapore	1
Switzerland	1
Texas	1
Turkey	1
United States	1
Vietnam	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	16
International English…	2
ACTFL Oral Proficiency…	1
Foreign Language Classroom…	1
Graduate Record Examinations	1
Test of English for…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 61 results Save | Export

Assessing Penmanship of Chinese Handwriting: A Deep Learning-Based Approach

Peer reviewed

Direct link

Zebo Xu; Prerit S. Mittal; Mohd. Mohsin Ahmed; Chandranath Adak; Zhenguang G. Cai – Reading and Writing: An Interdisciplinary Journal, 2025

The rise of the digital era has led to a decline in handwriting as the primary mode of communication, resulting in negative effects on handwriting literacy, particularly in complex writing systems such as Chinese. The marginalization of handwriting has contributed to the deterioration of penmanship, defined as the ability to write aesthetically…

Descriptors: Handwriting, Writing Skills, Chinese, Ideography

The Vulnerability of AI-Based Scoring Systems to Gaming Strategies: A Case Study

Peer reviewed

Direct link

Peter Baldwin; Victoria Yaneva; Kai North; Le An Ha; Yiyun Zhou; Alex J. Mechaber; Brian E. Clauser – Journal of Educational Measurement, 2025

Recent developments in the use of large-language models have led to substantial improvements in the accuracy of content-based automated scoring of free-text responses. The reported accuracy levels suggest that automated systems could have widespread applicability in assessment. However, before they are used in operational testing, other aspects of…

Descriptors: Artificial Intelligence, Scoring, Computational Linguistics, Accuracy

Using Linkage Sets to Improve Connectedness in Rater Response Model Estimation

Peer reviewed

Direct link

Casabianca, Jodi M.; Donoghue, John R.; Shin, Hyo Jeong; Chao, Szu-Fu; Choi, Ikkyu – Journal of Educational Measurement, 2023

Using item-response theory to model rater effects provides an alternative solution for rater monitoring and diagnosis, compared to using standard performance metrics. In order to fit such models, the ratings data must be sufficiently connected in order to estimate rater effects. Due to popular rating designs used in large-scale testing scenarios,…

Descriptors: Item Response Theory, Alternative Assessment, Evaluators, Research Problems

Automated Essay Scoring and Revising Based on Open-Source Large Language Models

Peer reviewed

Direct link

Yishen Song; Qianta Zhu; Huaibo Wang; Qinhua Zheng – IEEE Transactions on Learning Technologies, 2024

Manually scoring and revising student essays has long been a time-consuming task for educators. With the rise of natural language processing techniques, automated essay scoring (AES) and automated essay revising (AER) have emerged to alleviate this burden. However, current AES and AER models require large amounts of training data and lack…

Descriptors: Scoring, Essays, Writing Evaluation, Computer Software

Generative AI vs. Instructor vs. Peer Assessments: A Comparison of Grading and Feedback in Higher Education

Peer reviewed

Direct link

Maya Usher – Assessment & Evaluation in Higher Education, 2025

The integration of Generative Artificial Intelligence (GenAI) in education has introduced innovative approaches to assessment. One such approach is AI chatbot-based assessment, which utilizes large language models to provide students with timely and consistent feedback. However, the effectiveness of AI chatbots in generating assessments comparable…

Descriptors: Artificial Intelligence, Computer Assisted Testing, Student Evaluation, Peer Evaluation

Can AI Grade Like a Human? Validity, Reliability, and Fairness in University Coursework Assessment

Peer reviewed
PDF on ERIC

Download full text

Georgios Zacharis; Stamatios Papadakis – Educational Process: International Journal, 2025

Background/purpose: Generative artificial intelligence (GenAI) is often promoted as a transformative tool for assessment, yet evidence of its validity compared to human raters remains limited. This study examined whether an AI-based rater could be used interchangeably with trained faculty in scoring complex coursework. Materials/methods:…

Descriptors: Artificial Intelligence, Technology Uses in Education, Computer Assisted Testing, Grading

Application of an Automated Essay Scoring Engine to English Writing Assessment Using Many-Facet Rasch Measurement

Peer reviewed

Direct link

Chan, Kinnie Kin Yee; Bond, Trevor; Yan, Zi – Language Testing, 2023

We investigated the relationship between the scores assigned by an Automated Essay Scoring (AES) system, the Intelligent Essay Assessor (IEA), and grades allocated by trained, professional human raters to English essay writing by instigating two procedures novel to written-language assessment: the logistic transformation of AES raw scores into…

Descriptors: Computer Assisted Testing, Essays, Scoring, Scores

Meta-Analysis of Inter-Rater Agreement and Discrepancy Between Human and Automated English Essay Scoring

Peer reviewed
PDF on ERIC

Download full text

Direct link

Jiyeo Yun – English Teaching, 2023

Studies on automatic scoring systems in writing assessments have also evaluated the relationship between human and machine scores for the reliability of automated essay scoring systems. This study investigated the magnitudes of indices for inter-rater agreement and discrepancy, especially regarding human and machine scoring, in writing assessment.…

Descriptors: Meta Analysis, Interrater Reliability, Essays, Scoring

Artificial Intelligence as an Automated Essay Scoring Tool: A Focus on ChatGPT

Peer reviewed
PDF on ERIC

Download full text

Ahmet Can Uyar; Dilek Büyükahiska – International Journal of Assessment Tools in Education, 2025

This study explores the effectiveness of using ChatGPT, an Artificial Intelligence (AI) language model, as an Automated Essay Scoring (AES) tool for grading English as a Foreign Language (EFL) learners' essays. The corpus consists of 50 essays representing various types including analysis, compare and contrast, descriptive, narrative, and opinion…

Descriptors: Artificial Intelligence, Computer Software, Technology Uses in Education, Teaching Methods

Accuracy and Reliability of Large Language Models in Assessing Learning Outcomes Achievement across Cognitive Domains

Peer reviewed

Direct link

Swapna Haresh Teckwani; Amanda Huee-Ping Wong; Nathasha Vihangi Luke; Ivan Cherh Chiet Low – Advances in Physiology Education, 2024

The advent of artificial intelligence (AI), particularly large language models (LLMs) like ChatGPT and Gemini, has significantly impacted the educational landscape, offering unique opportunities for learning and assessment. In the realm of written assessment grading, traditionally viewed as a laborious and subjective process, this study sought to…

Descriptors: Accuracy, Reliability, Computational Linguistics, Standards

Automated Speech Scoring of Dialogue Response by Japanese Learners of English as a Foreign Language

Peer reviewed

Direct link

Yuko Hayashi; Yusuke Kondo; Yutaka Ishii – Innovation in Language Learning and Teaching, 2024

Purpose: This study builds a new system for automatically assessing learners' speech elicited from an oral discourse completion task (DCT), and evaluates the prediction capability of the system with a view to better understanding factors deemed influential in predicting speaking proficiency scores and the pedagogical implications of the system.…

Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Japanese

A Hybrid Approach for Automatic Generation of Named Entity Distractors for Multiple Choice Questions

Peer reviewed

Direct link

Patra, Rakesh; Saha, Sujan Kumar – Education and Information Technologies, 2019

Assessment plays an important role in learning and Multiple Choice Questions (MCQs) are quite popular in large-scale evaluations. Technology-enabled learning necessitates a smart assessment. Therefore, automatic MCQ generation became increasingly popular in the last two decades. Despite a large amount of research effort, system generated MCQs are…

Descriptors: Multiple Choice Tests, High Stakes Tests, Semantics, Evaluation Methods

Using Latent Semantic Analysis to Score Short Answer Constructed Responses: Automated Scoring of the Consequences Test

Peer reviewed

Direct link

LaVoie, Noelle; Parker, James; Legree, Peter J.; Ardison, Sharon; Kilcullen, Robert N. – Educational and Psychological Measurement, 2020

Automated scoring based on Latent Semantic Analysis (LSA) has been successfully used to score essays and constrained short answer responses. Scoring tests that capture open-ended, short answer responses poses some challenges for machine learning approaches. We used LSA techniques to score short answer responses to the Consequences Test, a measure…

Descriptors: Semantics, Evaluators, Essays, Scoring

Human versus Computer Partner in the Paired Oral Discussion Test

Peer reviewed

Direct link

Ockey, Gary J.; Chukharev-Hudilainen, Evgeny – Applied Linguistics, 2021

A challenge of large-scale oral communication assessments is to feasibly assess a broad construct that includes interactional competence. One possible approach in addressing this challenge is to use a spoken dialog system (SDS), with the computer acting as a peer to elicit a ratable speech sample. With this aim, an SDS was built and four trained…

Descriptors: Oral Language, Grammar, Language Fluency, Language Tests

Temporal Fluency and Floor/Ceiling Scoring of Intermediate and Advanced Speech on the ACTFL Spanish Oral Proficiency Interview--Computer

Peer reviewed

Direct link

Cox, Troy L.; Brown, Alan V.; Thompson, Gregory L. – Language Testing, 2023

The rating of proficiency tests that use the Inter-agency Roundtable (ILR) and American Council on the Teaching of Foreign Languages (ACTFL) guidelines claims that each major level is based on hierarchal linguistic functions that require mastery of multidimensional traits in such a way that each level subsumes the levels beneath it. These…

Descriptors: Oral Language, Language Fluency, Scoring, Cues

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5

ETS Research Report Series	9
Language Testing	8
Language Assessment Quarterly	5
Applied Measurement in…	3
Assessment in Education:…	2
British Journal of…	2
English Language Teaching	2
International Journal of…	2
Journal of Educational…	2
Advances in Physiology…	1
Applied Linguistics	1
Assessment & Evaluation in…	1
CALICO Journal	1
Contemporary Educational…	1
Contemporary Issues in…	1
Education Journal	1
Education and Information…	1
Educational Process:…	1
Educational Research and…	1
Educational and Psychological…	1
English Teaching	1
European Journal of Open,…	1
Evaluation and Program…	1
IEEE Transactions on Learning…	1
Innovation in Language…	1
More ▼

Coniam, David	4
Attali, Yigal	2
Bridgeman, Brent	2
Casabianca, Jodi M.	2
Davis, Larry	2
Kunnan, Antony John	2
Mollaun, Pamela	2
Xi, Xiaoming	2
Yan, Zi	2
Zechner, Klaus	2
Ahmet Can Uyar	1
Alegre, Analucia	1
Alex J. Mechaber	1
Amanda Huee-Ping Wong	1
Amrane-Cooper, Linda	1
Apple, Kristen	1
Ardison, Sharon	1
Bejar, Isaac I.	1
Bell, John F.	1
Bennett, Randy Elliot	1
Blanchard, Daniel	1
Bond, Trevor	1
Breyer, F. Jay	1
Brian E. Clauser	1
Brown, Alan V.	1
More ▼