ERIC - Search Results

Publication Date

In 2026	0
Since 2025	6
Since 2022 (last 5 years)	16
Since 2017 (last 10 years)	29
Since 2007 (last 20 years)	54

Descriptor

Computer Assisted Testing	61
Evaluators	61
Scoring	34
Second Language Learning	29
English (Second Language)	27
Language Tests	27
Comparative Analysis	21
Foreign Countries	21
Scores	20
Computer Software	18
Correlation	18
Essays	17
Interrater Reliability	16
Language Proficiency	14
Oral Language	12
Accuracy	11
Writing Evaluation	11
Rating Scales	10
Statistical Analysis	10
Artificial Intelligence	9
Undergraduate Students	9
Writing Tests	9
College Students	8
Evaluation Methods	8
Second Language Instruction	8
More ▼

Publication Type

Reports - Research	61
Journal Articles	54
Tests/Questionnaires	8
Speeches/Meeting Papers	4
Collected Works - Proceedings	1

Education Level

Higher Education	20
Postsecondary Education	19
Secondary Education	5
Elementary Education	3
High Schools	2
Elementary Secondary Education	1
Grade 5	1
Intermediate Grades	1
Middle Schools	1

Audience

Location

China	6
Hong Kong	3
Germany	2
Taiwan	2
Australia	1
California	1
China (Beijing)	1
Cyprus	1
Europe	1
Greece	1
Iran	1
Japan	1
Singapore	1
Switzerland	1
Texas	1
Turkey	1
United Kingdom	1
United States	1
Vietnam	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	17
Graduate Record Examinations	2
International English…	2
ACTFL Oral Proficiency…	1
Foreign Language Classroom…	1
Test of English for…	1
Torrance Tests of Creative…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 61 results Save | Export

Assessing Penmanship of Chinese Handwriting: A Deep Learning-Based Approach

Peer reviewed

Direct link

Zebo Xu; Prerit S. Mittal; Mohd. Mohsin Ahmed; Chandranath Adak; Zhenguang G. Cai – Reading and Writing: An Interdisciplinary Journal, 2025

The rise of the digital era has led to a decline in handwriting as the primary mode of communication, resulting in negative effects on handwriting literacy, particularly in complex writing systems such as Chinese. The marginalization of handwriting has contributed to the deterioration of penmanship, defined as the ability to write aesthetically…

Descriptors: Handwriting, Writing Skills, Chinese, Ideography

The Vulnerability of AI-Based Scoring Systems to Gaming Strategies: A Case Study

Peer reviewed

Direct link

Peter Baldwin; Victoria Yaneva; Kai North; Le An Ha; Yiyun Zhou; Alex J. Mechaber; Brian E. Clauser – Journal of Educational Measurement, 2025

Recent developments in the use of large-language models have led to substantial improvements in the accuracy of content-based automated scoring of free-text responses. The reported accuracy levels suggest that automated systems could have widespread applicability in assessment. However, before they are used in operational testing, other aspects of…

Descriptors: Artificial Intelligence, Scoring, Computational Linguistics, Accuracy

Using Linkage Sets to Improve Connectedness in Rater Response Model Estimation

Peer reviewed

Direct link

Casabianca, Jodi M.; Donoghue, John R.; Shin, Hyo Jeong; Chao, Szu-Fu; Choi, Ikkyu – Journal of Educational Measurement, 2023

Using item-response theory to model rater effects provides an alternative solution for rater monitoring and diagnosis, compared to using standard performance metrics. In order to fit such models, the ratings data must be sufficiently connected in order to estimate rater effects. Due to popular rating designs used in large-scale testing scenarios,…

Descriptors: Item Response Theory, Alternative Assessment, Evaluators, Research Problems

Automated Essay Scoring and Revising Based on Open-Source Large Language Models

Peer reviewed

Direct link

Yishen Song; Qianta Zhu; Huaibo Wang; Qinhua Zheng – IEEE Transactions on Learning Technologies, 2024

Manually scoring and revising student essays has long been a time-consuming task for educators. With the rise of natural language processing techniques, automated essay scoring (AES) and automated essay revising (AER) have emerged to alleviate this burden. However, current AES and AER models require large amounts of training data and lack…

Descriptors: Scoring, Essays, Writing Evaluation, Computer Software

Modeling and Analyzing Scorer Preferences in Short-Answer Math Questions

Peer reviewed
PDF on ERIC

Download full text

Zhang, Mengxue; Heffernan, Neil; Lan, Andrew – International Educational Data Mining Society, 2023

Automated scoring of student responses to open-ended questions, including short-answer questions, has great potential to scale to a large number of responses. Recent approaches for automated scoring rely on supervised learning, i.e., training classifiers or fine-tuning language models on a small number of responses with human-provided score…

Descriptors: Scoring, Computer Assisted Testing, Mathematics Instruction, Mathematics Tests

Generative AI vs. Instructor vs. Peer Assessments: A Comparison of Grading and Feedback in Higher Education

Peer reviewed

Direct link

Maya Usher – Assessment & Evaluation in Higher Education, 2025

The integration of Generative Artificial Intelligence (GenAI) in education has introduced innovative approaches to assessment. One such approach is AI chatbot-based assessment, which utilizes large language models to provide students with timely and consistent feedback. However, the effectiveness of AI chatbots in generating assessments comparable…

Descriptors: Artificial Intelligence, Computer Assisted Testing, Student Evaluation, Peer Evaluation

Can AI Grade Like a Human? Validity, Reliability, and Fairness in University Coursework Assessment

Peer reviewed
PDF on ERIC

Download full text

Georgios Zacharis; Stamatios Papadakis – Educational Process: International Journal, 2025

Background/purpose: Generative artificial intelligence (GenAI) is often promoted as a transformative tool for assessment, yet evidence of its validity compared to human raters remains limited. This study examined whether an AI-based rater could be used interchangeably with trained faculty in scoring complex coursework. Materials/methods:…

Descriptors: Artificial Intelligence, Technology Uses in Education, Computer Assisted Testing, Grading

Application of an Automated Essay Scoring Engine to English Writing Assessment Using Many-Facet Rasch Measurement

Peer reviewed

Direct link

Chan, Kinnie Kin Yee; Bond, Trevor; Yan, Zi – Language Testing, 2023

We investigated the relationship between the scores assigned by an Automated Essay Scoring (AES) system, the Intelligent Essay Assessor (IEA), and grades allocated by trained, professional human raters to English essay writing by instigating two procedures novel to written-language assessment: the logistic transformation of AES raw scores into…

Descriptors: Computer Assisted Testing, Essays, Scoring, Scores

Artificial Intelligence as an Automated Essay Scoring Tool: A Focus on ChatGPT

Peer reviewed
PDF on ERIC

Download full text

Ahmet Can Uyar; Dilek Büyükahiska – International Journal of Assessment Tools in Education, 2025

This study explores the effectiveness of using ChatGPT, an Artificial Intelligence (AI) language model, as an Automated Essay Scoring (AES) tool for grading English as a Foreign Language (EFL) learners' essays. The corpus consists of 50 essays representing various types including analysis, compare and contrast, descriptive, narrative, and opinion…

Descriptors: Artificial Intelligence, Computer Software, Technology Uses in Education, Teaching Methods

Measuring Original Thinking in Elementary School: Development and Validation of a Computational Psychometric Approach

Peer reviewed

Direct link

Selcuk Acar; Denis Dumas; Peter Organisciak; Kelly Berthiaume – Grantee Submission, 2024

Creativity is highly valued in both education and the workforce, but assessing and developing creativity can be difficult without psychometrically robust and affordable tools. The open-ended nature of creativity assessments has made them difficult to score, expensive, often imprecise, and therefore impractical for school- or district-wide use. To…

Descriptors: Thinking Skills, Elementary School Students, Artificial Intelligence, Measurement Techniques

Accuracy and Reliability of Large Language Models in Assessing Learning Outcomes Achievement across Cognitive Domains

Peer reviewed

Direct link

Swapna Haresh Teckwani; Amanda Huee-Ping Wong; Nathasha Vihangi Luke; Ivan Cherh Chiet Low – Advances in Physiology Education, 2024

The advent of artificial intelligence (AI), particularly large language models (LLMs) like ChatGPT and Gemini, has significantly impacted the educational landscape, offering unique opportunities for learning and assessment. In the realm of written assessment grading, traditionally viewed as a laborious and subjective process, this study sought to…

Descriptors: Accuracy, Reliability, Computational Linguistics, Standards

Automated Speech Scoring of Dialogue Response by Japanese Learners of English as a Foreign Language

Peer reviewed

Direct link

Yuko Hayashi; Yusuke Kondo; Yutaka Ishii – Innovation in Language Learning and Teaching, 2024

Purpose: This study builds a new system for automatically assessing learners' speech elicited from an oral discourse completion task (DCT), and evaluates the prediction capability of the system with a view to better understanding factors deemed influential in predicting speaking proficiency scores and the pedagogical implications of the system.…

Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Japanese

Using Latent Semantic Analysis to Score Short Answer Constructed Responses: Automated Scoring of the Consequences Test

Peer reviewed

Direct link

LaVoie, Noelle; Parker, James; Legree, Peter J.; Ardison, Sharon; Kilcullen, Robert N. – Educational and Psychological Measurement, 2020

Automated scoring based on Latent Semantic Analysis (LSA) has been successfully used to score essays and constrained short answer responses. Scoring tests that capture open-ended, short answer responses poses some challenges for machine learning approaches. We used LSA techniques to score short answer responses to the Consequences Test, a measure…

Descriptors: Semantics, Evaluators, Essays, Scoring

Human versus Computer Partner in the Paired Oral Discussion Test

Peer reviewed

Direct link

Ockey, Gary J.; Chukharev-Hudilainen, Evgeny – Applied Linguistics, 2021

A challenge of large-scale oral communication assessments is to feasibly assess a broad construct that includes interactional competence. One possible approach in addressing this challenge is to use a spoken dialog system (SDS), with the computer acting as a peer to elicit a ratable speech sample. With this aim, an SDS was built and four trained…

Descriptors: Oral Language, Grammar, Language Fluency, Language Tests

Temporal Fluency and Floor/Ceiling Scoring of Intermediate and Advanced Speech on the ACTFL Spanish Oral Proficiency Interview--Computer

Peer reviewed

Direct link

Cox, Troy L.; Brown, Alan V.; Thompson, Gregory L. – Language Testing, 2023

The rating of proficiency tests that use the Inter-agency Roundtable (ILR) and American Council on the Teaching of Foreign Languages (ACTFL) guidelines claims that each major level is based on hierarchal linguistic functions that require mastery of multidimensional traits in such a way that each level subsumes the levels beneath it. These…

Descriptors: Oral Language, Language Fluency, Scoring, Cues

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5

ETS Research Report Series	9
Language Testing	7
Language Assessment Quarterly	5
Applied Measurement in…	2
Assessment in Education:…	2
British Journal of…	2
English Language Teaching	2
International Journal of…	2
Journal of Educational…	2
Advances in Physiology…	1
Applied Linguistics	1
Assessment & Evaluation in…	1
CALICO Journal	1
Contemporary Educational…	1
Contemporary Issues in…	1
Education Journal	1
Educational Process:…	1
Educational and Psychological…	1
Grantee Submission	1
IEEE Transactions on Learning…	1
Innovation in Language…	1
Interactive Learning…	1
International Educational…	1
International Journal of…	1
International Journal of…	1
More ▼

Coniam, David	3
Attali, Yigal	2
Bridgeman, Brent	2
Casabianca, Jodi M.	2
Davis, Larry	2
Kunnan, Antony John	2
Mollaun, Pamela	2
Sebrechts, Marc M.	2
Wolfe, Edward W.	2
Xi, Xiaoming	2
Yan, Zi	2
Zechner, Klaus	2
Ahmet Can Uyar	1
Alegre, Analucia	1
Alex J. Mechaber	1
Amanda Huee-Ping Wong	1
Apple, Kristen	1
Ardison, Sharon	1
Bejar, Isaac I.	1
Bell, John F.	1
Bennett, Randy Elliot	1
Blanchard, Daniel	1
Bond, Trevor	1
Breyer, F. Jay	1
More ▼