ERIC - Search Results

Publication Date

In 2026	0
Since 2025	2
Since 2022 (last 5 years)	13
Since 2017 (last 10 years)	19
Since 2007 (last 20 years)	21

Descriptor

Comparative Analysis	23
Computational Linguistics	23
Computer Assisted Testing	23
Computer Software	11
English (Second Language)	11
Second Language Learning	11
Foreign Countries	10
Language Tests	10
Writing Evaluation	8
Accuracy	7
Scoring	7
Second Language Instruction	7
Essays	6
Evaluators	6
Artificial Intelligence	5
Natural Language Processing	5
Scores	5
Teaching Methods	5
Correlation	4
Grammar	4
Language Usage	4
Task Analysis	4
Test Items	4
Test Validity	4
Undergraduate Students	4
More ▼

Source

Language Testing	3
Discourse Processes: A…	2
International Educational…	2
ProQuest LLC	2
Advances in Physiology…	1
Applied Psycholinguistics	1
Computer Assisted Language…	1
Developmental Psychology	1
ETS Research Report Series	1
English Teaching	1
IEEE Transactions on Learning…	1
International Association for…	1
International Journal of…	1
International TESOL Journal	1
Modern Language Journal	1
OTESSA Conference Proceedings	1
Turkish Online Journal of…	1
Vocabulary Learning and…	1
More ▼

Publication Type

Reports - Research	20
Journal Articles	17
Speeches/Meeting Papers	4
Dissertations/Theses -…	2
Information Analyses	1

Education Level

Higher Education	10
Postsecondary Education	9
Secondary Education	3
Elementary Education	2
Elementary Secondary Education	2
High Schools	2
Early Childhood Education	1
Grade 10	1
Grade 11	1
Grade 4	1
Grade 6	1
Grade 9	1
Junior High Schools	1
Kindergarten	1
Middle Schools	1
Primary Education	1
More ▼

Audience

Location

Germany	2
Taiwan	2
China	1
France	1
Iran	1
Japan	1
Singapore	1
Switzerland	1
Turkey	1

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	4
International English…	2
Peabody Picture Vocabulary…	1
Wechsler Abbreviated Scale of…	1
Woodcock Johnson Tests of…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 23 results Save | Export

Automated Essay Scoring and Revising Based on Open-Source Large Language Models

Peer reviewed

Direct link

Yishen Song; Qianta Zhu; Huaibo Wang; Qinhua Zheng – IEEE Transactions on Learning Technologies, 2024

Manually scoring and revising student essays has long been a time-consuming task for educators. With the rise of natural language processing techniques, automated essay scoring (AES) and automated essay revising (AER) have emerged to alleviate this burden. However, current AES and AER models require large amounts of training data and lack…

Descriptors: Scoring, Essays, Writing Evaluation, Computer Software

Modeling and Analyzing Scorer Preferences in Short-Answer Math Questions

Peer reviewed
PDF on ERIC

Download full text

Zhang, Mengxue; Heffernan, Neil; Lan, Andrew – International Educational Data Mining Society, 2023

Automated scoring of student responses to open-ended questions, including short-answer questions, has great potential to scale to a large number of responses. Recent approaches for automated scoring rely on supervised learning, i.e., training classifiers or fine-tuning language models on a small number of responses with human-provided score…

Descriptors: Scoring, Computer Assisted Testing, Mathematics Instruction, Mathematics Tests

Authenticity of Academic Lecture Passages in High-Stakes Tests: A Temporal Fluency Perspective

Peer reviewed

Direct link

Hitoshi Nishizawa – Language Testing, 2024

Corpus-based studies have offered the domain definition inference for test developers. Yet, corpus-based studies on temporal fluency measures (e.g., speech rate) have been limited, especially in the context of academic lecture settings. This made it difficult for test developers to sample representative fluency features to create authentic…

Descriptors: High Stakes Tests, Language Tests, Second Language Learning, Computer Assisted Testing

Meta-Analysis of Inter-Rater Agreement and Discrepancy Between Human and Automated English Essay Scoring

Peer reviewed
PDF on ERIC

Download full text

Direct link

Jiyeo Yun – English Teaching, 2023

Studies on automatic scoring systems in writing assessments have also evaluated the relationship between human and machine scores for the reliability of automated essay scoring systems. This study investigated the magnitudes of indices for inter-rater agreement and discrepancy, especially regarding human and machine scoring, in writing assessment.…

Descriptors: Meta Analysis, Interrater Reliability, Essays, Scoring

Individual Fairness Evaluation for Automated Essay Scoring System

Peer reviewed
PDF on ERIC

Download full text

Doewes, Afrizal; Saxena, Akrati; Pei, Yulong; Pechenizkiy, Mykola – International Educational Data Mining Society, 2022

In Automated Essay Scoring (AES) systems, many previous works have studied group fairness using the demographic features of essay writers. However, individual fairness also plays an important role in fair evaluation and has not been yet explored. Initialized by Dwork et al., the fundamental concept of individual fairness is "similar people…

Descriptors: Scoring, Essays, Writing Evaluation, Comparative Analysis

Developing a Generic Scorer for Practice Writing Tests of Statewide Assessment Essays with Natural Language Processing Transfer Learning Techniques

Direct link

Yi Gui – ProQuest LLC, 2024

This study explores using transfer learning in machine learning for natural language processing (NLP) to create generic automated essay scoring (AES) models, providing instant online scoring for statewide writing assessments in K-12 education. The goal is to develop an instant online scorer that is generalizable to any prompt, addressing the…

Descriptors: Writing Tests, Natural Language Processing, Writing Evaluation, Scoring

Artificial Intelligence as an Automated Essay Scoring Tool: A Focus on ChatGPT

Peer reviewed
PDF on ERIC

Download full text

Ahmet Can Uyar; Dilek Büyükahiska – International Journal of Assessment Tools in Education, 2025

This study explores the effectiveness of using ChatGPT, an Artificial Intelligence (AI) language model, as an Automated Essay Scoring (AES) tool for grading English as a Foreign Language (EFL) learners' essays. The corpus consists of 50 essays representing various types including analysis, compare and contrast, descriptive, narrative, and opinion…

Descriptors: Artificial Intelligence, Computer Software, Technology Uses in Education, Teaching Methods

Mitigating Gender and L1 Biases in Automated English Speaking Assessment

Direct link

Alexander James Kwako – ProQuest LLC, 2023

Automated assessment using Natural Language Processing (NLP) has the potential to make English speaking assessments more reliable, authentic, and accessible. Yet without careful examination, NLP may exacerbate social prejudices based on gender or native language (L1). Current NLP-based assessments are prone to such biases, yet research and…

Descriptors: Gender Bias, Natural Language Processing, Native Language, Computational Linguistics

Accuracy and Reliability of Large Language Models in Assessing Learning Outcomes Achievement across Cognitive Domains

Peer reviewed

Direct link

Swapna Haresh Teckwani; Amanda Huee-Ping Wong; Nathasha Vihangi Luke; Ivan Cherh Chiet Low – Advances in Physiology Education, 2024

The advent of artificial intelligence (AI), particularly large language models (LLMs) like ChatGPT and Gemini, has significantly impacted the educational landscape, offering unique opportunities for learning and assessment. In the realm of written assessment grading, traditionally viewed as a laborious and subjective process, this study sought to…

Descriptors: Accuracy, Reliability, Computational Linguistics, Standards

The Development of Lexical Inhibition in Spoken Word Recognition

Peer reviewed

Direct link

Blomquist, Christina; McMurray, Bob – Developmental Psychology, 2023

As a spoken word unfolds over time, similar sounding words ("cap" and "cat") compete until one word "wins". Lexical competition becomes more efficient from infancy through adolescence. We examined one potential mechanism underlying this development: lexical inhibition, by which activated candidates suppress…

Descriptors: Speech Communication, Language Acquisition, Age Differences, Word Recognition

Evaluation of Automated Vocabulary Quiz Generation with VocQGen

Peer reviewed
PDF on ERIC

Download full text

Qiao Wang; Ralph L. Rose; Ayaka Sugawara; Naho Orita – Vocabulary Learning and Instruction, 2025

VocQGen is an automated tool designed to generate multiple-choice cloze (MCC) questions for vocabulary assessment in second language learning contexts. It leverages several natural language processing (NLP) tools and OpenAI's GPT-4 model to produce MCC items quickly from user-specified word lists. To evaluate its effectiveness, we used the first…

Descriptors: Vocabulary Skills, Artificial Intelligence, Computer Software, Multiple Choice Tests

Proficiency at the Lexis-Grammar Interface: Comparing Oral versus Written French Exam Tasks

Peer reviewed

Direct link

Vandeweerd, Nathan; Housen, Alex; Paquot, Magali – Language Testing, 2023

This study investigates whether re-thinking the separation of lexis and grammar in language testing could lead to more valid inferences about proficiency across modes. As argued by Römer, typical scoring rubrics ignore important information about proficiency encoded at the lexis-grammar interface, in particular how the co-selection of lexical and…

Descriptors: French, Language Tests, Grammar, Second Language Learning

TOEFL iBT Iranian Test-Takers' Oral Language Performance: A Comparison between Independent and Integrated Speaking Tasks

Peer reviewed
PDF on ERIC

Download full text

Ariamanesh, Ali A.; Barati, Hossein; Youhanaee, Manijeh – International TESOL Journal, 2022

The present study investigated the speaking module of TOEFL iBT with an emphasis on the dichotomy of independent and integrated tasks. The potential differences between the two speaking conditions were intended to be explored based on the oral performance elicited from a group of Iranian test takers. To collect the required data, a simulated…

Descriptors: Second Language Learning, English (Second Language), Language Tests, Computer Assisted Testing

Assessing Open-Book-Open-Web Exam in High Schools: The Case of a Developing Country

Peer reviewed
PDF on ERIC

Download full text

El Rassi, Mary Ann Barbour – International Association for Development of the Information Society, 2019

It has long been debated whether the Open-Book-Open-Web exam was useful and efficient as the traditional closed book exams. Some scholars and practitioners have doubted the efficiency and the possibility of cheating in the OBOW as it is not directly monitored. This paper tends to investigate the effectiveness of OBOW exams by comparing them with…

Descriptors: Developing Nations, Test Format, Tests, Cheating

A Comparison of Spoken and Written Language Use in Traditional and Technology-Mediated Learning Environments. TOEFL® Research Report. RR-94. ETS RR-21-16

Peer reviewed
PDF on ERIC

Download full text

Kyle, Kristopher; Choe, Ann Tai; Eguchi, Masaki; LaFlair, Geoff; Ziegler, Nicole – ETS Research Report Series, 2021

A key piece of a validity argument for a language assessment tool is clear overlap between assessment tasks and the target language use (TLU) domain (i.e., the domain description inference). The TOEFL 2000 Spoken and Written Academic Language (T2K-SWAL) corpus, which represents a variety of academic registers and disciplines in traditional…

Descriptors: Comparative Analysis, Second Language Learning, English (Second Language), Language Tests

Previous Page | Next Page »

Pages: 1 | 2

Ahmet Can Uyar	1
Alexander James Kwako	1
Amanda Huee-Ping Wong	1
Ariamanesh, Ali A.	1
Ayaka Sugawara	1
Barati, Hossein	1
Biber, Douglas	1
Blomquist, Christina	1
Boo, Jaeyool	1
Choe, Ann Tai	1
Choi, Inn-Chull	1
Dilek Büyükahiska	1
Doewes, Afrizal	1
Eguchi, Masaki	1
El Rassi, Mary Ann Barbour	1
Feifei Han	1
Gotzner, Nicole	1
Gygax, Pascal M.	1
Heffernan, Neil	1
Hitoshi Nishizawa	1
Housen, Alex	1
Huaibo Wang	1
Ivan Cherh Chiet Low	1
Jiyeo Yun	1
Kempe, Vera	1
More ▼