ERIC - Search Results

Publication Date

In 2026	0
Since 2025	6
Since 2022 (last 5 years)	26
Since 2017 (last 10 years)	59
Since 2007 (last 20 years)	87

Descriptor

Comparative Analysis	88
Evaluators	88
Second Language Learning	88
English (Second Language)	77
Second Language Instruction	49
Foreign Countries	45
Language Tests	35
Writing Evaluation	25
Scores	24
Teaching Methods	22
Pronunciation	21
Speech Communication	21
Correlation	20
Language Proficiency	20
Oral Language	19
Essays	18
Native Language	18
Native Speakers	18
Scoring	18
Accuracy	16
Language Teachers	15
Computer Software	14
Interrater Reliability	14
Statistical Analysis	14
Grammar	13
More ▼

Publication Type

Journal Articles	79
Reports - Research	73
Tests/Questionnaires	12
Dissertations/Theses -…	6
Reports - Evaluative	5
Information Analyses	2
Reports - Descriptive	2
Speeches/Meeting Papers	2
Guides - Non-Classroom	1

Education Level

Higher Education	39
Postsecondary Education	33
Secondary Education	7
High Schools	3
Adult Education	2
Elementary Education	1
Elementary Secondary Education	1
Grade 1	1
Grade 11	1
Grade 12	1
Grade 2	1
Grade 3	1
Kindergarten	1
More ▼

Audience

Practitioners

Location

China	7
Iran	4
Turkey	4
Canada	3
Europe	3
Japan	3
United States	3
Australia	2
Hong Kong	2
Thailand	2
Afghanistan	1
Argentina	1
Brazil	1
Canada (Montreal)	1
India	1
Iran (Tehran)	1
Israel	1
Massachusetts	1
Mexico	1
Mexico (Oaxaca)	1
Nebraska	1
Netherlands	1
North America	1
Oman	1
Pakistan	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Test of English as a Foreign…	6
International English…	5
Program for International…	1
Test of English for…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 88 results Save | Export

Comparing L2 Intelligibility for Learners of French: Automatic Speech Recognition versus Human Listeners

Peer reviewed

Direct link

Elena Shimanskaya – Foreign Language Annals, 2025

In this study, I compare the accuracy of automatic speech recognition (ASR) transcription against two measures of intelligibility provided by human listeners. The data came from readings of five texts recorded by 15 language learners of French. Human understanding was gauged by (i) asking a group of 36 naïve first language (L1) speakers of French…

Descriptors: Comparative Analysis, French, Second Language Learning, Second Language Instruction

Examining the Effect of Item Difficulty and Rater Leniency on Iranian Test Takers' Performance on WDCT and DSAT: A Comparative Study

Peer reviewed
PDF on ERIC

Download full text

Reza Shahi; Hamdollah Ravand; Golam Reza Rohani – International Journal of Language Testing, 2025

The current paper intends to exploit the Many Facet Rasch Model to investigate and compare the impact of situations (items) and raters on test takers' performance on the Written Discourse Completion Test (WDCT) and Discourse Self-Assessment Tests (DSAT). In this study, the participants were 110 English as a Foreign Language (EFL) students at…

Descriptors: Comparative Analysis, English (Second Language), Second Language Learning, Second Language Instruction

Utilizing Large Language Models for EFL Essay Grading: An Examination of Reliability and Validity in Rubric-Based Assessments

Peer reviewed

Direct link

Fatih Yavuz; Özgür Çelik; Gamze Yavas Çelik – British Journal of Educational Technology, 2025

This study investigates the validity and reliability of generative large language models (LLMs), specifically ChatGPT and Google's Bard, in grading student essays in higher education based on an analytical grading rubric. A total of 15 experienced English as a foreign language (EFL) instructors and two LLMs were asked to evaluate three student…

Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Computational Linguistics

The Visual Signature of Non-Understanding: A Systematic Replication of McDonough, Trofimovich, Lu, and Abashidze (2019)

Peer reviewed

Direct link

McDonough, Kim; Lindberg, Rachael; Trofimovich, Pavel; Tekin, Oguzhan – Language Teaching, 2023

This replication study seeks to extend the generalizability of an exploratory study (McDonough et al., 2019) that identified holds (i.e., temporary cessation of dynamic movement by the listener) as a reliable visual cue of non-understanding. Conversations between second language (L2) English speakers in the Corpus of English as a Lingua Franca…

Descriptors: Second Language Learning, Second Language Instruction, English (Second Language), Computational Linguistics

The Intersection of AI and Language Assessment: A Study on the Reliability of ChatGPT in Grading IELTS Writing Task 2

Peer reviewed
PDF on ERIC

Download full text

Osama Koraishi – Language Teaching Research Quarterly, 2024

This study conducts a comprehensive quantitative evaluation of OpenAI's language model, ChatGPT 4, for grading Task 2 writing of the IELTS exam. The objective is to assess the alignment between ChatGPT's grading and that of official human raters. The analysis encompassed a multifaceted approach, including a comparison of means and reliability…

Descriptors: Second Language Learning, English (Second Language), Language Tests, Artificial Intelligence

Meta-Analysis of Inter-Rater Agreement and Discrepancy Between Human and Automated English Essay Scoring

Peer reviewed
PDF on ERIC

Download full text

Direct link

Jiyeo Yun – English Teaching, 2023

Studies on automatic scoring systems in writing assessments have also evaluated the relationship between human and machine scores for the reliability of automated essay scoring systems. This study investigated the magnitudes of indices for inter-rater agreement and discrepancy, especially regarding human and machine scoring, in writing assessment.…

Descriptors: Meta Analysis, Interrater Reliability, Essays, Scoring

Depth-Perception-Based Representation in Holistic Rating on ESL Essay Writing

Peer reviewed

Direct link

Lian Li; Jiehui Hu; Yu Dai; Ping Zhou; Wanhong Zhang – Reading & Writing Quarterly, 2024

This paper proposes to use depth perception to represent raters' decision in holistic evaluation of ESL essays, as an alternative medium to conventional form of numerical scores. The researchers verified the new method's accuracy and inter/intra-rater reliability by inviting 24 ESL teachers to perform different representations when rating 60…

Descriptors: Essays, Holistic Approach, Writing Evaluation, Accuracy

Comprehensible to Whom? Examining Rater, Speaker, and Interlocutor Perspectives on Comprehensibility in an Interactive Context

Peer reviewed

Direct link

Nagle, Charlie L.; Trofimovich, Pavel; O'Brien, Mary Grantham; Kennedy, Sara – Modern Language Journal, 2022

Comprehensibility has emerged as a useful and intuitive means of globally evaluating second language (L2) speakers in many research and instructional contexts. In most cases, L2 speakers' comprehensibility is assessed by external listeners who do not engage in extensive communication with the speakers, even though the degree to which a speaker is…

Descriptors: Evaluators, Intelligibility, Pronunciation, Task Analysis

Artificial Intelligence as an Automated Essay Scoring Tool: A Focus on ChatGPT

Peer reviewed
PDF on ERIC

Download full text

Ahmet Can Uyar; Dilek Büyükahiska – International Journal of Assessment Tools in Education, 2025

This study explores the effectiveness of using ChatGPT, an Artificial Intelligence (AI) language model, as an Automated Essay Scoring (AES) tool for grading English as a Foreign Language (EFL) learners' essays. The corpus consists of 50 essays representing various types including analysis, compare and contrast, descriptive, narrative, and opinion…

Descriptors: Artificial Intelligence, Computer Software, Technology Uses in Education, Teaching Methods

Comparative Analysis of Human Graders and AI in Assessing Secondary School EFL Journal Writing

Peer reviewed
PDF on ERIC

Download full text

Seval Kemal; Aysegül Liman-Kaban – Asian Journal of Distance Education, 2025

This study conducts a comprehensive analysis of the assessment of journal writing in English as a Foreign Language (EFL) at the secondary school level, comparing the performance of a Generative Artificial Intelligence (GenAI) platform with two human graders. Employing a convergent parallel mixed methods design, quantitative data were collected…

Descriptors: Artificial Intelligence, Secondary School Students, Feedback (Response), Writing Assignments

Interlanguage Phonology and Accentedness: An Experimental Study of Thai Final Nasal Consonants in Chinese Students Learning Thai

Peer reviewed
PDF on ERIC

Download full text

Hou, Peng; Kraisame, Sarawut – rEFLections, 2023

This paper provides an experimental study of interlanguage phonological characteristics of Chinese students learning Thai as a foreign language and the accentedness perceived by native Thai speakers. Both production and perception experiments were designed to see how Chinese students acoustically produced Thai final nasal consonants and how Thai…

Descriptors: Phonology, Pronunciation, Thai, Second Language Learning

Automated Assessment of Second Language Comprehensibility: Review, Training, Validation, and Generalization Studies

Peer reviewed

Direct link

Saito, Kazuya; Macmillan, Konstantinos; Kachlicka, Magdalena; Kunihara, Takuya; Minematsu, Nobuaki – Studies in Second Language Acquisition, 2023

Whereas many scholars have emphasized the relative importance of "comprehensibility" as an ecologically valid goal for L2 speech training, testing, and development, eliciting listeners' judgments is time-consuming. Following calls for research on more efficient L2 speech rating methods in applied linguistics, and growing attention toward…

Descriptors: Second Language Learning, Second Language Instruction, Interrater Reliability, Speech Communication

Assessing Second-Language Academic Writing: AI vs. Human Raters

Peer reviewed
PDF on ERIC

Download full text

Vasfiye Geçkin; Ebru Kiziltas; Çagatay Çinar – Journal of Educational Technology and Online Learning, 2023

The quality of writing in a second language (L2) is one of the indicators of the level of proficiency for many college students to be eligible for departmental studies. Although certain software programs, such as Intelligent Essay Assessor or IntelliMetric, have been introduced to evaluate second-language writing quality, an overall assessment of…

Descriptors: Writing Evaluation, Second Language Learning, Second Language Instruction, Language Proficiency

Automated Speech Scoring of Dialogue Response by Japanese Learners of English as a Foreign Language

Peer reviewed

Direct link

Yuko Hayashi; Yusuke Kondo; Yutaka Ishii – Innovation in Language Learning and Teaching, 2024

Purpose: This study builds a new system for automatically assessing learners' speech elicited from an oral discourse completion task (DCT), and evaluates the prediction capability of the system with a view to better understanding factors deemed influential in predicting speaking proficiency scores and the pedagogical implications of the system.…

Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Japanese

The Dual Personality of 'Topic' in the IELTS Speaking Test

Peer reviewed

Direct link

Seedhouse, Paul – ELT Journal, 2019

This article investigates the central role of topic in the IELTS Speaking Test (IST). Topic has developed a dual personality in this interactional setting: topic-as-script is the scripted statement of topic on the examiner's cards prior to the interaction, whereas topic-as-action is how topic is developed by the candidate during the course of the…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Personality Traits

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

Language Assessment Quarterly	9
Language Testing	9
ProQuest LLC	6
Language Testing in Asia	4
Studies in Second Language…	4
TESOL Quarterly: A Journal…	3
Computer Assisted Language…	2
ETS Research Report Series	2
English Language Teaching	2
International Journal of…	2
Language Learning	2
Language Teaching Research	2
Language and Education	2
TESL Canada Journal	2
rEFLections	2
Advances in Language and…	1
Asian Journal of Distance…	1
Assessment in Education:…	1
British Journal of…	1
CALICO Journal	1
CATESOL Journal	1
ELT Journal	1
Educational Research and…	1
English Teaching	1
Foreign Language Annals	1
More ▼

Trofimovich, Pavel	6
McDonough, Kim	3
Saito, Kazuya	3
Kennedy, Sara	2
Lindberg, Rachael	2
O'Brien, Mary Grantham	2
Sanders, Ted	2
Abashidze, Dato	1
Aggarwal, Varun	1
Ahmadi, Alireza	1
Ahmet Can Uyar	1
Allen, Laura K.	1
Amory, Michael	1
Attali, Yigal	1
Aysegül Liman-Kaban	1
Barkaoui, Khaled	1
Bosker, Hans Rutger	1
Brannen, Kathleen	1
Breyer, F. Jay	1
Briggs, Sarah L.	1
Brooks, Rachel Lunde	1
Brown, Anne	1
Buckingham, Louisa	1
Burke, Rachel	1
Cardoso, Walcir	1
More ▼