ERIC - Search Results

Publication Date

In 2025	9
Since 2024	17

Descriptor

Evaluators	17
Second Language Learning	17
English (Second Language)	15
Language Tests	10
Foreign Countries	9
Second Language Instruction	9
College Students	8
Writing Evaluation	8
Computer Software	7
Comparative Analysis	6
Language Proficiency	6
Scoring Rubrics	6
Artificial Intelligence	5
Essays	5
Scores	5
Accuracy	4
Language Teachers	4
Speech Communication	4
Writing Tests	4
Computational Linguistics	3
Computer Assisted Testing	3
Correlation	3
Decision Making	3
Evaluation Criteria	3
Native Language	3
More ▼

Source

Language Testing	5
International Journal of…	2
British Journal of…	1
Eurasian Journal of Applied…	1
Innovation in Language…	1
International Journal of…	1
International Journal of…	1
Journal of Education and…	1
Journal of Multilingual and…	1
Language Teaching Research…	1
Language Testing in Asia	1
Reading & Writing Quarterly	1
More ▼

Publication Type

Journal Articles	17
Reports - Research	17
Tests/Questionnaires	5

Education Level

Higher Education	11
Postsecondary Education	11
Secondary Education	1

Audience

Location

Japan	3
China	2
Iran	2
Australia	1
Hawaii	1
Illinois (Urbana)	1
New Zealand	1
Turkey	1

Laws, Policies, & Programs

Assessments and Surveys

International English…	4
Test of English as a Foreign…	2
ACTFL Oral Proficiency…	1
Foreign Language Classroom…	1
Test of English for…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 17 results Save | Export

Do Source Use Features Impact Raters' Judgment of Argumentation? An Experimental Study

Peer reviewed

Direct link

Ping-Lin Chuang – Language Testing, 2025

This experimental study explores how source use features impact raters' judgment of argumentation in a second language (L2) integrated writing test. One hundred four experienced and novice raters were recruited to complete a rating task that simulated the scoring assignment of a local English Placement Test (EPT). Sixty written responses were…

Descriptors: Interrater Reliability, Evaluators, Information Sources, Primary Sources

Revisiting Raters' Accent Familiarity in Speaking Tests: Evidence That Presentation Mode Interacts with Accent Familiarity to Variably Affect Comprehensibility Ratings

Peer reviewed

Direct link

Michael D. Carey; Stefan Szocs – Language Testing, 2024

This controlled experimental study investigated the interaction of variables associated with rating the pronunciation component of high-stakes English-language-speaking tests such as IELTS and TOEFL iBT. One hundred experienced raters who were all either familiar or unfamiliar with Brazilian-accented English or Papua New Guinean Tok Pisin-accented…

Descriptors: Dialects, Pronunciation, Suprasegmentals, Familiarity

Examining the Effect of Item Difficulty and Rater Leniency on Iranian Test Takers' Performance on WDCT and DSAT: A Comparative Study

Peer reviewed
PDF on ERIC

Download full text

Reza Shahi; Hamdollah Ravand; Golam Reza Rohani – International Journal of Language Testing, 2025

The current paper intends to exploit the Many Facet Rasch Model to investigate and compare the impact of situations (items) and raters on test takers' performance on the Written Discourse Completion Test (WDCT) and Discourse Self-Assessment Tests (DSAT). In this study, the participants were 110 English as a Foreign Language (EFL) students at…

Descriptors: Comparative Analysis, English (Second Language), Second Language Learning, Second Language Instruction

Scoring Difficulty in Summary Writing Assessment: Toward the Reconstruction of Analytic Rubric

Peer reviewed
PDF on ERIC

Download full text

Makiko Kato – Journal of Education and Learning, 2025

This study aims to examine whether differences exist in the factors influencing the difficulty of scoring English summaries and determining scores based on the raters' attributes, and to collect candid opinions, considerations, and tentative suggestions for future improvements to the analytic rubric of summary writing for English learners. In this…

Descriptors: Writing Evaluation, Scoring, Writing Skills, English (Second Language)

Assessing the Content Quality of Essays in Content and Language Integrated Learning: Exploring the Construct from Subject Specialists' Perspectives

Peer reviewed

Direct link

Takanori Sato – Language Testing, 2024

Assessing the content of learners' compositions is a common practice in second language (L2) writing assessment. However, the construct definition of content in L2 writing assessment potentially underrepresents the target competence in content and language integrated learning (CLIL), which aims to foster not only L2 proficiency but also critical…

Descriptors: Language Tests, Content and Language Integrated Learning, Writing Evaluation, Writing Tests

Utilizing Large Language Models for EFL Essay Grading: An Examination of Reliability and Validity in Rubric-Based Assessments

Peer reviewed

Direct link

Fatih Yavuz; Özgür Çelik; Gamze Yavas Çelik – British Journal of Educational Technology, 2025

This study investigates the validity and reliability of generative large language models (LLMs), specifically ChatGPT and Google's Bard, in grading student essays in higher education based on an analytical grading rubric. A total of 15 experienced English as a foreign language (EFL) instructors and two LLMs were asked to evaluate three student…

Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Computational Linguistics

Making Each Point Count: Revising a Local Adaptation of the Jacobs et al.'s (1981) ESL COMPOSITION PROFILE Rubric

Peer reviewed

Direct link

Yu-Tzu Chang; Ann Tai Choe; Daniel Holden; Daniel R. Isbell – Language Testing, 2024

In this Brief Report, we describe an evaluation of and revisions to a rubric adapted from the Jacobs et al.'s (1981) ESL COMPOSITION PROFILE, with four rubric categories and 20-point rating scales, in the context of an intensive English program writing placement test. Analysis of 4 years of rating data (2016-2021, including 434 essays) using…

Descriptors: Language Tests, Rating Scales, Second Language Learning, English (Second Language)

Revisiting the Effectiveness of a Performance Decision Tree-Style Rubric Compared to a Grid-Style Rubric

Peer reviewed

Direct link

Yuichiro Yokouchi – Language Testing in Asia, 2025

The performance decision tree (PDT; Fulcher et al., 2011) is a rubric style that is applicable to performance assessment, with origins in Upshur and Turner's (1995) empirically derived binary-choice, boundary-definition (EBB) scale. It is easier for raters to assess performance by evaluating multiple binary-choice descriptors. Additionally,…

Descriptors: Scoring Rubrics, Second Language Learning, Second Language Instruction, Language Teachers

The Intersection of AI and Language Assessment: A Study on the Reliability of ChatGPT in Grading IELTS Writing Task 2

Peer reviewed
PDF on ERIC

Download full text

Osama Koraishi – Language Teaching Research Quarterly, 2024

This study conducts a comprehensive quantitative evaluation of OpenAI's language model, ChatGPT 4, for grading Task 2 writing of the IELTS exam. The objective is to assess the alignment between ChatGPT's grading and that of official human raters. The analysis encompassed a multifaceted approach, including a comparison of means and reliability…

Descriptors: Second Language Learning, English (Second Language), Language Tests, Artificial Intelligence

Depth-Perception-Based Representation in Holistic Rating on ESL Essay Writing

Peer reviewed

Direct link

Lian Li; Jiehui Hu; Yu Dai; Ping Zhou; Wanhong Zhang – Reading & Writing Quarterly, 2024

This paper proposes to use depth perception to represent raters' decision in holistic evaluation of ESL essays, as an alternative medium to conventional form of numerical scores. The researchers verified the new method's accuracy and inter/intra-rater reliability by inviting 24 ESL teachers to perform different representations when rating 60…

Descriptors: Essays, Holistic Approach, Writing Evaluation, Accuracy

Artificial Intelligence as an Automated Essay Scoring Tool: A Focus on ChatGPT

Peer reviewed
PDF on ERIC

Download full text

Ahmet Can Uyar; Dilek Büyükahiska – International Journal of Assessment Tools in Education, 2025

This study explores the effectiveness of using ChatGPT, an Artificial Intelligence (AI) language model, as an Automated Essay Scoring (AES) tool for grading English as a Foreign Language (EFL) learners' essays. The corpus consists of 50 essays representing various types including analysis, compare and contrast, descriptive, narrative, and opinion…

Descriptors: Artificial Intelligence, Computer Software, Technology Uses in Education, Teaching Methods

Automated Speech Scoring of Dialogue Response by Japanese Learners of English as a Foreign Language

Peer reviewed

Direct link

Yuko Hayashi; Yusuke Kondo; Yutaka Ishii – Innovation in Language Learning and Teaching, 2024

Purpose: This study builds a new system for automatically assessing learners' speech elicited from an oral discourse completion task (DCT), and evaluates the prediction capability of the system with a view to better understanding factors deemed influential in predicting speaking proficiency scores and the pedagogical implications of the system.…

Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Japanese

Accentedness and Personality Evaluation of Asian and Caucasian Second Language Speakers of English by Asian Second Language English Listeners

Peer reviewed

Direct link

Yao Lu; Ksenia Gnevsheva – Journal of Multilingual and Multicultural Development, 2024

Previous research that explores the effect of ethnicity in the perception of speaker accentedness and personality traits often finds that Asian appearance contributes to a more accented and less competent impression. Importantly, most of the work done to date employed only Caucasian first language-speaking listeners; moreover, ethnicity and gender…

Descriptors: Pronunciation, Gender Differences, Personality Traits, Korean

Optimizing Oral Proficiency Assessment in Chinese as a Second Language: Challenges and Improvement Strategies of the OPI Test

Peer reviewed
PDF on ERIC

Download full text

Na Liu; Jeferd Saong – International Journal of Education and Literacy Studies, 2025

The study examined the use of the Oral Proficiency Interview (OPI) in assessing Chinese as a second language and the challenges faced by teachers in the Chinese Language Scholarship (CLS) program. The Concurrent Triangulation Mixed Method Research was used in the study in which qualitative and quantitative data are collected simultaneously,…

Descriptors: Chinese, Second Language Learning, Oral Language, Language Proficiency

Human-AI Collaborative Feedback in Improving EFL Writing Performance: An Analysis Based on Natural Language Processing Technology

Peer reviewed
PDF on ERIC

Download full text

Xiaoling Bai; Nur Rasyidah Mohd Nordin – Eurasian Journal of Applied Linguistics, 2025

A perfect writing skill has been deemed instrumental to achieving competence in EFL, yet it is considered one of the most impressive learning domains. This study investigates the impact of human-AI collaborative feedback on the writing proficiency of EFL students. It examines key teaching domains, including the teaching environment, teacher…

Descriptors: Artificial Intelligence, Feedback (Response), Evaluators, Writing Skills

Previous Page | Next Page »

Pages: 1 | 2

Ahmet Can Uyar	1
Ann Tai Choe	1
Daniel Holden	1
Daniel R. Isbell	1
Dilek Büyükahiska	1
Fatih Yavuz	1
Gamze Yavas Çelik	1
Golam Reza Rohani	1
Hamdollah Ravand	1
J. Dylan Burton	1
Jeferd Saong	1
Jiehui Hu	1
Karim Sadeghi	1
Ksenia Gnevsheva	1
Lian Li	1
Makiko Kato	1
Michael D. Carey	1
Na Liu	1
Neda Bakhshi	1
Nur Rasyidah Mohd Nordin	1
Osama Koraishi	1
Ping Zhou	1
Ping-Lin Chuang	1
Reza Shahi	1
Stefan Szocs	1
More ▼