ERIC - Search Results

Publication Date

In 2025	2
Since 2024	4
Since 2021 (last 5 years)	13
Since 2016 (last 10 years)	29
Since 2006 (last 20 years)	42

Descriptor

Comparative Analysis	45
Evaluators	45
Language Tests	45
Second Language Learning	35
English (Second Language)	34
Foreign Countries	20
Language Proficiency	17
Scores	17
Oral Language	13
Second Language Instruction	13
Scoring	11
Computer Assisted Testing	9
Computer Software	9
Native Language	9
Speech Communication	9
Essays	8
Interrater Reliability	8
Testing	8
Correlation	7
Teaching Methods	7
Test Items	7
Undergraduate Students	7
Writing Evaluation	7
Evaluation Methods	6
Pronunciation	6
More ▼

Publication Type

Journal Articles	39
Reports - Research	37
Tests/Questionnaires	9
Reports - Evaluative	4
Dissertations/Theses -…	3
Speeches/Meeting Papers	3
Guides - Non-Classroom	1
Reports - Descriptive	1

Education Level

Higher Education	16
Postsecondary Education	13
Secondary Education	4
High Schools	2
Adult Education	1
Elementary Secondary Education	1
Grade 12	1

Audience

Practitioners

Location

Iran	3
China	2
Europe	2
Turkey	2
Australia	1
Canada	1
Hong Kong	1
India	1
Israel	1
Italy	1
Japan	1
Mexico (Oaxaca)	1
Netherlands	1
Pakistan	1
Spain	1
United States	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Test of English as a Foreign…	6
International English…	5
Program for International…	1
Test of English for…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 45 results Save | Export

Examining the Effect of Item Difficulty and Rater Leniency on Iranian Test Takers' Performance on WDCT and DSAT: A Comparative Study

Peer reviewed
PDF on ERIC

Download full text

Reza Shahi; Hamdollah Ravand; Golam Reza Rohani – International Journal of Language Testing, 2025

The current paper intends to exploit the Many Facet Rasch Model to investigate and compare the impact of situations (items) and raters on test takers' performance on the Written Discourse Completion Test (WDCT) and Discourse Self-Assessment Tests (DSAT). In this study, the participants were 110 English as a Foreign Language (EFL) students at…

Descriptors: Comparative Analysis, English (Second Language), Second Language Learning, Second Language Instruction

The Intersection of AI and Language Assessment: A Study on the Reliability of ChatGPT in Grading IELTS Writing Task 2

Peer reviewed
PDF on ERIC

Download full text

Osama Koraishi – Language Teaching Research Quarterly, 2024

This study conducts a comprehensive quantitative evaluation of OpenAI's language model, ChatGPT 4, for grading Task 2 writing of the IELTS exam. The objective is to assess the alignment between ChatGPT's grading and that of official human raters. The analysis encompassed a multifaceted approach, including a comparison of means and reliability…

Descriptors: Second Language Learning, English (Second Language), Language Tests, Artificial Intelligence

Crowdsourced Adaptive Comparative Judgment: A Community-Based Solution for Proficiency Rating

Peer reviewed

Direct link

Paquot, Magali; Rubin, Rachel; Vandeweerd, Nathan – Language Learning, 2022

The main objective of this Methods Showcase Article is to show how the technique of adaptive comparative judgment, coupled with a crowdsourcing approach, can offer practical solutions to reliability issues as well as to address the time and cost difficulties associated with a text-based approach to proficiency assessment in L2 research. We…

Descriptors: Comparative Analysis, Decision Making, Language Proficiency, Reliability

Artificial Intelligence as an Automated Essay Scoring Tool: A Focus on ChatGPT

Peer reviewed
PDF on ERIC

Download full text

Ahmet Can Uyar; Dilek Büyükahiska – International Journal of Assessment Tools in Education, 2025

This study explores the effectiveness of using ChatGPT, an Artificial Intelligence (AI) language model, as an Automated Essay Scoring (AES) tool for grading English as a Foreign Language (EFL) learners' essays. The corpus consists of 50 essays representing various types including analysis, compare and contrast, descriptive, narrative, and opinion…

Descriptors: Artificial Intelligence, Computer Software, Technology Uses in Education, Teaching Methods

Mitigating Gender and L1 Biases in Automated English Speaking Assessment

Direct link

Alexander James Kwako – ProQuest LLC, 2023

Automated assessment using Natural Language Processing (NLP) has the potential to make English speaking assessments more reliable, authentic, and accessible. Yet without careful examination, NLP may exacerbate social prejudices based on gender or native language (L1). Current NLP-based assessments are prone to such biases, yet research and…

Descriptors: Gender Bias, Natural Language Processing, Native Language, Computational Linguistics

Automated Speech Scoring of Dialogue Response by Japanese Learners of English as a Foreign Language

Peer reviewed

Direct link

Yuko Hayashi; Yusuke Kondo; Yutaka Ishii – Innovation in Language Learning and Teaching, 2024

Purpose: This study builds a new system for automatically assessing learners' speech elicited from an oral discourse completion task (DCT), and evaluates the prediction capability of the system with a view to better understanding factors deemed influential in predicting speaking proficiency scores and the pedagogical implications of the system.…

Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Japanese

The Dual Personality of 'Topic' in the IELTS Speaking Test

Peer reviewed

Direct link

Seedhouse, Paul – ELT Journal, 2019

This article investigates the central role of topic in the IELTS Speaking Test (IST). Topic has developed a dual personality in this interactional setting: topic-as-script is the scripted statement of topic on the examiner's cards prior to the interaction, whereas topic-as-action is how topic is developed by the candidate during the course of the…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Personality Traits

Comparing Rating Modes: Analysing Live, Audio, and Video Ratings of IELTS Speaking Test Performances

Peer reviewed

Direct link

Nakatsuhara, Fumiyo; Inoue, Chihiro; Taylor, Lynda – Language Assessment Quarterly, 2021

This mixed methods study compared IELTS examiners' scores when assessing spoken performances under live and two 'non-live' testing conditions using audio and video recordings. Six IELTS examiners assessed 36 test-takers' performances under the live, audio, and video rating conditions. Scores in the three rating modes were calibrated using the…

Descriptors: Video Technology, Audio Equipment, English (Second Language), Language Tests

Investigating the Impact of Rater Training on Rater Errors in the Process of Assessing Writing Skill

Peer reviewed
PDF on ERIC

Download full text

Sata, Mehmet; Karakaya, Ismail – International Journal of Assessment Tools in Education, 2022

In the process of measuring and assessing high-level cognitive skills, interference of rater errors in measurements brings about a constant concern and low objectivity. The main purpose of this study was to investigate the impact of rater training on rater errors in the process of assessing individual performance. The study was conducted with a…

Descriptors: Evaluators, Training, Comparative Analysis, Academic Language

Calibrated Parsing Items Evaluation: A Step towards Objectifying the Translation Assessment

Peer reviewed

Direct link

Akbari, Alireza; Shahnazari, Mohammadtaghi – Language Testing in Asia, 2019

The present research paper introduces a translation evaluation method called Calibrated Parsing Items Evaluation (CPIE hereafter). This evaluation method maximizes translators' performance through identifying the parsing items with an optimal p-docimology and d-index (item discrimination). This method checks all the possible parses (annotations)…

Descriptors: Test Items, Translation, Computer Software, Evaluators

Human versus Computer Partner in the Paired Oral Discussion Test

Peer reviewed

Direct link

Ockey, Gary J.; Chukharev-Hudilainen, Evgeny – Applied Linguistics, 2021

A challenge of large-scale oral communication assessments is to feasibly assess a broad construct that includes interactional competence. One possible approach in addressing this challenge is to use a spoken dialog system (SDS), with the computer acting as a peer to elicit a ratable speech sample. With this aim, an SDS was built and four trained…

Descriptors: Oral Language, Grammar, Language Fluency, Language Tests

Investigating Test Delivery Modes within Video-Conferenced English Speaking Proficiency Assessment

Direct link

Jin Soo Choi – ProQuest LLC, 2022

Nonverbal behavior is essential in human interaction (Gullberg, de Bot, & Volterra, 2008; McNeill, 1992, 2005). For second language speakers, nonverbal features can be helpful for successful and efficient communication (e.g., Dahl & Ludvigsen, 2014). However, due to the complexity of nonverbal features, language testing institutions have…

Descriptors: Language Tests, Language Proficiency, Videoconferencing, Second Language Learning

Using Google Voice Typing to Automatically Assess Pronunciation

Peer reviewed
PDF on ERIC

Download full text

Johnson, Carol; Cardoso, Walcir; Zuercher, Beau; Brannen, Kathleen; Springer, Suzanne – Research-publishing.net, 2022

This study examined the use of a popular Automatic Speech Recognition (ASR), Google Voice Typing (GVT), to automatically assess English as second language pronunciation. It aimed to answer the following question: What is the relationship between GVT-rated scores and human-rated scores? To answer this question, we compared audio recordings of 56…

Descriptors: Teaching Methods, Computer Software, Pronunciation, Second Language Learning

A Comparison of Holistic, Analytic, and Part Marking Models in Speaking Assessment

Peer reviewed

Direct link

Khabbazbashi, Nahal; Galaczi, Evelina D. – Language Testing, 2020

This mixed methods study examined holistic, analytic, and part marking models (MMs) in terms of their measurement properties and impact on candidate CEFR classifications in a semi-direct online speaking test. Speaking performances of 240 candidates were first marked holistically and by part (phase 1). On the basis of phase 1 findings--which…

Descriptors: Holistic Approach, Classification, Grading, Language Tests

Assessing L2 English Speaking Using Automated Scoring Technology: Examining Automarker Reliability

Peer reviewed

Direct link

Xu, Jing; Jones, Edmund; Laxton, Victoria; Galaczi, Evelina – Assessment in Education: Principles, Policy & Practice, 2021

Recent advances in machine learning have made automated scoring of learner speech widespread, and yet validation research that provides support for applying automated scoring technology to assessment is still in its infancy. Both the educational measurement and language assessment communities have called for greater transparency in describing…

Descriptors: Second Language Learning, Second Language Instruction, English (Second Language), Computer Software

Previous Page | Next Page »

Pages: 1 | 2 | 3

Language Assessment Quarterly	7
Language Testing	7
ProQuest LLC	3
ETS Research Report Series	2
International Journal of…	2
Language Learning	2
Language Testing in Asia	2
Applied Linguistics	1
Assessment in Education:…	1
Canadian Modern Language…	1
ELT Journal	1
Educational and Psychological…	1
English Language Teaching	1
Grantee Submission	1
Innovation in Language…	1
International Educational…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
Interpreter and Translator…	1
JALT CALL Journal	1
Language Teaching Research…	1
Language and Education	1
Language and Intercultural…	1
Research-publishing.net	1
More ▼

Lamprianou, Iasonas	2
Ahmadi, Alireza	1
Ahmet Can Uyar	1
Akbari, Alireza	1
Alexander James Kwako	1
Allen, Laura K.	1
Amory, Michael	1
Attali, Yigal	1
Brannen, Kathleen	1
Breyer, F. Jay	1
Briggs, Sarah L.	1
Brooks, Rachel Lunde	1
Brown, Anne	1
Cardoso, Walcir	1
Chukharev-Hudilainen, Evgeny	1
Cots, Josep M.	1
Crossley, Scott A.	1
Dai, David Wei	1
Davis, James R.	1
Dey, Prasenjit	1
Dilek Büyükahiska	1
Dwivedi, Utkarsh	1
Fernandez, Miguel	1
Galaczi, Evelina	1
Galaczi, Evelina D.	1
More ▼