Publication Date
| In 2026 | 0 |
| Since 2025 | 2 |
| Since 2022 (last 5 years) | 9 |
| Since 2017 (last 10 years) | 13 |
| Since 2007 (last 20 years) | 23 |
Descriptor
| Computer Software | 23 |
| Evaluators | 23 |
| Language Tests | 23 |
| Second Language Learning | 21 |
| English (Second Language) | 18 |
| Language Proficiency | 13 |
| Computer Assisted Testing | 11 |
| Foreign Countries | 10 |
| Scores | 10 |
| Scoring | 10 |
| Comparative Analysis | 9 |
| More ▼ | |
Source
Author
| Bridgeman, Brent | 2 |
| Zechner, Klaus | 2 |
| Ahmet Can Uyar | 1 |
| Ahn, Soojin | 1 |
| Akbari, Alireza | 1 |
| Bagheri, Mohammad Sadegh | 1 |
| Bejar, Isaac I. | 1 |
| Brannen, Kathleen | 1 |
| Breyer, F. Jay | 1 |
| Cardoso, Walcir | 1 |
| Chung, Eun Seon | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 21 |
| Reports - Research | 21 |
| Tests/Questionnaires | 5 |
| Dissertations/Theses -… | 1 |
| Numerical/Quantitative Data | 1 |
| Reports - Descriptive | 1 |
| Speeches/Meeting Papers | 1 |
Education Level
| Higher Education | 10 |
| Postsecondary Education | 9 |
| Secondary Education | 2 |
| High Schools | 1 |
Audience
Laws, Policies, & Programs
Assessments and Surveys
| Test of English as a Foreign… | 6 |
| International English… | 4 |
| ACTFL Oral Proficiency… | 1 |
| Test of English for… | 1 |
What Works Clearinghouse Rating
Osama Koraishi – Language Teaching Research Quarterly, 2024
This study conducts a comprehensive quantitative evaluation of OpenAI's language model, ChatGPT 4, for grading Task 2 writing of the IELTS exam. The objective is to assess the alignment between ChatGPT's grading and that of official human raters. The analysis encompassed a multifaceted approach, including a comparison of means and reliability…
Descriptors: Second Language Learning, English (Second Language), Language Tests, Artificial Intelligence
Paquot, Magali; Rubin, Rachel; Vandeweerd, Nathan – Language Learning, 2022
The main objective of this Methods Showcase Article is to show how the technique of adaptive comparative judgment, coupled with a crowdsourcing approach, can offer practical solutions to reliability issues as well as to address the time and cost difficulties associated with a text-based approach to proficiency assessment in L2 research. We…
Descriptors: Comparative Analysis, Decision Making, Language Proficiency, Reliability
Ahmet Can Uyar; Dilek Büyükahiska – International Journal of Assessment Tools in Education, 2025
This study explores the effectiveness of using ChatGPT, an Artificial Intelligence (AI) language model, as an Automated Essay Scoring (AES) tool for grading English as a Foreign Language (EFL) learners' essays. The corpus consists of 50 essays representing various types including analysis, compare and contrast, descriptive, narrative, and opinion…
Descriptors: Artificial Intelligence, Computer Software, Technology Uses in Education, Teaching Methods
Yuko Hayashi; Yusuke Kondo; Yutaka Ishii – Innovation in Language Learning and Teaching, 2024
Purpose: This study builds a new system for automatically assessing learners' speech elicited from an oral discourse completion task (DCT), and evaluates the prediction capability of the system with a view to better understanding factors deemed influential in predicting speaking proficiency scores and the pedagogical implications of the system.…
Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Japanese
Na Liu; Jeferd Saong – International Journal of Education and Literacy Studies, 2025
The study examined the use of the Oral Proficiency Interview (OPI) in assessing Chinese as a second language and the challenges faced by teachers in the Chinese Language Scholarship (CLS) program. The Concurrent Triangulation Mixed Method Research was used in the study in which qualitative and quantitative data are collected simultaneously,…
Descriptors: Chinese, Second Language Learning, Oral Language, Language Proficiency
Akbari, Alireza; Shahnazari, Mohammadtaghi – Language Testing in Asia, 2019
The present research paper introduces a translation evaluation method called Calibrated Parsing Items Evaluation (CPIE hereafter). This evaluation method maximizes translators' performance through identifying the parsing items with an optimal p-docimology and d-index (item discrimination). This method checks all the possible parses (annotations)…
Descriptors: Test Items, Translation, Computer Software, Evaluators
J. Dylan Burton – Language Testing, 2024
Nonverbal behavior can impact language proficiency scores in speaking tests, but there is little empirical information of the size or consistency of its effects or whether language proficiency may be a moderating variable. In this study, 100 novice raters watched and scored 30 recordings of test takers taking an international, high stakes…
Descriptors: Nonverbal Ability, Language Fluency, Second Language Learning, Language Proficiency
Johnson, Carol; Cardoso, Walcir; Zuercher, Beau; Brannen, Kathleen; Springer, Suzanne – Research-publishing.net, 2022
This study examined the use of a popular Automatic Speech Recognition (ASR), Google Voice Typing (GVT), to automatically assess English as second language pronunciation. It aimed to answer the following question: What is the relationship between GVT-rated scores and human-rated scores? To answer this question, we compared audio recordings of 56…
Descriptors: Teaching Methods, Computer Software, Pronunciation, Second Language Learning
Han, Chao; Lu, Xiaolei – Computer Assisted Language Learning, 2023
The use of translation and interpreting (T&I) in the language learning classroom is commonplace, serving various pedagogical and assessment purposes. Previous utilization of T&I exercises is driven largely by their potential to enhance language learning, whereas the latest trend has begun to underscore T&I as a crucial skill to be…
Descriptors: Translation, Computational Linguistics, Correlation, Language Processing
Xu, Jing; Jones, Edmund; Laxton, Victoria; Galaczi, Evelina – Assessment in Education: Principles, Policy & Practice, 2021
Recent advances in machine learning have made automated scoring of learner speech widespread, and yet validation research that provides support for applying automated scoring technology to assessment is still in its infancy. Both the educational measurement and language assessment communities have called for greater transparency in describing…
Descriptors: Second Language Learning, Second Language Instruction, English (Second Language), Computer Software
Chung, Eun Seon; Ahn, Soojin – Computer Assisted Language Learning, 2022
Many studies that have investigated the educational value of online machine translation (MT) in second language (L2) writing generally report significant improvements after MT use, but no study as of yet has comprehensively analyzed the effectiveness of MT use in terms of various measures in syntactic complexity, accuracy, lexical complexity, and…
Descriptors: Translation, Computational Linguistics, English (Second Language), Second Language Learning
Gu, Lin; Davis, Larry; Tao, Jacob; Zechner, Klaus – Assessment in Education: Principles, Policy & Practice, 2021
Recent technology advancements have increased the prospects for automated spoken language technology to provide feedback on speaking performance. In this study we examined user perceptions of using an automated feedback system for preparing for the TOEFL iBT® test. Test takers and language teachers evaluated three types of machine-generated…
Descriptors: Audio Equipment, Test Preparation, Feedback (Response), Scores
Linlin, Cao – English Language Teaching, 2020
Through Many-Facet Rasch analysis, this study explores the rating differences between 1 computer automatic rater and 5 expert teacher raters on scoring 119 students in a computerized English listening-speaking test. Results indicate that both automatic and the teacher raters demonstrate good inter-rater reliability, though the automatic rater…
Descriptors: Language Tests, Computer Assisted Testing, English (Second Language), Second Language Learning
Ockey, Gary J.; Papageorgiou, Spiros; French, Robert – International Journal of Listening, 2016
This article reports on a study which aimed to determine the effect of strength of accent on listening comprehension of interactive lectures. Test takers (N = 21,726) listened to an interactive lecture given by one of nine speakers and responded to six comprehension items. The test taker responses were analyzed with the Rasch computer program…
Descriptors: Pronunciation, Listening Comprehension, Lecture Method, Computer Software
Cox, Troy L. – ProQuest LLC, 2013
Speaking assessments for second language learners have traditionally been expensive to administer because of the cost of rating the speech samples. To reduce the cost, many researchers are investigating the potential of using automatic speech recognition (ASR) as a means to score examinee responses to open-ended prompts. This study examined the…
Descriptors: Cues, Second Language Learning, English Language Learners, Language Tests
Previous Page | Next Page »
Pages: 1 | 2
Peer reviewed
Direct link
