Publication Date
In 2025 | 1 |
Since 2024 | 3 |
Since 2021 (last 5 years) | 10 |
Since 2016 (last 10 years) | 18 |
Since 2006 (last 20 years) | 21 |
Descriptor
Comparative Analysis | 23 |
Evaluators | 23 |
Language Proficiency | 23 |
Second Language Learning | 20 |
English (Second Language) | 19 |
Language Tests | 17 |
Second Language Instruction | 12 |
Foreign Countries | 11 |
Oral Language | 8 |
Computer Software | 6 |
Interrater Reliability | 6 |
More ▼ |
Source
Author
Ahmet Can Uyar | 1 |
Briggs, Sarah L. | 1 |
Brooks, Rachel Lunde | 1 |
Cots, Josep M. | 1 |
Dilek Büyükahiska | 1 |
Ebru Kiziltas | 1 |
Galaczi, Evelina | 1 |
Heidari, Jamshid | 1 |
Hsu, Lung-hsun | 1 |
Huang, Lan-fen | 1 |
Inoue, Chihiro | 1 |
More ▼ |
Publication Type
Journal Articles | 19 |
Reports - Research | 18 |
Dissertations/Theses -… | 3 |
Reports - Evaluative | 2 |
Speeches/Meeting Papers | 1 |
Tests/Questionnaires | 1 |
Education Level
Higher Education | 11 |
Postsecondary Education | 9 |
Secondary Education | 2 |
High Schools | 1 |
Audience
Laws, Policies, & Programs
Assessments and Surveys
International English… | 3 |
Test of English as a Foreign… | 2 |
Test of English for… | 1 |
What Works Clearinghouse Rating
Osama Koraishi – Language Teaching Research Quarterly, 2024
This study conducts a comprehensive quantitative evaluation of OpenAI's language model, ChatGPT 4, for grading Task 2 writing of the IELTS exam. The objective is to assess the alignment between ChatGPT's grading and that of official human raters. The analysis encompassed a multifaceted approach, including a comparison of means and reliability…
Descriptors: Second Language Learning, English (Second Language), Language Tests, Artificial Intelligence
Paquot, Magali; Rubin, Rachel; Vandeweerd, Nathan – Language Learning, 2022
The main objective of this Methods Showcase Article is to show how the technique of adaptive comparative judgment, coupled with a crowdsourcing approach, can offer practical solutions to reliability issues as well as to address the time and cost difficulties associated with a text-based approach to proficiency assessment in L2 research. We…
Descriptors: Comparative Analysis, Decision Making, Language Proficiency, Reliability
Ahmet Can Uyar; Dilek Büyükahiska – International Journal of Assessment Tools in Education, 2025
This study explores the effectiveness of using ChatGPT, an Artificial Intelligence (AI) language model, as an Automated Essay Scoring (AES) tool for grading English as a Foreign Language (EFL) learners' essays. The corpus consists of 50 essays representing various types including analysis, compare and contrast, descriptive, narrative, and opinion…
Descriptors: Artificial Intelligence, Computer Software, Technology Uses in Education, Teaching Methods
Vasfiye Geçkin; Ebru Kiziltas; Çagatay Çinar – Journal of Educational Technology and Online Learning, 2023
The quality of writing in a second language (L2) is one of the indicators of the level of proficiency for many college students to be eligible for departmental studies. Although certain software programs, such as Intelligent Essay Assessor or IntelliMetric, have been introduced to evaluate second-language writing quality, an overall assessment of…
Descriptors: Writing Evaluation, Second Language Learning, Second Language Instruction, Language Proficiency
Yuko Hayashi; Yusuke Kondo; Yutaka Ishii – Innovation in Language Learning and Teaching, 2024
Purpose: This study builds a new system for automatically assessing learners' speech elicited from an oral discourse completion task (DCT), and evaluates the prediction capability of the system with a view to better understanding factors deemed influential in predicting speaking proficiency scores and the pedagogical implications of the system.…
Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Japanese
Comparing Rating Modes: Analysing Live, Audio, and Video Ratings of IELTS Speaking Test Performances
Nakatsuhara, Fumiyo; Inoue, Chihiro; Taylor, Lynda – Language Assessment Quarterly, 2021
This mixed methods study compared IELTS examiners' scores when assessing spoken performances under live and two 'non-live' testing conditions using audio and video recordings. Six IELTS examiners assessed 36 test-takers' performances under the live, audio, and video rating conditions. Scores in the three rating modes were calibrated using the…
Descriptors: Video Technology, Audio Equipment, English (Second Language), Language Tests
Jin Soo Choi – ProQuest LLC, 2022
Nonverbal behavior is essential in human interaction (Gullberg, de Bot, & Volterra, 2008; McNeill, 1992, 2005). For second language speakers, nonverbal features can be helpful for successful and efficient communication (e.g., Dahl & Ludvigsen, 2014). However, due to the complexity of nonverbal features, language testing institutions have…
Descriptors: Language Tests, Language Proficiency, Videoconferencing, Second Language Learning
Xu, Jing; Jones, Edmund; Laxton, Victoria; Galaczi, Evelina – Assessment in Education: Principles, Policy & Practice, 2021
Recent advances in machine learning have made automated scoring of learner speech widespread, and yet validation research that provides support for applying automated scoring technology to assessment is still in its infancy. Both the educational measurement and language assessment communities have called for greater transparency in describing…
Descriptors: Second Language Learning, Second Language Instruction, English (Second Language), Computer Software
Susan Rowe – ProQuest LLC, 2023
This dissertation explored whether unnecessary linguistic complexity (LC) in mathematics and biology assessment items changes the direction and significance of differential item functioning (DIF) between subgroups emergent bilinguals (EBs) and English proficient students (EPs). Due to inconsistencies in measuring LC in items, Study One adapted a…
Descriptors: Difficulty Level, English for Academic Purposes, Second Language Learning, Second Language Instruction
Park, Mi Sun – Language Assessment Quarterly, 2020
In the present study, I examined the effects of rater characteristics, in particular, raters' familiarity with a foreign accent, on the assessment of second language (L2) pronunciation. Forty-three native English-speaking teachers were divided into three groups according to their reported types of familiarity with Korean accents: heritage,…
Descriptors: Evaluators, Familiarity, Second Language Learning, English (Second Language)
Nagao, Akiko – English Language Teaching, 2020
This study applied a Systemic Functional Linguistics (SFL) model to explore how 27 first-year university students in two different English proficiency groups improved their lexicogrammatical choices and metafunctions for writing analytical exposition essays during a 15-week course. To explore how "the teaching learning cycle" influences…
Descriptors: Second Language Learning, Second Language Instruction, English (Second Language), Teaching Methods
Llanes, Àngels; Cots, Josep M. – International Journal of Multilingualism, 2022
This study compares the language proficiency gains of two groups of students taking a business English course module in a bilingual university in Catalonia (Spain). Whereas one of these groups followed a 'translanguaging' or 'plurilingual' pedagogy, the other followed a strictly monolingual approach. Participants were 54 mostly Catalan/Spanish…
Descriptors: Language Proficiency, English (Second Language), Second Language Learning, Second Language Instruction
Huang, Lan-fen; Kubelec, Simon; Keng, Nicole; Hsu, Lung-hsun – Language Testing in Asia, 2018
Background: Although teachers of English are required to assess students' speaking proficiency in the Common European Framework of Reference for Languages (CEFR), their ability to rate is seldom evaluated. The application of descriptors in the assessment of English speaking on CEFR in the context of English as a foreign language has not often been…
Descriptors: Evaluators, Second Language Learning, Second Language Instruction, English (Second Language)
Kang, Okim; Rubin, Don; Kermad, Alyssa – Language Testing, 2019
As a result of the fact that judgments of non-native speech are closely tied to social biases, oral proficiency ratings are susceptible to error because of rater background and social attitudes. In the present study we seek first to estimate the variance attributable to rater background and attitudinal variables on novice raters' assessments of L2…
Descriptors: Evaluators, Second Language Learning, Language Tests, English (Second Language)
Li, Jiuliang – Language Assessment Quarterly, 2018
In language testing programs, different test forms are often used to administer the same test. Demonstrating the comparability of these forms is essential to avoid criticisms of potential test unfairness. However, studies with this objective are scarce. This study aims to investigate the extent to which the picture-prompt writing tasks of three…
Descriptors: Writing Tests, Language Tests, Check Lists, Culture Fair Tests
Previous Page | Next Page »
Pages: 1 | 2