Publication Date
In 2025 | 1 |
Since 2024 | 3 |
Since 2021 (last 5 years) | 20 |
Since 2016 (last 10 years) | 48 |
Since 2006 (last 20 years) | 76 |
Descriptor
Correlation | 76 |
Evaluators | 76 |
Second Language Learning | 76 |
English (Second Language) | 59 |
Language Tests | 39 |
Foreign Countries | 38 |
Second Language Instruction | 36 |
Scores | 26 |
Writing Evaluation | 22 |
Language Proficiency | 21 |
Speech Communication | 21 |
More ▼ |
Source
Author
Trofimovich, Pavel | 6 |
Saito, Kazuya | 4 |
Bridgeman, Brent | 2 |
Coniam, David | 2 |
Han, Chao | 2 |
Isaacs, Talia | 2 |
Kuiken, Folkert | 2 |
Kunnan, Antony John | 2 |
McDonough, Kim | 2 |
Vedder, Ineke | 2 |
Xi, Xiaoming | 2 |
More ▼ |
Publication Type
Journal Articles | 69 |
Reports - Research | 66 |
Tests/Questionnaires | 9 |
Dissertations/Theses -… | 5 |
Reports - Evaluative | 4 |
Information Analyses | 2 |
Speeches/Meeting Papers | 2 |
Education Level
Audience
Location
China | 6 |
Canada | 4 |
Hong Kong | 4 |
Iran | 4 |
Europe | 3 |
Japan | 3 |
Australia | 2 |
Singapore | 2 |
Turkey | 2 |
United Kingdom | 2 |
United States | 2 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
Test of English as a Foreign… | 9 |
International English… | 7 |
Flesch Kincaid Grade Level… | 1 |
Foreign Language Classroom… | 1 |
What Works Clearinghouse Rating
Osama Koraishi – Language Teaching Research Quarterly, 2024
This study conducts a comprehensive quantitative evaluation of OpenAI's language model, ChatGPT 4, for grading Task 2 writing of the IELTS exam. The objective is to assess the alignment between ChatGPT's grading and that of official human raters. The analysis encompassed a multifaceted approach, including a comparison of means and reliability…
Descriptors: Second Language Learning, English (Second Language), Language Tests, Artificial Intelligence
Jiyeo Yun – English Teaching, 2023
Studies on automatic scoring systems in writing assessments have also evaluated the relationship between human and machine scores for the reliability of automated essay scoring systems. This study investigated the magnitudes of indices for inter-rater agreement and discrepancy, especially regarding human and machine scoring, in writing assessment.…
Descriptors: Meta Analysis, Interrater Reliability, Essays, Scoring
Ogawa, Chie – Language Testing in Asia, 2022
This study explored two assessment approaches to oral performances: analytical complexity, accuracy, and fluency (CAF) indices and human raters' evaluations. CAF indices are frequently used in second-language speaking (L2) research; however, because tasks are communicative and goal-oriented, the degree to which students achieve such communicative…
Descriptors: Oral Language, Evaluators, Audio Equipment, Accuracy
Saito, Kazuya; Macmillan, Konstantinos; Kachlicka, Magdalena; Kunihara, Takuya; Minematsu, Nobuaki – Studies in Second Language Acquisition, 2023
Whereas many scholars have emphasized the relative importance of "comprehensibility" as an ecologically valid goal for L2 speech training, testing, and development, eliciting listeners' judgments is time-consuming. Following calls for research on more efficient L2 speech rating methods in applied linguistics, and growing attention toward…
Descriptors: Second Language Learning, Second Language Instruction, Interrater Reliability, Speech Communication
Vasfiye Geçkin; Ebru Kiziltas; Çagatay Çinar – Journal of Educational Technology and Online Learning, 2023
The quality of writing in a second language (L2) is one of the indicators of the level of proficiency for many college students to be eligible for departmental studies. Although certain software programs, such as Intelligent Essay Assessor or IntelliMetric, have been introduced to evaluate second-language writing quality, an overall assessment of…
Descriptors: Writing Evaluation, Second Language Learning, Second Language Instruction, Language Proficiency
Yao Lu; Ksenia Gnevsheva – Journal of Multilingual and Multicultural Development, 2024
Previous research that explores the effect of ethnicity in the perception of speaker accentedness and personality traits often finds that Asian appearance contributes to a more accented and less competent impression. Importantly, most of the work done to date employed only Caucasian first language-speaking listeners; moreover, ethnicity and gender…
Descriptors: Pronunciation, Gender Differences, Personality Traits, Korean
Seedhouse, Paul – ELT Journal, 2019
This article investigates the central role of topic in the IELTS Speaking Test (IST). Topic has developed a dual personality in this interactional setting: topic-as-script is the scripted statement of topic on the examiner's cards prior to the interaction, whereas topic-as-action is how topic is developed by the candidate during the course of the…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Personality Traits
Ritz, Catherine; Sherf, Nicole – Foreign Language Annals, 2022
This large-scale study used a survey to collect data on K-12 world language program leadership and instructional practices in Massachusetts public schools, investigating associations between the presence of a program leader or primary evaluator who is a world language specialist and instructional practices, curriculum, and assessment. The study…
Descriptors: Second Language Programs, Second Language Learning, Second Language Instruction, Kindergarten
Dimova, Slobodanka – Language Teaching Research Quarterly, 2022
Drawing on Glenn Fulcher's extensive work in performance-based language assessment of speaking, this paper explores the assessment of L2 speaking ability in local language testing contexts. For that purpose, I review Fulcher's influential work that highlights the relationship between the speaking construct, the task, the performance, and the…
Descriptors: Language Tests, Speech Communication, Performance Based Assessment, Second Language Learning
Ren, Rong – ProQuest LLC, 2022
This study examined how L2 English speakers interpreted the notion of native English speakers (NESs) and nonnative English speakers (NNESs) and whether nativeness would influence their self-perception and speech production. It aimed at filling the following research gaps. First, limited studies have explored how L2 English speakers view the other…
Descriptors: Foreign Students, English (Second Language), Second Language Learning, Native Speakers
Youn, Soo Jung – TESOL Quarterly: A Journal for Teachers of English to Speakers of Other Languages and of Standard English as a Second Dialect, 2020
This study explicates the nature of second language (L2) pragmatic interaction focusing on the quantitative function of interactional features. A relationship between the fine-grained interactional features elicited from learners' role-play performances at varying levels and trained raters' scores was investigated. The corpus of 102 learners'…
Descriptors: Role Playing, Pragmatics, Speech Communication, Computational Linguistics
Li, Shuai; Wen, Ting; Li, Xian; Feng, Yali; Lin, Chuan – Language Testing, 2023
This study compared holistic and analytic marking methods for their effects on parameter estimation (of examinees, raters, and items) and rater cognition in assessing speech act production in L2 Chinese. Seventy American learners of Chinese completed an oral Discourse Completion Test assessing requests and refusals. Four first-language (L1)…
Descriptors: Speech Acts, Second Language Learning, Second Language Instruction, Chinese
Johnson, Carol; Cardoso, Walcir; Zuercher, Beau; Brannen, Kathleen; Springer, Suzanne – Research-publishing.net, 2022
This study examined the use of a popular Automatic Speech Recognition (ASR), Google Voice Typing (GVT), to automatically assess English as second language pronunciation. It aimed to answer the following question: What is the relationship between GVT-rated scores and human-rated scores? To answer this question, we compared audio recordings of 56…
Descriptors: Teaching Methods, Computer Software, Pronunciation, Second Language Learning
Khabbazbashi, Nahal; Galaczi, Evelina D. – Language Testing, 2020
This mixed methods study examined holistic, analytic, and part marking models (MMs) in terms of their measurement properties and impact on candidate CEFR classifications in a semi-direct online speaking test. Speaking performances of 240 candidates were first marked holistically and by part (phase 1). On the basis of phase 1 findings--which…
Descriptors: Holistic Approach, Classification, Grading, Language Tests
Ma, Wenyue; Winke, Paula – Language Assessment Quarterly, 2022
The factors that influence rater scoring have been a subject of great interest to researchers in second language assessment. However, the research on the impact of test-takers' speech profiles (e.g., a jagged or a flat profile reflecting analytic subscores) on raters' scoring behaviors remains to be seen. To investigate the role of speech profiles…
Descriptors: Language Tests, Second Language Learning, Speech Communication, Profiles