ERIC - Search Results

Publication Date

In 2025	11
Since 2024	19
Since 2021 (last 5 years)	70
Since 2016 (last 10 years)	182
Since 2006 (last 20 years)	281

Descriptor

English (Second Language)	285
Evaluators	285
Second Language Learning	285
Foreign Countries	161
Second Language Instruction	148
Language Tests	140
Language Proficiency	96
Comparative Analysis	77
Scores	71
Writing Evaluation	71
Oral Language	67
Pronunciation	62
Correlation	59
Speech Communication	58
Native Speakers	57
Native Language	50
Teaching Methods	49
Scoring	47
Essays	46
Rating Scales	46
Undergraduate Students	46
Language Teachers	44
Statistical Analysis	43
Interrater Reliability	42
College Students	41
More ▼

Publication Type

Journal Articles	263
Reports - Research	244
Tests/Questionnaires	40
Reports - Evaluative	15
Dissertations/Theses -…	14
Reports - Descriptive	8
Information Analyses	3
Speeches/Meeting Papers	3
Books	1
Collected Works - General	1
Guides - Non-Classroom	1
Numerical/Quantitative Data	1
Opinion Papers	1
More ▼

Education Level

Higher Education	133
Postsecondary Education	104
Secondary Education	18
High Schools	6
Adult Education	5
Elementary Education	5
Grade 2	2
Early Childhood Education	1
Elementary Secondary Education	1
Grade 1	1
Grade 11	1
Grade 12	1
Grade 3	1
Junior High Schools	1
Kindergarten	1
Middle Schools	1
Primary Education	1
More ▼

Audience

Practitioners	1
Teachers	1

Location

China	24
Japan	20
Iran	17
Australia	12
Turkey	12
Europe	10
Hong Kong	8
Canada	5
India	5
South Korea	5
Taiwan	5
United Kingdom	5
Indonesia	4
Singapore	4
Spain	4
Thailand	4
California	3
Canada (Montreal)	3
New Zealand	3
United States	3
Vietnam	3
Germany	2
Iran (Tehran)	2
Japan (Tokyo)	2
New York (New York)	2
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Test of English as a Foreign…	35
International English…	21
Test of English for…	7
Flesch Kincaid Grade Level…	2
Foreign Language Classroom…	1
Graduate Record Examinations	1
Modern Language Aptitude Test	1
Program for International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 285 results Save | Export

Exploring Individual Differences in Rating Second Language Speech: Rater's Language Aptitude, Major, Accent Familiarity, and Attitudes

Peer reviewed

Direct link

Kahng, Jimin – TESOL Quarterly: A Journal for Teachers of English to Speakers of Other Languages and of Standard English as a Second Dialect, 2023

This study is the first attempt to explore the relationship between rater variables focusing on raters' language aptitude and their judgments of second language (L2) speech. Thirty-four English listeners rated 65 spontaneous native and nonnative speech samples for comprehensibility, accentedness, and fluency. They also completed the LLAMA language…

Descriptors: Evaluators, Second Language Learning, Language Tests, Language Fluency

Revisiting Raters' Accent Familiarity in Speaking Tests: Evidence That Presentation Mode Interacts with Accent Familiarity to Variably Affect Comprehensibility Ratings

Peer reviewed

Direct link

Michael D. Carey; Stefan Szocs – Language Testing, 2024

This controlled experimental study investigated the interaction of variables associated with rating the pronunciation component of high-stakes English-language-speaking tests such as IELTS and TOEFL iBT. One hundred experienced raters who were all either familiar or unfamiliar with Brazilian-accented English or Papua New Guinean Tok Pisin-accented…

Descriptors: Dialects, Pronunciation, Suprasegmentals, Familiarity

Examining the Effect of Item Difficulty and Rater Leniency on Iranian Test Takers' Performance on WDCT and DSAT: A Comparative Study

Peer reviewed
PDF on ERIC

Download full text

Reza Shahi; Hamdollah Ravand; Golam Reza Rohani – International Journal of Language Testing, 2025

The current paper intends to exploit the Many Facet Rasch Model to investigate and compare the impact of situations (items) and raters on test takers' performance on the Written Discourse Completion Test (WDCT) and Discourse Self-Assessment Tests (DSAT). In this study, the participants were 110 English as a Foreign Language (EFL) students at…

Descriptors: Comparative Analysis, English (Second Language), Second Language Learning, Second Language Instruction

Scoring Difficulty in Summary Writing Assessment: Toward the Reconstruction of Analytic Rubric

Peer reviewed
PDF on ERIC

Download full text

Makiko Kato – Journal of Education and Learning, 2025

This study aims to examine whether differences exist in the factors influencing the difficulty of scoring English summaries and determining scores based on the raters' attributes, and to collect candid opinions, considerations, and tentative suggestions for future improvements to the analytic rubric of summary writing for English learners. In this…

Descriptors: Writing Evaluation, Scoring, Writing Skills, English (Second Language)

Assessing the Content Quality of Essays in Content and Language Integrated Learning: Exploring the Construct from Subject Specialists' Perspectives

Peer reviewed

Direct link

Takanori Sato – Language Testing, 2024

Assessing the content of learners' compositions is a common practice in second language (L2) writing assessment. However, the construct definition of content in L2 writing assessment potentially underrepresents the target competence in content and language integrated learning (CLIL), which aims to foster not only L2 proficiency but also critical…

Descriptors: Language Tests, Content and Language Integrated Learning, Writing Evaluation, Writing Tests

Utilizing Large Language Models for EFL Essay Grading: An Examination of Reliability and Validity in Rubric-Based Assessments

Peer reviewed

Direct link

Fatih Yavuz; Özgür Çelik; Gamze Yavas Çelik – British Journal of Educational Technology, 2025

This study investigates the validity and reliability of generative large language models (LLMs), specifically ChatGPT and Google's Bard, in grading student essays in higher education based on an analytical grading rubric. A total of 15 experienced English as a foreign language (EFL) instructors and two LLMs were asked to evaluate three student…

Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Computational Linguistics

Making Each Point Count: Revising a Local Adaptation of the Jacobs et al.'s (1981) ESL COMPOSITION PROFILE Rubric

Peer reviewed

Direct link

Yu-Tzu Chang; Ann Tai Choe; Daniel Holden; Daniel R. Isbell – Language Testing, 2024

In this Brief Report, we describe an evaluation of and revisions to a rubric adapted from the Jacobs et al.'s (1981) ESL COMPOSITION PROFILE, with four rubric categories and 20-point rating scales, in the context of an intensive English program writing placement test. Analysis of 4 years of rating data (2016-2021, including 434 essays) using…

Descriptors: Language Tests, Rating Scales, Second Language Learning, English (Second Language)

Rater Cognitive Processes in Integrated Writing Tasks: From the Perspective of Problem-Solving

Peer reviewed

Direct link

Jia, Wenfeng; Zhang, Peixin – Language Testing in Asia, 2023

It is widely believed that raters' cognition is an important aspect of writing assessment, as it has both logical and temporal priority over scores. Based on a critical review of previous research in this area, it is found that raters' cognition can be boiled to two fundamental issues: building text images and strategies for articulating scores.…

Descriptors: Problem Solving, Cognitive Processes, Writing Evaluation, Evaluators

The Visual Signature of Non-Understanding: A Systematic Replication of McDonough, Trofimovich, Lu, and Abashidze (2019)

Peer reviewed

Direct link

McDonough, Kim; Lindberg, Rachael; Trofimovich, Pavel; Tekin, Oguzhan – Language Teaching, 2023

This replication study seeks to extend the generalizability of an exploratory study (McDonough et al., 2019) that identified holds (i.e., temporary cessation of dynamic movement by the listener) as a reliable visual cue of non-understanding. Conversations between second language (L2) English speakers in the Corpus of English as a Lingua Franca…

Descriptors: Second Language Learning, Second Language Instruction, English (Second Language), Computational Linguistics

Impact of Self-Construal on Rater Severity in Peer Assessments of Oral Presentations

Peer reviewed

Direct link

Tanaka, Mitsuko; Ross, Steven J. – Assessment in Education: Principles, Policy & Practice, 2023

Raters vary from each other in their severity and leniency in rating performance. This study examined the factors affecting rater severity in peer assessments of oral presentations in English as a Foreign Language (EFL), focusing on peer raters' self-construal and presentation abilities. Japanese university students enrolled in EFL classes…

Descriptors: Evaluators, Interrater Reliability, Item Response Theory, Peer Evaluation

Revisiting the Effectiveness of a Performance Decision Tree-Style Rubric Compared to a Grid-Style Rubric

Peer reviewed

Direct link

Yuichiro Yokouchi – Language Testing in Asia, 2025

The performance decision tree (PDT; Fulcher et al., 2011) is a rubric style that is applicable to performance assessment, with origins in Upshur and Turner's (1995) empirically derived binary-choice, boundary-definition (EBB) scale. It is easier for raters to assess performance by evaluating multiple binary-choice descriptors. Additionally,…

Descriptors: Scoring Rubrics, Second Language Learning, Second Language Instruction, Language Teachers

"How Do Raters Learn to Rate?" Many-Facet Rasch Modeling of Rater Performance over the Course of a Rater Certification Program

Peer reviewed

Direct link

Yan, Xun; Chuang, Ping-Lin – Language Testing, 2023

This study employed a mixed-methods approach to examine how rater performance develops during a semester-long rater certification program for an English as a Second Language (ESL) writing placement test at a large US university. From 2016 to 2018, we tracked three groups of novice raters (n = 30) across four rounds in the certification program.…

Descriptors: Evaluators, Interrater Reliability, Item Response Theory, Certification

Exploring Potential Biases in GPT-4o's Ratings of English Language Learners' Essays

Peer reviewed

Direct link

Taichi Yamashita – Language Testing, 2025

With the rapid development of generative artificial intelligence (AI) frameworks (e.g., the generative pre-trained transformer [GPT]), a growing number of researchers have started to explore its potential as an automated essay scoring (AES) system. While previous studies have investigated the alignment between human ratings and GPT ratings, few…

Descriptors: Artificial Intelligence, English (Second Language), Second Language Learning, Second Language Instruction

The Intersection of AI and Language Assessment: A Study on the Reliability of ChatGPT in Grading IELTS Writing Task 2

Peer reviewed
PDF on ERIC

Download full text

Osama Koraishi – Language Teaching Research Quarterly, 2024

This study conducts a comprehensive quantitative evaluation of OpenAI's language model, ChatGPT 4, for grading Task 2 writing of the IELTS exam. The objective is to assess the alignment between ChatGPT's grading and that of official human raters. The analysis encompassed a multifaceted approach, including a comparison of means and reliability…

Descriptors: Second Language Learning, English (Second Language), Language Tests, Artificial Intelligence

Meta-Analysis of Inter-Rater Agreement and Discrepancy Between Human and Automated English Essay Scoring

Peer reviewed
PDF on ERIC

Download full text

Direct link

Jiyeo Yun – English Teaching, 2023

Studies on automatic scoring systems in writing assessments have also evaluated the relationship between human and machine scores for the reliability of automated essay scoring systems. This study investigated the magnitudes of indices for inter-rater agreement and discrepancy, especially regarding human and machine scoring, in writing assessment.…

Descriptors: Meta Analysis, Interrater Reliability, Essays, Scoring

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 19

Language Testing	39
Language Assessment Quarterly	22
ProQuest LLC	14
Language Testing in Asia	13
ETS Research Report Series	11
English Language Teaching	11
TESOL Quarterly: A Journal…	9
Studies in Second Language…	8
Computer Assisted Language…	5
Journal of Pan-Pacific…	5
Online Submission	5
Advances in Language and…	4
Assessment in Education:…	4
Reading Matrix: An…	4
TESL Canada Journal	4
Australian Review of Applied…	3
Journal of English as an…	3
Journal of Multilingual and…	3
Language Awareness	3
Language Teaching Research	3
Modern Language Journal	3
System: An International…	3
Taiwan Journal of TESOL	3
Applied Linguistics	2
Bilingualism: Language and…	2
More ▼

Trofimovich, Pavel	10
Saito, Kazuya	8
McDonough, Kim	5
Pill, John	5
Isaacs, Talia	4
Xi, Xiaoming	4
Alemi, Minoo	3
Barkaoui, Khaled	3
Kang, Okim	3
McNamara, Tim	3
Shintani, Natsuko	3
Winke, Paula	3
Abashidze, Dato	2
Ahmadi, Alireza	2
Bridgeman, Brent	2
Coniam, David	2
Cots, Josep M.	2
Crossley, Scott A.	2
Davis, Larry	2
Elder, Catherine	2
Galaczi, Evelina	2
Gass, Susan	2
Han, Chao	2
Han, Turgay	2
Harding, Luke	2
More ▼