ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	10
Since 2016 (last 10 years)	15
Since 2006 (last 20 years)	24

Descriptor

Evaluation Methods	25
Evaluators	25
Second Language Learning	25
English (Second Language)	15
Language Tests	10
Foreign Countries	9
Second Language Instruction	9
Correlation	8
Language Proficiency	8
Scoring	8
Interrater Reliability	7
Comparative Analysis	6
Rating Scales	6
Scores	6
Native Language	5
Oral Language	5
Speech Communication	5
Task Analysis	5
Validity	5
Writing Evaluation	5
Computer Assisted Testing	4
Computer Software	4
Decision Making	4
Majors (Students)	4
Pronunciation	4
More ▼

Source

Language Testing	6
ETS Research Report Series	2
Language Learning	2
Language Teaching Research…	2
Language Testing in Asia	2
Advances in Language and…	1
Applied Linguistics	1
Computer Assisted Language…	1
English Language Teaching	1
Journal of Pan-Pacific…	1
Language Assessment Quarterly	1
Language Education &…	1
Modern Language Journal	1
Studies in Second Language…	1
rEFLections	1
More ▼

Publication Type

Journal Articles	24
Reports - Research	22
Tests/Questionnaires	4
Reports - Evaluative	2
Information Analyses	1
Speeches/Meeting Papers	1

Education Level

Higher Education	8
Postsecondary Education	7
High Schools	1
Secondary Education	1

Audience

Location

China	5
Canada	1
Europe	1
Haiti	1
Illinois (Urbana)	1
Iran	1
Japan	1
Nigeria	1
Philippines	1
Thailand	1
United States	1
Vietnam	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…

What Works Clearinghouse Rating

Showing 1 to 15 of 25 results Save | Export

Do Source Use Features Impact Raters' Judgment of Argumentation? An Experimental Study

Peer reviewed

Direct link

Ping-Lin Chuang – Language Testing, 2025

This experimental study explores how source use features impact raters' judgment of argumentation in a second language (L2) integrated writing test. One hundred four experienced and novice raters were recruited to complete a rating task that simulated the scoring assignment of a local English Placement Test (EPT). Sixty written responses were…

Descriptors: Interrater Reliability, Evaluators, Information Sources, Primary Sources

Rater Cognitive Processes in Integrated Writing Tasks: From the Perspective of Problem-Solving

Peer reviewed

Direct link

Jia, Wenfeng; Zhang, Peixin – Language Testing in Asia, 2023

It is widely believed that raters' cognition is an important aspect of writing assessment, as it has both logical and temporal priority over scores. Based on a critical review of previous research in this area, it is found that raters' cognition can be boiled to two fundamental issues: building text images and strategies for articulating scores.…

Descriptors: Problem Solving, Cognitive Processes, Writing Evaluation, Evaluators

How Many Raters Can Be Enough: G Theory Applied to Assessment and Measurement of L2 Speech Perception

Peer reviewed
PDF on ERIC

Download full text

Kevin Hirschi; Okim Kang – Language Teaching Research Quarterly, 2023

This paper extends the use of Generalizability Theory to the measurement of extemporaneous L2 speech through the lens of speech perception. Using six datasets of previous studies, it reports on "G studies"--a method of breaking down measurement variance--and "D studies"--a predictive study of the impact on reliability when…

Descriptors: Evaluators, Generalization, Evaluation Methods, Speech Communication

Comprehensible to Whom? Examining Rater, Speaker, and Interlocutor Perspectives on Comprehensibility in an Interactive Context

Peer reviewed

Direct link

Nagle, Charlie L.; Trofimovich, Pavel; O'Brien, Mary Grantham; Kennedy, Sara – Modern Language Journal, 2022

Comprehensibility has emerged as a useful and intuitive means of globally evaluating second language (L2) speakers in many research and instructional contexts. In most cases, L2 speakers' comprehensibility is assessed by external listeners who do not engage in extensive communication with the speakers, even though the degree to which a speaker is…

Descriptors: Evaluators, Intelligibility, Pronunciation, Task Analysis

Automated Assessment of Second Language Comprehensibility: Review, Training, Validation, and Generalization Studies

Peer reviewed

Direct link

Saito, Kazuya; Macmillan, Konstantinos; Kachlicka, Magdalena; Kunihara, Takuya; Minematsu, Nobuaki – Studies in Second Language Acquisition, 2023

Whereas many scholars have emphasized the relative importance of "comprehensibility" as an ecologically valid goal for L2 speech training, testing, and development, eliciting listeners' judgments is time-consuming. Following calls for research on more efficient L2 speech rating methods in applied linguistics, and growing attention toward…

Descriptors: Second Language Learning, Second Language Instruction, Interrater Reliability, Speech Communication

Performance-Based Speaking Tests: Possibilities in Local Language Testing

Peer reviewed
PDF on ERIC

Download full text

Dimova, Slobodanka – Language Teaching Research Quarterly, 2022

Drawing on Glenn Fulcher's extensive work in performance-based language assessment of speaking, this paper explores the assessment of L2 speaking ability in local language testing contexts. For that purpose, I review Fulcher's influential work that highlights the relationship between the speaking construct, the task, the performance, and the…

Descriptors: Language Tests, Speech Communication, Performance Based Assessment, Second Language Learning

Can Automated Machine Translation Evaluation Metrics Be Used to Assess Students' Interpretation in the Language Learning Classroom?

Peer reviewed

Direct link

Han, Chao; Lu, Xiaolei – Computer Assisted Language Learning, 2023

The use of translation and interpreting (T&I) in the language learning classroom is commonplace, serving various pedagogical and assessment purposes. Previous utilization of T&I exercises is driven largely by their potential to enhance language learning, whereas the latest trend has begun to underscore T&I as a crucial skill to be…

Descriptors: Translation, Computational Linguistics, Correlation, Language Processing

Effects of Second Language Pronunciation Teaching Revisited: A Proposed Measurement Framework and Meta-Analysis

Peer reviewed

Direct link

Saito, Kazuya; Plonsky, Luke – Language Learning, 2019

We propose a new framework for conceptualizing measures of instructed second language (L2) pronunciation performance according to three sets of parameters: (a) the constructs (focused on global vs. specific aspects of pronunciation), (b) the scoring method (human raters vs. acoustic analyses), and (c) the type of knowledge elicited (controlled vs.…

Descriptors: Second Language Learning, Second Language Instruction, Scoring, Pronunciation Instruction

The Processes of Rating L2 Speaking Performance Using an Analytic Rating Scale -- A Qualitative Exploration

Peer reviewed
PDF on ERIC

Download full text

Thai, Thuy; Sheehan, Susan – Language Education & Assessment, 2022

In language performance tests, raters are important as their scoring decisions determine which aspects of performance the scores represent; however, raters are considered as one of the potential sources contributing to unwanted variability in scores (Davis, 2012). Although a great number of studies have been conducted to unpack how rater…

Descriptors: Rating Scales, Speech Communication, Second Language Learning, Second Language Instruction

Building an Initial Validity Argument for Binary and Analytic Rating Scales for an EFL Classroom Writing Assessment: Evidence from Many-Facets Rasch Measurement

Peer reviewed
PDF on ERIC

Download full text

Khamboonruang, Apichat – rEFLections, 2022

Although much research has compared the functioning between analytic and holistic rating scales, little research has compared the functioning of binary rating scales with other types of rating scales. This quantitative study set out to preliminarily and comparatively validate binary and analytic rating scales intended for use in formative…

Descriptors: Writing Evaluation, Evaluation Methods, Second Language Learning, Second Language Instruction

A Comparative Judgment Approach to Assessing Chinese Sign Language Interpreting

Peer reviewed

Direct link

Han, Chao; Xiao, Xiaoyan – Language Testing, 2022

The quality of sign language interpreting (SLI) is a gripping construct among practitioners, educators and researchers, calling for reliable and valid assessment. There has been a diverse array of methods in the extant literature to measure SLI quality, ranging from traditional error analysis to recent rubric scoring. In this study, we want to…

Descriptors: Comparative Analysis, Sign Language, Deaf Interpreting, Evaluators

Using Native-Speaker Psycholinguistic Norms to Predict Lexical Proficiency and Development in Second-Language Production

Peer reviewed

Direct link

Berger, Cynthia M.; Crossley, Scott A.; Kyle, Kristopher – Applied Linguistics, 2019

A large data set of L1 psycholinguistic norms (Balota "et al." 2007) was used to assess spoken L2 English lexical proficiency in cross-sectional and longitudinal learner corpora. Behavioral norms included lexical decision and word naming latencies (i.e. reaction times) and accuracies for 40,481 English words. A frequency measure was…

Descriptors: Psycholinguistics, Native Language, Second Language Learning, Case Studies

Comparison of Automatic and Expert Teachers' Rating of Computerized English Listening-Speaking Test

Peer reviewed
PDF on ERIC

Download full text

Linlin, Cao – English Language Teaching, 2020

Through Many-Facet Rasch analysis, this study explores the rating differences between 1 computer automatic rater and 5 expert teacher raters on scoring 119 students in a computerized English listening-speaking test. Results indicate that both automatic and the teacher raters demonstrate good inter-rater reliability, though the automatic rater…

Descriptors: Language Tests, Computer Assisted Testing, English (Second Language), Second Language Learning

Rater Reliability and Score Discrepancy under Holistic and Analytic Scoring of Second Language Writing

Peer reviewed

Direct link

Zhang, Bo; Xiao, Yunnan; Luo, Juan – Language Testing in Asia, 2015

Previous studies comparing holistic scoring to analytic scoring of second language writing have given mixed results. Some of them suffer from methodological drawbacks, such as limited writing sample size, limited number of raters, and lack of direct comparison of the two methods. Based on 300 writing samples graded by 14 raters, this research…

Descriptors: Evaluators, Reliability, Scores, Holistic Approach

Functional Adequacy in L2 Writing: Towards a New Rating Scale

Peer reviewed

Direct link

Kuiken, Folkert; Vedder, Ineke – Language Testing, 2017

The importance of functional adequacy as an essential component of L2 proficiency has been observed by several authors (Pallotti, 2009; De Jong, Steinel, Florijn, Schoonen, & Hulstijn, 2012a, b). The rationale underlying the present study is that the assessment of writing proficiency in L2 is not fully possible without taking into account the…

Descriptors: Second Language Learning, Rating Scales, Computational Linguistics, Persuasive Discourse

Previous Page | Next Page »

Pages: 1 | 2

Han, Chao	2
Saito, Kazuya	2
Bejar, Isaac I.	1
Berger, Cynthia M.	1
Bridgeman, Brent	1
Crossley, Scott A.	1
Cziko, Gary A.	1
Davey, Tim	1
Dimova, Slobodanka	1
Gaillard, Stéphanie	1
He, Lianzhen	1
Hemat, Ramin	1
Jia, Wenfeng	1
Kachlicka, Magdalena	1
Kennedy, Sara	1
Kevin Hirschi	1
Khamboonruang, Apichat	1
Kondo, Yusuke	1
Kuiken, Folkert	1
Kunihara, Takuya	1
Kyle, Kristopher	1
Li, Hang	1
Linlin, Cao	1
Lu, Xiaolei	1
Luo, Juan	1
More ▼