ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	8
Since 2006 (last 20 years)	13

Descriptor

Correlation	13
Evaluators	13
Second Language Learning	13
Language Tests	9
English (Second Language)	8
Language Proficiency	6
Scoring	6
Foreign Countries	5
Oral Language	5
Scores	5
Statistical Analysis	5
Native Language	4
Rating Scales	4
Second Language Instruction	4
Task Analysis	4
Writing Evaluation	4
Evaluation Criteria	3
Guidelines	3
Holistic Approach	3
Interrater Reliability	3
Language Fluency	3
Speech Communication	3
Undergraduate Students	3
Chinese	2
Classification	2
More ▼

Source

Language Testing

Publication Type

Journal Articles	13
Reports - Research	12
Tests/Questionnaires	3
Information Analyses	1
Reports - Evaluative	1

Education Level

Higher Education	5
Postsecondary Education	2

Audience

Location

China	2
Canada	1
Europe	1
Japan	1
Ohio	1

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…

What Works Clearinghouse Rating

Showing all 13 results Save | Export

Comparing Holistic and Analytic Marking Methods in Assessing Speech Act Production in L2 Chinese

Peer reviewed

Direct link

Li, Shuai; Wen, Ting; Li, Xian; Feng, Yali; Lin, Chuan – Language Testing, 2023

This study compared holistic and analytic marking methods for their effects on parameter estimation (of examinees, raters, and items) and rater cognition in assessing speech act production in L2 Chinese. Seventy American learners of Chinese completed an oral Discourse Completion Test assessing requests and refusals. Four first-language (L1)…

Descriptors: Speech Acts, Second Language Learning, Second Language Instruction, Chinese

A Comparison of Holistic, Analytic, and Part Marking Models in Speaking Assessment

Peer reviewed

Direct link

Khabbazbashi, Nahal; Galaczi, Evelina D. – Language Testing, 2020

This mixed methods study examined holistic, analytic, and part marking models (MMs) in terms of their measurement properties and impact on candidate CEFR classifications in a semi-direct online speaking test. Speaking performances of 240 candidates were first marked holistically and by part (phase 1). On the basis of phase 1 findings--which…

Descriptors: Holistic Approach, Classification, Grading, Language Tests

A Generalizability Theory Study of Optimal Measurement Design for a Summative Assessment of English/Chinese Consecutive Interpreting

Peer reviewed

Direct link

Han, Chao – Language Testing, 2019

Summative assessment of interpretation is widely conducted in interpreting courses/programs to inform high-stakes decision making, such as the selection, certification, and conferral of academic degrees. Yet there has been very limited empirical research to investigate the score dependability of summative interpretation assessment. The present…

Descriptors: Generalization, Decision Making, Summative Evaluation, Evaluators

The Influence of Training and Experience on Rater Performance in Scoring Spoken Language

Peer reviewed

Direct link

Davis, Larry – Language Testing, 2016

Two factors were investigated that are thought to contribute to consistency in rater scoring judgments: rater training and experience in scoring. Also considered were the relative effects of scoring rubrics and exemplars on rater performance. Experienced teachers of English (N = 20) scored recorded responses from the TOEFL iBT speaking test prior…

Descriptors: Evaluators, Oral Language, Scores, Language Tests

Functional Adequacy in L2 Writing: Towards a New Rating Scale

Peer reviewed

Direct link

Kuiken, Folkert; Vedder, Ineke – Language Testing, 2017

The importance of functional adequacy as an essential component of L2 proficiency has been observed by several authors (Pallotti, 2009; De Jong, Steinel, Florijn, Schoonen, & Hulstijn, 2012a, b). The rationale underlying the present study is that the assessment of writing proficiency in L2 is not fully possible without taking into account the…

Descriptors: Second Language Learning, Rating Scales, Computational Linguistics, Persuasive Discourse

Task and Rater Effects in L2 Speaking and Writing: A Synthesis of Generalizability Studies

Peer reviewed

Direct link

In'nami, Yo; Koizumi, Rie – Language Testing, 2016

We addressed Deville and Chalhoub-Deville's (2006), Schoonen's (2012), and Xi and Mollaun's (2006) call for research into the contextual features that are considered related to person-by-task interactions in the framework of generalizability theory in two ways. First, we quantitatively synthesized the generalizability studies to determine the…

Descriptors: Evaluators, Second Language Learning, Writing Skills, Oral Language

How Do Utterance Measures Predict Raters' Perceptions of Fluency in French as a Second Language?

Peer reviewed

Direct link

Préfontaine, Yvonne; Kormos, Judit; Johnson, Daniel Ezra – Language Testing, 2016

While the research literature on second language (L2) fluency is replete with descriptions of fluency and its influence with regard to English as an additional language, little is known about what fluency features influence judgments of fluency in L2 French. This study reports the results of an investigation that analyzed the relationship between…

Descriptors: Prediction, French, Second Language Learning, Evaluators

Grounding Lexical Diversity in Human Judgments

Peer reviewed

Direct link

Jarvis, Scott – Language Testing, 2017

The present study discusses the relevance of measures of lexical diversity (LD) to the assessment of learner corpora. It also argues that existing measures of LD, many of which have become specialized for use with language corpora, are fundamentally measures of lexical repetition, are based on an etic perspective of language, and lack construct…

Descriptors: Computational Linguistics, English (Second Language), Second Language Learning, Native Speakers

Rating Written Performance: What Do Raters Do and Why?

Peer reviewed

Direct link

Kuiken, Folkert; Vedder, Ineke – Language Testing, 2014

This study investigates the relationship in L2 writing between raters' judgments of communicative adequacy and linguistic complexity by means of six-point Likert scales, and general measures of linguistic performance. The participants were 39 learners of Italian and 32 of Dutch, who wrote two short argumentative essays. The same writing tasks…

Descriptors: Writing Evaluation, Second Language Learning, Evaluators, Native Language

TOEFL iBT Speaking Test Scores as Indicators of Oral Communicative Language Proficiency

Peer reviewed

Direct link

Bridgeman, Brent; Powers, Donald; Stone, Elizabeth; Mollaun, Pamela – Language Testing, 2012

Scores assigned by trained raters and by an automated scoring system (SpeechRater[TM]) on the speaking section of the TOEFL iBT[TM] were validated against a communicative competence criterion. Specifically, a sample of 555 undergraduate students listened to speech samples from 184 examinees who took the Test of English as a Foreign Language…

Descriptors: Undergraduate Students, Speech Communication, Rating Scales, Scoring

Explaining ESL Essay Holistic Scores: A Multilevel Modeling Approach

Peer reviewed

Direct link

Barkaoui, Khaled – Language Testing, 2010

This study adopted a multilevel modeling (MLM) approach to examine the contribution of rater and essay factors to variability in ESL essay holistic scores. Previous research aiming to explain variability in essay holistic scores has focused on either rater or essay factors. The few studies that have examined the contribution of more than one…

Descriptors: Performance Based Assessment, English (Second Language), Second Language Learning, Holistic Approach

Evaluating Analytic Scoring for the TOEFL[R] Academic Speaking Test (TAST) for Operational Use

Peer reviewed

Direct link

Xi, Xiaoming – Language Testing, 2007

This study explores the utility of analytic scoring for TAST in providing useful and reliable diagnostic information for operational use in three aspects of candidates' performance: delivery, language use and topic development. One hundred and forty examinees' responses to six TAST tasks were scored analytically on these three aspects of speech. G…

Descriptors: Scoring, Profiles, Performance Based Assessment, Academic Discourse

EFL Classroom Peer Assessment: Training Effects on Rating and Commenting

Peer reviewed

Direct link

Saito, Hidetoshi – Language Testing, 2008

This study examined the effects of training on peer assessment and comments provided regarding oral presentations in EFL (English as a Foreign Language) classrooms. In Study 1, both the treatment and control groups received instruction on skill aspects, but only the treatment group was given an additional 40-minute training on how to rate…

Descriptors: Control Groups, Student Attitudes, Peer Evaluation, English (Second Language)

Kuiken, Folkert	2
Vedder, Ineke	2
Barkaoui, Khaled	1
Bridgeman, Brent	1
Davis, Larry	1
Feng, Yali	1
Galaczi, Evelina D.	1
Han, Chao	1
In'nami, Yo	1
Jarvis, Scott	1
Johnson, Daniel Ezra	1
Khabbazbashi, Nahal	1
Koizumi, Rie	1
Kormos, Judit	1
Li, Shuai	1
Li, Xian	1
Lin, Chuan	1
Mollaun, Pamela	1
Powers, Donald	1
Préfontaine, Yvonne	1
Saito, Hidetoshi	1
Stone, Elizabeth	1
Wen, Ting	1
Xi, Xiaoming	1
More ▼