ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	7
Since 2016 (last 10 years)	16
Since 2006 (last 20 years)	22

Descriptor

Evaluators	22
Foreign Countries	22
Second Language Learning	18
Language Tests	17
English (Second Language)	12
Second Language Instruction	11
Scores	10
Language Proficiency	8
Correlation	5
Decision Making	5
Comparative Analysis	4
Language Fluency	4
Language Teachers	4
Native Language	4
Oral Language	4
Scoring	4
Speech Communication	4
Student Evaluation	4
Training	4
Undergraduate Students	4
Indo European Languages	3
Interrater Reliability	3
Rating Scales	3
Task Analysis	3
Writing Evaluation	3
More ▼

Source

Language Testing

Publication Type

Journal Articles	22
Reports - Research	20
Reports - Evaluative	2
Tests/Questionnaires	2

Education Level

Higher Education	10
Postsecondary Education	8
Secondary Education	4
Adult Education	1
Elementary Education	1
Elementary Secondary Education	1

Audience

Location

China	5
Australia	3
Europe	3
Netherlands	3
Turkey	2
California (San Francisco)	1
Canada	1
Colombia	1
Finland	1
India	1
Japan	1
New York (New York)	1
Switzerland	1
United States	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

International English…

What Works Clearinghouse Rating

Showing 1 to 15 of 22 results Save | Export

Communal Factors in Rater Severity and Consistency over Time in High-Stakes Oral Assessment

Peer reviewed

Direct link

Reeta Neittaanmäki; Iasonas Lamprianou – Language Testing, 2024

This article focuses on rater severity and consistency and their relation to major changes in the rating system in a high-stakes testing context. The study is based on longitudinal data collected from 2009 to 2019 from the second language (L2) Finnish speaking subtest in the National Certificates of Language Proficiency in Finland. We investigated…

Descriptors: Foreign Countries, Interrater Reliability, Evaluators, Item Response Theory

Language Testers and Their Place in the Policy Web

Peer reviewed

Direct link

Laura Schildt; Bart Deygers; Albert Weideman – Language Testing, 2024

In the context of policy-driven language testing for citizenship, a growing body of research examines the political justifications and ethical implications of language requirements and test use. However, virtually no studies have looked at the role that language testers play in the evolution of language requirements. Critical gaps remain in our…

Descriptors: Language Tests, Citizenship, Educational Policy, Assessment Literacy

Challenges in Rating Signed Production: A Mixed-Methods Study of a Swiss German Sign Language Form-Recall Vocabulary Test

Peer reviewed

Direct link

Batty, Aaron Olaf; Haug, Tobias; Ebling, Sarah; Tissi, Katja; Sidler-Miserez, Sandra – Language Testing, 2023

Sign languages present particular challenges to language assessors in relation to variation in signs, weakly defined citation forms, and a general lack of standard-setting work even in long-established measures of productive sign proficiency. The present article addresses and explores these issues via a mixed-methods study of a human-rated…

Descriptors: Sign Language, Language Tests, Standard Setting, Barriers

The Longitudinal Stability of Rating Characteristics in an EFL Examination: Methodological and Substantive Considerations

Peer reviewed

Direct link

Lamprianou, Iasonas; Tsagari, Dina; Kyriakou, Nansia – Language Testing, 2021

This longitudinal study (2002-2014) investigates the stability of rating characteristics of a large group of raters over time in the context of the writing paper of a national high-stakes examination. The study uses one measure of rater severity and two measures of rater consistency. The results suggest that the rating characteristics of…

Descriptors: Longitudinal Studies, Evaluators, High Stakes Tests, Writing Evaluation

Comparing Holistic and Analytic Marking Methods in Assessing Speech Act Production in L2 Chinese

Peer reviewed

Direct link

Li, Shuai; Wen, Ting; Li, Xian; Feng, Yali; Lin, Chuan – Language Testing, 2023

This study compared holistic and analytic marking methods for their effects on parameter estimation (of examinees, raters, and items) and rater cognition in assessing speech act production in L2 Chinese. Seventy American learners of Chinese completed an oral Discourse Completion Test assessing requests and refusals. Four first-language (L1)…

Descriptors: Speech Acts, Second Language Learning, Second Language Instruction, Chinese

A Comparison of Holistic, Analytic, and Part Marking Models in Speaking Assessment

Peer reviewed

Direct link

Khabbazbashi, Nahal; Galaczi, Evelina D. – Language Testing, 2020

This mixed methods study examined holistic, analytic, and part marking models (MMs) in terms of their measurement properties and impact on candidate CEFR classifications in a semi-direct online speaking test. Speaking performances of 240 candidates were first marked holistically and by part (phase 1). On the basis of phase 1 findings--which…

Descriptors: Holistic Approach, Classification, Grading, Language Tests

A Comparative Judgment Approach to Assessing Chinese Sign Language Interpreting

Peer reviewed

Direct link

Han, Chao; Xiao, Xiaoyan – Language Testing, 2022

The quality of sign language interpreting (SLI) is a gripping construct among practitioners, educators and researchers, calling for reliable and valid assessment. There has been a diverse array of methods in the extant literature to measure SLI quality, ranging from traditional error analysis to recent rubric scoring. In this study, we want to…

Descriptors: Comparative Analysis, Sign Language, Deaf Interpreting, Evaluators

Professional and Non-Professional Raters' Responsiveness to Fluency and Accuracy in L2 Speech: An Experimental Approach

Peer reviewed

Direct link

Duijm, Klaartje; Schoonen, Rob; Hulstijn, Jan H. – Language Testing, 2018

It is general practice to use rater judgments in speaking proficiency testing. However, it has been shown that raters' knowledge and experience may influence their ratings, both in terms of leniency and varied focus on different aspects of speech. The purpose of this study is to identify raters' relative responsiveness to fluency and linguistic…

Descriptors: Language Fluency, Accuracy, Second Languages, Language Tests

"Am I Qualified to Be a Language Tester?": Understanding the Development of Language Assessment Literacy across Three Stakeholder Groups

Peer reviewed

Direct link

Yan, Xun; Fan, Jason – Language Testing, 2021

Recent investigations into language assessment literacy (LAL) suggest that stakeholder groups might differ in interests, needs, and expectations in assessment practice, resulting in different LAL profiles. This qualitative study furthers this line of research by examining the effect of contextual and experiential factors on the LAL profiles and…

Descriptors: Evaluators, Language Tests, Language Teachers, Second Language Learning

Do Experience and Text Quality Matter for Raters' Decision-Making Behaviors?

Peer reviewed

Direct link

Sahan, Özgür; Razi, Salim – Language Testing, 2020

This study examines the decision-making behaviors of raters with varying levels of experience while assessing EFL essays of distinct qualities. The data were collected from 28 raters with varying levels of rating experience and working at the English language departments of different universities in Turkey. Using a 10-point analytic rubric, each…

Descriptors: Decision Making, Essays, Writing Evaluation, Evaluators

A Generalizability Theory Study of Optimal Measurement Design for a Summative Assessment of English/Chinese Consecutive Interpreting

Peer reviewed

Direct link

Han, Chao – Language Testing, 2019

Summative assessment of interpretation is widely conducted in interpreting courses/programs to inform high-stakes decision making, such as the selection, certification, and conferral of academic degrees. Yet there has been very limited empirical research to investigate the score dependability of summative interpretation assessment. The present…

Descriptors: Generalization, Decision Making, Summative Evaluation, Evaluators

The Impact of Pre-Task Planning on Speaking Test Performance for English-Medium University Admission

Peer reviewed

Direct link

O'Grady, Stefan – Language Testing, 2019

This study investigated the impact of different lengths of pre-task planning time on performance in a test of second language speaking ability for university admission. In the study, 47 Turkish-speaking learners of English took a test of English language speaking ability. The participants were divided into two groups according to their language…

Descriptors: Language of Instruction, English (Second Language), Second Language Learning, Student Placement

Measuring the Impact of Rater Negotiation in Writing Performance Assessment

Peer reviewed

Direct link

Trace, Jonathan; Janssen, Gerriet; Meier, Valerie – Language Testing, 2017

Previous research in second language writing has shown that when scoring performance assessments even trained raters can exhibit significant differences in severity. When raters disagree, using discussion to try to reach a consensus is one popular form of score resolution, particularly in contexts with limited resources, as it does not require…

Descriptors: Performance Based Assessment, Second Language Learning, Scoring, Evaluators

Effect of Genre on the Generalizability of Writing Scores

Peer reviewed

Direct link

Bouwer, Renske; Béguin, Anton; Sanders, Ted; van den Bergh, Huub – Language Testing, 2015

In the present study, aspects of the measurement of writing are disentangled in order to investigate the validity of inferences made on the basis of writing performance and to describe implications for the assessment of writing. To include genre as a facet in the measurement, we obtained writing scores of 12 texts in four different genres for each…

Descriptors: Writing Tests, Generalization, Scores, Writing Instruction

How Do Utterance Measures Predict Raters' Perceptions of Fluency in French as a Second Language?

Peer reviewed

Direct link

Préfontaine, Yvonne; Kormos, Judit; Johnson, Daniel Ezra – Language Testing, 2016

While the research literature on second language (L2) fluency is replete with descriptions of fluency and its influence with regard to English as an additional language, little is known about what fluency features influence judgments of fluency in L2 French. This study reports the results of an investigation that analyzed the relationship between…

Descriptors: Prediction, French, Second Language Learning, Evaluators

Previous Page | Next Page »

Pages: 1 | 2

Han, Chao	2
Pill, John	2
Sanders, Ted	2
Zhang, Ying	2
van den Bergh, Huub	2
Albert Weideman	1
Bart Deygers	1
Batty, Aaron Olaf	1
Bouwer, Renske	1
Béguin, Anton	1
Duijm, Klaartje	1
Ebling, Sarah	1
Elder, Catherine	1
Fan, Jason	1
Feng, Yali	1
Galaczi, Evelina D.	1
Harding, Luke	1
Haug, Tobias	1
Hsu, Tammy Huei-Lien	1
Hulstijn, Jan H.	1
Iasonas Lamprianou	1
Janssen, Gerriet	1
Johnson, Daniel Ezra	1
Khabbazbashi, Nahal	1
Kormos, Judit	1
More ▼