ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	9
Since 2016 (last 10 years)	16
Since 2006 (last 20 years)	19

Descriptor

Decision Making	19
Evaluators	19
Language Tests	19
English (Second Language)	15
Second Language Learning	14
Foreign Countries	12
Scores	11
Second Language Instruction	11
Language Proficiency	8
Correlation	6
Oral Language	6
Rating Scales	6
Speech Communication	6
Evaluation Criteria	5
Training	5
Scoring	4
Statistical Analysis	4
Case Studies	3
English	3
High Stakes Tests	3
Interrater Reliability	3
Native Language	3
Performance Based Assessment	3
Recall (Psychology)	3
Reliability	3
More ▼

Source

Language Testing	4
Language Assessment Quarterly	3
Language Testing in Asia	3
English Language Teaching	2
Advances in Language and…	1
Language Education &…	1
Language Learning	1
ProQuest LLC	1
Second Language Research	1
Studies in Applied…	1
TESL-EJ	1
More ▼

Publication Type

Journal Articles	18
Reports - Research	18
Tests/Questionnaires	2
Dissertations/Theses -…	1

Education Level

Higher Education	6
Postsecondary Education	6
Secondary Education	2
High Schools	1

Audience

Location

China	3
Europe	3
Australia	1
India	1
Japan	1
Japan (Tokyo)	1
New York (New York)	1
Turkey (Istanbul)	1
United Kingdom	1
United States	1
Vietnam	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

International English…	4
Test of English as a Foreign…	2

What Works Clearinghouse Rating

Showing 1 to 15 of 19 results Save | Export

Language Testers and Their Place in the Policy Web

Peer reviewed

Direct link

Laura Schildt; Bart Deygers; Albert Weideman – Language Testing, 2024

In the context of policy-driven language testing for citizenship, a growing body of research examines the political justifications and ethical implications of language requirements and test use. However, virtually no studies have looked at the role that language testers play in the evolution of language requirements. Critical gaps remain in our…

Descriptors: Language Tests, Citizenship, Educational Policy, Assessment Literacy

Crowdsourced Adaptive Comparative Judgment: A Community-Based Solution for Proficiency Rating

Peer reviewed

Direct link

Paquot, Magali; Rubin, Rachel; Vandeweerd, Nathan – Language Learning, 2022

The main objective of this Methods Showcase Article is to show how the technique of adaptive comparative judgment, coupled with a crowdsourcing approach, can offer practical solutions to reliability issues as well as to address the time and cost difficulties associated with a text-based approach to proficiency assessment in L2 research. We…

Descriptors: Comparative Analysis, Decision Making, Language Proficiency, Reliability

A Sequential Approach to Detecting Differential Rater Functioning in Sparse Rater-Mediated Assessment Networks

Peer reviewed

Direct link

Wind, Stefanie A. – Language Testing, 2023

Researchers frequently evaluate rater judgments in performance assessments for evidence of differential rater functioning (DRF), which occurs when rater severity is systematically related to construct-irrelevant student characteristics after controlling for student achievement levels. However, researchers have observed that methods for detecting…

Descriptors: Evaluators, Decision Making, Student Characteristics, Performance Based Assessment

Raters' Perceptions of Rating Scales Criteria and Its Effect on the Process and Outcome of Their Rating

Peer reviewed

Direct link

Heidari, Nasim; Ghanbari, Nasim; Abbasi, Abbas – Language Testing in Asia, 2022

It is widely believed that human rating performance is influenced by an array of different factors. Among these, rater-related variables such as experience, language background, perceptions, and attitudes have been mentioned. One of the important rater-related factors is the way the raters interact with the rating scales. In particular, how raters…

Descriptors: Evaluators, Rating Scales, Language Tests, English (Second Language)

Generalizability of Writing Scores and Language Program Placement Decisions: Score Dependability, Task Variability, and Score Profiles on an ESL Placement Test

Peer reviewed
PDF on ERIC

Download full text

Eskin, Daniel – Studies in Applied Linguistics & TESOL, 2022

For agencies that deliver high-stakes Second Language (L2) proficiency exams, a research agenda has been undertaken for years to examine the role of rater, task, and rubric as sources of variability into their performance assessments (Lee, 2006; Sawaki & Sinharay, 2013; Xi, 2007; Xi & Mollaun, 2006). However, these challenges are more…

Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Student Placement

The Processes of Rating L2 Speaking Performance Using an Analytic Rating Scale -- A Qualitative Exploration

Peer reviewed
PDF on ERIC

Download full text

Thai, Thuy; Sheehan, Susan – Language Education & Assessment, 2022

In language performance tests, raters are important as their scoring decisions determine which aspects of performance the scores represent; however, raters are considered as one of the potential sources contributing to unwanted variability in scores (Davis, 2012). Although a great number of studies have been conducted to unpack how rater…

Descriptors: Rating Scales, Speech Communication, Second Language Learning, Second Language Instruction

Establishing an Operational Model of Rating Scale Construction for English Writing Assessment

Peer reviewed
PDF on ERIC

Download full text

Wu, Xuefeng – English Language Teaching, 2022

Rating scales for writing assessment are critical in that they determine directly the quality and fairness of such performance tests. However, in many EFL contexts, rating scales are made, to certain extent, based on the intuition of teachers who strongly need a feasible and scientific route to guide their construction of rating scales. This study…

Descriptors: Writing Evaluation, Rating Scales, Second Language Learning, Second Language Instruction

Linking the International English Language Competency Assessment Suite of Examinations to the Common European Framework of Reference

Peer reviewed

Direct link

Hidri, Sahbi – Language Testing in Asia, 2021

The study investigated the alignment process of the International English Language Competency Assessment (IELCA) suite examinations' four levels, B1, B2, C1 and C2, onto the Common European Framework of Reference (CEFR) by explaining and discussing the five linking stages (Council of Europe (CoE 2009). Unlike previous studies, this study used the…

Descriptors: Literacy, Second Language Learning, Second Language Instruction, English (Second Language)

Rater Attitude towards Emerging Varieties of English: A New Rater Effect?

Peer reviewed

Direct link

Hsu, Tammy Huei-Lien – Language Testing in Asia, 2019

Background: A strong interest in researching World Englishes (WE) in relation to language assessment has become an emerging theme in language assessment studies over the past two decades. While research on WE has highlighted the status, function, and legitimacy of varieties of English language, it remains unclear how raters respond to the results…

Descriptors: Language Attitudes, Language Variation, Language Tests, Second Language Learning

Roles of Collocation in L2 Oral Proficiency Revisited: Different Tasks, L1 vs. L2 Raters, and Cross-Sectional vs. Longitudinal Analyses

Peer reviewed

Direct link

Saito, Kazuya; Liu, Yuwei – Second Language Research, 2022

There is emerging evidence that collocation use plays a primary role in determining various dimensions of L2 oral proficiency assessment and development. The current study presents the results of three experiments which examined the relationship between the degree of association in collocation use (operationalized as t scores and mutual…

Descriptors: Phrase Structure, Case Studies, Second Language Learning, Second Language Instruction

A Generalizability Theory Study of Optimal Measurement Design for a Summative Assessment of English/Chinese Consecutive Interpreting

Peer reviewed

Direct link

Han, Chao – Language Testing, 2019

Summative assessment of interpretation is widely conducted in interpreting courses/programs to inform high-stakes decision making, such as the selection, certification, and conferral of academic degrees. Yet there has been very limited empirical research to investigate the score dependability of summative interpretation assessment. The present…

Descriptors: Generalization, Decision Making, Summative Evaluation, Evaluators

"How Scripted Is This Going to Be?" Raters' Views of Authenticity in Speaking-Performance Tests

Peer reviewed

Direct link

Burton, John Dylan – Language Assessment Quarterly, 2020

An assumption underlying speaking tests is that scores reflect the ability to produce online, non-rehearsed speech. Speech produced in testing situations may, however, be less spontaneous if extensive test preparation takes place, resulting in memorized or rehearsed responses. If raters detect these patterns, they may conceptualize speech as…

Descriptors: Language Tests, Oral Language, Scores, Speech Communication

Assessing Individual and Group Oral Exams: Scoring Criteria and Rater Interaction

Peer reviewed
PDF on ERIC

Download full text

Yalçin-Çolakoglu, Özlem; Selçuk, Merve – Advances in Language and Literary Studies, 2019

Criterion referenced tests of second language speaking performance are administered in different institutions using different procedures. The present study reports raters' practices of second language speaking tests, in particular the correspondence between test-takers' grades when assessed individually and in groups. Data derived from…

Descriptors: Oral Language, Language Tests, Test Validity, Inferences

How Do Raters Judge Spoken Vocabulary?

Peer reviewed
PDF on ERIC

Download full text

Li, Hui – English Language Teaching, 2016

The aim of the study was to investigate how raters come to their decisions when judging spoken vocabulary. Segmental rating was introduced to quantify raters' decision-making process. It is hoped that this simulated study brings fresh insight to future methodological considerations with spoken data. Twenty trainee raters assessed five Chinese…

Descriptors: Foreign Countries, Evaluators, Interrater Reliability, Decision Making

Extending the Scope of Speaking Assessment Criteria in a Specific-Purpose Language Test: Operationalizing a Health Professional Perspective

Peer reviewed

Direct link

O'Hagan, Sally; Pill, John; Zhang, Ying – Language Testing, 2016

Criticism of specific-purpose language (LSP) tests is often directed at their limited ability to represent fully the demands of the target language use situation. Such criticisms extend to the criteria used to assess test performance, which may fail to capture what matters to participants in the domain of interest. This paper reports on the…

Descriptors: Health Personnel, Language Tests, English for Special Purposes, Criticism

Previous Page | Next Page »

Pages: 1 | 2

Pill, John	2
Abbasi, Abbas	1
Albert Weideman	1
Bart Deygers	1
Burton, John Dylan	1
Davis, Lawrence Edward	1
Eskin, Daniel	1
Ghanbari, Nasim	1
Han, Chao	1
Harding, Luke	1
Heidari, Nasim	1
Hidri, Sahbi	1
Hsu, Tammy Huei-Lien	1
Kang, Okim	1
Kozaki, Yoko	1
Laura Schildt	1
Li, Hui	1
Liu, Yuwei	1
Moran, Meghan Kerry	1
O'Hagan, Sally	1
Paquot, Magali	1
Rubin, Rachel	1
Ryan, Kerry	1
Saito, Kazuya	1
Selçuk, Merve	1
More ▼