Publication Date
In 2025 | 5 |
Since 2024 | 14 |
Since 2021 (last 5 years) | 59 |
Since 2016 (last 10 years) | 126 |
Since 2006 (last 20 years) | 200 |
Descriptor
Evaluators | 220 |
Language Tests | 220 |
Second Language Learning | 169 |
English (Second Language) | 153 |
Foreign Countries | 102 |
Language Proficiency | 88 |
Oral Language | 73 |
Second Language Instruction | 73 |
Scores | 72 |
Scoring | 49 |
Comparative Analysis | 45 |
More ▼ |
Source
Author
Pill, John | 5 |
Eckes, Thomas | 4 |
Xi, Xiaoming | 4 |
May, Lyn | 3 |
McNamara, Tim | 3 |
Winke, Paula | 3 |
Ahmadi, Alireza | 2 |
Bejar, Isaac I. | 2 |
Bridgeman, Brent | 2 |
Brown, Annie | 2 |
Cots, Josep M. | 2 |
More ▼ |
Publication Type
Education Level
Audience
Practitioners | 1 |
Location
China | 18 |
Iran | 9 |
Japan | 9 |
Australia | 8 |
Europe | 8 |
Canada | 6 |
Hong Kong | 5 |
United Kingdom | 5 |
India | 4 |
Netherlands | 4 |
Turkey | 4 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Stefan O'Grady – TESOL Journal, 2025
Task-based language assessment represents a major component of task-based language teaching syllabi. Current perspectives emphasise the importance of tasks in the assessment process, suggesting that adherence to influential models of language production during task design yields predictable test outcomes. The current study contends that the…
Descriptors: Task Analysis, Language Tests, Evaluators, Rating Scales
Reeta Neittaanmäki; Iasonas Lamprianou – Language Testing, 2024
This article focuses on rater severity and consistency and their relation to major changes in the rating system in a high-stakes testing context. The study is based on longitudinal data collected from 2009 to 2019 from the second language (L2) Finnish speaking subtest in the National Certificates of Language Proficiency in Finland. We investigated…
Descriptors: Foreign Countries, Interrater Reliability, Evaluators, Item Response Theory
Ginther, April – Language Testing, 2023
Great opportunities for language testing practitioners are enabled through language program administration. Local language tests lend themselves to multiple purposes--for placement and diagnosis, as a means of tracking progress, and as a contribution to program evaluation and revision. Administrative choices, especially those involving a test, are…
Descriptors: Language Tests, Testing, Examiners, Placement Tests
Laura Schildt; Bart Deygers; Albert Weideman – Language Testing, 2024
In the context of policy-driven language testing for citizenship, a growing body of research examines the political justifications and ethical implications of language requirements and test use. However, virtually no studies have looked at the role that language testers play in the evolution of language requirements. Critical gaps remain in our…
Descriptors: Language Tests, Citizenship, Educational Policy, Assessment Literacy
Bijani, Houman; Hashempour, Bahareh; Ibrahim, Khaled Ahmed Abdel-Al; Orabah, Salim Said Bani; Heydarnejad, Tahereh – Language Testing in Asia, 2022
Due to subjectivity in oral assessment, much concentration has been put on obtaining a satisfactory measure of consistency among raters. However, the process for obtaining more consistency might not result in valid decisions. One matter that is at the core of both reliability and validity in oral assessment is rater training. Recently,…
Descriptors: Oral Language, Language Tests, Feedback (Response), Bias
Huiying Cai; Xun Yan – Language Testing, 2024
Rater comments tend to be qualitatively analyzed to indicate raters' application of rating scales. This study applied natural language processing (NLP) techniques to quantify meaningful, behavioral information from a corpus of rater comments and triangulated that information with a many-facet Rasch measurement (MFRM) analysis of rater scores. The…
Descriptors: Natural Language Processing, Item Response Theory, Rating Scales, Writing Evaluation
Kahng, Jimin – TESOL Quarterly: A Journal for Teachers of English to Speakers of Other Languages and of Standard English as a Second Dialect, 2023
This study is the first attempt to explore the relationship between rater variables focusing on raters' language aptitude and their judgments of second language (L2) speech. Thirty-four English listeners rated 65 spontaneous native and nonnative speech samples for comprehensibility, accentedness, and fluency. They also completed the LLAMA language…
Descriptors: Evaluators, Second Language Learning, Language Tests, Language Fluency
Sarah Surrain; Michael P. Mesa; Mike A. Assel; Tricia A. Zucker – Grantee Submission, 2023
Purpose: The ongoing COVID-19 pandemic has prompted changes to child assessment procedures in schools such as the use of face masks by assessors. Research with adults suggests that face masks diminish performance on speech processing and comprehension tasks, yet little is known about how assessor masking affects child performance. Therefore, we…
Descriptors: Kindergarten, Young Children, Native Language, Language Tests
Sarah Surrain; Michael P. Mesa; Mike A. Assel; Tricia A. Zucker – Language, Speech, and Hearing Services in Schools, 2023
Purpose: The ongoing COVID-19 pandemic has prompted changes to child assessment procedures in schools such as the use of face masks by assessors. Research with adults suggests that face masks diminish performance on speech processing and comprehension tasks, yet little is known about how assessor masking affects child performance. Therefore, we…
Descriptors: Kindergarten, Young Children, Native Language, Language Tests
Michael D. Carey; Stefan Szocs – Language Testing, 2024
This controlled experimental study investigated the interaction of variables associated with rating the pronunciation component of high-stakes English-language-speaking tests such as IELTS and TOEFL iBT. One hundred experienced raters who were all either familiar or unfamiliar with Brazilian-accented English or Papua New Guinean Tok Pisin-accented…
Descriptors: Dialects, Pronunciation, Suprasegmentals, Familiarity
Reza Shahi; Hamdollah Ravand; Golam Reza Rohani – International Journal of Language Testing, 2025
The current paper intends to exploit the Many Facet Rasch Model to investigate and compare the impact of situations (items) and raters on test takers' performance on the Written Discourse Completion Test (WDCT) and Discourse Self-Assessment Tests (DSAT). In this study, the participants were 110 English as a Foreign Language (EFL) students at…
Descriptors: Comparative Analysis, English (Second Language), Second Language Learning, Second Language Instruction
Jin, Kuan-Yu; Eckes, Thomas – Measurement: Interdisciplinary Research and Perspectives, 2022
Recent research on rater effects in performance assessments has increasingly focused on rater centrality, the tendency to assign scores clustering around the rating scale's middle categories. In the present paper, we adopted Jin and Wang's (2018) extended facets modeling approach and constructed a centrality continuum, ranging from raters…
Descriptors: Performance Based Assessment, Evaluators, Scoring, Sample Size
Takanori Sato – Language Testing, 2024
Assessing the content of learners' compositions is a common practice in second language (L2) writing assessment. However, the construct definition of content in L2 writing assessment potentially underrepresents the target competence in content and language integrated learning (CLIL), which aims to foster not only L2 proficiency but also critical…
Descriptors: Language Tests, Content and Language Integrated Learning, Writing Evaluation, Writing Tests
Yu-Tzu Chang; Ann Tai Choe; Daniel Holden; Daniel R. Isbell – Language Testing, 2024
In this Brief Report, we describe an evaluation of and revisions to a rubric adapted from the Jacobs et al.'s (1981) ESL COMPOSITION PROFILE, with four rubric categories and 20-point rating scales, in the context of an intensive English program writing placement test. Analysis of 4 years of rating data (2016-2021, including 434 essays) using…
Descriptors: Language Tests, Rating Scales, Second Language Learning, English (Second Language)
Jia, Wenfeng; Zhang, Peixin – Language Testing in Asia, 2023
It is widely believed that raters' cognition is an important aspect of writing assessment, as it has both logical and temporal priority over scores. Based on a critical review of previous research in this area, it is found that raters' cognition can be boiled to two fundamental issues: building text images and strategies for articulating scores.…
Descriptors: Problem Solving, Cognitive Processes, Writing Evaluation, Evaluators