Publication Date
In 2025 | 6 |
Since 2024 | 9 |
Since 2021 (last 5 years) | 48 |
Since 2016 (last 10 years) | 131 |
Since 2006 (last 20 years) | 178 |
Descriptor
Evaluators | 181 |
Foreign Countries | 181 |
Second Language Learning | 181 |
English (Second Language) | 157 |
Second Language Instruction | 110 |
Language Tests | 86 |
Language Proficiency | 63 |
Comparative Analysis | 42 |
Scores | 41 |
Speech Communication | 39 |
Undergraduate Students | 39 |
More ▼ |
Source
Author
Saito, Kazuya | 5 |
Trofimovich, Pavel | 4 |
Han, Chao | 3 |
Shintani, Natsuko | 3 |
Ahmadi, Alireza | 2 |
Alemi, Minoo | 2 |
Barati, Hossein | 2 |
Cardoso, Walcir | 2 |
Coniam, David | 2 |
Cots, Josep M. | 2 |
Han, Turgay | 2 |
More ▼ |
Publication Type
Journal Articles | 171 |
Reports - Research | 163 |
Tests/Questionnaires | 31 |
Dissertations/Theses -… | 6 |
Reports - Descriptive | 6 |
Reports - Evaluative | 5 |
Speeches/Meeting Papers | 2 |
Numerical/Quantitative Data | 1 |
Opinion Papers | 1 |
Education Level
Higher Education | 112 |
Postsecondary Education | 94 |
Secondary Education | 12 |
High Schools | 5 |
Adult Education | 3 |
Elementary Education | 1 |
Elementary Secondary Education | 1 |
Grade 11 | 1 |
Grade 12 | 1 |
Audience
Location
China | 28 |
Japan | 20 |
Iran | 18 |
Australia | 13 |
Turkey | 12 |
Europe | 10 |
Canada | 9 |
Hong Kong | 9 |
Indonesia | 6 |
United Kingdom | 6 |
Taiwan | 5 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
International English… | 12 |
Test of English as a Foreign… | 11 |
Test of English for… | 7 |
Flesch Kincaid Grade Level… | 2 |
ACTFL Oral Proficiency… | 1 |
Foreign Language Classroom… | 1 |
Modern Language Aptitude Test | 1 |
What Works Clearinghouse Rating
Reza Shahi; Hamdollah Ravand; Golam Reza Rohani – International Journal of Language Testing, 2025
The current paper intends to exploit the Many Facet Rasch Model to investigate and compare the impact of situations (items) and raters on test takers' performance on the Written Discourse Completion Test (WDCT) and Discourse Self-Assessment Tests (DSAT). In this study, the participants were 110 English as a Foreign Language (EFL) students at…
Descriptors: Comparative Analysis, English (Second Language), Second Language Learning, Second Language Instruction
Makiko Kato – Journal of Education and Learning, 2025
This study aims to examine whether differences exist in the factors influencing the difficulty of scoring English summaries and determining scores based on the raters' attributes, and to collect candid opinions, considerations, and tentative suggestions for future improvements to the analytic rubric of summary writing for English learners. In this…
Descriptors: Writing Evaluation, Scoring, Writing Skills, English (Second Language)
Jia, Wenfeng; Zhang, Peixin – Language Testing in Asia, 2023
It is widely believed that raters' cognition is an important aspect of writing assessment, as it has both logical and temporal priority over scores. Based on a critical review of previous research in this area, it is found that raters' cognition can be boiled to two fundamental issues: building text images and strategies for articulating scores.…
Descriptors: Problem Solving, Cognitive Processes, Writing Evaluation, Evaluators
Tanaka, Mitsuko; Ross, Steven J. – Assessment in Education: Principles, Policy & Practice, 2023
Raters vary from each other in their severity and leniency in rating performance. This study examined the factors affecting rater severity in peer assessments of oral presentations in English as a Foreign Language (EFL), focusing on peer raters' self-construal and presentation abilities. Japanese university students enrolled in EFL classes…
Descriptors: Evaluators, Interrater Reliability, Item Response Theory, Peer Evaluation
Yuichiro Yokouchi – Language Testing in Asia, 2025
The performance decision tree (PDT; Fulcher et al., 2011) is a rubric style that is applicable to performance assessment, with origins in Upshur and Turner's (1995) empirically derived binary-choice, boundary-definition (EBB) scale. It is easier for raters to assess performance by evaluating multiple binary-choice descriptors. Additionally,…
Descriptors: Scoring Rubrics, Second Language Learning, Second Language Instruction, Language Teachers
Rachael Lindberg; Pavel Trofimovich – Canadian Journal of Applied Linguistics / Revue canadienne de linguistique appliquée, 2023
According to expectation violation theory, job applicants can be upgraded or downgraded during an interview when their accent does not match employers' speech expectations. Focusing on the employment of second language French job candidates in Québec, this study explored this issue dynamically in terms of how expectations may impact the trajectory…
Descriptors: French, Pronunciation, Second Language Learning, Service Occupations
Lian Li; Jiehui Hu; Yu Dai; Ping Zhou; Wanhong Zhang – Reading & Writing Quarterly, 2024
This paper proposes to use depth perception to represent raters' decision in holistic evaluation of ESL essays, as an alternative medium to conventional form of numerical scores. The researchers verified the new method's accuracy and inter/intra-rater reliability by inviting 24 ESL teachers to perform different representations when rating 60…
Descriptors: Essays, Holistic Approach, Writing Evaluation, Accuracy
Ahmet Can Uyar; Dilek Büyükahiska – International Journal of Assessment Tools in Education, 2025
This study explores the effectiveness of using ChatGPT, an Artificial Intelligence (AI) language model, as an Automated Essay Scoring (AES) tool for grading English as a Foreign Language (EFL) learners' essays. The corpus consists of 50 essays representing various types including analysis, compare and contrast, descriptive, narrative, and opinion…
Descriptors: Artificial Intelligence, Computer Software, Technology Uses in Education, Teaching Methods
Ogawa, Chie – Language Testing in Asia, 2022
This study explored two assessment approaches to oral performances: analytical complexity, accuracy, and fluency (CAF) indices and human raters' evaluations. CAF indices are frequently used in second-language speaking (L2) research; however, because tasks are communicative and goal-oriented, the degree to which students achieve such communicative…
Descriptors: Oral Language, Evaluators, Audio Equipment, Accuracy
Lin, Rongchan – Language Assessment Quarterly, 2023
Communication in the real world often entails the interpretation, evaluation, and integration of content from different sources. However, it appears that the ability to integrate content into discourse has not been explicitly scored for in existing studies. This study operationalizes content integration in the analytic scoring of a…
Descriptors: Listening Comprehension Tests, Generalization, Chinese, Second Language Learning
Hou, Peng; Kraisame, Sarawut – rEFLections, 2023
This paper provides an experimental study of interlanguage phonological characteristics of Chinese students learning Thai as a foreign language and the accentedness perceived by native Thai speakers. Both production and perception experiments were designed to see how Chinese students acoustically produced Thai final nasal consonants and how Thai…
Descriptors: Phonology, Pronunciation, Thai, Second Language Learning
O'Grady, Stefan; Taskesen, Özgür – Language Learning in Higher Education, 2022
An important aspect of language assessment development is to create tasks that engage the competencies required in the target situation. For this reason, English-medium university entrance tests increasingly feature integrated reading-into-writing tasks as a way of enhancing target domain representation. Despite increased use of this task type,…
Descriptors: Writing Evaluation, Scoring Rubrics, Rating Scales, English (Second Language)
Vasfiye Geçkin; Ebru Kiziltas; Çagatay Çinar – Journal of Educational Technology and Online Learning, 2023
The quality of writing in a second language (L2) is one of the indicators of the level of proficiency for many college students to be eligible for departmental studies. Although certain software programs, such as Intelligent Essay Assessor or IntelliMetric, have been introduced to evaluate second-language writing quality, an overall assessment of…
Descriptors: Writing Evaluation, Second Language Learning, Second Language Instruction, Language Proficiency
Batty, Aaron Olaf; Haug, Tobias; Ebling, Sarah; Tissi, Katja; Sidler-Miserez, Sandra – Language Testing, 2023
Sign languages present particular challenges to language assessors in relation to variation in signs, weakly defined citation forms, and a general lack of standard-setting work even in long-established measures of productive sign proficiency. The present article addresses and explores these issues via a mixed-methods study of a human-rated…
Descriptors: Sign Language, Language Tests, Standard Setting, Barriers
Apichat Khamboonruang – PASAA: Journal of Language Teaching and Learning in Thailand, 2023
Differential rater severity (DRS), one prevalent case of differential rater functioning (aka rater bias or rater interaction) effects, manifests itself when a rater assigns unusually severe or lenient ratings, threatening the validity and fairness of rater-mediated assessment. Building on a many-facets Rasch measurement (MFRM) approach, this study…
Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Scoring Rubrics