Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 5 |
Since 2016 (last 10 years) | 10 |
Since 2006 (last 20 years) | 23 |
Descriptor
Evaluators | 23 |
Protocol Analysis | 23 |
English (Second Language) | 13 |
Second Language Learning | 11 |
Foreign Countries | 10 |
Scoring | 9 |
Writing Evaluation | 9 |
Rating Scales | 8 |
Decision Making | 7 |
Essays | 7 |
Language Tests | 7 |
More ▼ |
Source
Author
Barkaoui, Khaled | 2 |
Abbasi, Abbas | 1 |
Ang-Aw, Hui Teng | 1 |
Armengol, Lurdes | 1 |
Bell, Courtney A. | 1 |
Bogorevich, Valeriia | 1 |
Borowiec, Katrina | 1 |
Brooks, Val | 1 |
Cai, Hongwen | 1 |
Castle, Courtney | 1 |
Chambers, Lucy | 1 |
More ▼ |
Publication Type
Journal Articles | 21 |
Reports - Research | 18 |
Tests/Questionnaires | 3 |
Dissertations/Theses -… | 2 |
Reports - Descriptive | 1 |
Reports - Evaluative | 1 |
Education Level
Higher Education | 5 |
Elementary Secondary Education | 3 |
Postsecondary Education | 3 |
Secondary Education | 2 |
Adult Education | 1 |
Elementary Education | 1 |
Grade 4 | 1 |
Intermediate Grades | 1 |
Audience
Location
China | 2 |
Turkey | 2 |
California (Los Angeles) | 1 |
Europe | 1 |
Finland | 1 |
Indonesia | 1 |
Netherlands | 1 |
Singapore | 1 |
Spain | 1 |
Vietnam | 1 |
Laws, Policies, & Programs
Assessments and Surveys
International English… | 1 |
Test of English as a Foreign… | 1 |
What Works Clearinghouse Rating
Takanori Sato – Language Testing, 2024
Assessing the content of learners' compositions is a common practice in second language (L2) writing assessment. However, the construct definition of content in L2 writing assessment potentially underrepresents the target competence in content and language integrated learning (CLIL), which aims to foster not only L2 proficiency but also critical…
Descriptors: Language Tests, Content and Language Integrated Learning, Writing Evaluation, Writing Tests
Leech, Tony; Chambers, Lucy – Research Matters, 2022
Two of the central issues in comparative judgement (CJ), which are perhaps underexplored compared to questions of the method's reliability and technical quality, are "what processes do judges use to make their decisions" and "what features do they focus on when making their decisions?" This article discusses both, in the…
Descriptors: Comparative Analysis, Decision Making, Evaluators, Reliability
Heidari, Nasim; Ghanbari, Nasim; Abbasi, Abbas – Language Testing in Asia, 2022
It is widely believed that human rating performance is influenced by an array of different factors. Among these, rater-related variables such as experience, language background, perceptions, and attitudes have been mentioned. One of the important rater-related factors is the way the raters interact with the rating scales. In particular, how raters…
Descriptors: Evaluators, Rating Scales, Language Tests, English (Second Language)
Borowiec, Katrina; Castle, Courtney – Practical Assessment, Research & Evaluation, 2019
Rater cognition or "think-aloud" studies have historically been used to enhance rater accuracy and consistency in writing and language assessments. As assessments are developed for new, complex constructs from the "Next Generation Science Standards (NGSS)," the present study illustrates the utility of extending…
Descriptors: Evaluators, Scoring, Scoring Rubrics, Protocol Analysis
Thai, Thuy; Sheehan, Susan – Language Education & Assessment, 2022
In language performance tests, raters are important as their scoring decisions determine which aspects of performance the scores represent; however, raters are considered as one of the potential sources contributing to unwanted variability in scores (Davis, 2012). Although a great number of studies have been conducted to unpack how rater…
Descriptors: Rating Scales, Speech Communication, Second Language Learning, Second Language Instruction
Cognitive Flexibility: Exploring Students' Problem-Solving in Elementary School Mathematics Learning
Rahayuningsih, Sri; Sirajuddin, Sirajuddin; Nasrun, Nasrun – Journal of Research and Advances in Mathematics Education, 2021
In classroom learning, students need mathematical cognitive flexibility to be able to solve mathematical problems with the various ideas they express. To solve the problems, they must be able to grasp the problem, see it from various points of view, and should not be rigid thinking with one solving method. In fact, the students still lack the…
Descriptors: Elementary School Students, Problem Solving, Mathematics Instruction, Creativity
Qi, Yi; Bell, Courtney A.; Jones, Nathan D.; Lewis, Jennifer M.; Witherspoon, Margaret W.; Redash, Amanda – ETS Research Report Series, 2018
Teacher observations are being used for high-stakes purposes in states across the country, and administrators often serve as raters in teacher evaluation systems. This paper examines how the cognitive aspects of administrators' use of an observation instrument, a modified version of Charlotte Danielson's Framework for Teaching, interact with the…
Descriptors: Teacher Evaluation, Classroom Observation Techniques, Observation, Evaluation Methods
Essers, Geurt; Dielissen, Patrick; van Weel, Chris; van der Vleuten, Cees; van Dulmen, Sandra; Kramer, Anneke – Advances in Health Sciences Education, 2015
Communication assessment in real-life consultations is a complex task. Generic assessment instruments help but may also have disadvantages. The generic nature of the skills being assessed does not provide indications for context-specific behaviour required in practice situations; context influences are mostly taken into account implicitly. Our…
Descriptors: Communication (Thought Transfer), Context Effect, Evaluators, Qualitative Research
Sahan, Özgür; Razi, Salim – Language Testing, 2020
This study examines the decision-making behaviors of raters with varying levels of experience while assessing EFL essays of distinct qualities. The data were collected from 28 raters with varying levels of rating experience and working at the English language departments of different universities in Turkey. Using a 10-point analytic rubric, each…
Descriptors: Decision Making, Essays, Writing Evaluation, Evaluators
Han, Turgay – International Journal of Progressive Education, 2017
The aim of this study is to examine the variability in and reliability of scores assigned to different quality EFL compositions by EFL instructors and their rating behaviors. Using a mixed research design, quantitative data were collected from EFL instructors' ratings of 30 compositions of three different qualities using a holistic scoring rubric.…
Descriptors: English (Second Language), Writing Evaluation, Scores, Expertise
Bogorevich, Valeriia – ProQuest LLC, 2018
Rater variation in performance assessment can impact test-takers' scores and compromise assessments' fairness and validity (Crooks, Kane, & Cohen, 1996). Rater variation can also undermine a test's validity and fairness; therefore, it is important to investigate raters' scoring patterns in order to inform rater training. Substantial work has…
Descriptors: Pronunciation, Familiarity, English (Second Language), Second Language Learning
Cai, Hongwen – Language Assessment Quarterly, 2015
This study is an attempt to classify raters according to their weighting patterns and explore systematic differences between rater types in the rating process. In the context of an EFL speaking test, 126 raters were classified into three types--form-oriented, balanced, and content-oriented--through cluster analyses of their weighting patterns…
Descriptors: Classification, Language Tests, English (Second Language), Second Language Learning
Shirazi, Masoumeh Ahmadi – Language Testing in Asia, 2012
The research reported here suggests that raters, when involved in writing assessment, are more concerned with their own criteria to set a basis for their judgment rather than the standards provided by scale descriptors. This study sampled think aloud of eight raters who scored 15 essays in accord with Test of Written English (TWE) holistic scoring…
Descriptors: Evaluators, Writing Evaluation, Evaluation Criteria, Standards
Govaerts, M. J. B.; Van de Wiel, M. W. J.; Schuwirth, L. W. T.; Van der Vleuten, C. P. M.; Muijtjens, A. M. M. – Advances in Health Sciences Education, 2013
Weaknesses in the nature of rater judgments are generally considered to compromise the utility of workplace-based assessment (WBA). In order to gain insight into the underpinnings of rater behaviours, we investigated how raters form impressions of and make judgments on trainee performance. Using theoretical frameworks of social cognition and…
Descriptors: Medical Education, Personnel Evaluation, Evaluators, Trainees
Li, Hang; He, Lianzhen – Language Assessment Quarterly, 2015
This study used think-aloud protocols to compare essay-rating processes across holistic and analytic rating scales in the context of China's College English Test Band 6 (CET-6). A group of 9 experienced CET-6 raters scored the same batch of 10 CET-6 essays produced in an operational CET-6 administration twice, using both the CET-6 holistic…
Descriptors: Protocol Analysis, English (Second Language), Second Language Learning, Classification
Previous Page | Next Page »
Pages: 1 | 2