ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	5
Since 2016 (last 10 years)	10
Since 2006 (last 20 years)	23

Descriptor

Evaluators	23
Protocol Analysis	23
English (Second Language)	13
Second Language Learning	11
Foreign Countries	10
Scoring	9
Writing Evaluation	9
Rating Scales	8
Decision Making	7
Essays	7
Language Tests	7
Evaluation Criteria	6
Scores	6
Evaluation Methods	5
Interrater Reliability	5
Second Language Instruction	5
Cognitive Processes	4
Comparative Analysis	4
Language Proficiency	4
Qualitative Research	4
Scoring Rubrics	4
Classification	3
Interviews	3
Native Language	3
Oral Language	3
More ▼

Source

Language Assessment Quarterly	4
Language Testing	3
Advances in Health Sciences…	2
Language Testing in Asia	2
ProQuest LLC	2
ETS Research Report Series	1
International Journal of…	1
Journal of Educational…	1
Journal of Research and…	1
Language Awareness	1
Language Education &…	1
Practical Assessment,…	1
RELC Journal: A Journal of…	1
Research Matters	1
Research Papers in Education	1
More ▼

Publication Type

Journal Articles	21
Reports - Research	18
Tests/Questionnaires	3
Dissertations/Theses -…	2
Reports - Descriptive	1
Reports - Evaluative	1

Education Level

Higher Education	5
Elementary Secondary Education	3
Postsecondary Education	3
Secondary Education	2
Adult Education	1
Elementary Education	1
Grade 4	1
Intermediate Grades	1

Audience

Location

China	2
Turkey	2
California (Los Angeles)	1
Europe	1
Finland	1
Indonesia	1
Netherlands	1
Singapore	1
Spain	1
Vietnam	1

Laws, Policies, & Programs

Assessments and Surveys

International English…	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 23 results Save | Export

Assessing the Content Quality of Essays in Content and Language Integrated Learning: Exploring the Construct from Subject Specialists' Perspectives

Peer reviewed

Direct link

Takanori Sato – Language Testing, 2024

Assessing the content of learners' compositions is a common practice in second language (L2) writing assessment. However, the construct definition of content in L2 writing assessment potentially underrepresents the target competence in content and language integrated learning (CLIL), which aims to foster not only L2 proficiency but also critical…

Descriptors: Language Tests, Content and Language Integrated Learning, Writing Evaluation, Writing Tests

How Do Judges in Comparative Judgement Exercises Make Their Judgements?

Download full text

Leech, Tony; Chambers, Lucy – Research Matters, 2022

Two of the central issues in comparative judgement (CJ), which are perhaps underexplored compared to questions of the method's reliability and technical quality, are "what processes do judges use to make their decisions" and "what features do they focus on when making their decisions?" This article discusses both, in the…

Descriptors: Comparative Analysis, Decision Making, Evaluators, Reliability

Raters' Perceptions of Rating Scales Criteria and Its Effect on the Process and Outcome of Their Rating

Peer reviewed

Direct link

Heidari, Nasim; Ghanbari, Nasim; Abbasi, Abbas – Language Testing in Asia, 2022

It is widely believed that human rating performance is influenced by an array of different factors. Among these, rater-related variables such as experience, language background, perceptions, and attitudes have been mentioned. One of the important rater-related factors is the way the raters interact with the rating scales. In particular, how raters…

Descriptors: Evaluators, Rating Scales, Language Tests, English (Second Language)

Using Rater Cognition to Improve Generalizability of an Assessment of Scientific Argumentation

Peer reviewed
PDF on ERIC

Download full text

Borowiec, Katrina; Castle, Courtney – Practical Assessment, Research & Evaluation, 2019

Rater cognition or "think-aloud" studies have historically been used to enhance rater accuracy and consistency in writing and language assessments. As assessments are developed for new, complex constructs from the "Next Generation Science Standards (NGSS)," the present study illustrates the utility of extending…

Descriptors: Evaluators, Scoring, Scoring Rubrics, Protocol Analysis

The Processes of Rating L2 Speaking Performance Using an Analytic Rating Scale -- A Qualitative Exploration

Peer reviewed
PDF on ERIC

Download full text

Thai, Thuy; Sheehan, Susan – Language Education & Assessment, 2022

In language performance tests, raters are important as their scoring decisions determine which aspects of performance the scores represent; however, raters are considered as one of the potential sources contributing to unwanted variability in scores (Davis, 2012). Although a great number of studies have been conducted to unpack how rater…

Descriptors: Rating Scales, Speech Communication, Second Language Learning, Second Language Instruction

Cognitive Flexibility: Exploring Students' Problem-Solving in Elementary School Mathematics Learning

Peer reviewed
PDF on ERIC

Download full text

Rahayuningsih, Sri; Sirajuddin, Sirajuddin; Nasrun, Nasrun – Journal of Research and Advances in Mathematics Education, 2021

In classroom learning, students need mathematical cognitive flexibility to be able to solve mathematical problems with the various ideas they express. To solve the problems, they must be able to grasp the problem, see it from various points of view, and should not be rigid thinking with one solving method. In fact, the students still lack the…

Descriptors: Elementary School Students, Problem Solving, Mathematics Instruction, Creativity

Administrators' Uses of Teacher Observation Protocol in Different Rating Contexts. Research Report. ETS RR-18-18

Peer reviewed
PDF on ERIC

Download full text

Qi, Yi; Bell, Courtney A.; Jones, Nathan D.; Lewis, Jennifer M.; Witherspoon, Margaret W.; Redash, Amanda – ETS Research Report Series, 2018

Teacher observations are being used for high-stakes purposes in states across the country, and administrators often serve as raters in teacher evaluation systems. This paper examines how the cognitive aspects of administrators' use of an observation instrument, a modified version of Charlotte Danielson's Framework for Teaching, interact with the…

Descriptors: Teacher Evaluation, Classroom Observation Techniques, Observation, Evaluation Methods

How Do Trained Raters Take Context Factors into Account When Assessing GP Trainee Communication Performance? An Exploratory, Qualitative Study

Peer reviewed

Direct link

Essers, Geurt; Dielissen, Patrick; van Weel, Chris; van der Vleuten, Cees; van Dulmen, Sandra; Kramer, Anneke – Advances in Health Sciences Education, 2015

Communication assessment in real-life consultations is a complex task. Generic assessment instruments help but may also have disadvantages. The generic nature of the skills being assessed does not provide indications for context-specific behaviour required in practice situations; context influences are mostly taken into account implicitly. Our…

Descriptors: Communication (Thought Transfer), Context Effect, Evaluators, Qualitative Research

Do Experience and Text Quality Matter for Raters' Decision-Making Behaviors?

Peer reviewed

Direct link

Sahan, Özgür; Razi, Salim – Language Testing, 2020

This study examines the decision-making behaviors of raters with varying levels of experience while assessing EFL essays of distinct qualities. The data were collected from 28 raters with varying levels of rating experience and working at the English language departments of different universities in Turkey. Using a 10-point analytic rubric, each…

Descriptors: Decision Making, Essays, Writing Evaluation, Evaluators

Scores Assigned by Inexpert EFL Raters to Different Quality EFL Compositions, and the Raters' Decision-Making Behaviors

Peer reviewed
PDF on ERIC

Download full text

Han, Turgay – International Journal of Progressive Education, 2017

The aim of this study is to examine the variability in and reliability of scores assigned to different quality EFL compositions by EFL instructors and their rating behaviors. Using a mixed research design, quantitative data were collected from EFL instructors' ratings of 30 compositions of three different qualities using a holistic scoring rubric.…

Descriptors: English (Second Language), Writing Evaluation, Scores, Expertise

Native and Non-Native Raters of L2 Speaking Performance: Accent Familiarity and Cognitive Processes

Direct link

Bogorevich, Valeriia – ProQuest LLC, 2018

Rater variation in performance assessment can impact test-takers' scores and compromise assessments' fairness and validity (Crooks, Kane, & Cohen, 1996). Rater variation can also undermine a test's validity and fairness; therefore, it is important to investigate raters' scoring patterns in order to inform rater training. Substantial work has…

Descriptors: Pronunciation, Familiarity, English (Second Language), Second Language Learning

Weight-Based Classification of Raters and Rater Cognition in an EFL Speaking Test

Peer reviewed

Direct link

Cai, Hongwen – Language Assessment Quarterly, 2015

This study is an attempt to classify raters according to their weighting patterns and explore systematic differences between rater types in the rating process. In the context of an EFL speaking test, 126 raters were classified into three types--form-oriented, balanced, and content-oriented--through cluster analyses of their weighting patterns…

Descriptors: Classification, Language Tests, English (Second Language), Second Language Learning

When Raters Talk, Rubrics Fall Silent

Peer reviewed

Direct link

Shirazi, Masoumeh Ahmadi – Language Testing in Asia, 2012

The research reported here suggests that raters, when involved in writing assessment, are more concerned with their own criteria to set a basis for their judgment rather than the standards provided by scale descriptors. This study sampled think aloud of eight raters who scored 15 essays in accord with Test of Written English (TWE) holistic scoring…

Descriptors: Evaluators, Writing Evaluation, Evaluation Criteria, Standards

Workplace-Based Assessment: Raters' Performance Theories and Constructs

Peer reviewed

Direct link

Govaerts, M. J. B.; Van de Wiel, M. W. J.; Schuwirth, L. W. T.; Van der Vleuten, C. P. M.; Muijtjens, A. M. M. – Advances in Health Sciences Education, 2013

Weaknesses in the nature of rater judgments are generally considered to compromise the utility of workplace-based assessment (WBA). In order to gain insight into the underpinnings of rater behaviours, we investigated how raters form impressions of and make judgments on trainee performance. Using theoretical frameworks of social cognition and…

Descriptors: Medical Education, Personnel Evaluation, Evaluators, Trainees

A Comparison of EFL Raters' Essay-Rating Processes across Two Types of Rating Scales

Peer reviewed

Direct link

Li, Hang; He, Lianzhen – Language Assessment Quarterly, 2015

This study used think-aloud protocols to compare essay-rating processes across holistic and analytic rating scales in the context of China's College English Test Band 6 (CET-6). A group of 9 experienced CET-6 raters scored the same batch of 10 CET-6 essays produced in an operational CET-6 administration twice, using both the CET-6 holistic…

Descriptors: Protocol Analysis, English (Second Language), Second Language Learning, Classification

Previous Page | Next Page »

Pages: 1 | 2

Barkaoui, Khaled	2
Abbasi, Abbas	1
Ang-Aw, Hui Teng	1
Armengol, Lurdes	1
Bell, Courtney A.	1
Bogorevich, Valeriia	1
Borowiec, Katrina	1
Brooks, Val	1
Cai, Hongwen	1
Castle, Courtney	1
Chambers, Lucy	1
Cots, Josep M.	1
Dielissen, Patrick	1
Essers, Geurt	1
Ghanbari, Nasim	1
Goh, Christine Chuen Meng	1
Govaerts, M. J. B.	1
Han, Turgay	1
He, Lianzhen	1
Heidari, Nasim	1
Jones, Nathan D.	1
Kiili, Carita	1
Kramer, Anneke	1
Laurinen, Leena	1
Leech, Tony	1
More ▼