Publication Date
In 2025 | 1 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 4 |
Since 2016 (last 10 years) | 9 |
Since 2006 (last 20 years) | 10 |
Descriptor
Decision Making | 11 |
Evaluators | 11 |
Student Evaluation | 11 |
Evaluation Methods | 5 |
Foreign Countries | 5 |
College Faculty | 4 |
Comparative Analysis | 4 |
Reliability | 4 |
Scoring | 4 |
Second Language Learning | 4 |
English (Second Language) | 3 |
More ▼ |
Source
Author
Bartholomew, Scott Ronald | 1 |
Biancarosa, Gina | 1 |
Bortnick, Barrie D. | 1 |
Chambers, Lucy | 1 |
Cummings, Kelli D. | 1 |
Dancza, Karina M. | 1 |
Davis, Lawrence Edward | 1 |
Han, Chao | 1 |
Han, Qie | 1 |
Han, Turgay | 1 |
Hartell, Eva | 1 |
More ▼ |
Publication Type
Journal Articles | 9 |
Reports - Research | 8 |
Dissertations/Theses -… | 1 |
Information Analyses | 1 |
Reports - Descriptive | 1 |
Education Level
Higher Education | 7 |
Postsecondary Education | 7 |
Elementary Education | 1 |
Grade 5 | 1 |
Grade 6 | 1 |
Intermediate Grades | 1 |
Middle Schools | 1 |
Audience
Location
China | 1 |
Japan | 1 |
Singapore | 1 |
Turkey | 1 |
United Kingdom (England) | 1 |
United Kingdom (Glasgow) | 1 |
United States | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Test of English as a Foreign… | 1 |
What Works Clearinghouse Rating
Yuichiro Yokouchi – Language Testing in Asia, 2025
The performance decision tree (PDT; Fulcher et al., 2011) is a rubric style that is applicable to performance assessment, with origins in Upshur and Turner's (1995) empirically derived binary-choice, boundary-definition (EBB) scale. It is easier for raters to assess performance by evaluating multiple binary-choice descriptors. Additionally,…
Descriptors: Scoring Rubrics, Second Language Learning, Second Language Instruction, Language Teachers
Bartholomew, Scott Ronald; Ruesch, Emily Yoshikawa; Hartell, Eva; Strimel, Greg J. – International Journal of Technology and Design Education, 2020
Adaptive comparative judgment (ACJ) has proven to be a valid, reliable, and feasible method for assessing student performance in open-ended design scenarios. In addition to the use of ACJ for purely assessment and evaluation, research has demonstrated an opportunity to identify the design values of judges involved with the ACJ process. The…
Descriptors: Design, Evaluators, International Cooperation, Cultural Influences
Vidal Rodeiro, Carmen; Chambers, Lucy – Research Matters, 2022
Many high-stakes qualifications include non-exam assessments that are marked by teachers. Awarding bodies then apply a moderation process to bring the marking of these assessments to an agreed standard. Comparative Judgement (CJ) is a technique where two (or more) pieces of work are compared at a time, allowing an overall rank order of work to be…
Descriptors: Evaluation Methods, Portfolios (Background Materials), Decision Making, Task Analysis
Han, Chao; Xiao, Xiaoyan – Language Testing, 2022
The quality of sign language interpreting (SLI) is a gripping construct among practitioners, educators and researchers, calling for reliable and valid assessment. There has been a diverse array of methods in the extant literature to measure SLI quality, ranging from traditional error analysis to recent rubric scoring. In this study, we want to…
Descriptors: Comparative Analysis, Sign Language, Deaf Interpreting, Evaluators
Tan, Chin Pei; Howes, Dora; Tan, Rendell K. W.; Dancza, Karina M. – Assessment & Evaluation in Higher Education, 2022
Interactive oral assessments demonstrate potential to develop graduate attributes such as critical thinking, professional communication and collaborative skills in students through authentic simulation of workplace scenarios. This study captured the design, delivery and evaluation of interactive oral assessments across three programmes --…
Descriptors: Oral Language, Interaction, Critical Thinking, Communication Skills
Han, Qie – Working Papers in TESOL & Applied Linguistics, 2016
This literature review attempts to survey representative studies within the context of L2 speaking assessment that have contributed to the conceptualization of rater cognition. Two types of studies are looked at: 1) studies that examine "how" raters differ (and sometimes agree) in their cognitive processes and rating behaviors, in terms…
Descriptors: Second Language Learning, Student Evaluation, Evaluators, Speech Tests
Reed, Deborah K.; Cummings, Kelli D.; Schaper, Andrew; Lynn, Devon; Biancarosa, Gina – Reading and Writing: An Interdisciplinary Journal, 2019
Informal reading inventories (IRI) and curriculum-based measures of reading (CBM-R) have continued importance in instructional planning, but raters have exhibited difficulty in accurately identifying students' miscues. To identify and tabulate scorers' mismarkings, this study employed examiners and raters who scored 15,051 words from 108 passage…
Descriptors: Accuracy, Miscue Analysis, Grade 5, Grade 6
Han, Turgay – International Journal of Progressive Education, 2017
The aim of this study is to examine the variability in and reliability of scores assigned to different quality EFL compositions by EFL instructors and their rating behaviors. Using a mixed research design, quantitative data were collected from EFL instructors' ratings of 30 compositions of three different qualities using a holistic scoring rubric.…
Descriptors: English (Second Language), Writing Evaluation, Scores, Expertise
Naumann, Fiona L.; Marshall, Stephen; Shulruf, Boaz; Jones, Philip D. – Advances in Health Sciences Education, 2016
Exercise physiology courses have transitioned to competency based, forcing Universities to rethink assessment to ensure students are competent to practice. This study built on earlier research to explore rater cognition, capturing factors that contribute to assessor decision making about students' competency. The aims were to determine the source…
Descriptors: Exercise Physiology, Evaluators, Competency Based Education, Evaluation Methods
Davis, Lawrence Edward – ProQuest LLC, 2012
Speaking performance tests typically employ raters to produce scores; accordingly, variability in raters' scoring decisions has important consequences for test reliability and validity. One such source of variability is the rater's level of expertise in scoring. Therefore, it is important to understand how raters' performance is influenced by…
Descriptors: Evaluators, Expertise, Scores, Second Language Learning
Bortnick, Barrie D.; And Others – 1976
Four experiential learning programs that are in the early stages of development are described. The report from the Consortium of the California State University and Colleges details the efforts of a traditional, decentralized system to introduce flexibility into the use of examinations for crediting prior learning. A description of a large,…
Descriptors: College Administration, College Credits, College Faculty, College Students