ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	14
Since 2016 (last 10 years)	19
Since 2006 (last 20 years)	20

Descriptor

Comparative Analysis	24
Decision Making	24
Evaluators	24
Evaluation Methods	11
Second Language Learning	7
Foreign Countries	6
Interrater Reliability	6
Computer Software	5
Scores	5
Scoring	5
Accuracy	4
English (Second Language)	4
Reliability	4
Second Language Instruction	4
Student Evaluation	4
Writing Evaluation	4
Artificial Intelligence	3
Correlation	3
Essays	3
Evaluation Criteria	3
Higher Education	3
Native Language	3
Protocol Analysis	3
Scoring Rubrics	3
Task Analysis	3
More ▼

Publication Type

Journal Articles	23
Reports - Research	22
Information Analyses	2
Opinion Papers	1
Reports - Evaluative	1
Speeches/Meeting Papers	1
Tests/Questionnaires	1

Education Level

Higher Education	6
Postsecondary Education	6
Adult Education	1
Elementary Education	1
Grade 4	1
Grade 8	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
Secondary Education	1

Audience

Location

China	2
Japan	1
Turkey	1
United Kingdom	1
United Kingdom (England)	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	1
United States Medical…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 24 results Save | Export

Implicit versus Explicit First Impressions in Performance-Based Assessment: Will Raters Overcome Their First Impressions When Learner Performance Changes?

Peer reviewed

Direct link

Timothy J. Wood; Vijay J. Daniels; Debra Pugh; Claire Touchie; Samantha Halman; Susan Humphrey-Murto – Advances in Health Sciences Education, 2024

First impressions can influence rater-based judgments but their contribution to rater bias is unclear. Research suggests raters can overcome first impressions in experimental exam contexts with explicit first impressions, but these findings may not generalize to a workplace context with implicit first impressions. The study had two aims. First, to…

Descriptors: Evaluators, Work Environment, Decision Making, Video Technology

Agreement between Visual Inspection and Objective Analysis Methods: A Replication and Extension

Peer reviewed

Direct link

Taylor, Tessa; Lanovaz, Marc J. – Journal of Applied Behavior Analysis, 2022

Behavior analysts typically rely on visual inspection of single-case experimental designs to make treatment decisions. However, visual inspection is subjective, which has led to the development of supplemental objective methods such as the conservative dual-criteria method. To replicate and extend a study conducted by Wolfe et al. (2018) on the…

Descriptors: Visual Perception, Artificial Intelligence, Decision Making, Evaluators

The Concurrent Validity of Comparative Judgement Outcomes Compared with Marks

Download full text

Gill, Tim – Research Matters, 2022

In Comparative Judgement (CJ) exercises, examiners are asked to look at a selection of candidate scripts (with marks removed) and order them in terms of which they believe display the best quality. By including scripts from different examination sessions, the results of these exercises can be used to help with maintaining standards. Results from…

Descriptors: Comparative Analysis, Decision Making, Scripts, Standards

How Do Judges in Comparative Judgement Exercises Make Their Judgements?

Download full text

Leech, Tony; Chambers, Lucy – Research Matters, 2022

Two of the central issues in comparative judgement (CJ), which are perhaps underexplored compared to questions of the method's reliability and technical quality, are "what processes do judges use to make their decisions" and "what features do they focus on when making their decisions?" This article discusses both, in the…

Descriptors: Comparative Analysis, Decision Making, Evaluators, Reliability

Crowdsourced Adaptive Comparative Judgment: A Community-Based Solution for Proficiency Rating

Peer reviewed

Direct link

Paquot, Magali; Rubin, Rachel; Vandeweerd, Nathan – Language Learning, 2022

The main objective of this Methods Showcase Article is to show how the technique of adaptive comparative judgment, coupled with a crowdsourcing approach, can offer practical solutions to reliability issues as well as to address the time and cost difficulties associated with a text-based approach to proficiency assessment in L2 research. We…

Descriptors: Comparative Analysis, Decision Making, Language Proficiency, Reliability

Depth-Perception-Based Representation in Holistic Rating on ESL Essay Writing

Peer reviewed

Direct link

Lian Li; Jiehui Hu; Yu Dai; Ping Zhou; Wanhong Zhang – Reading & Writing Quarterly, 2024

This paper proposes to use depth perception to represent raters' decision in holistic evaluation of ESL essays, as an alternative medium to conventional form of numerical scores. The researchers verified the new method's accuracy and inter/intra-rater reliability by inviting 24 ESL teachers to perform different representations when rating 60…

Descriptors: Essays, Holistic Approach, Writing Evaluation, Accuracy

A Model-Data-Fit-Informed Approach to Score Resolution in Performance Assessments

Peer reviewed

Direct link

Wind, Stefanie A.; Walker, A. Adrienne – Educational Measurement: Issues and Practice, 2021

Many large-scale performance assessments include score resolution procedures for resolving discrepancies in rater judgments. The goal of score resolution is conceptually similar to person fit analyses: To identify students for whom observed scores may not accurately reflect their achievement. Previously, researchers have observed that…

Descriptors: Goodness of Fit, Performance Based Assessment, Evaluators, Decision Making

Intelligent Tutoring for Surgical Decision Making: A Planning-Based Approach

Peer reviewed

Direct link

Vannaprathip, Narumol; Haddawy, Peter; Schultheis, Holger; Suebnukarn, Siriwan – International Journal of Artificial Intelligence in Education, 2022

Virtual reality simulation has had a significant impact on training of psychomotor surgical skills, yet there is still a lack of work on its use to teach surgical decision making. This is particularly noteworthy given the recognized importance of decision making in achieving positive surgical outcomes. With the objective of filling this gap, we…

Descriptors: Intelligent Tutoring Systems, Decision Making, Surgery, Teaching Methods

Automated Assessment of Second Language Comprehensibility: Review, Training, Validation, and Generalization Studies

Peer reviewed

Direct link

Saito, Kazuya; Macmillan, Konstantinos; Kachlicka, Magdalena; Kunihara, Takuya; Minematsu, Nobuaki – Studies in Second Language Acquisition, 2023

Whereas many scholars have emphasized the relative importance of "comprehensibility" as an ecologically valid goal for L2 speech training, testing, and development, eliciting listeners' judgments is time-consuming. Following calls for research on more efficient L2 speech rating methods in applied linguistics, and growing attention toward…

Descriptors: Second Language Learning, Second Language Instruction, Interrater Reliability, Speech Communication

Comparing Machine and Human Reviewers to Evaluate the Risk of Bias in Randomized Controlled Trials

Peer reviewed

Direct link

Armijo-Olivo, Susan; Craig, Rodger; Campbell, Sandy – Research Synthesis Methods, 2020

Background: Evidence from new health technologies is growing, along with demands for evidence to inform policy decisions, creating challenges in completing health technology assessments (HTAs)/systematic reviews (SRs) in a timely manner. Software can decrease the time and burden by automating the process, but evidence validating such software is…

Descriptors: Comparative Analysis, Computer Software, Decision Making, Randomized Controlled Trials

Identifying Design Values across Countries through Adaptive Comparative Judgment

Peer reviewed

Direct link

Bartholomew, Scott Ronald; Ruesch, Emily Yoshikawa; Hartell, Eva; Strimel, Greg J. – International Journal of Technology and Design Education, 2020

Adaptive comparative judgment (ACJ) has proven to be a valid, reliable, and feasible method for assessing student performance in open-ended design scenarios. In addition to the use of ACJ for purely assessment and evaluation, research has demonstrated an opportunity to identify the design values of judges involved with the ACJ process. The…

Descriptors: Design, Evaluators, International Cooperation, Cultural Influences

Moderation of Non-Exam Assessments: Is Comparative Judgement a Practical Alternative?

Download full text

Vidal Rodeiro, Carmen; Chambers, Lucy – Research Matters, 2022

Many high-stakes qualifications include non-exam assessments that are marked by teachers. Awarding bodies then apply a moderation process to bring the marking of these assessments to an agreed standard. Comparative Judgement (CJ) is a technique where two (or more) pieces of work are compared at a time, allowing an overall rank order of work to be…

Descriptors: Evaluation Methods, Portfolios (Background Materials), Decision Making, Task Analysis

Assumptions of Speaker Ethnicity and the Effect on Ratings of Accentedness, Comprehensibility, and Intelligibility

Peer reviewed

Direct link

Lee, Bradford J.; Bailey, Justin L. – Language Awareness, 2023

While listeners tend to downgrade speakers' accent and comprehensibility when they perceive them to be from a different language community--a process known as reverse linguistic stereotyping (RLS)--research has generally relied solely on quantitative data such as Likert scale ratings. The current study sought to extend the analysis further by…

Descriptors: Likert Scales, Stereotypes, Ethnicity, Intelligibility

An Investigation of Machine Translation Output Quality and the Influencing Factors of Source Texts

Peer reviewed

Direct link

Lee, Sangmin-Michelle – ReCALL, 2022

The use of machine translation (MT) in the academic context has increased in recent years. Hence, language teachers have found it difficult to ignore MT, which has led to some concerns. Among the concerns, its accuracy has become a major factor that shapes language teachers' pedagogical decision to use MT in their language classrooms. Despite the…

Descriptors: Translation, Grammar, Second Language Learning, Second Language Instruction

AutoML Feature Engineering for Student Modeling Yields High Accuracy, but Limited Interpretability

Peer reviewed
PDF on ERIC

Download full text

Bosch, Nigel – Journal of Educational Data Mining, 2021

Automatic machine learning (AutoML) methods automate the time-consuming, feature-engineering process so that researchers produce accurate student models more quickly and easily. In this paper, we compare two AutoML feature engineering methods in the context of the National Assessment of Educational Progress (NAEP) data mining competition. The…

Descriptors: Accuracy, Learning Analytics, Models, National Competency Tests

Previous Page | Next Page »

Pages: 1 | 2

Research Matters	3
Educational and Psychological…	2
Language Testing	2
Advances in Health Sciences…	1
Educational Measurement:…	1
International Journal of…	1
International Journal of…	1
Journal of Applied Behavior…	1
Journal of Educational Data…	1
Journal of Educational…	1
Language Assessment Quarterly	1
Language Awareness	1
Language Learning	1
ReCALL	1
Reading & Writing Quarterly	1
Research Synthesis Methods	1
Studies in Educational…	1
Studies in Second Language…	1
Working Papers in TESOL &…	1
More ▼

Chambers, Lucy	2
Armijo-Olivo, Susan	1
Bailey, Justin L.	1
Baldwin, Peter	1
Barkaoui, Khaled	1
Bartholomew, Scott Ronald	1
Bosch, Nigel	1
Campbell, Sandy	1
Claire Touchie	1
Clauser, Jerome C.	1
Craig, Rodger	1
Debra Pugh	1
Gill, Tim	1
Haddawy, Peter	1
Hambleton, Ronald K.	1
Han, Chao	1
Han, Qie	1
Hartell, Eva	1
Hunsaker, Scott L.	1
Jiehui Hu	1
Kachlicka, Magdalena	1
King, Jean A.	1
Kunihara, Takuya	1
Lanovaz, Marc J.	1
More ▼