ERIC - Search Results

Publication Date

In 2025	2
Since 2024	6
Since 2021 (last 5 years)	18
Since 2016 (last 10 years)	28
Since 2006 (last 20 years)	67

Descriptor

Comparative Analysis	108
Evaluation Methods	108
Reliability	108
Validity	50
Foreign Countries	30
Higher Education	18
Student Evaluation	17
Correlation	16
Evaluators	13
Computer Software	12
Models	12
Program Effectiveness	12
College Students	11
Evaluation Criteria	10
Scoring Rubrics	10
Decision Making	9
Psychometrics	9
Rating Scales	9
Scores	9
Data Analysis	8
Measurement Techniques	8
Predictor Variables	8
Statistical Analysis	8
Task Analysis	8
College Faculty	7
More ▼

Publication Type

Journal Articles	78
Reports - Research	60
Reports - Evaluative	20
Reports - Descriptive	11
Speeches/Meeting Papers	8
Information Analyses	7
Dissertations/Theses -…	6
Opinion Papers	6
Tests/Questionnaires	5
Numerical/Quantitative Data	2
Collected Works - Proceedings	1
More ▼

Education Level

Higher Education	27
Postsecondary Education	20
Elementary Education	5
Elementary Secondary Education	5
Secondary Education	5
Early Childhood Education	3
Middle Schools	3
Adult Education	2
High Schools	2
Junior High Schools	2
Preschool Education	2
Grade 7	1
Kindergarten	1
Primary Education	1
More ▼

Audience

Practitioners	3
Administrators	2
Policymakers	2
Researchers	2
Teachers	2

Location

United Kingdom (England)	8
Australia	4
United Kingdom	4
China	3
United States	3
Connecticut	2
Netherlands	2
New Hampshire	2
New York	2
New Zealand	2
Portugal	2
Rhode Island	2
Vermont	2
Austria	1
Belgium	1
Canada	1
European Union	1
Florida	1
Germany	1
Hong Kong	1
Malaysia	1
New Mexico	1
Singapore	1
South Africa	1
Spain	1
More ▼

Laws, Policies, & Programs

Every Student Succeeds Act…

Assessments and Surveys

National Assessment of…	2
New York State Regents…	2
College Student Experiences…	1
Dale Chall Readability Formula	1
Early Childhood Environment…	1
Flesch Kincaid Grade Level…	1
Flesch Reading Ease Formula	1
Fry Readability Formula	1
Medical College Admission Test	1
Personality Assessment…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 108 results Save | Export

Moderation of Non-Exam Assessments: A Novel Approach Using Comparative Judgement

Peer reviewed

Direct link

Lucy Chambers; Sylvia Vitello; Carmen Vidal Rodeiro – Assessment in Education: Principles, Policy & Practice, 2024

In England, some secondary-level qualifications comprise non-exam assessments which need to undergo moderation before grading. Currently, moderation is conducted at centre (school) level. This raises challenges for maintaining the standard across centres. Recent technological advances enable novel moderation methods that are no longer bound by…

Descriptors: Foreign Countries, Evaluation Methods, Comparative Analysis, Grading

Towards the Automatic Risk of Bias Assessment on Randomized Controlled Trials: A Comparison of RobotReviewer and Humans

Peer reviewed

Direct link

Yuan Tian; Xi Yang; Suhail A. Doi; Luis Furuya-Kanamori; Lifeng Lin; Joey S. W. Kwong; Chang Xu – Research Synthesis Methods, 2024

RobotReviewer is a tool for automatically assessing the risk of bias in randomized controlled trials, but there is limited evidence of its reliability. We evaluated the agreement between RobotReviewer and humans regarding the risk of bias assessment based on 1955 randomized controlled trials. The risk of bias in these trials was assessed via two…

Descriptors: Risk, Randomized Controlled Trials, Classification, Robotics

Simulating the Relationship between Nonword Repetition Performance and Vocabulary Growth in 2-Year-Olds: Evidence from the Language 0-5 Project

Peer reviewed

Direct link

Caroline F. Rowland; Amy Bidgood; Gary Jones; Andrew Jessop; Paula Stinson; Julian M. Pine; Samantha Durrant; Michelle S. Peter – Language Learning, 2025

A strong predictor of children's language is performance on non-word repetition (NWR) tasks. However, the basis of this relationship remains unknown. Some suggest that NWR tasks measure phonological working memory, which then affects language growth. Others argue that children's knowledge of language/language experience affects NWR performance. A…

Descriptors: Vocabulary Development, Comparative Analysis, Computational Linguistics, Language Skills

Students' Comparison Competencies in Geography: Results from an Explorative Assessment Study

Peer reviewed

Direct link

Marine Simon; Alexandra Budke – Journal of Geography in Higher Education, 2024

Comparison is an important geographic method and a common task in geography education. Mastering comparison is a complex competency and written comparisons are challenging tasks both for students and assessors. As yet, however, there is no set test for evaluating comparison competency nor tool for enhancing it. Moreover, little is known about…

Descriptors: Geography Instruction, Student Evaluation, Comparative Analysis, Reliability

Comparative Judgement for Evaluating Young Learners' EFL Writing Performances: Reliability and Teacher Perceptions of Holistic and Dimension-Based Judgements

Peer reviewed

Direct link

Rebecca Sickinger; Tineke Brunfaut; John Pill – Language Testing, 2025

Comparative Judgement (CJ) is an evaluation method, typically conducted online, whereby a rank order is constructed, and scores calculated, from judges' pairwise comparisons of performances. CJ has been researched in various educational contexts, though only rarely in English as a Foreign Language (EFL) writing settings, and is generally agreed to…

Descriptors: Writing Evaluation, English (Second Language), Second Language Learning, Second Language Instruction

The Concurrent Validity of Comparative Judgement Outcomes Compared with Marks

Download full text

Gill, Tim – Research Matters, 2022

In Comparative Judgement (CJ) exercises, examiners are asked to look at a selection of candidate scripts (with marks removed) and order them in terms of which they believe display the best quality. By including scripts from different examination sessions, the results of these exercises can be used to help with maintaining standards. Results from…

Descriptors: Comparative Analysis, Decision Making, Scripts, Standards

Evaluating Large Language Models in Analysing Classroom Dialogue

Peer reviewed

Direct link

Yun Long; Haifeng Luo; Yu Zhang – npj Science of Learning, 2024

This study explores the use of Large Language Models (LLMs), specifically GPT-4, in analysing classroom dialogue--a key task for teaching diagnosis and quality improvement. Traditional qualitative methods are both knowledge- and labour-intensive. This research investigates the potential of LLMs to streamline and enhance this process. Using…

Descriptors: Classroom Communication, Computational Linguistics, Chinese, Mathematics Instruction

The Effect of Adaptivity on the Reliability Coefficient in Adaptive Comparative Judgement

Peer reviewed

Direct link

Bramley, Tom; Vitello, Sylvia – Assessment in Education: Principles, Policy & Practice, 2019

Comparative Judgement (CJ) is an increasingly widely investigated method in assessment for creating a scale, for example of the quality of essays. One area that has attracted attention in CJ studies is the optimisation of the selection of pairs of objects for judgement. One approach is known as adaptive comparative judgement (ACJ). It has been…

Descriptors: Reliability, Evaluation Methods, Comparative Analysis, Essay Tests

Judges' Views on Pairwise Comparative Judgement and Rank Ordering as Alternatives to Analytical Essay Marking

Download full text

Walland, Emma – Research Matters, 2022

In this article, I report on examiners' views and experiences of using Pairwise Comparative Judgement (PCJ) and Rank Ordering (RO) as alternatives to traditional analytical marking for GCSE English Language essays. Fifteen GCSE English Language examiners took part in the study. After each had judged 100 pairs of essays using PCJ and eight packs of…

Descriptors: Essays, Grading, Writing Evaluation, Evaluators

How Do Judges in Comparative Judgement Exercises Make Their Judgements?

Download full text

Leech, Tony; Chambers, Lucy – Research Matters, 2022

Two of the central issues in comparative judgement (CJ), which are perhaps underexplored compared to questions of the method's reliability and technical quality, are "what processes do judges use to make their decisions" and "what features do they focus on when making their decisions?" This article discusses both, in the…

Descriptors: Comparative Analysis, Decision Making, Evaluators, Reliability

Crowdsourced Adaptive Comparative Judgment: A Community-Based Solution for Proficiency Rating

Peer reviewed

Direct link

Paquot, Magali; Rubin, Rachel; Vandeweerd, Nathan – Language Learning, 2022

The main objective of this Methods Showcase Article is to show how the technique of adaptive comparative judgment, coupled with a crowdsourcing approach, can offer practical solutions to reliability issues as well as to address the time and cost difficulties associated with a text-based approach to proficiency assessment in L2 research. We…

Descriptors: Comparative Analysis, Decision Making, Language Proficiency, Reliability

Structural Variable Validation of an Online Learning Response Behavior (OLRB) Instrument: A Comparison Analysis of Three Extraction Methods of Exploratory Factor Analysis

Peer reviewed

Direct link

Azman Ong, Mohd Hanafi; Mohd Yasin, Norazlina; Ibrahim, Nur Syafikah – Asian Association of Open Universities Journal, 2022

Purpose: Measuring internal response of online learning is seen as fundamental to absorptive capacity which stimulates knowledge assimilation. However, the evaluation of practice and research of validated instruments that could effectively measure online learning response behavior is limited. Thus, in this study, a new instrument was designed…

Descriptors: Online Courses, Student Surveys, Student Attitudes, Factor Analysis

Fine-Tuning the Standard Setting of Objective Structured Practical Examinations in Clinical Anatomy

Peer reviewed

Direct link

Dissabandara, Lakal O.; Nawaratna, Sujeevi; Nirthanan, Selvanayagam – Anatomical Sciences Education, 2023

The objective structured practical examination (OSPE) is a reliable assessment of practical skills in anatomy teaching. It is often administered as low-stake assessments to track progress at multiple time points in anatomy curricula. Standard-setting OSPEs to derive a pass mark and to ensure assessment quality and rigor is a complex task. This…

Descriptors: Standard Setting, Anatomy, Medical Education, Medical Schools

Moderation of Non-Exam Assessments: Is Comparative Judgement a Practical Alternative?

Download full text

Vidal Rodeiro, Carmen; Chambers, Lucy – Research Matters, 2022

Many high-stakes qualifications include non-exam assessments that are marked by teachers. Awarding bodies then apply a moderation process to bring the marking of these assessments to an agreed standard. Comparative Judgement (CJ) is a technique where two (or more) pieces of work are compared at a time, allowing an overall rank order of work to be…

Descriptors: Evaluation Methods, Portfolios (Background Materials), Decision Making, Task Analysis

Reproducibility of Dual-Microphone Voice Range Profile Equipment

Peer reviewed

Direct link

Printz, Trine; Pedersen, Ellen Raben; Juhl, Peter; Nielsen, Troels; Grøntved, Ågot Møller; Godballe, Christian – Journal of Speech, Language, and Hearing Research, 2017

Purpose: The aim of this study was to add further knowledge about the usefulness of the Voice Range Profile (VRP) assessment in clinical settings and research by analyzing VRP dual-microphone equipment precision, reliability, and room effect. Method: Test-retest studies were conducted in an anechoic chamber and an office: (a) comparing sound…

Descriptors: Audio Equipment, Reliability, Accuracy, Comparative Analysis

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8

ProQuest LLC	6
Research Matters	4
Journal of Speech, Language,…	3
Social Indicators Research	3
Assessment & Evaluation in…	2
Assessment in Education:…	2
Educational Research	2
Educational and Psychological…	2
International Journal of…	2
Journal of Communication…	2
Language Learning	2
Psychological Assessment	2
Quality Assurance in…	2
Academic Medicine	1
Advances in Language and…	1
American Journal of Evaluation	1
Anatomical Sciences Education	1
Asia Pacific Education Review	1
Asian Association of Open…	1
Assessment and Evaluation in…	1
Australian Educational…	1
Behavior Modification	1
Behavioral Disorders	1
Bulletin of the Council for…	1
College Student Experiences…	1
More ▼

Chambers, Lucy	2
Darling-Hammond, Linda	2
Mott, Michael S.	2
Schultz, Douglas G.	2
Abdullah, Firdaus	1
Akbari, Alireza	1
Alexandra Budke	1
Alfonso, Vincent C.	1
Allam, Reynald	1
Alsree, Zubaida	1
Amy Bidgood	1
Anderson, Ronald E.	1
Andrew Jessop	1
Apple, Kristen	1
Armstrong, Elizabeth	1
Arneson, Brian Todd	1
Aziz, Anealka	1
Azman Ong, Mohd Hanafi	1
Bacon, Donald R.	1
Barth, Amy E.	1
Berridge, Damon	1
Bless, Diane M.	1
Bosch, Emma	1
Bramley, Tom	1
More ▼