ERIC - Search Results

Publication Date

In 2026	0
Since 2025	4
Since 2022 (last 5 years)	19
Since 2017 (last 10 years)	51
Since 2007 (last 20 years)	125

Descriptor

Evaluators	238
Models	238
Evaluation Methods	123
Program Evaluation	92
Foreign Countries	33
Higher Education	32
Decision Making	30
Evaluation	27
Evaluation Criteria	23
Feedback (Response)	21
Elementary Secondary Education	20
Case Studies	18
Data Analysis	18
Interrater Reliability	18
Academic Achievement	17
Accuracy	15
Data Collection	15
Evaluation Needs	15
Educational Assessment	14
Item Response Theory	14
Research Methodology	14
Comparative Analysis	13
Evaluation Utilization	13
Professional Development	13
Program Effectiveness	13
More ▼

Education Level

Higher Education	27
Postsecondary Education	19
Elementary Secondary Education	10
Secondary Education	7
High Schools	5
Elementary Education	3
Early Childhood Education	2
Grade 7	2
Grade 8	2
Grade 4	1
Grade 9	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
Preschool Education	1
Two Year Colleges	1
More ▼

Audience

Researchers	7
Policymakers	4
Administrators	3
Support Staff	3
Counselors	2
Media Staff	2
Practitioners	2
Teachers	2
Students	1

Location

Rhode Island	6
Texas	5
California	3
Canada	3
New York	3
Oregon	3
Australia	2
Botswana	2
Kentucky	2
Massachusetts	2
Michigan	2
Nebraska	2
Netherlands	2
New Mexico	2
Ohio (Cincinnati)	2
United States	2
Virginia	2
Washington	2
Afghanistan	1
Arizona	1
Arkansas	1
Asia	1
China	1
Delaware	1
Denmark	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…	2
No Child Left Behind Act 2001	2
Americans with Disabilities…	1
Education for All Handicapped…	1
Elementary and Secondary…	1
Higher Education Act Title IX	1
Race to the Top	1
Rehabilitation Act 1973	1
Womens Educational Equity Act	1

Assessments and Surveys

National Assessment of…	1
Test of English as a Foreign…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 238 results Save | Export

Employing a Hierarchical Rater Models for Automated Scoring: Scope Review on the Application in Educational Assessment

Peer reviewed
PDF on ERIC

Download full text

Direct link

Akif Avcu – Malaysian Online Journal of Educational Technology, 2025

This scope-review presents the milestones of how Hierarchical Rater Models (HRMs) become operable to used in automated essay scoring (AES) to improve instructional evaluation. Although essay evaluations--a useful instrument for evaluating higher-order cognitive abilities--have always depended on human raters, concerns regarding rater bias,…

Descriptors: Automation, Scoring, Models, Educational Assessment

An Analysis of the Effect of Graph Construction Disclaimers on Visual Analysis

Peer reviewed

Direct link

Keith C. Radley; Evan H. Dart – Journal of Behavioral Education, 2025

Recent research has indicated that the manner in which single-case data are typically displayed for visual analysis may influence rater decisions regarding the effect of an intervention. Subsequently, researchers have encouraged adherence to a standard assembly for linear graphs in order to control these effects. Others, however, have encouraged…

Descriptors: Graphs, Research Design, Visual Aids, Data Analysis

Planning Missing Data Designs for Human Ratings in Creativity Research: A Practical Guide

Peer reviewed

Direct link

Boris Forthmann; Benjamin Goecke; Roger E. Beaty – Creativity Research Journal, 2025

Human ratings are ubiquitous in creativity research. Yet, the process of rating responses to creativity tasks -- typically several hundred or thousands of responses, per rater -- is often time-consuming and expensive. Planned missing data designs, where raters only rate a subset of the total number of responses, have been recently proposed as one…

Descriptors: Creativity, Research, Researchers, Research Methodology

IRT Observed-Score Equating for Rater-Mediated Assessments Using a Hierarchical Rater Model

Peer reviewed

Direct link

Tong Wu; Stella Y. Kim; Carl Westine; Michelle Boyer – Journal of Educational Measurement, 2025

While significant attention has been given to test equating to ensure score comparability, limited research has explored equating methods for rater-mediated assessments, where human raters inherently introduce error. If not properly addressed, these errors can undermine score interchangeability and test validity. This study proposes an equating…

Descriptors: Item Response Theory, Evaluators, Error of Measurement, Test Validity

Exploring Difficult-to-Score Essays with a Hyperbolic Cosine Accuracy Model and Coh-Metrix Indices

Peer reviewed

Direct link

Wang, Jue; Engelhard, George; Combs, Trenton – Journal of Experimental Education, 2023

Unfolding models are frequently used to develop scales for measuring attitudes. Recently, unfolding models have been applied to examine rater severity and accuracy within the context of rater-mediated assessments. One of the problems in applying unfolding models to rater-mediated assessments is that the substantive interpretations of the latent…

Descriptors: Writing Evaluation, Scoring, Accuracy, Computational Linguistics

Effects of Using Double Ratings as Item Scores on IRT Proficiency Estimation

Peer reviewed

Direct link

Song, Yoon Ah; Lee, Won-Chan – Applied Measurement in Education, 2022

This article presents the performance of item response theory (IRT) models when double ratings are used as item scores over single ratings when rater effects are present. Study 1 examined the influence of the number of ratings on the accuracy of proficiency estimation in the generalized partial credit model (GPCM). Study 2 compared the accuracy of…

Descriptors: Item Response Theory, Item Analysis, Scores, Accuracy

Evaluating Quadratic Weighted Kappa as the Standard Performance Metric for Automated Essay Scoring

Peer reviewed
PDF on ERIC

Download full text

Doewes, Afrizal; Kurdhi, Nughthoh Arfawi; Saxena, Akrati – International Educational Data Mining Society, 2023

Automated Essay Scoring (AES) tools aim to improve the efficiency and consistency of essay scoring by using machine learning algorithms. In the existing research work on this topic, most researchers agree that human-automated score agreement remains the benchmark for assessing the accuracy of machine-generated scores. To measure the performance of…

Descriptors: Essays, Writing Evaluation, Evaluators, Accuracy

An Item Response Theory Approach to Enhance Peer Assessment Effectiveness in Massive Open Online Courses

Peer reviewed

Direct link

Nakayama, Minoru; Sciarrone, Filippo; Temperini, Marco; Uto, Masaki – International Journal of Distance Education Technologies, 2022

Massive open on-line courses (MOOCs) are effective and flexible resources to educate, train, and empower populations. Peer assessment (PA) provides a powerful pedagogical strategy to support educational activities and foster learners' success, also where a huge number of learners is involved. Item response theory (IRT) can model students'…

Descriptors: Item Response Theory, Peer Evaluation, MOOCs, Models

Evaluation Is Creation: Self and Social Judgments of Creativity across the Four-C Model

Peer reviewed

Direct link

Denis Dumas; James C. Kaufman – Educational Psychology Review, 2024

Who should evaluate the originality and task-appropriateness of a given idea has been a perennial debate among psychologists of creativity. Here, we argue that the most relevant evaluator of a given idea depends crucially on the level of expertise of the person who generated it. To build this argument, we draw on two complimentary theoretical…

Descriptors: Decision Making, Creativity, Task Analysis, Psychologists

Exploring the Impersonal Judgments and Personal Preferences of Raters in Rater-Mediated Assessments with Unfolding Models

Peer reviewed

Direct link

Wang, Jue; Engelhard, George, Jr. – Educational and Psychological Measurement, 2019

The purpose of this study is to explore the use of unfolding models for evaluating the quality of ratings obtained in rater-mediated assessments. Two different judgmental processes can be used to conceptualize ratings: impersonal judgments and personal preferences. Impersonal judgments are typically expected in rater-mediated assessments, and…

Descriptors: Evaluative Thinking, Preferences, Evaluators, Models

A School of the Arts Embedded Evaluation: Defining a System of Values as Curricular Design

Direct link

Renato Britto Ferreira – ProQuest LLC, 2024

Modern-day educators often share a sense they lack voice and agency with school administration. A classic example is curriculum development, where third-party designers develop uncontextualized curricula, and teachers then must implement the design even if inefficient and ineffective. What happens if this identical situation occurs at the program…

Descriptors: Art Education, School Administration, Curriculum Design, Program Design

Modeling and Analyzing Scorer Preferences in Short-Answer Math Questions

Peer reviewed
PDF on ERIC

Download full text

Zhang, Mengxue; Heffernan, Neil; Lan, Andrew – International Educational Data Mining Society, 2023

Automated scoring of student responses to open-ended questions, including short-answer questions, has great potential to scale to a large number of responses. Recent approaches for automated scoring rely on supervised learning, i.e., training classifiers or fine-tuning language models on a small number of responses with human-provided score…

Descriptors: Scoring, Computer Assisted Testing, Mathematics Instruction, Mathematics Tests

Examining Differential Rater Functioning Using a Between-Subgroup Outfit Approach

Peer reviewed

Direct link

Wind, Stefanie A.; Sebok-Syer, Stefanie S. – Journal of Educational Measurement, 2019

When practitioners use modern measurement models to evaluate rating quality, they commonly examine rater fit statistics that summarize how well each rater's ratings fit the expectations of the measurement model. Essentially, this approach involves examining the unexpected ratings that each misfitting rater assigned (i.e., carrying out analyses of…

Descriptors: Measurement, Models, Evaluators, Simulation

Forced-Choice Ranking Models for Raters' Ranking Data

Peer reviewed

Direct link

Hung, Su-Pin; Huang, Hung-Yu – Journal of Educational and Behavioral Statistics, 2022

To address response style or bias in rating scales, forced-choice items are often used to request that respondents rank their attitudes or preferences among a limited set of options. The rating scales used by raters to render judgments on ratees' performance also contribute to rater bias or errors; consequently, forced-choice items have recently…

Descriptors: Evaluation Methods, Rating Scales, Item Analysis, Preferences

The Polymeric Model of School Evaluation in the Era of Accountability

Peer reviewed

Direct link

Zamir, Sara – Quality Assurance in Education: An International Perspective, 2019

Purpose: As the school evaluator's role is multifaceted and the school elevator is the school principal's subordinate, this paper aims to present the school evaluator's complex conduct to achieve a better understanding of his or her functioning. Design/methodology/approach: Theoretical paper. Findings: The two critical dimensions connected to the…

Descriptors: Institutional Evaluation, Accountability, Schools, Evaluators

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 16

American Journal of Evaluation	15
Educational Evaluation and…	9
ProQuest LLC	8
New Directions for Evaluation	7
Evaluation and Program…	6
New Directions for Program…	6
Rhode Island Department of…	6
Evaluation Review	5
Educational and Psychological…	4
International Educational…	4
Journal of Educational…	4
Applied Measurement in…	3
Canadian Journal of Program…	2
ETS Research Report Series	2
Evaluation and Program…	2
Journal of Educational and…	2
Journal of MultiDisciplinary…	2
Language Testing	2
Online Submission	2
Review of Educational Research	2
AIDS Education and Prevention	1
Administration for Children &…	1
Advances in Health Sciences…	1
American Journal of Distance…	1
Applied Developmental Science	1
More ▼

Bhola, H. S.	3
Brown, Robert D.	3
Engelhard, George, Jr.	3
Scriven, Michael	3
Wang, Jue	3
Wind, Stefanie A.	3
Alkin, Marvin C.	2
Caulley, Darrel N.	2
Coryn, Chris L. S.	2
Cousins, J. Bradley	2
Dowdy, Irene	2
Felix, Joseph L.	2
Goe, Laura	2
Good, H. M.	2
Holdheide, Lynn	2
King, Jean A.	2
Mertens, Donna M.	2
Miller, Tricia	2
Morris, Michael	2
Raymond, Mark R.	2
Scheirer, Mary Ann	2
St. John, Mark	2
Thompson, Bruce	2
Urban, Jennifer Brown	2
More ▼

Journal Articles	138
Reports - Research	91
Speeches/Meeting Papers	41
Reports - Evaluative	40
Reports - Descriptive	37
Opinion Papers	22
Guides - Non-Classroom	21
Information Analyses	12
Dissertations/Theses -…	8
Reports - General	8
Tests/Questionnaires	6
Books	5
Collected Works - Proceedings	3
Collected Works - Serials	3
Reference Materials -…	3
Collected Works - General	1
Collected Works - Serial	1
Guides - Classroom - Learner	1
Numerical/Quantitative Data	1
Reports -…	1
More ▼