ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	4

Source

Educational and Psychological…

Author

Engelhard, George, Jr.	2
Wang, Jue	2
Conger, Anthony J.	1
Khorramdel, Lale	1
Tyack, Lillian	1
Wolfe, Edward W.	1
von Davier, Matthias	1

Publication Type

Journal Articles	4
Reports - Research	4

Education Level

Elementary Secondary Education

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Trends in International…

What Works Clearinghouse Rating

Showing all 4 results Save | Export

Exploring the Impersonal Judgments and Personal Preferences of Raters in Rater-Mediated Assessments with Unfolding Models

Peer reviewed

Direct link

Wang, Jue; Engelhard, George, Jr. – Educational and Psychological Measurement, 2019

The purpose of this study is to explore the use of unfolding models for evaluating the quality of ratings obtained in rater-mediated assessments. Two different judgmental processes can be used to conceptualize ratings: impersonal judgments and personal preferences. Impersonal judgments are typically expected in rater-mediated assessments, and…

Descriptors: Evaluative Thinking, Preferences, Evaluators, Models

Scoring Graphical Responses in TIMSS 2019 Using Artificial Neural Networks

Peer reviewed

Direct link

von Davier, Matthias; Tyack, Lillian; Khorramdel, Lale – Educational and Psychological Measurement, 2023

Automated scoring of free drawings or images as responses has yet to be used in large-scale assessments of student achievement. In this study, we propose artificial neural networks to classify these types of graphical responses from a TIMSS 2019 item. We are comparing classification accuracy of convolutional and feed-forward approaches. Our…

Descriptors: Scoring, Networks, Artificial Intelligence, Elementary Secondary Education

Kappa and Rater Accuracy: Paradigms and Parameters

Peer reviewed

Direct link

Conger, Anthony J. – Educational and Psychological Measurement, 2017

Drawing parallels to classical test theory, this article clarifies the difference between rater accuracy and reliability and demonstrates how category marginal frequencies affect rater agreement and Cohen's kappa. Category assignment paradigms are developed: comparing raters to a standard (index) versus comparing two raters to one another…

Descriptors: Interrater Reliability, Evaluators, Accuracy, Statistical Analysis

Evaluating Rater Accuracy in Rater-Mediated Assessments Using an Unfolding Model

Peer reviewed

Direct link

Wang, Jue; Engelhard, George, Jr.; Wolfe, Edward W. – Educational and Psychological Measurement, 2016

The number of performance assessments continues to increase around the world, and it is important to explore new methods for evaluating the quality of ratings obtained from raters. This study describes an unfolding model for examining rater accuracy. Accuracy is defined as the difference between observed and expert ratings. Dichotomous accuracy…

Descriptors: Evaluators, Accuracy, Performance Based Assessment, Models

Evaluators	4
Models	4
Accuracy	3
Classification	2
Item Response Theory	2
Scoring	2
Writing Tests	2
Achievement Tests	1
Artificial Intelligence	1
Comparative Analysis	1
Computer Software	1
Elementary Secondary Education	1
Error of Measurement	1
Evaluative Thinking	1
Foreign Countries	1
Graphs	1
International Assessment	1
Interrater Reliability	1
Item Analysis	1
Mathematical Formulas	1
Mathematics Achievement	1
Mathematics Tests	1
Maximum Likelihood Statistics	1
Networks	1
Performance Based Assessment	1
More ▼