ERIC - Search Results

Descriptor

Evaluators	8
Performance Based Assessment	3
Rating Scales	3
Scoring	3
Student Evaluation	3
Test Construction	3
Ability	2
Acting	2
Aesthetic Values	2
Art Criticism	2
Comparative Analysis	2
Drama	2
Evaluation Methods	2
High School Students	2
High Schools	2
Interrater Reliability	2
Item Response Theory	2
Matched Groups	2
Reliability	2
Secondary School Teachers	2
Test Use	2
Value Judgment	2
Videotape Recordings	2
Adults	1
Classification	1
More ▼

Source

Journal of Applied Measurement	2
Applied Measurement in…	1
Educational Assessment	1

Author

Myford, Carol M.	8
Wolfe, Edward W.	3
Moulder, Bradley C.	2
Heller, Joan I.	1
Shiengold, Karen	1

Publication Type

Reports - Research	5
Speeches/Meeting Papers	5
Journal Articles	4
Reports - Descriptive	2
Reports - Evaluative	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…

What Works Clearinghouse Rating

Showing all 8 results Save | Export

Investigating Design Features of Descriptive Graphic Rating Scales.

Peer reviewed

Myford, Carol M. – Applied Measurement in Education, 2002

Studied the use of descriptive graphic rating scales by 11 raters to evaluate students' work, exploring different design features. Used a Rasch-model based rating scale analysis to determine that all the continuous scales could be considered to have at least five points, and that defined midpoints did not result in higher student separation…

Descriptors: Evaluators, Rating Scales, Reliability, Test Construction

When Raters Disagree, Then What: Examining a Third-rating Discrepancy Resolution Procedure and Its Utility for Identifying Unusual Patterns of Ratings.

Peer reviewed

Myford, Carol M.; Wolfe, Edward W. – Journal of Applied Measurement, 2002

Examined a procedure for identifying and resolving discrepancies in ratings, focusing on the third rater adjudication procedure used in scoring the Test of Spoken English. Results for 1,446 adult examinees demonstrate that implementing a discrepancy resolution procedure is not sufficient in itself for quality control monitoring. (SLD)

Descriptors: Adults, Evaluators, Quality Control, Scoring

Detecting Differential Rater Functioning over Time (DRIFT) Using a Rasch Multi-faceted Rating Scale Model.

Peer reviewed

Wolfe, Edward W.; Moulder, Bradley C.; Myford, Carol M. – Journal of Applied Measurement, 2001

Describes a class of rater effects, differential rater functioning over time (DRIFT), that depicts rater-by-time interactions. Also describes Rasch measurement procedures designed to identify these types of DRIFT in rating data. Applied these procedures to simulated data to show their usefulness in classifying raters as aberrant or non-aberrant…

Descriptors: Evaluators, Interaction, Item Response Theory, Simulation

Detecting Differential Rater Functioning over Time (DRIFT) Using a Rasch Multi-Faceted Rating Scale Model.

Download full text

Wolfe, Edward W.; Moulder, Bradley C.; Myford, Carol M. – 1999

This paper describes a class of rater effects that depict rater-by-time interactions. This class of rater effects is referred to as differential rater functioning over time (DRIFT). This article describes several types of DRIFT (primacy/recency, differential centrality/extremism, and practice/fatigue) and Rasch measurement procedures designed to…

Descriptors: Classification, Effect Size, Evaluators, Item Response Theory

Constructing Scoring Rubrics: Using "Facets" To Study Design Features of Descriptive Rating Scales.

Download full text

Myford, Carol M.; And Others – 1996

Developing scoring rubrics to evaluate student work was studied, concentrating on the use of intermediate points in rating scales. How scales that allow for intermediate points between defined categories should be constructed and used was explored. In the recent National Assessment of Educational Progress (NAEP) visual arts field test, researchers…

Descriptors: Evaluators, Rating Scales, Scoring, Scoring Rubrics

Reasoning about Evidence in Portfolios: Cognitive Foundations for Valid and Reliable Assessment.

Peer reviewed

Heller, Joan I.; Shiengold, Karen; Myford, Carol M. – Educational Assessment, 1998

Analyses of 10 raters' reasoning during think-aloud interviews provided evidence to support a model of the fundamental processes involved in rating standards-based, nonprescriptive portfolios. This process model provides a framework within which to conceptualize sound-rater reasoning and to identify reasoning that distorts the meaning of scores.…

Descriptors: Elementary Secondary Education, Evaluators, Interviews, Performance Based Assessment

Judging Acting Ability: The Transition from Novice to Expert.

Download full text

Myford, Carol M. – 1991

The aesthetic judgments of experts (casting directors and high school drama teachers), theater buffs, and novices were compared as they rated high school students' videotaped performances of Shakespearean monologues. It was hypothesized that theater buffs would represent an intermediate stage on the path to developing expertise in judging acting…

Descriptors: Ability, Acting, Aesthetic Values, Art Criticism

Assessment of Acting Ability.

Download full text

Myford, Carol M. – 1991

The aesthetic judgments of experts (casting directors and high school drama teachers), theater buffs, and novices were compared as they rated the videotaped performances of high school students performing Shakespearean monologues. Focus was on going beyond the determination of between-judge agreement to determine whether there were objective…

Descriptors: Ability, Acting, Aesthetic Values, Art Criticism