NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 8 results Save | Export
Peer reviewed Peer reviewed
Myford, Carol M. – Applied Measurement in Education, 2002
Studied the use of descriptive graphic rating scales by 11 raters to evaluate students' work, exploring different design features. Used a Rasch-model based rating scale analysis to determine that all the continuous scales could be considered to have at least five points, and that defined midpoints did not result in higher student separation…
Descriptors: Evaluators, Rating Scales, Reliability, Test Construction
Peer reviewed Peer reviewed
Myford, Carol M.; Wolfe, Edward W. – Journal of Applied Measurement, 2002
Examined a procedure for identifying and resolving discrepancies in ratings, focusing on the third rater adjudication procedure used in scoring the Test of Spoken English. Results for 1,446 adult examinees demonstrate that implementing a discrepancy resolution procedure is not sufficient in itself for quality control monitoring. (SLD)
Descriptors: Adults, Evaluators, Quality Control, Scoring
Peer reviewed Peer reviewed
Wolfe, Edward W.; Moulder, Bradley C.; Myford, Carol M. – Journal of Applied Measurement, 2001
Describes a class of rater effects, differential rater functioning over time (DRIFT), that depicts rater-by-time interactions. Also describes Rasch measurement procedures designed to identify these types of DRIFT in rating data. Applied these procedures to simulated data to show their usefulness in classifying raters as aberrant or non-aberrant…
Descriptors: Evaluators, Interaction, Item Response Theory, Simulation
Wolfe, Edward W.; Moulder, Bradley C.; Myford, Carol M. – 1999
This paper describes a class of rater effects that depict rater-by-time interactions. This class of rater effects is referred to as differential rater functioning over time (DRIFT). This article describes several types of DRIFT (primacy/recency, differential centrality/extremism, and practice/fatigue) and Rasch measurement procedures designed to…
Descriptors: Classification, Effect Size, Evaluators, Item Response Theory
Myford, Carol M.; And Others – 1996
Developing scoring rubrics to evaluate student work was studied, concentrating on the use of intermediate points in rating scales. How scales that allow for intermediate points between defined categories should be constructed and used was explored. In the recent National Assessment of Educational Progress (NAEP) visual arts field test, researchers…
Descriptors: Evaluators, Rating Scales, Scoring, Scoring Rubrics
Peer reviewed Peer reviewed
Heller, Joan I.; Shiengold, Karen; Myford, Carol M. – Educational Assessment, 1998
Analyses of 10 raters' reasoning during think-aloud interviews provided evidence to support a model of the fundamental processes involved in rating standards-based, nonprescriptive portfolios. This process model provides a framework within which to conceptualize sound-rater reasoning and to identify reasoning that distorts the meaning of scores.…
Descriptors: Elementary Secondary Education, Evaluators, Interviews, Performance Based Assessment
Myford, Carol M. – 1991
The aesthetic judgments of experts (casting directors and high school drama teachers), theater buffs, and novices were compared as they rated high school students' videotaped performances of Shakespearean monologues. It was hypothesized that theater buffs would represent an intermediate stage on the path to developing expertise in judging acting…
Descriptors: Ability, Acting, Aesthetic Values, Art Criticism
Myford, Carol M. – 1991
The aesthetic judgments of experts (casting directors and high school drama teachers), theater buffs, and novices were compared as they rated the videotaped performances of high school students performing Shakespearean monologues. Focus was on going beyond the determination of between-judge agreement to determine whether there were objective…
Descriptors: Ability, Acting, Aesthetic Values, Art Criticism