ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	1

Descriptor

Scoring Rubrics	5
Interrater Reliability	3
Reliability	3
High School Students	2
High Schools	2
Scores	2
Writing Tests	2
Academic Achievement	1
Alignment (Education)	1
Biology	1
Comparative Analysis	1
Delphi Technique	1
Educational Assessment	1
Essay Tests	1
Essays	1
Evaluation	1
Federal Aid	1
Generalizability Theory	1
Grade 5	1
Graduate Students	1
Holistic Evaluation	1
Instructional Effectiveness	1
Intermediate Grades	1
Measurement Techniques	1
Models	1
More ▼

Source

Applied Measurement in…	1
Assessing Writing	1
Assessment & Evaluation in…	1
Journal of Experimental…	1
Language Assessment Quarterly	1

Author

Johnson, Robert L.	5
Gordon, Belita	4
Penny, James	2
Penny, Jim	2
Fisher, Steven P.	1
Payne, John R.	1
Shumate, Steven R.	1
Strickland, Denise C.	1
Timmerman, Briana E. Crotwell	1

Publication Type

Journal Articles	5
Reports - Research	5

Education Level

Higher Education	1
Postsecondary Education	1

Audience

Location

Georgia	1
South Carolina	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 5 results Save | Export

Development of a "Universal" Rubric for Assessing Undergraduates' Scientific Reasoning Skills Using Scientific Writing

Peer reviewed

Direct link

Timmerman, Briana E. Crotwell; Strickland, Denise C.; Johnson, Robert L.; Payne, John R. – Assessment & Evaluation in Higher Education, 2011

We developed a rubric for measuring students' ability to reason and write scientifically. The Rubric for Science Writing (Rubric) was tested in a variety of undergraduate biology laboratory courses (total n = 142 laboratory reports) using science graduate students (teaching assistants) as raters. Generalisability analysis indicates that the Rubric…

Descriptors: Graduate Students, Science Laboratories, Biology, Writing Skills

Using Rating Augmentation To Expand the Scale of an Analytic Rubric.

Peer reviewed

Penny, Jim; Johnson, Robert L.; Gordon, Belita – Journal of Experimental Education, 2000

Used an analytic rubric to score 120 writing samples from Georgia's 11th grade writing assessment. Raters augmented scores by adding a "+" or "-" to the score. Results indicate that this method of augmentation tends to improve most indices of interrater reliability, although the percentage of exact and adjacent agreement…

Descriptors: High School Students, High Schools, Interrater Reliability, Scoring Rubrics

The Relation between Score Resolution Methods and Interrater Reliability: An Empirical Study of an Analytic Scoring Rubric.

Peer reviewed

Johnson, Robert L.; Penny, James; Gordon, Belita – Applied Measurement in Education, 2000

Studied four forms of score resolution used by testing agencies and investigated the effect that each has on the interrater reliability associated with the resulting operational scores. Results, based on 120 essays from the Georgia High School Writing Test, show some forms of resolution to be associated with higher reliability and some associated…

Descriptors: Essay Tests, High School Students, High Schools, Interrater Reliability

The Effect of Rating Augmentation on Inter-Rater Reliability: An Empirical Study of a Holistic Rubric.

Peer reviewed

Penny, Jim; Johnson, Robert L.; Gordon, Belita – Assessing Writing, 2000

Defines a two-stage process by which a holistic rubric is applied to the assessment of open-ended items, such as writing samples. Indicates that the use of rating augmentation can improve the inter-rater reliability of holistic assessments, as indicated by generalizability phi coefficients, correlation coefficients, and percent agreement indices.…

Descriptors: Grade 5, Holistic Evaluation, Intermediate Grades, Reliability

Resolving Score Differences in the Rating of Writing Samples: Does Discussion Improve the Accuracy of Scores?

Peer reviewed

Direct link

Johnson, Robert L.; Penny, James; Gordon, Belita; Shumate, Steven R.; Fisher, Steven P. – Language Assessment Quarterly, 2005

Many studies have indicated that at least 2 raters should score writing assessments to improve interrater reliability. However, even for assessments that characteristically demonstrate high levels of rater agreement, 2 raters of the same essay can occasionally report different, or discrepant, scores. If a single score, typically referred to as an…

Descriptors: Interrater Reliability, Scores, Evaluation, Reliability