Descriptor
Holistic Evaluation | 5 |
Scoring | 4 |
Essay Tests | 2 |
Estimation (Mathematics) | 2 |
Scores | 2 |
Test Reliability | 2 |
Writing Evaluation | 2 |
Analysis of Variance | 1 |
College Faculty | 1 |
Constructed Response | 1 |
English Instruction | 1 |
More ▼ |
Source
Educational and Psychological… | 5 |
Author
Anderson, Judy | 1 |
Brown, Timothy T. | 1 |
Capps, Lee | 1 |
De Ayala, R. J. | 1 |
De Kruif, Renee E. L. | 1 |
Hooper, Stephen R. | 1 |
Levine, Melvin D. | 1 |
Michael, William B. | 1 |
Mitchell, Karen | 1 |
Mongomery, James W. | 1 |
Pomplun, Mark | 1 |
More ▼ |
Publication Type
Journal Articles | 5 |
Reports - Research | 5 |
Education Level
Audience
Location
Kansas | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Medical College Admission Test | 1 |
What Works Clearinghouse Rating

Mitchell, Karen; Anderson, Judy – Educational and Psychological Measurement, 1986
This study examined the reliability of holistic scoring for a sample of essays written during the Spring 1985 MCAT administration. Analysis of variance techniques was used to estimate the reliability of scoring and to partition score variance into that due to level differences between papers and to context-specific factors. (Author/LMO)
Descriptors: Analysis of Variance, Essay Tests, Holistic Evaluation, Medical Education

Swartz, Carl W.; Hooper, Stephen R.; Mongomery, James W.; Wakely, Melissa B.; De Kruif, Renee E. L.; Reed, Martha; Brown, Timothy T.; Levine, Melvin D.; White, Kinnard P. – Educational and Psychological Measurement, 1999
Used generalizability theory to investigate the impact of the number of raters and the type of decision (relative versus absolute) on the reliability of writing scores. Results from 251 middle school students and 20 intermediate grade students show that reliability coefficients decline as the number of raters declines and when absolute decisions…
Descriptors: Estimation (Mathematics), Generalizability Theory, Holistic Evaluation, Intermediate Grades

And Others; Michael, William B. – Educational and Psychological Measurement, 1980
Ratings of student performance for two essay questions rendered by professors of English and by professors in other disciplines were compared for reliability and concurrent validity. It was concluded that the reliability and validity of the ratings of the two groups were nearly comparable. (Author/BW)
Descriptors: College Faculty, English Instruction, Essay Tests, Higher Education

De Ayala, R. J.; And Others – Educational and Psychological Measurement, 1991
To investigate the effect on item parameter estimation of pooling the raters' ratings and to examine the information content of 2 types of writing samples, the partial credit model was fit to 2,000 writing samples from secondary students that had been holistically scored. Implications for test construction are discussed. (SLD)
Descriptors: Estimation (Mathematics), Graphs, Holistic Evaluation, Mathematical Models

Pomplun, Mark; Capps, Lee – Educational and Psychological Measurement, 1999
Studied gender differences in answers to constructed-response mathematics items on approximately 500 papers from grades 7 and 10 from the Kansas Assessment Program. Rubric-relevant variables were highly predictive of holistic scores and accounted for some of the gender differences, especially in grade 7. (SLD)
Descriptors: Constructed Response, Grade 10, Grade 7, High School Students