ERIC - Search Results

Descriptor

Holistic Evaluation	5
Scoring	4
Essay Tests	2
Estimation (Mathematics)	2
Scores	2
Test Reliability	2
Writing Evaluation	2
Analysis of Variance	1
College Faculty	1
Constructed Response	1
English Instruction	1
Generalizability Theory	1
Grade 10	1
Grade 7	1
Graphs	1
High School Students	1
High Schools	1
Higher Education	1
Intermediate Grades	1
Junior High School Students	1
Junior High Schools	1
Mathematical Models	1
Mathematics Tests	1
Medical Education	1
Middle Schools	1
More ▼

Source

Educational and Psychological…

Author

Anderson, Judy	1
Brown, Timothy T.	1
Capps, Lee	1
De Ayala, R. J.	1
De Kruif, Renee E. L.	1
Hooper, Stephen R.	1
Levine, Melvin D.	1
Michael, William B.	1
Mitchell, Karen	1
Mongomery, James W.	1
Pomplun, Mark	1
Reed, Martha	1
Swartz, Carl W.	1
Wakely, Melissa B.	1
White, Kinnard P.	1
More ▼

Publication Type

Journal Articles	5
Reports - Research	5

Education Level

Audience

Location

Kansas

Laws, Policies, & Programs

Assessments and Surveys

Medical College Admission Test

What Works Clearinghouse Rating

Showing all 5 results Save | Export

Reliability of Holistic Scoring for the MCAT Essay.

Peer reviewed

Mitchell, Karen; Anderson, Judy – Educational and Psychological Measurement, 1986

This study examined the reliability of holistic scoring for a sample of essays written during the Spring 1985 MCAT administration. Analysis of variance techniques was used to estimate the reliability of scoring and to partition score variance into that due to level differences between papers and to context-specific factors. (Author/LMO)

Descriptors: Analysis of Variance, Essay Tests, Holistic Evaluation, Medical Education

Using Generalizability Theory To Estimate the Reliability of Writing Scores Derived from Holistic and Analytical Scoring Methods.

Peer reviewed

Swartz, Carl W.; Hooper, Stephen R.; Mongomery, James W.; Wakely, Melissa B.; De Kruif, Renee E. L.; Reed, Martha; Brown, Timothy T.; Levine, Melvin D.; White, Kinnard P. – Educational and Psychological Measurement, 1999

Used generalizability theory to investigate the impact of the number of raters and the type of decision (relative versus absolute) on the reliability of writing scores. Results from 251 middle school students and 20 intermediate grade students show that reliability coefficients decline as the number of raters declines and when absolute decisions…

Descriptors: Estimation (Mathematics), Generalizability Theory, Holistic Evaluation, Intermediate Grades

A Comparison of the Reliability and Validity of Ratings of Student Performance on Essay Examinations by Professors of English and by Professors in Other Disciplines.

Peer reviewed

And Others; Michael, William B. – Educational and Psychological Measurement, 1980

Ratings of student performance for two essay questions rendered by professors of English and by professors in other disciplines were compared for reliability and concurrent validity. It was concluded that the reliability and validity of the ratings of the two groups were nearly comparable. (Author/BW)

Descriptors: College Faculty, English Instruction, Essay Tests, Higher Education

Partial Credit Analysis of Writing Ability.

Peer reviewed

De Ayala, R. J.; And Others – Educational and Psychological Measurement, 1991

To investigate the effect on item parameter estimation of pooling the raters' ratings and to examine the information content of 2 types of writing samples, the partial credit model was fit to 2,000 writing samples from secondary students that had been holistically scored. Implications for test construction are discussed. (SLD)

Descriptors: Estimation (Mathematics), Graphs, Holistic Evaluation, Mathematical Models

Gender Differences for Constructed-Response Mathematics Items.

Peer reviewed

Pomplun, Mark; Capps, Lee – Educational and Psychological Measurement, 1999

Studied gender differences in answers to constructed-response mathematics items on approximately 500 papers from grades 7 and 10 from the Kansas Assessment Program. Rubric-relevant variables were highly predictive of holistic scores and accounted for some of the gender differences, especially in grade 7. (SLD)

Descriptors: Constructed Response, Grade 10, Grade 7, High School Students