Descriptor
Evaluators | 4 |
Testing Problems | 4 |
Scoring | 3 |
Elementary School Teachers | 2 |
Interrater Reliability | 2 |
Questionnaires | 2 |
Response Style (Tests) | 2 |
Test Construction | 2 |
Test Reliability | 2 |
Behavior Rating Scales | 1 |
Bias | 1 |
More ▼ |
Source
Publication Type
Tests/Questionnaires | 4 |
Reports - Research | 3 |
Reports - Evaluative | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Crews, William E., Jr. – 1991
As part of a study of teacher evaluation of student replies to open-ended questions, a second question--the best method of determining interrater reliability--was examined. The standard method, the Pearson Product-Moment correlation, overestimated the degree of match between researchers' and teachers' scoring of tests. The simpler percent…
Descriptors: Comparative Analysis, Elementary School Teachers, Evaluation Methods, Evaluators
Arnold, Voiza; And Others – 1990
In 1990, a study was conducted at Rio Hondo College (Whittier, California) to determine if readers exhibited any bias in scoring test papers that were composed on a word processor as opposed to being written by hand. The study began with the formulation of tentative pilot study questions and the development of procedures to address them. Three…
Descriptors: Bias, Community Colleges, Evaluators, Handwriting
Goldberg, Gail Lynn; Kapinus, Barbara – 1992
The Maryland School Performance Assessment Program (MSPAP) is a relatively new, statewide performance assessment of students in grades 3, 5, and 8. When first administered in May of 1991, the MSPAP included a battery of performance assessment tasks designed to generate written or drawn responses to reading texts. This study evaluated selected…
Descriptors: Comparative Testing, Elementary Education, Elementary School Teachers, Evaluators
Shiflett, Samuel; And Others – 1985
A study was undertaken to improve the measurement of small team performance within the Army. A provisional taxonomy of team-level performance functions was field-validated; criteria and measures of the functions were developed; and their reliability was examined. The provisional taxonomy, used for observing Army field training exercises, was used…
Descriptors: Behavior Rating Scales, Classification, Evaluation Criteria, Evaluators