ERIC - Search Results

Descriptor

Evaluators	10
Scoring	10
Testing Problems	10
Interrater Reliability	7
Standard Setting (Scoring)	5
Cutting Scores	3
Test Interpretation	3
Test Reliability	3
Testing Programs	3
Elementary School Teachers	2
Elementary Secondary Education	2
Evaluation Methods	2
Examiners	2
Holistic Evaluation	2
Mathematical Models	2
Performance Based Assessment	2
Questionnaires	2
Reading Tests	2
State Programs	2
Test Construction	2
Writing Evaluation	2
Writing Tests	2
Analysis of Variance	1
Bias	1
Community Colleges	1
More ▼

Source

Educational Measurement:…

Author

Arnold, Voiza	1
Auchter, Joan Chikos	1
Cramer, Stephen E.	1
Crews, William E., Jr.	1
Geisinger, Kurt F.	1
Goldberg, Gail Lynn	1
Halpin, Glennelle	1
Jaeger, Richard M.	1
Johnson, Eugene G.	1
Kapinus, Barbara	1
Kaplan, Bruce A.	1
McLean, James E.	1
Patience, Wayne	1
Plake, Barbara S.	1
More ▼

Publication Type

Reports - Evaluative	7
Speeches/Meeting Papers	5
Journal Articles	3
Reports - Research	3
Tests/Questionnaires	3
Opinion Papers	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Alabama High School…	1
General Educational…	1
National Assessment of…	1

What Works Clearinghouse Rating

Showing all 10 results Save | Export

Using Standard-Setting Data to Establish Cutoff Scores.

Peer reviewed

Geisinger, Kurt F. – Educational Measurement: Issues and Practice, 1991

Ways to use standard-setting data to adjust cutoff scores on examinations are reviewed. Ten sources of information to be used in determining standards are listed. The decision to modify passing scores should be based on these types of information and consideration of adverse impact or rating process irregularities. (SLD)

Descriptors: Cutting Scores, Evaluation Utilization, Evaluators, Interrater Reliability

Factors Influencing Intrajudge Consistency during Standard-Setting.

Peer reviewed

Plake, Barbara S.; And Others – Educational Measurement: Issues and Practice, 1991

Possible sources of intrajudge inconsistency in standard setting are reviewed, and approaches are presented to improve the accuracy of rating. Procedures for providing judges with feedback through discussion or computerized communication are discussed. Monitoring and maintaining judges' consistency throughout the rating process are essential. (SLD)

Descriptors: Computer Assisted Instruction, Evaluators, Examiners, Feedback

Analysis of Interrater Reliability on the Evaluation of Answers to Open-Ended Questions.

Crews, William E., Jr. – 1991

As part of a study of teacher evaluation of student replies to open-ended questions, a second question--the best method of determining interrater reliability--was examined. The standard method, the Pearson Product-Moment correlation, overestimated the degree of match between researchers' and teachers' scoring of tests. The simpler percent…

Descriptors: Comparative Analysis, Elementary School Teachers, Evaluation Methods, Evaluators

Selection of Judges for Standard-Setting.

Peer reviewed

Jaeger, Richard M. – Educational Measurement: Issues and Practice, 1991

Issues concerning the selection of judges for standard setting are discussed. Determining the consistency of judges' recommendations, or their congruity with other expert recommendations, would help in selection. Enough judges must be chosen to allow estimation of recommendations by an entire population of judges. (SLD)

Descriptors: Cutting Scores, Evaluation Methods, Evaluators, Examiners

Do Students Get Higher Scores on Their Word-Processed Papers? A Study of Bias in Scoring Hand-Written vs. Word-Processed Papers.

Download full text

Arnold, Voiza; And Others – 1990

In 1990, a study was conducted at Rio Hondo College (Whittier, California) to determine if readers exhibited any bias in scoring test papers that were composed on a word processor as opposed to being written by hand. The study began with the formulation of tentative pilot study questions and the development of procedures to address them. Three…

Descriptors: Bias, Community Colleges, Evaluators, Handwriting

Sources of Variability in the Angoff Standard-Setting Process.

Download full text

Halpin, Glennelle; McLean, James E. – 1991

Although the standard-setting method of W. H. Angoff (1971) has broad-based support in the research literature, inconsistencies in the resulting standards do occur. Sources of these inconsistencies are examined in a study of judges, competencies (items), rounds (replications), and the interactions among them. A modified Angoff approach was used to…

Descriptors: Analysis of Variance, Error of Measurement, Evaluators, High Schools

Problematic Responses to Reading Performance Assessment Tasks: Sources and Implications.

Goldberg, Gail Lynn; Kapinus, Barbara – 1992

The Maryland School Performance Assessment Program (MSPAP) is a relatively new, statewide performance assessment of students in grades 3, 5, and 8. When first administered in May of 1991, the MSPAP included a battery of performance assessment tasks designed to generate written or drawn responses to reading texts. This study evaluated selected…

Descriptors: Comparative Testing, Elementary Education, Elementary School Teachers, Evaluators

Some Practical Solutions to Standard-Setting Problems: The Georgia Teacher Certification Test Experience.

Download full text

Cramer, Stephen E. – 1990

A standard-setting procedure was developed for the Georgia Teacher Certification Testing Program as tests in 30 teaching fields were revised. A list of important characteristics of a standard-setting procedure was derived, drawing on the work of R. A. Berk (1986). The best method was found to be a highly formalized judgmental, empirical Angoff…

Descriptors: Computer Assisted Testing, Cutting Scores, Data Collection, Elementary Secondary Education

Decentralized Large Scale Essay Scoring: Methods for Establishing and Evaluating Score Scale Stability and Reading Reliability.

Auchter, Joan Chikos; Patience, Wayne – 1989

The methods used by the General Educational Development Testing Service (GEDTS) to establish and maintain score stability and reading reliability on its direct assessment of writing are described. Using the 1988 site certification and monitoring results of several scoring sites, the focus is on describing how the score scale was established and…

Descriptors: Decentralization, Equivalency Tests, Essay Tests, Evaluators

Reliability of Professionally Scored Data: NAEP-Related Issues.

Kaplan, Bruce A.; Johnson, Eugene G. – 1992

Across the field of educational assessment the case has been made for alternatives to the multiple-choice item type. Most of the alternative types of items require a subjective evaluation by a rater. The reliability of this subjective rating is a key component of these types of alternative items. In this paper, measures of reliability are…

Descriptors: Educational Assessment, Elementary Secondary Education, Estimation (Mathematics), Evaluators