Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 1 |
Descriptor
Author
Publication Type
Speeches/Meeting Papers | 12 |
Reports - Research | 11 |
Reports - Evaluative | 9 |
Journal Articles | 7 |
Opinion Papers | 4 |
Tests/Questionnaires | 4 |
Guides - General | 1 |
Guides - Non-Classroom | 1 |
Reports - Descriptive | 1 |
Education Level
Elementary Secondary Education | 1 |
Audience
Location
Louisiana | 1 |
Netherlands | 1 |
Oregon | 1 |
Thailand | 1 |
Laws, Policies, & Programs
Elementary and Secondary… | 1 |
Assessments and Surveys
Alabama High School… | 1 |
General Educational… | 1 |
National Assessment of… | 1 |
What Works Clearinghouse Rating
Wind, Stefanie A.; Schumacker, Randall E. – Educational Measurement: Issues and Practice, 2017
The term measurement disturbance has been used to describe systematic conditions that affect a measurement process, resulting in a compromised interpretation of person or item estimates. Measurement disturbances have been discussed in relation to systematic response patterns associated with items and persons, such as start-up, plodding, boredom,…
Descriptors: Measurement, Testing Problems, Writing Tests, Performance Based Assessment

Educational Evaluation and Policy Analysis, 1981
Robert Stake presents his views on responsive evaluation, naturalistic approaches to evaluation, the role of testing in evaluation, and the training of evaluators. (BW)
Descriptors: Evaluators, Interviews, Models, Professional Training
Linacre, John M. – 1989
An accepted criterion for gauging the fairness of examinees' scores, derived from judge-awarded ratings, has been the size of the correlation between the judges and the inter-rater reliability. Various means of achieving inter-rater reliability were reviewed, and a model to measure inter-rater reliability is forwarded. Both theoretical and…
Descriptors: Evaluators, Interrater Reliability, Latent Trait Theory, Licensing Examinations (Professions)
Posante, Rebecca – 1981
Policy-making problems being faced in Louisiana regarding the testing of handicapped students within a state mandated minimum competency testing program are dealt with. The decision-making process is complicated by the fact that two groups (staff administering the test and staff administering programs for the handicapped) with differing…
Descriptors: Agency Cooperation, Decision Making, Disabilities, Elementary Secondary Education

Geisinger, Kurt F. – Educational Measurement: Issues and Practice, 1991
Ways to use standard-setting data to adjust cutoff scores on examinations are reviewed. Ten sources of information to be used in determining standards are listed. The decision to modify passing scores should be based on these types of information and consideration of adverse impact or rating process irregularities. (SLD)
Descriptors: Cutting Scores, Evaluation Utilization, Evaluators, Interrater Reliability
Shapiro, Bernard J. – Highway One, 1985
Describes some forces that have shaped current attitudes toward evaluation of education, presents five guidelines to help in discussions of assessment, and charges teachers with the responsibility of making evaluation effective. (DF)
Descriptors: Accountability, Educational Assessment, Elementary Secondary Education, Evaluation Criteria

Plake, Barbara S.; And Others – Educational Measurement: Issues and Practice, 1991
Possible sources of intrajudge inconsistency in standard setting are reviewed, and approaches are presented to improve the accuracy of rating. Procedures for providing judges with feedback through discussion or computerized communication are discussed. Monitoring and maintaining judges' consistency throughout the rating process are essential. (SLD)
Descriptors: Computer Assisted Instruction, Evaluators, Examiners, Feedback
Crews, William E., Jr. – 1991
As part of a study of teacher evaluation of student replies to open-ended questions, a second question--the best method of determining interrater reliability--was examined. The standard method, the Pearson Product-Moment correlation, overestimated the degree of match between researchers' and teachers' scoring of tests. The simpler percent…
Descriptors: Comparative Analysis, Elementary School Teachers, Evaluation Methods, Evaluators
Anderson, Scarvia B. – 1977
Several issues are related to the use of educational tests. First, test users must be able to choose appropriate tests, interpret scores, and make decisions based on scores. In the field of educational testing, few test users have adequate training in these areas. Second, test makers must clearly specify directions for administration, allowable…
Descriptors: Educational Testing, Elementary Secondary Education, Evaluators, Guides
Raymond, Mark R.; Viswesvaran, Chockalingam – 1991
This study illustrates the use of three least-squares models to control for rater effects in performance evaluation: (1) ordinary least squares (OLS); (2) weighted least squares (WLS); and (3) OLS subsequent to applying a logistic transformation to observed ratings (LOG-OLS). The three models were applied to ratings obtained from four…
Descriptors: Evaluators, Higher Education, Interrater Reliability, Least Squares Statistics
Lai, Morris K. – 1978
Although much has been written about educational evaluation, few guidelines exist for consumers--project directors, school administrators, curriculum developers, legislators, teachers, parents, and boards of education. Several cautions surface from a review of the literature. First, tests that are based on program objectives are most useful to…
Descriptors: Administrator Guides, Cost Estimates, Educational Assessment, Educational Testing

Jaeger, Richard M. – Educational Measurement: Issues and Practice, 1991
Issues concerning the selection of judges for standard setting are discussed. Determining the consistency of judges' recommendations, or their congruity with other expert recommendations, would help in selection. Enough judges must be chosen to allow estimation of recommendations by an entire population of judges. (SLD)
Descriptors: Cutting Scores, Evaluation Methods, Evaluators, Examiners
Ward, Annie W., Ed.; And Others
A number of brief papers are presented to provide guidelines for test directors of school systems. This collection is intended for both newly appointed and experienced directors. Contributions were solicited from practicing directors of testing; the authors include Anthony J. Allen, Margaret Backman, Joan Bollenbacker, Gerald Hanna, James Lawson;…
Descriptors: Administrator Guides, Administrator Role, Educational Testing, Elementary Secondary Education
Engelhard, George, Jr.; And Others – 1989
Whether judges on bias review committees can identify test items that function differently for black and white examinees was studied. Judges (n=42) on three bias review committees were asked to examine a set of items and predict differential item functioning (DIF) without empirical data. Test items from teacher certification tests in the content…
Descriptors: Black Students, Evaluators, Interrater Reliability, Item Analysis
Pimpa, Nattavud – Educational Research for Policy and Practice, 2005
This research focuses on the examination of problems related to the national teacher performance appraisal system by the Thai Ministry of Education. It highlights major problems of the current performance appraisal system by delineating the weaknesses and pitfalls of the current appraisal system. The findings indicate problems to three major…
Descriptors: Evaluators, Teacher Evaluation, Foreign Countries, Evaluation Problems
Previous Page | Next Page ยป
Pages: 1 | 2