ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	1

Descriptor

Evaluators	26
Testing Problems	26
Interrater Reliability	13
Scoring	10
Elementary Secondary Education	7
Evaluation Methods	7
Licensing Examinations…	5
Models	5
Standard Setting (Scoring)	5
Test Interpretation	5
Performance Based Assessment	4
Program Evaluation	4
Test Reliability	4
Testing Programs	4
Cutting Scores	3
Educational Assessment	3
Educational Testing	3
Examiners	3
Latent Trait Theory	3
Reading Tests	3
Standardized Tests	3
State Programs	3
Test Construction	3
Test Items	3
Testing	3
More ▼

Source

Educational Measurement:…	4
Educational Evaluation and…	1
Educational Research for…	1
Highway One	1

Publication Type

Speeches/Meeting Papers	12
Reports - Research	11
Reports - Evaluative	9
Journal Articles	7
Opinion Papers	4
Tests/Questionnaires	4
Guides - General	1
Guides - Non-Classroom	1
Reports - Descriptive	1

Education Level

Elementary Secondary Education

Audience

Location

Louisiana	1
Netherlands	1
Oregon	1
Thailand	1

Laws, Policies, & Programs

Elementary and Secondary…

Assessments and Surveys

Alabama High School…	1
General Educational…	1
National Assessment of…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 26 results Save | Export

Detecting Measurement Disturbances in Rater-Mediated Assessments

Peer reviewed

Direct link

Wind, Stefanie A.; Schumacker, Randall E. – Educational Measurement: Issues and Practice, 2017

The term measurement disturbance has been used to describe systematic conditions that affect a measurement process, resulting in a compromised interpretation of person or item estimates. Measurement disturbances have been discussed in relation to systematic response patterns associated with items and persons, such as start-up, plodding, boredom,…

Descriptors: Measurement, Testing Problems, Writing Tests, Performance Based Assessment

Interview with Robert E. Stake.

Peer reviewed

Educational Evaluation and Policy Analysis, 1981

Robert Stake presents his views on responsive evaluation, naturalistic approaches to evaluation, the role of testing in evaluation, and the training of evaluators. (BW)

Descriptors: Evaluators, Interviews, Models, Professional Training

Objectivity for Judge-Intermediated Certification Examinations.

Download full text

Linacre, John M. – 1989

An accepted criterion for gauging the fairness of examinees' scores, derived from judge-awarded ratings, has been the size of the correlation between the judges and the inter-rater reliability. Various means of achieving inter-rater reliability were reviewed, and a model to measure inter-rater reliability is forwarded. Both theoretical and…

Descriptors: Evaluators, Interrater Reliability, Latent Trait Theory, Licensing Examinations (Professions)

Utilization of Evaluation Results in Joint Policy Making.

Posante, Rebecca – 1981

Policy-making problems being faced in Louisiana regarding the testing of handicapped students within a state mandated minimum competency testing program are dealt with. The decision-making process is complicated by the fact that two groups (staff administering the test and staff administering programs for the handicapped) with differing…

Descriptors: Agency Cooperation, Decision Making, Disabilities, Elementary Secondary Education

Using Standard-Setting Data to Establish Cutoff Scores.

Peer reviewed

Geisinger, Kurt F. – Educational Measurement: Issues and Practice, 1991

Ways to use standard-setting data to adjust cutoff scores on examinations are reviewed. Ten sources of information to be used in determining standards are listed. The decision to modify passing scores should be based on these types of information and consideration of adverse impact or rating process irregularities. (SLD)

Descriptors: Cutting Scores, Evaluation Utilization, Evaluators, Interrater Reliability

A Context for Evaluation.

Shapiro, Bernard J. – Highway One, 1985

Describes some forces that have shaped current attitudes toward evaluation of education, presents five guidelines to help in discussions of assessment, and charges teachers with the responsibility of making evaluation effective. (DF)

Descriptors: Accountability, Educational Assessment, Elementary Secondary Education, Evaluation Criteria

Factors Influencing Intrajudge Consistency during Standard-Setting.

Peer reviewed

Plake, Barbara S.; And Others – Educational Measurement: Issues and Practice, 1991

Possible sources of intrajudge inconsistency in standard setting are reviewed, and approaches are presented to improve the accuracy of rating. Procedures for providing judges with feedback through discussion or computerized communication are discussed. Monitoring and maintaining judges' consistency throughout the rating process are essential. (SLD)

Descriptors: Computer Assisted Instruction, Evaluators, Examiners, Feedback

Analysis of Interrater Reliability on the Evaluation of Answers to Open-Ended Questions.

Crews, William E., Jr. – 1991

As part of a study of teacher evaluation of student replies to open-ended questions, a second question--the best method of determining interrater reliability--was examined. The standard method, the Pearson Product-Moment correlation, overestimated the degree of match between researchers' and teachers' scoring of tests. The simpler percent…

Descriptors: Comparative Analysis, Elementary School Teachers, Evaluation Methods, Evaluators

Issues Related to Test Use.

Anderson, Scarvia B. – 1977

Several issues are related to the use of educational tests. First, test users must be able to choose appropriate tests, interpret scores, and make decisions based on scores. In the field of educational testing, few test users have adequate training in these areas. Second, test makers must clearly specify directions for administration, allowable…

Descriptors: Educational Testing, Elementary Secondary Education, Evaluators, Guides

Least-Squares Models to Correct for Rater Effects in Performance Assessment.

Download full text

Raymond, Mark R.; Viswesvaran, Chockalingam – 1991

This study illustrates the use of three least-squares models to control for rater effects in performance evaluation: (1) ordinary least squares (OLS); (2) weighted least squares (WLS); and (3) OLS subsequent to applying a logistic transformation to observed ratings (LOG-OLS). The three models were applied to ratings obtained from four…

Descriptors: Evaluators, Higher Education, Interrater Reliability, Least Squares Statistics

Consumer's Guide to Educational Evaluation.

Download full text

Lai, Morris K. – 1978

Although much has been written about educational evaluation, few guidelines exist for consumers--project directors, school administrators, curriculum developers, legislators, teachers, parents, and boards of education. Several cautions surface from a review of the literature. First, tests that are based on program objectives are most useful to…

Descriptors: Administrator Guides, Cost Estimates, Educational Assessment, Educational Testing

Selection of Judges for Standard-Setting.

Peer reviewed

Jaeger, Richard M. – Educational Measurement: Issues and Practice, 1991

Issues concerning the selection of judges for standard setting are discussed. Determining the consistency of judges' recommendations, or their congruity with other expert recommendations, would help in selection. Enough judges must be chosen to allow estimation of recommendations by an entire population of judges. (SLD)

Descriptors: Cutting Scores, Evaluation Methods, Evaluators, Examiners

Guide for School Testing Programs.

Ward, Annie W., Ed.; And Others

A number of brief papers are presented to provide guidelines for test directors of school systems. This collection is intended for both newly appointed and experienced directors. Contributions were solicited from practicing directors of testing; the authors include Anthony J. Allen, Margaret Backman, Joan Bollenbacker, Gerald Hanna, James Lawson;…

Descriptors: Administrator Guides, Administrator Role, Educational Testing, Elementary Secondary Education

Accuracy of Bias Review Judges in Identifying Differential Item Functioning on Teacher Certification Tests.

Download full text

Engelhard, George, Jr.; And Others – 1989

Whether judges on bias review committees can identify test items that function differently for black and white examinees was studied. Judges (n=42) on three bias review committees were asked to examine a set of items and predict differential item functioning (DIF) without empirical data. Test items from teacher certification tests in the content…

Descriptors: Black Students, Evaluators, Interrater Reliability, Item Analysis

Teacher Performance Appraisal in Thailand: Poison or Panacea?

Peer reviewed

Direct link

Pimpa, Nattavud – Educational Research for Policy and Practice, 2005

This research focuses on the examination of problems related to the national teacher performance appraisal system by the Thai Ministry of Education. It highlights major problems of the current performance appraisal system by delineating the weaknesses and pitfalls of the current appraisal system. The findings indicate problems to three major…

Descriptors: Evaluators, Teacher Evaluation, Foreign Countries, Evaluation Problems

Previous Page | Next Page »

Pages: 1 | 2

Anderson, Scarvia B.	1
Arnold, Voiza	1
Auchter, Joan Chikos	1
Backman, Margaret E.	1
Cramer, Stephen E.	1
Crews, William E., Jr.	1
Engelhard, George, Jr.	1
Forbes, 2ean W.	1
Geisinger, Kurt F.	1
Goldberg, Gail Lynn	1
Halpin, Glennelle	1
Jaeger, Richard M.	1
Johnson, Eugene G.	1
Kapinus, Barbara	1
Kaplan, Bruce A.	1
Kreeft, Henk	1
Lai, Morris K.	1
Linacre, John M.	1
Lunz, Mary E.	1
McLean, James E.	1
Patience, Wayne	1
Pimpa, Nattavud	1
Plake, Barbara S.	1
Posante, Rebecca	1
More ▼