Descriptor
| Reliability | 7 |
| Standards | 7 |
| Standard Setting (Scoring) | 6 |
| Cutting Scores | 4 |
| Test Items | 3 |
| Validity | 3 |
| Comparative Analysis | 2 |
| Definitions | 2 |
| Evaluators | 2 |
| Mathematical Models | 2 |
| Pass Fail Grading | 2 |
| More ▼ | |
Author
| Impara, James C. | 2 |
| Plake, Barbara S. | 2 |
| Andrew, Barbara J. | 1 |
| Hecht, James T. | 1 |
| Irwin, Patrick | 1 |
| Irwin, Patrick M. | 1 |
| Krippendorff, Klaus | 1 |
| Rothman, Arthur I. | 1 |
| Sigmon, Gary L. | 1 |
| Van der Linden, Wim J. | 1 |
Publication Type
| Reports - Research | 4 |
| Journal Articles | 3 |
| Speeches/Meeting Papers | 3 |
| Reports - Evaluative | 2 |
Education Level
Audience
| Researchers | 1 |
Location
| Canada | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Peer reviewedPlake, Barbara S.; Impara, James C.; Irwin, Patrick M. – Journal of Educational Measurement, 2000
Examined intra- and inter-rater consistency of item performance estimated from an Angoff standard setting over 2 years, with 29 panelists one year, and 30 the next. Results provide evidence that item performance estimates were consistent within and across panels within and across years. Factors that might have influenced this high degree of…
Descriptors: Evaluators, Prediction, Reliability, Standard Setting
Krippendorff, Klaus – 1992
When one wants to set data reliability standards for a class of scientific inquiries or when one needs to compare and select among many different kinds of data with reliabilities that are crucial to a particular research undertaking, then one needs a single reliability coefficient that is adaptable to all or most situations. Work toward this goal…
Descriptors: Definitions, Equations (Mathematics), Mathematical Models, Reliability
Sigmon, Gary L.; And Others – 1983
In recent years educators have been utilizing judgmental methods, such as the ones advocated by Ebel and Angoff, to set minimum competency standards on test items. This study was designed to investigate the reliability and validity of these two procedures in setting minimum levels of performance on 175 vocational evaluator competency statements.…
Descriptors: Comparative Analysis, Evaluation Methods, Evaluators, Minimum Competencies
Plake, Barbara S.; Impara, James C.; Irwin, Patrick – 1999
Judgmental standard setting methods, such as the Angoff method (W. Angoff, 1971), use item performance estimates as the basis for determining the minimum passing score (MPS). Therefore the accuracy of these item performance estimates is crucial to the validity of the resulting MPS. Recent researchers (L. Shepard, 1994; J. Impara, 1997) have called…
Descriptors: Cutting Scores, Estimation (Mathematics), Judges, Performance Factors
Peer reviewedAndrew, Barbara J.; Hecht, James T. – Educational and Psychological Measurement, 1976
Results suggest that different groups of judges do set similar examination standards when using the same procedure, and that the average of individual judgments does not differ significantly from group consensus judgments. Significant differences were found, however, between the standards set by the two procedures employed. (RC)
Descriptors: Comparative Analysis, Cutting Scores, Multiple Choice Tests, Pass Fail Grading
Peer reviewedVan der Linden, Wim J. – Journal of Educational Measurement, 1982
An ignored aspect of standard setting, namely the possibility that Angoff or Nedelsky judges specify inconsistent probabilities (e.g., low probabilities for easy items but large probabilities for hard items) is explored. A latent trait method is proposed to estimate such misspecifications, and an index of consistency is defined. (Author/PN)
Descriptors: Cutting Scores, Latent Trait Theory, Mastery Tests, Mathematical Models
The Consistency and Uncertainty in Examiners' Definitions of Pass/Fail Performance on OSCE Stations.
Peer reviewedRothman, Arthur I.; And Others – Evaluation and the Health Professions, 1996
Results of the fall 1993 administration of part two of the Medical Council of Canada's Evaluating Examination for 744 candidates provided evidence of the consistency of the pass/fail and cutting score definitions for the objective- structured clinical examination stations used across examiners. These results support the validity of this…
Descriptors: Cutting Scores, Definitions, Foreign Countries, Medical Education


