Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 10 |
Descriptor
Standard Setting (Scoring) | 37 |
Standards | 16 |
Cutting Scores | 13 |
Decision Making | 10 |
Elementary Secondary Education | 10 |
Evaluators | 10 |
Licensing Examinations… | 10 |
Performance Based Assessment | 9 |
Test Items | 9 |
Evaluation Methods | 8 |
Judges | 7 |
More ▼ |
Source
Applied Measurement in… | 37 |
Author
Plake, Barbara S. | 6 |
Hambleton, Ronald K. | 3 |
Wyse, Adam E. | 3 |
Chang, Lei | 2 |
Clauser, Brian E. | 2 |
Jaeger, Richard M. | 2 |
Kannan, Priya | 2 |
Norcini, John | 2 |
Sgammato, Adrienne | 2 |
Tannenbaum, Richard J. | 2 |
Anderson, David W. | 1 |
More ▼ |
Publication Type
Journal Articles | 37 |
Reports - Research | 19 |
Reports - Evaluative | 14 |
Information Analyses | 5 |
Speeches/Meeting Papers | 5 |
Reports - Descriptive | 4 |
Education Level
Elementary Secondary Education | 2 |
Junior High Schools | 2 |
Middle Schools | 2 |
Secondary Education | 2 |
Early Childhood Education | 1 |
Elementary Education | 1 |
Grade 7 | 1 |
Audience
Teachers | 1 |
Location
Australia | 1 |
Georgia | 1 |
United Kingdom | 1 |
Laws, Policies, & Programs
Assessments and Surveys
National Assessment of… | 2 |
National Teacher Examinations | 1 |
Praxis Series | 1 |
United States Medical… | 1 |
What Works Clearinghouse Rating
Wyse, Adam E. – Applied Measurement in Education, 2020
This article compares cut scores from two variations of the Hofstee and Beuk methods, which determine cut scores by resolving inconsistencies in panelists' judgments about cut scores and pass rates, with the Angoff method. The first variation uses responses to the Hofstee and Beuk percentage correct and pass rate questions to calculate cut scores.…
Descriptors: Cutting Scores, Evaluation Methods, Standard Setting (Scoring), Equations (Mathematics)
Wyse, Adam E. – Applied Measurement in Education, 2018
An important consideration in standard setting is recruiting a group of panelists with different experiences and backgrounds to serve on the standard-setting panel. This study uses data from 14 different Angoff standard settings from a variety of medical imaging credentialing programs to examine whether people with different professional roles and…
Descriptors: Standard Setting (Scoring), Test Construction, Cutting Scores, Accuracy
Wyse, Adam E. – Applied Measurement in Education, 2018
This article discusses regression effects that are commonly observed in Angoff ratings where panelists tend to think that hard items are easier than they are and easy items are more difficult than they are in comparison to estimated item difficulties. Analyses of data from two credentialing exams illustrate these regression effects and the…
Descriptors: Regression (Statistics), Test Items, Difficulty Level, Licensing Examinations (Professions)
Clauser, Jerome C.; Clauser, Brian E.; Hambleton, Ronald K. – Applied Measurement in Education, 2014
The purpose of the present study was to extend past work with the Angoff method for setting standards by examining judgments at the judge level rather than the panel level. The focus was on investigating the relationship between observed Angoff standard setting judgments and empirical conditional probabilities. This relationship has been used as a…
Descriptors: Standard Setting (Scoring), Validity, Reliability, Correlation
Kannan, Priya; Sgammato, Adrienne; Tannenbaum, Richard J.; Katz, Irvin R. – Applied Measurement in Education, 2015
The Angoff method requires experts to view every item on the test and make a probability judgment. This can be time consuming when there are large numbers of items on the test. In this study, a G-theory framework was used to determine if a subset of items can be used to make generalizable cut-score recommendations. Angoff ratings (i.e.,…
Descriptors: Reliability, Standard Setting (Scoring), Cutting Scores, Test Items
Kannan, Priya; Sgammato, Adrienne; Tannenbaum, Richard J. – Applied Measurement in Education, 2015
Establishing cut scores using the Angoff method requires panelists to evaluate every item on a test and make a probability judgment. This can be time-consuming when there are large numbers of items on the test. Previous research using resampling studies suggest that it is possible to recommend stable Angoff-based cut score estimates using a…
Descriptors: Cutting Scores, Test Items, Standard Setting (Scoring), Feasibility Studies
Humphry, Stephen; Heldsinger, Sandra; Andrich, David – Applied Measurement in Education, 2014
One of the best-known methods for setting a benchmark standard on a test is that of Angoff and its modifications. When scored dichotomously, judges estimate the probability that a benchmark student has of answering each item correctly. As in most methods of standard setting, it is assumed implicitly that the unit of the latent scale of the…
Descriptors: Foreign Countries, Standard Setting (Scoring), Judges, Item Response Theory
Hurtz, Gregory M.; Jones, J. Patrick – Applied Measurement in Education, 2009
Standard setting methods such as the Angoff method rely on judgments of item characteristics; item response theory empirically estimates item characteristics and displays them in item characteristic curves (ICCs). This study evaluated several indexes of rater fit to ICCs as a method for judging rater accuracy in their estimates of expected item…
Descriptors: Standard Setting (Scoring), Item Response Theory, Reliability, Measurement
Clauser, Brian E.; Harik, Polina; Margolis, Melissa J.; McManus, I. C.; Mollon, Jennifer; Chis, Liliana; Williams, Simon – Applied Measurement in Education, 2009
Numerous studies have compared the Angoff standard-setting procedure to other standard-setting methods, but relatively few studies have evaluated the procedure based on internal criteria. This study uses a generalizability theory framework to evaluate the stability of the estimated cut score. To provide a measure of internal consistency, this…
Descriptors: Generalizability Theory, Group Discussion, Standard Setting (Scoring), Scoring
Elliott, Stephen N.; Roach, Andrew T. – Applied Measurement in Education, 2007
This article examines three typical approaches to alternate assessment for students with significant cognitive disabilities--portfolios, performance assessments, and rating scales. A detailed analysis of common and unique design features of these approaches is provided, including features of each approach that influence the psychometric quality of…
Descriptors: Psychometrics, Validity, Rating Scales, Alternative Assessment
Brandon, Paul R. – Applied Measurement in Education, 2004
This article reviews the empirical literature on 9 topics about the modified Angoff standard-setting method that have been studied repeatedly in the literature, while taking into consideration the methodological warrant for the findings on the topics. It concludes that we can be reasonably confident about selecting the appropriate number of judges…
Descriptors: Test Items, Standard Setting (Scoring), Research Methodology, Testing

Sireci, Stephen G.; Robin, Frederic; Patelis, Thanos – Applied Measurement in Education, 1999
Presents a procedure for standard setting that involves the cluster analysis of test takers to discover examinee groups that are useful for envisioning marginally competent performance or defining borderline or contrasting groups. Illustrates use of the procedure with a statewide mathematics test, and concludes that cluster analysis is useful in…
Descriptors: Cluster Analysis, Mathematics Tests, Standard Setting (Scoring), Standards

Engelhard, George, Jr.; Anderson, David W. – Applied Measurement in Education, 1998
A new approach for examining the quality of judgments from standard-setting judges using a Binomial Trials Model (BTM) is presented and illustrated with 26 judges from the Georgia High School Graduation Test. Results suggest that the BTM provides information not available from other methods. (SLD)
Descriptors: Graduation Requirements, High Schools, Judges, Standard Setting (Scoring)
Giraud, Gerald; Impara, James C.; Plake, Barbara S. – Applied Measurement in Education, 2005
In cut score setting processes, subject matter experts are asked to make judgments about the likely performance of examinees at a targeted skill level. When cut scores are used in K-12 settings to separate students who have and have not mastered certain skills, the target examinee may be characterized as the barely proficient or barely master…
Descriptors: Elementary Secondary Education, Cutting Scores, Standard Setting (Scoring), Workshops

Goodwin, Laura D. – Applied Measurement in Education, 1999
The relations between Angoff ratings (minimum passing levels) and the actual "p" values for borderline examinees were studied with 115 examinees taking the Certified Financial Planner examination. Findings do not suggest that the Angoff judges' task is nearly impossible, but they do suggest the need to improve standard-setting…
Descriptors: Cutting Scores, Difficulty Level, Judges, Licensing Examinations (Professions)