ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	10

Descriptor

Standard Setting (Scoring)	37
Standards	16
Cutting Scores	13
Decision Making	10
Elementary Secondary Education	10
Evaluators	10
Licensing Examinations…	10
Performance Based Assessment	9
Test Items	9
Evaluation Methods	8
Judges	7
Comparative Analysis	6
Interrater Reliability	6
Credentials	5
Difficulty Level	5
Teacher Evaluation	5
Certification	4
Item Analysis	4
Minimum Competency Testing	4
Probability	4
Reliability	4
Scoring	4
Test Construction	4
Computation	3
Educational Assessment	3
More ▼

Source

Applied Measurement in…

Publication Type

Journal Articles	37
Reports - Research	19
Reports - Evaluative	14
Information Analyses	5
Speeches/Meeting Papers	5
Reports - Descriptive	4

Education Level

Elementary Secondary Education	2
Junior High Schools	2
Middle Schools	2
Secondary Education	2
Early Childhood Education	1
Elementary Education	1
Grade 7	1

Audience

Teachers

Location

Australia	1
Georgia	1
United Kingdom	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	2
National Teacher Examinations	1
Praxis Series	1
United States Medical…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 37 results Save | Export

Comparing Cut Scores from the Angoff Method and Two Variations of the Hofstee and Beuk Methods

Peer reviewed

Direct link

Wyse, Adam E. – Applied Measurement in Education, 2020

This article compares cut scores from two variations of the Hofstee and Beuk methods, which determine cut scores by resolving inconsistencies in panelists' judgments about cut scores and pass rates, with the Angoff method. The first variation uses responses to the Hofstee and Beuk percentage correct and pass rate questions to calculate cut scores.…

Descriptors: Cutting Scores, Evaluation Methods, Standard Setting (Scoring), Equations (Mathematics)

Examining How Professional Roles and Test Development Experiences Impact Angoff Ratings

Peer reviewed

Direct link

Wyse, Adam E. – Applied Measurement in Education, 2018

An important consideration in standard setting is recruiting a group of panelists with different experiences and backgrounds to serve on the standard-setting panel. This study uses data from 14 different Angoff standard settings from a variety of medical imaging credentialing programs to examine whether people with different professional roles and…

Descriptors: Standard Setting (Scoring), Test Construction, Cutting Scores, Accuracy

Regression Effects in Angoff Ratings: Examples from Credentialing Exams

Peer reviewed

Direct link

Wyse, Adam E. – Applied Measurement in Education, 2018

This article discusses regression effects that are commonly observed in Angoff ratings where panelists tend to think that hard items are easier than they are and easy items are more difficult than they are in comparison to estimated item difficulties. Analyses of data from two credentialing exams illustrate these regression effects and the…

Descriptors: Regression (Statistics), Test Items, Difficulty Level, Licensing Examinations (Professions)

Increasing the Validity of Angoff Standards through Analysis of Judge-Level Internal Consistency

Peer reviewed

Direct link

Clauser, Jerome C.; Clauser, Brian E.; Hambleton, Ronald K. – Applied Measurement in Education, 2014

The purpose of the present study was to extend past work with the Angoff method for setting standards by examining judgments at the judge level rather than the panel level. The focus was on investigating the relationship between observed Angoff standard setting judgments and empirical conditional probabilities. This relationship has been used as a…

Descriptors: Standard Setting (Scoring), Validity, Reliability, Correlation

Evaluating the Consistency of Angoff-Based Cut Scores Using Subsets of Items within a Generalizability Theory Framework

Peer reviewed

Direct link

Kannan, Priya; Sgammato, Adrienne; Tannenbaum, Richard J.; Katz, Irvin R. – Applied Measurement in Education, 2015

The Angoff method requires experts to view every item on the test and make a probability judgment. This can be time consuming when there are large numbers of items on the test. In this study, a G-theory framework was used to determine if a subset of items can be used to make generalizable cut-score recommendations. Angoff ratings (i.e.,…

Descriptors: Reliability, Standard Setting (Scoring), Cutting Scores, Test Items

Evaluating the Operational Feasibility of Using Subsets of Items to Recommend Minimal Competency Cut Scores

Peer reviewed

Direct link

Kannan, Priya; Sgammato, Adrienne; Tannenbaum, Richard J. – Applied Measurement in Education, 2015

Establishing cut scores using the Angoff method requires panelists to evaluate every item on a test and make a probability judgment. This can be time-consuming when there are large numbers of items on the test. Previous research using resampling studies suggest that it is possible to recommend stable Angoff-based cut score estimates using a…

Descriptors: Cutting Scores, Test Items, Standard Setting (Scoring), Feasibility Studies

Requiring a Consistent Unit of Scale between the Responses of Students and Judges in Standard Setting

Peer reviewed

Direct link

Humphry, Stephen; Heldsinger, Sandra; Andrich, David – Applied Measurement in Education, 2014

One of the best-known methods for setting a benchmark standard on a test is that of Angoff and its modifications. When scored dichotomously, judges estimate the probability that a benchmark student has of answering each item correctly. As in most methods of standard setting, it is assumed implicitly that the unit of the latent scale of the…

Descriptors: Foreign Countries, Standard Setting (Scoring), Judges, Item Response Theory

Innovations in Measuring Rater Accuracy in Standard Setting: Assessing "Fit" to Item Characteristic Curves

Peer reviewed

Direct link

Hurtz, Gregory M.; Jones, J. Patrick – Applied Measurement in Education, 2009

Standard setting methods such as the Angoff method rely on judgments of item characteristics; item response theory empirically estimates item characteristics and displays them in item characteristic curves (ICCs). This study evaluated several indexes of rater fit to ICCs as a method for judging rater accuracy in their estimates of expected item…

Descriptors: Standard Setting (Scoring), Item Response Theory, Reliability, Measurement

An Empirical Examination of the Impact of Group Discussion and Examinee Performance Information on Judgments Made in the Angoff Standard-Setting Procedure

Peer reviewed

Direct link

Clauser, Brian E.; Harik, Polina; Margolis, Melissa J.; McManus, I. C.; Mollon, Jennifer; Chis, Liliana; Williams, Simon – Applied Measurement in Education, 2009

Numerous studies have compared the Angoff standard-setting procedure to other standard-setting methods, but relatively few studies have evaluated the procedure based on internal criteria. This study uses a generalizability theory framework to evaluate the stability of the estimated cut score. To provide a measure of internal consistency, this…

Descriptors: Generalizability Theory, Group Discussion, Standard Setting (Scoring), Scoring

Alternate Assessments of Students with Significant Disabilities: Alternative Approaches, Common Technical Challenges

Peer reviewed

Direct link

Elliott, Stephen N.; Roach, Andrew T. – Applied Measurement in Education, 2007

This article examines three typical approaches to alternate assessment for students with significant cognitive disabilities--portfolios, performance assessments, and rating scales. A detailed analysis of common and unique design features of these approaches is provided, including features of each approach that influence the psychometric quality of…

Descriptors: Psychometrics, Validity, Rating Scales, Alternative Assessment

Conclusions about Frequently Studied Modified Angoff Standard-Setting Topics

Peer reviewed

Direct link

Brandon, Paul R. – Applied Measurement in Education, 2004

This article reviews the empirical literature on 9 topics about the modified Angoff standard-setting method that have been studied repeatedly in the literature, while taking into consideration the methodological warrant for the findings on the topics. It concludes that we can be reasonably confident about selecting the appropriate number of judges…

Descriptors: Test Items, Standard Setting (Scoring), Research Methodology, Testing

Using Cluster Analysis To Facilitate Standard Setting.

Peer reviewed

Sireci, Stephen G.; Robin, Frederic; Patelis, Thanos – Applied Measurement in Education, 1999

Presents a procedure for standard setting that involves the cluster analysis of test takers to discover examinee groups that are useful for envisioning marginally competent performance or defining borderline or contrasting groups. Illustrates use of the procedure with a statewide mathematics test, and concludes that cluster analysis is useful in…

Descriptors: Cluster Analysis, Mathematics Tests, Standard Setting (Scoring), Standards

A Binomial Trials Model for Examining the Ratings of Standard-Setting Judges.

Peer reviewed

Engelhard, George, Jr.; Anderson, David W. – Applied Measurement in Education, 1998

A new approach for examining the quality of judgments from standard-setting judges using a Binomial Trials Model (BTM) is presented and illustrated with 26 judges from the Georgia High School Graduation Test. Results suggest that the BTM provides information not available from other methods. (SLD)

Descriptors: Graduation Requirements, High Schools, Judges, Standard Setting (Scoring)

Teachers' Conceptions of the Target Examinee in Angoff Standard Setting

Peer reviewed

Direct link

Giraud, Gerald; Impara, James C.; Plake, Barbara S. – Applied Measurement in Education, 2005

In cut score setting processes, subject matter experts are asked to make judgments about the likely performance of examinees at a targeted skill level. When cut scores are used in K-12 settings to separate students who have and have not mastered certain skills, the target examinee may be characterized as the barely proficient or barely master…

Descriptors: Elementary Secondary Education, Cutting Scores, Standard Setting (Scoring), Workshops

Relations between Observed Item Difficulty Levels and Angoff Minimum Passing Levels for a Group of Borderline Examinees.

Peer reviewed

Goodwin, Laura D. – Applied Measurement in Education, 1999

The relations between Angoff ratings (minimum passing levels) and the actual "p" values for borderline examinees were studied with 115 examinees taking the Certified Financial Planner examination. Findings do not suggest that the Angoff judges' task is nearly impossible, but they do suggest the need to improve standard-setting…

Descriptors: Cutting Scores, Difficulty Level, Judges, Licensing Examinations (Professions)

Previous Page | Next Page »

Pages: 1 | 2 | 3

Plake, Barbara S.	6
Hambleton, Ronald K.	3
Wyse, Adam E.	3
Chang, Lei	2
Clauser, Brian E.	2
Jaeger, Richard M.	2
Kannan, Priya	2
Norcini, John	2
Sgammato, Adrienne	2
Tannenbaum, Richard J.	2
Anderson, David W.	1
Andrich, David	1
Angoff, William H.	1
Berk, Ronald A.	1
Brandon, Paul R.	1
Busch, John Christian	1
Chis, Liliana	1
Clauser, Jerome C.	1
Cohen, Allan S.	1
Crooks, Terence J.	1
Elliott, Stephen N.	1
Engelhard, George, Jr.	1
Ferdous, Abdullah A.	1
Giraud, Gerald	1
More ▼