ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	14

Descriptor

Licensing Examinations…	66
Standard Setting (Scoring)	66
Cutting Scores	35
Standards	22
Test Items	22
Higher Education	18
Certification	16
Evaluators	16
Scoring	15
Teacher Certification	15
Minimum Competency Testing	13
Testing Programs	13
Elementary Secondary Education	12
Test Construction	12
Test Validity	12
State Programs	10
State Standards	10
Difficulty Level	9
Interrater Reliability	9
Knowledge Level	9
Judges	8
Pass Fail Grading	8
Physicians	8
Public School Teachers	8
Scores	8
More ▼

Source

Applied Measurement in…	10
Journal of Educational…	5
Educational Measurement:…	4
International Journal of…	3
Evaluation and the Health…	2
Academic Medicine	1
Applied Psychological…	1
CLEAR Exam Review	1
Educational Assessment	1
Educational and Psychological…	1
Journal of Outcome Measurement	1
Journal of Personnel…	1
Online Submission	1
Practical Assessment,…	1
More ▼

Publication Type

Reports - Research	40
Speeches/Meeting Papers	33
Journal Articles	32
Reports - Evaluative	18
Reports - Descriptive	5
Tests/Questionnaires	2
Guides - Non-Classroom	1
Information Analyses	1
Opinion Papers	1

Education Level

Junior High Schools	1
Middle Schools	1
Secondary Education	1

Audience

Researchers	2
Administrators	1
Practitioners	1
Teachers	1

Location

Tennessee	6
Canada	2
California	1
North Carolina	1

Laws, Policies, & Programs

Comprehensive Education…

Assessments and Surveys

National Teacher Examinations	8
United States Medical…	2
National Assessment of…	1
Praxis Series	1

What Works Clearinghouse Rating

Showing 1 to 15 of 66 results Save | Export

Adding Objectivity to Standard Setting: Evaluating Consequence Using the Conscious and Subconscious Weight Methods

Peer reviewed

Direct link

Leventhal, Brian C.; Grabovsky, Irina – Educational Measurement: Issues and Practice, 2020

Standard setting is arguably one of the most subjective techniques in test development and psychometrics. The decisions when scores are compared to standards, however, are arguably the most consequential outcomes of testing. Providing licensure to practice in a profession has high stake consequences for the public. Denying graduation or forcing…

Descriptors: Standard Setting (Scoring), Weighted Scores, Test Construction, Psychometrics

Examining How Professional Roles and Test Development Experiences Impact Angoff Ratings

Peer reviewed

Direct link

Wyse, Adam E. – Applied Measurement in Education, 2018

An important consideration in standard setting is recruiting a group of panelists with different experiences and backgrounds to serve on the standard-setting panel. This study uses data from 14 different Angoff standard settings from a variety of medical imaging credentialing programs to examine whether people with different professional roles and…

Descriptors: Standard Setting (Scoring), Test Construction, Cutting Scores, Accuracy

Consistency of Angoff-Based Standard-Setting Judgments: Are Item Judgments and Passing Scores Replicable across Different Panels of Experts?

Peer reviewed

Direct link

Tannenbaum, Richard J.; Kannan, Priya – Educational Assessment, 2015

Angoff-based standard setting is widely used, especially for high-stakes licensure assessments. Nonetheless, some critics have claimed that the judgment task is too cognitively complex for panelists, whereas others have explicitly challenged the consistency in (replicability of) standard-setting outcomes. Evidence of consistency in item judgments…

Descriptors: Standard Setting (Scoring), Reliability, Scores, Licensing Examinations (Professions)

Regression Effects in Angoff Ratings: Examples from Credentialing Exams

Peer reviewed

Direct link

Wyse, Adam E. – Applied Measurement in Education, 2018

This article discusses regression effects that are commonly observed in Angoff ratings where panelists tend to think that hard items are easier than they are and easy items are more difficult than they are in comparison to estimated item difficulties. Analyses of data from two credentialing exams illustrate these regression effects and the…

Descriptors: Regression (Statistics), Test Items, Difficulty Level, Licensing Examinations (Professions)

The Impact of Examinee Performance Information on Judges' Cut Scores in Modified Angoff Standard-Setting Exercises

Peer reviewed

Direct link

Margolis, Melissa J.; Clauser, Brian E. – Educational Measurement: Issues and Practice, 2014

This research evaluated the impact of a common modification to Angoff standard-setting exercises: the provision of examinee performance data. Data from 18 independent standard-setting panels across three different medical licensing examinations were examined to investigate whether and how the provision of performance information impacted judgments…

Descriptors: Cutting Scores, Standard Setting (Scoring), Data, Licensing Examinations (Professions)

Increasing the Validity of Angoff Standards through Analysis of Judge-Level Internal Consistency

Peer reviewed

Direct link

Clauser, Jerome C.; Clauser, Brian E.; Hambleton, Ronald K. – Applied Measurement in Education, 2014

The purpose of the present study was to extend past work with the Angoff method for setting standards by examining judgments at the judge level rather than the panel level. The focus was on investigating the relationship between observed Angoff standard setting judgments and empirical conditional probabilities. This relationship has been used as a…

Descriptors: Standard Setting (Scoring), Validity, Reliability, Correlation

Evaluating the Consistency of Angoff-Based Cut Scores Using Subsets of Items within a Generalizability Theory Framework

Peer reviewed

Direct link

Kannan, Priya; Sgammato, Adrienne; Tannenbaum, Richard J.; Katz, Irvin R. – Applied Measurement in Education, 2015

The Angoff method requires experts to view every item on the test and make a probability judgment. This can be time consuming when there are large numbers of items on the test. In this study, a G-theory framework was used to determine if a subset of items can be used to make generalizable cut-score recommendations. Angoff ratings (i.e.,…

Descriptors: Reliability, Standard Setting (Scoring), Cutting Scores, Test Items

Evaluating the Operational Feasibility of Using Subsets of Items to Recommend Minimal Competency Cut Scores

Peer reviewed

Direct link

Kannan, Priya; Sgammato, Adrienne; Tannenbaum, Richard J. – Applied Measurement in Education, 2015

Establishing cut scores using the Angoff method requires panelists to evaluate every item on a test and make a probability judgment. This can be time-consuming when there are large numbers of items on the test. Previous research using resampling studies suggest that it is possible to recommend stable Angoff-based cut score estimates using a…

Descriptors: Cutting Scores, Test Items, Standard Setting (Scoring), Feasibility Studies

Assessing the Viability of External Searchable Resources on the American Board of Family Medicine's Certification Examination

Download full text

O'Neill, Thomas R.; Peabody, Michael R.; Stelter, Keith L.; Hagen, Michael D. – Online Submission, 2015

(Purpose) The purpose of our study was to assess the need for an external searchable resource to be used in conjunction with the American Board of Family Medicine's (ABFM) Maintenance of Certification for Family Physicians (MC-FP) Examination, discuss the philosophical question of whether an ESR should be allowed on the examination, and outline…

Descriptors: Licensing Examinations (Professions), Family Practice (Medicine), Physicians, Online Searching

Identifying and Evaluating External Validity Evidence for Passing Scores

Peer reviewed

Direct link

Davis-Becker, Susan L.; Buckendahl, Chad W. – International Journal of Testing, 2013

A critical component of the standard setting process is collecting evidence to evaluate the recommended cut scores and their use for making decisions and classifying students based on test performance. Kane (1994, 2001) proposed a framework by which practitioners can identify and evaluate evidence of the results of the standard setting from (1)…

Descriptors: Standard Setting (Scoring), Evidence, Validity, Cutting Scores

The Effect of Data Format on Integration of Performance Data into Angoff Judgments

Peer reviewed

Direct link

Clauser, Brian E.; Mee, Janet; Margolis, Melissa J. – International Journal of Testing, 2013

This study investigated the extent to which the performance data format impacted data use in Angoff standard setting exercises. Judges from two standard settings (a total of five panels) were randomly assigned to one of two groups. The full-data group received two types of data: (1) the proportion of examinees selecting each option and (2) plots…

Descriptors: Standard Setting (Scoring), Cutting Scores, Validity, Reliability

Evaluating the Bookmark Standard Setting Method: The Impact of Random Item Ordering

Peer reviewed

Direct link

Davis-Becker, Susan L.; Buckendahl, Chad W.; Gerrow, Jack – International Journal of Testing, 2011

Throughout the world, cut scores are an important aspect of a high-stakes testing program because they are a key operational component of the interpretation of test scores. One method for setting standards that is prevalent in educational testing programs--the Bookmark method--is intended to be a less cognitively complex alternative to methods…

Descriptors: Standard Setting (Scoring), Cutting Scores, Educational Testing, Licensing Examinations (Professions)

Recommending Cut Scores with a Subset of Items: An Empirical Illustration

Peer reviewed

Direct link

Buckendahl, Chad W.; Ferdous, Abdullah A.; Gerrow, Jack – Practical Assessment, Research & Evaluation, 2010

Many testing programs face the practical challenge of having limited resources to conduct comprehensive standard setting studies. Some researchers have suggested that replicating a group's recommended cut score on a full-length test may be possible by using a subset of the items. However, these studies were based on simulated data. This study…

Descriptors: Cutting Scores, Test Items, Standard Setting (Scoring), Methods

Judges' Use of Examinee Performance Data in an Angoff Standard-Setting Exercise for a Medical Licensing Examination: An Experimental Study

Peer reviewed

Direct link

Clauser, Brian E.; Mee, Janet; Baldwin, Su G.; Margolis, Melissa J.; Dillon, Gerard F. – Journal of Educational Measurement, 2009

Although the Angoff procedure is among the most widely used standard setting procedures for tests comprising multiple-choice items, research has shown that subject matter experts have considerable difficulty accurately making the required judgments in the absence of examinee performance data. Some authors have viewed the need to provide…

Descriptors: Standard Setting (Scoring), Program Effectiveness, Expertise, Health Personnel

Guidelines for Selecting a Standard Setting Panel for Licensure Testing.

Williamson, David M. – CLEAR Exam Review, 1999

Discusses panels for standard setting and presents 10 guidelines for the selection of panel members for such studies. Panel members should themselves hold the license for which they are producing a cutting score, and they must be familiar with the requirements of the profession and the characteristics of the candidates. (SLD)

Descriptors: Cutting Scores, Evaluators, Licensing Examinations (Professions), Selection

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5

Bowman, Harry L.	6
Jaeger, Richard M.	5
Norcini, John J.	5
Clauser, Brian E.	4
Hambleton, Ronald K.	4
Buckendahl, Chad W.	3
Busch, John Christian	3
Kannan, Priya	3
Margolis, Melissa J.	3
Plake, Barbara S.	3
Tannenbaum, Richard J.	3
Cizek, Gregory J.	2
Cope, Ronald T.	2
Davis-Becker, Susan L.	2
Friedman, Charles B.	2
Gerrow, Jack	2
Impara, James C.	2
Mee, Janet	2
Norcini, John	2
Petry, John R.	2
Sgammato, Adrienne	2
Shea, Judy A.	2
Wyse, Adam E.	2
Arrasmith, Dean G.	1
More ▼