ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	7

Descriptor

Cutting Scores	7
Standard Setting (Scoring)	6
Test Items	3
Data	2
Evaluation Methods	2
Expertise	2
Generalizability Theory	2
Judges	2
Licensing Examinations…	2
Probability	2
Validity	2
Academic Standards	1
Accuracy	1
Computation	1
Correlation	1
Credentials	1
Credibility	1
Difficulty Level	1
Evaluative Thinking	1
Foreign Countries	1
Group Discussion	1
Influences	1
Item Analysis	1
Knowledge Level	1
Medical Education	1
More ▼

Source

Educational Measurement:…	3
Journal of Educational…	2
Applied Measurement in…	1
International Journal of…	1

Author

Clauser, Brian E.	7
Margolis, Melissa J.	7
Mee, Janet	4
Winward, Marcia	3
Baldwin, Peter	2
Clauser, Jerome C.	2
Chis, Liliana	1
Harik, Polina	1
McManus, I. C.	1
Mollon, Jennifer	1
Williams, Simon	1
More ▼

Publication Type

Journal Articles	7
Reports - Research	6
Reports - Evaluative	1

Education Level

Higher Education	1
Postsecondary Education	1

Audience

Location

United Kingdom

Laws, Policies, & Programs

Assessments and Surveys

United States Medical…

What Works Clearinghouse Rating

Showing all 7 results Save | Export

The Choice of Response Probability in Bookmark Standard Setting: An Experimental Study

Peer reviewed

Direct link

Baldwin, Peter; Margolis, Melissa J.; Clauser, Brian E.; Mee, Janet; Winward, Marcia – Educational Measurement: Issues and Practice, 2020

Evidence of the internal consistency of standard-setting judgments is a critical part of the validity argument for tests used to make classification decisions. The bookmark standard-setting procedure is a popular approach to establishing performance standards, but there is relatively little research that reflects on the internal consistency of the…

Descriptors: Standard Setting (Scoring), Probability, Cutting Scores, Evaluation Methods

An Experimental Study of the Internal Consistency of Judgments Made in Bookmark Standard Setting

Peer reviewed

Direct link

Clauser, Brian E.; Baldwin, Peter; Margolis, Melissa J.; Mee, Janet; Winward, Marcia – Journal of Educational Measurement, 2017

Validating performance standards is challenging and complex. Because of the difficulties associated with collecting evidence related to external criteria, validity arguments rely heavily on evidence related to internal criteria--especially evidence that expert judgments are internally consistent. Given its importance, it is somewhat surprising…

Descriptors: Evaluation Methods, Standard Setting, Cutting Scores, Expertise

An Examination of the Replicability of Angoff Standard Setting Results within a Generalizability Theory Framework

Peer reviewed

Direct link

Clauser, Jerome C.; Margolis, Melissa J.; Clauser, Brian E. – Journal of Educational Measurement, 2014

Evidence of stable standard setting results over panels or occasions is an important part of the validity argument for an established cut score. Unfortunately, due to the high cost of convening multiple panels of content experts, standards often are based on the recommendation from a single panel of judges. This approach implicitly assumes that…

Descriptors: Standard Setting (Scoring), Generalizability Theory, Replication (Evaluation), Cutting Scores

The Impact of Examinee Performance Information on Judges' Cut Scores in Modified Angoff Standard-Setting Exercises

Peer reviewed

Direct link

Margolis, Melissa J.; Clauser, Brian E. – Educational Measurement: Issues and Practice, 2014

This research evaluated the impact of a common modification to Angoff standard-setting exercises: the provision of examinee performance data. Data from 18 independent standard-setting panels across three different medical licensing examinations were examined to investigate whether and how the provision of performance information impacted judgments…

Descriptors: Cutting Scores, Standard Setting (Scoring), Data, Licensing Examinations (Professions)

Effect of Content Knowledge on Angoff-Style Standard Setting Judgments

Peer reviewed

Direct link

Margolis, Melissa J.; Mee, Janet; Clauser, Brian E.; Winward, Marcia; Clauser, Jerome C. – Educational Measurement: Issues and Practice, 2016

Evidence to support the credibility of standard setting procedures is a critical part of the validity argument for decisions made based on tests that are used for classification. One area in which there has been limited empirical study is the impact of standard setting judge selection on the resulting cut score. One important issue related to…

Descriptors: Academic Standards, Standard Setting (Scoring), Cutting Scores, Credibility

The Effect of Data Format on Integration of Performance Data into Angoff Judgments

Peer reviewed

Direct link

Clauser, Brian E.; Mee, Janet; Margolis, Melissa J. – International Journal of Testing, 2013

This study investigated the extent to which the performance data format impacted data use in Angoff standard setting exercises. Judges from two standard settings (a total of five panels) were randomly assigned to one of two groups. The full-data group received two types of data: (1) the proportion of examinees selecting each option and (2) plots…

Descriptors: Standard Setting (Scoring), Cutting Scores, Validity, Reliability

An Empirical Examination of the Impact of Group Discussion and Examinee Performance Information on Judgments Made in the Angoff Standard-Setting Procedure

Peer reviewed

Direct link

Clauser, Brian E.; Harik, Polina; Margolis, Melissa J.; McManus, I. C.; Mollon, Jennifer; Chis, Liliana; Williams, Simon – Applied Measurement in Education, 2009

Numerous studies have compared the Angoff standard-setting procedure to other standard-setting methods, but relatively few studies have evaluated the procedure based on internal criteria. This study uses a generalizability theory framework to evaluate the stability of the estimated cut score. To provide a measure of internal consistency, this…

Descriptors: Generalizability Theory, Group Discussion, Standard Setting (Scoring), Scoring