Publication Date
| In 2026 | 0 |
| Since 2025 | 2 |
| Since 2022 (last 5 years) | 10 |
| Since 2017 (last 10 years) | 49 |
| Since 2007 (last 20 years) | 145 |
Descriptor
Source
Author
Publication Type
Education Level
Location
| Canada | 10 |
| Australia | 8 |
| Tennessee | 8 |
| United Kingdom | 7 |
| California | 4 |
| Kansas | 4 |
| Massachusetts | 4 |
| New Jersey | 4 |
| United States | 4 |
| Illinois | 3 |
| Michigan | 3 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Peer reviewedFitzpatrick, Anne R. – Review of Educational Research, 1989
Research from social psychology on the effects of group discussion and of exposure to the opinions of other group members on the decisions groups make is reviewed. The pertinence of this research to standard-setting practices is considered, and the implications for the design of standard-setting procedures are explored. (SLD)
Descriptors: Decision Making, Evaluative Thinking, Group Discussion, Group Dynamics
Peer reviewedBerk, Ronald A. – Applied Measurement in Education, 1995
A brief summary of standard setting knowledge is presented, derived from about 20 methods that utilize a judgmental review process, the approach most relevant to the standard-setting strategies proposed in this special issue. Criteria for judging effectiveness and critiques of the methods discussed in the issue are offered. (SLD)
Descriptors: Criteria, Decision Making, Educational History, Evaluation Methods
Peer reviewedHunter, Darryl M.; Gambell, Trevor J. – Canadian Journal of Program Evaluation, 2000
Discusses the roles played by participants and the processes involved in a provincial standards-setting exercise for a large-scale assessment of student skills. Offers perspectives of a policy maker outside the exercise and an insider participant. Describes alternate notions of representativeness, stakes, and significance. (SLD)
Descriptors: Evaluation Methods, Foreign Countries, Policy Formation, Program Evaluation
Peer reviewedJournal of School Improvement, 2000
States that standard scores are the numerical universal language for reporting and comparisons. Discusses what standard scores are, specifically, and why they are used, along with how the conversion assessment of raw scores to standard scores is accomplished. Provides contact information for those who would like to further their knowledge on the…
Descriptors: Educational Practices, Elementary Secondary Education, Higher Education, Standard Setting (Scoring)
Lin, Jie – Alberta Journal of Educational Research, 2006
The Bookmark standard-setting procedure was developed to address the perceived problems with the most popular method for setting cut-scores: the Angoff procedure (Angoff, 1971). The purposes of this article are to review the Bookmark procedure and evaluate it in terms of Berk's (1986) criteria for evaluating cut-score setting methods. The…
Descriptors: Standard Setting (Scoring), Cutting Scores, Evaluation Criteria, Evaluation Research
Plake, Barbara S.; Impara, James C. – 1996
This study investigated the intrajudge consistency of Angoff-based item performance estimates. The examination used was a certification examination in an emergency medicine specialty. Ten expert panelists rated the same 24 items twice during an operational standard setting study. Results indicate that the panelists were highly consistent, in terms…
Descriptors: Cutting Scores, Interrater Reliability, Licensing Examinations (Professions), Performance Based Assessment
van der Linden, Wim J. – 1994
Elements of arbitrariness in the standard setting process are explored, and an alternative to the use of cut scores is presented. The first part of the paper analyzes the use of cut scores in large-scale assessments, discussing three different functions: (1) cut scores define the qualifications used in assessments; (2) they simplify the reporting…
Descriptors: Academic Achievement, Criteria, Cutting Scores, Educational Assessment
Livingston, Samuel A. – 1983
Discussed are nine questions regarding standard setting issues in educational testing: (1) Should normative or content-referenced standards be used? (2) Different standard setting methods yield different results. Does this finding present a problem? (3) Assess the adequacy of the grounding of various methods of standard setting in psychological…
Descriptors: Educational Testing, Evaluation, Evaluation Methods, Measurement Objectives
Jones, J. Patrick; And Others – 1988
Three studies assessed the psychometric characteristics of the Direct Standard Setting Method (DSSM). The Angoff technique was also used in each study. The DSSM requires judges to consider an examination 10 items at a time and determine the minimum items in that set a candidate should answer correctly to receive the credential. Nine judges set a…
Descriptors: Certification, Credentials, Cutting Scores, Health Personnel
Harker, Jill K.; Cope, Ronald T. – 1988
Cut scores obtained for licensure tests using different judgmental methods of standard setting (holistic, test blueprint, Angoff, and modified Angoff) were compared. Nineteen educators and practitioners participated in this study as judges. Pre- and post-test feedback (feedback of total- and low-group item p-value) ratings were obtained under the…
Descriptors: Cutting Scores, Feedback, Holistic Evaluation, Interrater Reliability
Petry, John R. – 1984
This paper is a report of a study designed to develop recommendations on minimum qualifying scores for National Teacher Examinations (NTE) that are valid for certification and endorsement in Tennessee. The functions performed in the review of the NTE Core Battery and Specialty Area tests were conceptualized as panel activities. The number of…
Descriptors: Cutting Scores, Elementary Secondary Education, Occupational Tests, Standard Setting (Scoring)
Francis, Alexandria S.; Holmes, Susan E. – 1983
Discrepancies among the standards produced by different criterion-referenced standard-setting techniques may be the result of a failure to adequately define the minimally competent candidate. Current research in this area is reviewed in terms of three categories: studies in which no formal assistance in conceptualization is given to judges,…
Descriptors: Certification, Criterion Referenced Tests, Cutting Scores, Interrater Reliability
Kane, Michael; Wilson, Jennifer – 1982
This paper evaluates the magnitude of the total error in estimates of the difference between an examinee's domain score and the cutoff score. An observed score based on a random sample of items from the domain, and an estimated cutoff score derived from a judgmental standard setting procedure are assumed. The work of Brennan and Lockwood (1980) is…
Descriptors: Criterion Referenced Tests, Cutting Scores, Error of Measurement, Mastery Tests
Reid, Jerry B. – 1985
This report investigates an area of uncertainty in using the Angoff method for setting standards, namely whether or not a judge's conceptualizations of borderline group performance are realistic. Ratings are usually made with reference to the performance of this hypothetical group, therefore the Angoff method's success is dependent on this point.…
Descriptors: Certification, Cutting Scores, Difficulty Level, Interrater Reliability
Livingston, Samuel A. – 1982
In the specific methods of standard setting in testing, judgments about individuals being tested, contrasting groups being tested, and judgments about the test items are discussed. In judgments about individual test-takers, assumptions are presented based on the knowledge and skills the test is intended to measure, the test-takers' skills at the…
Descriptors: Academic Standards, Elementary Secondary Education, Evaluation Criteria, Evaluators

Direct link
