Publication Date
In 2025 | 2 |
Since 2024 | 4 |
Since 2021 (last 5 years) | 11 |
Since 2016 (last 10 years) | 50 |
Since 2006 (last 20 years) | 150 |
Descriptor
Standard Setting (Scoring) | 502 |
Cutting Scores | 228 |
Standards | 165 |
Elementary Secondary Education | 107 |
Test Items | 92 |
Evaluation Methods | 90 |
Academic Standards | 79 |
Scoring | 75 |
Minimum Competency Testing | 70 |
Licensing Examinations… | 66 |
Educational Assessment | 64 |
More ▼ |
Source
Author
Publication Type
Education Level
Location
Canada | 10 |
Australia | 8 |
Tennessee | 8 |
United Kingdom | 7 |
California | 4 |
Kansas | 4 |
Massachusetts | 4 |
New Jersey | 4 |
United States | 4 |
Illinois | 3 |
Michigan | 3 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating

Zieky, Michael – Studies in Educational Evaluation, 1989
Problems inherent in setting standards/passing scores for criterion-referenced tests are discussed; and traditional methods of setting standards are reviewed. Three acceptable methods based on judgments of questions are discussed; their authors include, respectively: (1) W. H. Angoff (1971); (2) R. L. Ebel (1972); and (3) L. Nedelsky (1954). (SLD)
Descriptors: Criterion Referenced Tests, Cutting Scores, Evaluation Methods, Standard Setting (Scoring)
Williamson, David M. – CLEAR Exam Review, 1999
Discusses panels for standard setting and presents 10 guidelines for the selection of panel members for such studies. Panel members should themselves hold the license for which they are producing a cutting score, and they must be familiar with the requirements of the profession and the characteristics of the candidates. (SLD)
Descriptors: Cutting Scores, Evaluators, Licensing Examinations (Professions), Selection
Radwan, Nizam; Rogers, W. Todd – Alberta Journal of Educational Research, 2006
The recent increase in the use of constructed-response items in educational assessment and the dissatisfaction with the nature of the decision that the judges must make using traditional standard-setting methods created a need to develop new and effective standard-setting procedures for tests that include both multiple-choice and…
Descriptors: Criticism, Cutting Scores, Educational Assessment, Standard Setting (Scoring)
Lin, Jie – Alberta Journal of Educational Research, 2006
The Bookmark standard-setting procedure was developed to address the perceived problems with the most popular method for setting cut-scores: the Angoff procedure (Angoff, 1971). The purposes of this article are to review the Bookmark procedure and evaluate it in terms of Berk's (1986) criteria for evaluating cut-score setting methods. The…
Descriptors: Standard Setting (Scoring), Cutting Scores, Evaluation Criteria, Evaluation Research

Norcini, John J.; And Others – Journal of Educational Measurement, 1988
Two studies of medical certification examinations were undertaken to assess standard setting using Angoff's Method. Results indicate that (1) specialization within broad content areas does not affect an expert's estimates of the performance of the borderline group; and (2) performance data should be provided during the standard-setting process.…
Descriptors: Certification, Cutting Scores, Licensing Examinations (Professions), Medicine
Impara, James C.; Plake, Barbara S. – 2000
This paper reports the results of using several alternative methods of setting cut scores. The methods used were: (1) a variation of the Angoff method (1971); (2) a variation of the borderline group method; and (3) an advanced impact method (G. Dillon, 1996). The results discussed are from studies undertaken to set the cut scores for fourth grade…
Descriptors: Cutting Scores, Intermediate Grades, Mathematics Tests, Scoring Formulas

Sireci, Stephen G.; Robin, Frederic; Patelis, Thanos – Applied Measurement in Education, 1999
Presents a procedure for standard setting that involves the cluster analysis of test takers to discover examinee groups that are useful for envisioning marginally competent performance or defining borderline or contrasting groups. Illustrates use of the procedure with a statewide mathematics test, and concludes that cluster analysis is useful in…
Descriptors: Cluster Analysis, Mathematics Tests, Standard Setting (Scoring), Standards
Wood, Timothy J.; Humphrey-Murto, Susan M.; Norman, Geoffrey R. – Advances in Health Sciences Education, 2006
When setting standards, administrators of small-scale OSCEs often face several challenges, including a lack of resources, a lack of available expertise in statistics, and difficulty in recruiting judges. The Modified Borderline-Group Method is a standard setting procedure that compensates for these challenges by using physician examiners and is…
Descriptors: Intervals, Standard Setting (Scoring), Measures (Individuals), Examiners
Reid, Jerry B. – 1984
While standard setting procedures are typically discussed in terms of deriving a reasonable cutting score for a given form of a test, the situation may be structured such that the standard has been mandated without regard to the test form itself. This situation may result either through legislative or policy actions and may be a fait accompli by…
Descriptors: Certification, Cutting Scores, Policy, Scores
Halpin, Glennelle; Halpin, Gerald – 1983
Research indicating that different cut-off points result from the use of different standard-setting techniques leaves decision makers with a disturbing dilemma: Which standard-setting method is best? This investigation of the reliability and validity of 10 different standard-setting approaches was designed to provide information that might help…
Descriptors: Adults, Comparative Analysis, Cutting Scores, Language Arts
Meskauskas, John A. – 1978
The ways in which standard-setting procedures are carried out in the medical specialty certification area are described. Three experiments that have been conducted with alternatives to currently used normative standards are examined, and reasons why these experiments have had limited success are suggested in this speech. The defensibility of…
Descriptors: Certification, Higher Education, Medical Education, Physicians

Norcini, John J.; And Others – Journal of Educational Measurement, 1988
Multiple matrix sampling is applied to a variation of Angoff's standard setting method. Thirty-six experts (internists) and 190 items were divided into five groups, and borderline examinee performance was estimated. There was some variability in the cutting scores produced by the individual groups, but various components were well estimated. (SLD)
Descriptors: Cutting Scores, Minimum Competency Testing, Physicians, Sampling

Edwards, Sarah – Voices from the Middle, 2002
Considers how educators can best prepare their students to meet the demands of test standards without sacrificing what they know to be effective teaching. Provides examples of how addressing the standards improved both the author's teaching and student learning by providing a focus and a challenge that moved her out of her comfort zone. (SG)
Descriptors: Academic Achievement, Instructional Effectiveness, Instructional Innovation, Middle Schools

Engelhard, George, Jr.; Anderson, David W. – Applied Measurement in Education, 1998
A new approach for examining the quality of judgments from standard-setting judges using a Binomial Trials Model (BTM) is presented and illustrated with 26 judges from the Georgia High School Graduation Test. Results suggest that the BTM provides information not available from other methods. (SLD)
Descriptors: Graduation Requirements, High Schools, Judges, Standard Setting (Scoring)

Hambleton, Ronald K.; Brennan, Robert L.; Brown, William; Dodd, Barbara; Forsyth, Robert A.; Mehrens, William A.; Nellhaus, Jeff; Reckase, Mark; Rindone, Douglas; van der Linden, Wim J.; Zwick, Rebecca – Educational Measurement: Issues and Practice, 2000
Responds to a negative evaluation of the National Assessment of Educational Progress (NAEP) by the National Academy of Sciences (NAS) and asserts that a review of the evidence for the NAEP performance standards indicates that there is support for the current approach to NAEP standard setting. Considers the scholarship of the NAS evaluation…
Descriptors: Academic Achievement, Elementary Secondary Education, Program Evaluation, Standard Setting (Scoring)