Publication Date
In 2025 | 2 |
Since 2024 | 4 |
Since 2021 (last 5 years) | 11 |
Since 2016 (last 10 years) | 50 |
Since 2006 (last 20 years) | 150 |
Descriptor
Standard Setting (Scoring) | 502 |
Cutting Scores | 228 |
Standards | 165 |
Elementary Secondary Education | 107 |
Test Items | 92 |
Evaluation Methods | 90 |
Academic Standards | 79 |
Scoring | 75 |
Minimum Competency Testing | 70 |
Licensing Examinations… | 66 |
Educational Assessment | 64 |
More ▼ |
Source
Author
Publication Type
Education Level
Location
Canada | 10 |
Australia | 8 |
Tennessee | 8 |
United Kingdom | 7 |
California | 4 |
Kansas | 4 |
Massachusetts | 4 |
New Jersey | 4 |
United States | 4 |
Illinois | 3 |
Michigan | 3 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
McClarty, Katie Larsen – Measurement: Interdisciplinary Research and Perspectives, 2013
The construct map is a promising tool for organizing the data standard-setting panelists interpret. The challenge in applying construct maps to standard-setting procedures will be the judicious selection of data to include within this organizing framework. Therefore, this commentary focuses on decisions about what to include in the construct map.…
Descriptors: Standard Setting (Scoring), Maps, Validity, Evidence
Mee, Janet; Clauser, Brian E.; Margolis, Melissa J. – Educational Measurement: Issues and Practice, 2013
Despite being widely used and frequently studied, the Angoff standard setting procedure has received little attention with respect to an integral part of the process: how judges incorporate examinee performance data in the decision-making process. Without performance data, subject matter experts have considerable difficulty accurately making the…
Descriptors: Standard Setting (Scoring), Judges, Data, Decision Making
Kannan, Priya; Sgammato, Adrienne; Tannenbaum, Richard J.; Katz, Irvin R. – Applied Measurement in Education, 2015
The Angoff method requires experts to view every item on the test and make a probability judgment. This can be time consuming when there are large numbers of items on the test. In this study, a G-theory framework was used to determine if a subset of items can be used to make generalizable cut-score recommendations. Angoff ratings (i.e.,…
Descriptors: Reliability, Standard Setting (Scoring), Cutting Scores, Test Items
Kannan, Priya; Sgammato, Adrienne; Tannenbaum, Richard J. – Applied Measurement in Education, 2015
Establishing cut scores using the Angoff method requires panelists to evaluate every item on a test and make a probability judgment. This can be time-consuming when there are large numbers of items on the test. Previous research using resampling studies suggest that it is possible to recommend stable Angoff-based cut score estimates using a…
Descriptors: Cutting Scores, Test Items, Standard Setting (Scoring), Feasibility Studies
Shulruf, Boaz; Jones, Phil; Turner, Rolf – Higher Education Studies, 2015
The determination of Pass/Fail decisions over Borderline grades, (i.e., grades which do not clearly distinguish between the competent and incompetent examinees) has been an ongoing challenge for academic institutions. This study utilises the Objective Borderline Method (OBM) to determine examinee ability and item difficulty, and from that…
Descriptors: Undergraduate Students, Pass Fail Grading, Decision Making, Probability
Schulz, E. Matthew – Measurement: Interdisciplinary Research and Perspectives, 2013
In this article, E. Matthew Schulz responds to Adam Wyse's article, "Construct Maps as a Foundation for Standard Setting." In doing so, he asserts that one of the most important ideas in Wyse's work is that information used in standard setting needs to be better represented through the use of graphics. However, he's not…
Descriptors: Standard Setting (Scoring), Maps, Item Response Theory, Test Items
Sorensen, Henry L. – ProQuest LLC, 2013
Cut-score setting processes are used to establish the passing standards for all kinds of tests in education and for credentialing. While experts use their best efforts to guide cut-score setting processes to generate valid and reliable results, cut-score participants often have a difficult time understanding the standard at which the cut score is…
Descriptors: Cutting Scores, Standard Setting (Scoring), Comparative Analysis, Difficulty Level
O'Neill, Thomas R.; Peabody, Michael R.; Stelter, Keith L.; Hagen, Michael D. – Online Submission, 2015
(Purpose) The purpose of our study was to assess the need for an external searchable resource to be used in conjunction with the American Board of Family Medicine's (ABFM) Maintenance of Certification for Family Physicians (MC-FP) Examination, discuss the philosophical question of whether an ESR should be allowed on the examination, and outline…
Descriptors: Licensing Examinations (Professions), Family Practice (Medicine), Physicians, Online Searching
Shin, Sun-Young; Lidster, Ryan – Language Testing, 2017
In language programs, it is crucial to place incoming students into appropriate levels to ensure that course curriculum and materials are well targeted to their learning needs. Deciding how and where to set cutscores on placement tests is thus of central importance to programs, but previous studies in educational measurement disagree as to which…
Descriptors: Language Tests, English (Second Language), Standard Setting (Scoring), Student Placement
Shulruf, Boaz; Booth, Roger; Baker, Heather; Bagg, Warwick; Barrow, Mark – Journal of Further and Higher Education, 2017
Decisions about progress through an academic programme are made by Boards of Examiners, on the basis of students' course assessments. For most students such pass/fail grading decisions are straightforward. However, for those students whose results are borderline (either at a pass/fail boundary or boundaries between grades) the exercise of some…
Descriptors: Medical Education, Decision Making, Student Promotion, Pass Fail Grading
Skaggs, Gary – Measurement: Interdisciplinary Research and Perspectives, 2013
The construct map is a particularly good way to approach instrument development, and this author states that he was delighted to read Adam Wyse's thoughts about how to use construct maps for standard setting. For a number of popular standard-setting methods, Wyse shows how typical feedback to panelists fits within a construct map framework.…
Descriptors: Standard Setting (Scoring), Maps, Test Construction, Measurement
Shulruf, Boaz; Turner, Rolf; Poole, Phillippa; Wilkinson, Tim – Advances in Health Sciences Education, 2013
The decision to pass or fail a medical student is a "high stakes" one. The aim of this study is to introduce and demonstrate the feasibility and practicality of a new objective standard-setting method for determining the pass/fail cut-off score from borderline grades. Three methods for setting up pass/fail cut-off scores were compared: the…
Descriptors: Standard Setting (Scoring), Probability, Medical Schools, Medical Students
Humphry, Stephen; Heldsinger, Sandra; Andrich, David – Applied Measurement in Education, 2014
One of the best-known methods for setting a benchmark standard on a test is that of Angoff and its modifications. When scored dichotomously, judges estimate the probability that a benchmark student has of answering each item correctly. As in most methods of standard setting, it is assumed implicitly that the unit of the latent scale of the…
Descriptors: Foreign Countries, Standard Setting (Scoring), Judges, Item Response Theory
Kingston, Neal M.; Tiemann, Gail C.; Loughran, Jessica T. – Measurement: Interdisciplinary Research and Perspectives, 2013
The authors of this article comment on "Construct Maps as a Foundation for Standard Setting," by Adam E. Wyse (this issue) in which Wyse presents construct maps, a visual display of a variety of sources of evidence that support standard-setting decisions, and shows how this approach could be used with a variety of existing…
Descriptors: Standard Setting (Scoring), Maps, Methods, Misconceptions
Davis-Becker, Susan L.; Buckendahl, Chad W. – International Journal of Testing, 2013
A critical component of the standard setting process is collecting evidence to evaluate the recommended cut scores and their use for making decisions and classifying students based on test performance. Kane (1994, 2001) proposed a framework by which practitioners can identify and evaluate evidence of the results of the standard setting from (1)…
Descriptors: Standard Setting (Scoring), Evidence, Validity, Cutting Scores