Publication Date
| In 2026 | 0 |
| Since 2025 | 2 |
| Since 2022 (last 5 years) | 10 |
| Since 2017 (last 10 years) | 49 |
| Since 2007 (last 20 years) | 145 |
Descriptor
Source
Author
Publication Type
Education Level
Location
| Canada | 10 |
| Australia | 8 |
| Tennessee | 8 |
| United Kingdom | 7 |
| California | 4 |
| Kansas | 4 |
| Massachusetts | 4 |
| New Jersey | 4 |
| United States | 4 |
| Illinois | 3 |
| Michigan | 3 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Humphry, Stephen; Heldsinger, Sandra; Andrich, David – Applied Measurement in Education, 2014
One of the best-known methods for setting a benchmark standard on a test is that of Angoff and its modifications. When scored dichotomously, judges estimate the probability that a benchmark student has of answering each item correctly. As in most methods of standard setting, it is assumed implicitly that the unit of the latent scale of the…
Descriptors: Foreign Countries, Standard Setting (Scoring), Judges, Item Response Theory
Kingston, Neal M.; Tiemann, Gail C.; Loughran, Jessica T. – Measurement: Interdisciplinary Research and Perspectives, 2013
The authors of this article comment on "Construct Maps as a Foundation for Standard Setting," by Adam E. Wyse (this issue) in which Wyse presents construct maps, a visual display of a variety of sources of evidence that support standard-setting decisions, and shows how this approach could be used with a variety of existing…
Descriptors: Standard Setting (Scoring), Maps, Methods, Misconceptions
Davis-Becker, Susan L.; Buckendahl, Chad W. – International Journal of Testing, 2013
A critical component of the standard setting process is collecting evidence to evaluate the recommended cut scores and their use for making decisions and classifying students based on test performance. Kane (1994, 2001) proposed a framework by which practitioners can identify and evaluate evidence of the results of the standard setting from (1)…
Descriptors: Standard Setting (Scoring), Evidence, Validity, Cutting Scores
Lim, Gad S.; Geranpayeh, Ardeshir; Khalifa, Hanan; Buckendahl, Chad W. – International Journal of Testing, 2013
Standard setting theory has largely developed with reference to a typical situation, determining a level or levels of performance for one exam for one context. However, standard setting is now being used with international reference frameworks, where some parameters and assumptions of classical standard setting do not hold. We consider the…
Descriptors: Standard Setting (Scoring), Validity, Models, Language Tests
Clauser, Brian E.; Mee, Janet; Margolis, Melissa J. – International Journal of Testing, 2013
This study investigated the extent to which the performance data format impacted data use in Angoff standard setting exercises. Judges from two standard settings (a total of five panels) were randomly assigned to one of two groups. The full-data group received two types of data: (1) the proportion of examinees selecting each option and (2) plots…
Descriptors: Standard Setting (Scoring), Cutting Scores, Validity, Reliability
Cravens, Xiu Chen; Goldring, Ellen B.; Porter, Andrew C.; Polikoff, Morgan S.; Murphy, Joseph; Elliott, Stephen N. – Educational Administration Quarterly, 2013
Purpose: Performance evaluation informs professional development and helps school personnel improve student learning. Although psychometric literature indicates that a rational, sound, and coherent standard-setting process adds to the credibility of an assessment, few studies have empirically examined the decision-making process. This article…
Descriptors: Instructional Leadership, Cutting Scores, Decision Making, Standard Setting (Scoring)
Hoover, William Brian; French, Brian F.; Field, William E.; Tormoehlen, Roger L. – Journal of Agricultural Education, 2012
Minimum passing scores for the Gearing Up for Safety: Production Agriculture Safety Training for Youth curriculum (Gearing Up for Safety) were set in 2006 with widely used and established procedures by efforts of subject matter experts (French, Breidenbach et al., 2007; French, Field, and Tormoehlen, 2006, 2007). While providing a research-based…
Descriptors: Agriculture, Safety, Safety Education, Agricultural Production
Wyse, Adam E.; Bunch, Michael B.; Deville, Craig; Viger, Steven G. – Educational and Psychological Measurement, 2014
This article describes a novel variation of the Body of Work method that uses construct maps to overcome problems of transparency, rater inconsistency, and scores gaps commonly occurring with the Body of Work method. The Body of Work method with construct maps was implemented to set cut-scores for two separate K-12 assessment programs in a large…
Descriptors: Standard Setting (Scoring), Educational Assessment, Elementary Secondary Education, Measurement
Çetin, Sevda; Gelbal, Selahattin – Educational Sciences: Theory and Practice, 2013
In this research, the cut score of a foundation university was re-calculated with bookmark method and with Angoff method, each of which is a standard setting method; and the cut scores found were compared with the current proficiency score. Thus, the final cut score was found to be 27.87 with the cooperative work of 17 experts through the Angoff…
Descriptors: Standard Setting (Scoring), Comparative Analysis, Cutting Scores, Correlation
Schafer, William D.; Hou, Xiaodong – Practical Assessment, Research & Evaluation, 2011
This study discusses and presents an example of a use of spline functions to establish and report test scores using a moderated system of any number of cut scores. Our main goals include studying the need for and establishing moderated standards and creating a reporting scale that is referenced to all the standards. Our secondary goals are to make…
Descriptors: Cutting Scores, Standard Setting (Scoring), Achievement Tests, National Competency Tests
Figueras, Neus; Kaftandjieva, Felianka; Takala, Sauli – Canadian Modern Language Review, 2013
The article addresses some problems and options in setting standards on language tests and examinations. More specifically, it reports on a set of three workshops conducted in the European context where standard setting in language education typically concerns linking tests and examinations to the Council of Europe's "Common European…
Descriptors: Reading Comprehension, Reading Tests, Cutting Scores, Academic Standards
GED Testing Service, 2014
This manual was written to provide technical information regarding the General Educational Development (GED®) test as evidence that the GED® test is technically sound. Throughout this manual, documentation is provided regarding the development of the GED® test and data collection activities, as well as evidence of reliability and validity. This…
Descriptors: High School Equivalency Programs, Equivalency Tests, Testing Programs, Test Validity
Northwest Evaluation Association, 2015
Recently, the Smarter Balanced Assessment Consortium (Smarter Balanced) released a document that established initial performance levels and the associated threshold scale scores for the Smarter Balanced assessment. The report included estimated percentages of students expected to perform at each of the four performance levels, reported by grade…
Descriptors: Standard Setting, Standard Setting (Scoring), Pretesting, Cutting Scores
Hsieh, Mingchuan – Language Testing, 2013
When implementing standard setting procedures, there are two major concerns: variance between panelists and efficiency in conducting multiple rounds of judgments. With regard to the former, there is concern over the consistency of the cutoff scores made by different panelists. If the cut scores show an inordinately wide range then further rounds…
Descriptors: Item Response Theory, Standard Setting (Scoring), Language Tests, English (Second Language)
Pitoniak, Mary J.; Yeld, Nan – International Journal of Testing, 2013
Criterion-referenced assessments have become more common around the world, with performance standards being set to differentiate different levels of student performance. However, use of standard setting methods developed in the United States may be complicated by factors related to the political and educational contexts within another country. In…
Descriptors: Standard Setting (Scoring), Criterion Referenced Tests, Benchmarking, Student Evaluation

Peer reviewed
Direct link
