Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 4 |
Since 2006 (last 20 years) | 17 |
Descriptor
Standard Setting | 23 |
Cutting Scores | 6 |
Evaluation Methods | 6 |
Evaluation | 4 |
Testing Programs | 4 |
Validity | 4 |
Item Response Theory | 3 |
Psychometrics | 3 |
Reader Response | 3 |
Standards | 3 |
Test Items | 3 |
More ▼ |
Source
Educational Measurement:… | 23 |
Author
Bunch, Michael B. | 2 |
Reckase, Mark D. | 2 |
Sireci, Stephen G. | 2 |
Baron, Patricia | 1 |
Burt, Winona M. | 1 |
Camara, Wayne | 1 |
Chudowsky, Naomi | 1 |
Cizek, Gregory J. | 1 |
Cook, Robert | 1 |
Elliott, Stuart | 1 |
Geisinger, Kurt F. | 1 |
More ▼ |
Publication Type
Journal Articles | 23 |
Reports - Descriptive | 9 |
Reports - Evaluative | 5 |
Reports - Research | 5 |
Opinion Papers | 4 |
Education Level
Elementary Secondary Education | 2 |
Elementary Education | 1 |
Grade 3 | 1 |
Higher Education | 1 |
Postsecondary Education | 1 |
Audience
Teachers | 1 |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Baron, Patricia; Sireci, Stephen G.; Slater, Sharon C. – Educational Measurement: Issues and Practice, 2021
Since the No Child Left Behind Act (No Child Left Behind [NCLB], 2001) was enacted, the Bookmark method has been used in many state standard setting studies (Karantonis and Sireci; Zieky, Perie, and Livingston). The purpose of the current study is to evaluate the criticism that when panelists are presented with data during the Bookmark standard…
Descriptors: State Standards, Standard Setting, Evaluators, Training
Peabody, Michael R.; Muckle, Timothy J.; Meng, Yu – Educational Measurement: Issues and Practice, 2023
The subjective aspect of standard-setting is often criticized, yet data-driven standard-setting methods are rarely applied. Therefore, we applied a mixture Rasch model approach to setting performance standards across several testing programs of various sizes and compared the results to existing passing standards derived from traditional…
Descriptors: Item Response Theory, Standard Setting, Testing, Sampling
Bunch, Michael B. – Educational Measurement: Issues and Practice, 2020
In this digital ITEMS module, Dr. Michael Bunch provides an in-depth, step-by-step look at how standard setting is done. It does not focus on any specific procedure or methodology (e.g., modified Angoff, bookmark, and body of work) but on the practical tasks that must be completed for any standard setting activity. Dr. Bunch carries the…
Descriptors: Standard Setting, Cutting Scores, Scores, Reports
Lewis, Daniel; Cook, Robert – Educational Measurement: Issues and Practice, 2020
In this paper we assert that the practice of principled assessment design renders traditional standard-setting methodology redundant at best and contradictory at worst. We describe the rationale for, and methodological details of, Embedded Standard Setting (ESS; previously, Engineered Cut Scores. Lewis, 2016), an approach to establish performance…
Descriptors: Standard Setting, Evaluation, Cutting Scores, Performance Based Assessment
Camara, Wayne – Educational Measurement: Issues and Practice, 2014
This article reviews the intended uses of these college- and career-readiness assessments with the goal of articulating an appropriate validity argument to support such uses. These assessments differ fundamentally from today's state assessments employed for state accountability. Current assessments are used to determine if students have…
Descriptors: College Readiness, Career Readiness, Aptitude Tests, Test Use
Geisinger, Kurt F.; McCormick, Carina M. – Educational Measurement: Issues and Practice, 2010
Standard-setting studies utilizing procedures such as the Bookmark or Angoff methods are just one component of the complete standard-setting process. Decision makers ultimately must determine what they believe to be the most appropriate standard or cut score to use, employing the input of the standard-setting panelists as one piece of information…
Descriptors: Standard Setting (Scoring), Measurement, Cutting Scores, Educational Policy
Nichols, Paul; Twing, Jon; Mueller, Canda D.; O'Malley, Kimberly – Educational Measurement: Issues and Practice, 2010
Some writers in the measurement literature have been skeptical of the meaningfulness of achievement standards and described the standard-setting process as blatantly arbitrary. We argue that standard setting is more appropriately conceived of as a measurement process similar to student assessment. The construct being measured is the panelists'…
Descriptors: Scaling, Achievement, Standard Setting (Scoring), Measurement
Hein, Serge F.; Skaggs, Gary – Educational Measurement: Issues and Practice, 2010
Increasingly, research has focused on the cognitive processes associated with various standard-setting activities. This qualitative study involved an examination of 16 third-grade reading teachers' experiences with the cognitive task of conceptualizing an entire classroom of hypothetical target students when the single-passage bookmark method or…
Descriptors: Focus Groups, Standard Setting, Interviews, Reading Teachers
Lissitz, Robert W.; Wei, Hua – Educational Measurement: Issues and Practice, 2008
In this article we address the issue of consistency in standard setting in the context of an augmented state testing program. Information gained from the external NRT scores is used to help make an informed decision on the determination of cut scores on the state test. The consistency of cut scores on the CRT across grades is maintained by forcing…
Descriptors: Testing Programs, State Programs, Standard Setting, Reliability
Burt, Winona M.; Stapleton, Laura M. – Educational Measurement: Issues and Practice, 2010
The purpose of this study was to investigate the connotation of performance labels used in standard setting. For example, do the performance labels "basic," "proficient," and "advanced" hold different connotations than "limited knowledge," "satisfactory," and "distinguished"? If these…
Descriptors: Standard Setting, Definitions, High Stakes Tests, Measures (Individuals)
Karantonis, Ana; Sireci, Stephen G. – Educational Measurement: Issues and Practice, 2006
The Bookmark method for setting standards on educational tests is currently one of the most popular standard-setting methods. However, research to support the method is scarce. In this report, we review the published and unpublished literature on this method as well as some seminal work in the area of evaluating standard-setting studies. Our…
Descriptors: Academic Standards, Educational Testing, Literature Reviews, Validity
Cizek, Gregory J.; Bunch, Michael B.; Koons, Heather – Educational Measurement: Issues and Practice, 2004
This module describes some common standard-setting procedures used to derive performance levels for achievement tests in education, licensure, and certification. Upon completing the module, readers will be able to: describe what standard setting is; understand why standard setting is necessary; recognize some of the purposes of standard setting;…
Descriptors: Achievement Tests, Standard Setting, Academic Standards, Academic Achievement

Jaeger, Richard M. – Educational Measurement: Issues and Practice, 1990
Means of establishing standards for teacher certification tests are discussed. Focus is on the requirements and implications of the 1985 "Standards for Educational and Psychological Tests" and the 1978 "Uniform Guidelines on Employee Selection Procedures" that apply to the establishment of teacher certification test standards.…
Descriptors: Civil Rights Legislation, Guidelines, Higher Education, Licensing Examinations (Professions)

Haertel, Edward H. – Educational Measurement: Issues and Practice, 2002
Outlines a framework for considering the validity of standards-based score interpretations and then considers the potential roles of different stakeholder groups and other participants in that process. Suggests study of a new standard-setting method, the "briefing book," which would describe alternative cut scores. (SLD)
Descriptors: Accountability, Cutting Scores, Elementary Secondary Education, High Stakes Tests
Huynh, Huynh – Educational Measurement: Issues and Practice, 2006
By analyzing the Fisher information allotted to the correct response of a Rasch binary item, Huynh (1994) established the response probability criterion 0.67 (RP67) for standard settings based on bookmarks and item mapping. The purpose of this note is to help clarify the conceptual and psychometric framework of the RP criterion.
Descriptors: Probability, Standard Setting, Item Response Theory, Psychometrics
Previous Page | Next Page ยป
Pages: 1 | 2