Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 9 |
Descriptor
Educational Assessment | 64 |
Standard Setting (Scoring) | 64 |
Elementary Secondary Education | 27 |
Performance Based Assessment | 22 |
Standards | 22 |
Academic Standards | 19 |
Cutting Scores | 18 |
Evaluation Methods | 15 |
Academic Achievement | 14 |
Scoring | 14 |
Test Construction | 12 |
More ▼ |
Source
Author
Plake, Barbara S. | 5 |
Hambleton, Ronald K. | 4 |
Reckase, Mark D. | 3 |
Chelimsky, Eleanor | 2 |
Cizek, Gregory J. | 2 |
Impara, James C. | 2 |
Jaeger, Richard M. | 2 |
Kahl, Stuart R. | 2 |
Linn, Robert L. | 2 |
Baker, Eva L. | 1 |
Baldwin, Su G. | 1 |
More ▼ |
Publication Type
Education Level
Elementary Education | 3 |
Elementary Secondary Education | 3 |
High Schools | 2 |
Higher Education | 2 |
Intermediate Grades | 2 |
Secondary Education | 2 |
Adult Education | 1 |
Grade 4 | 1 |
Grade 6 | 1 |
Grade 7 | 1 |
Grade 9 | 1 |
More ▼ |
Location
Canada | 3 |
United States | 3 |
China | 1 |
Connecticut | 1 |
Europe | 1 |
Louisiana | 1 |
Maine | 1 |
Michigan | 1 |
Minnesota | 1 |
New Hampshire | 1 |
New Jersey | 1 |
More ▼ |
Laws, Policies, & Programs
Carl D Perkins Vocational and… | 1 |
Improving Americas Schools… | 1 |
Assessments and Surveys
National Assessment of… | 11 |
Test of English for… | 1 |
What Works Clearinghouse Rating
Schmidgall, Jonathan – Educational Testing Service, 2021
The redesigned "TOEIC Bridge"® tests are designed to measure the reading, listening, speaking, and writing proficiency of beginning to low-intermediate English learners in the context of everyday adult life. This report describes the comprehensive and multifaceted process used to enhance the meaningfulness of TOEIC Bridge test score…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Language Proficiency
Foley, Brett P. – Practical Assessment, Research & Evaluation, 2016
There is always a chance that examinees will answer multiple choice (MC) items correctly by guessing. Design choices in some modern exams have created situations where guessing at random through the full exam--rather than only for a subset of items where the examinee does not know the answer--can be an effective strategy to pass the exam. This…
Descriptors: Guessing (Tests), Multiple Choice Tests, Case Studies, Test Construction
Wyse, Adam E.; Bunch, Michael B.; Deville, Craig; Viger, Steven G. – Educational and Psychological Measurement, 2014
This article describes a novel variation of the Body of Work method that uses construct maps to overcome problems of transparency, rater inconsistency, and scores gaps commonly occurring with the Body of Work method. The Body of Work method with construct maps was implemented to set cut-scores for two separate K-12 assessment programs in a large…
Descriptors: Standard Setting (Scoring), Educational Assessment, Elementary Secondary Education, Measurement
Iyioke, Ifeoma Chika – ProQuest LLC, 2013
This dissertation describes a design for training, in accordance with probability judgment heuristics principles, for the Angoff standard setting method. The new training with instruction, practice, and feedback tailored to the probability judgment heuristics principles was called the Heuristic training and the prevailing Angoff method training…
Descriptors: Standard Setting (Scoring), Probability, Heuristics, Training
Hsieh, Mingchuan – Language Assessment Quarterly, 2013
The Yes/No Angoff and Bookmark method for setting standards on educational assessment are currently two of the most popular standard-setting methods. However, there is no research into the comparability of these two methods in the context of language assessment. This study compared results from the Yes/No Angoff and Bookmark methods as applied to…
Descriptors: Standard Setting (Scoring), Comparative Analysis, Language Tests, Multiple Choice Tests
Rodeck, Elaine M.; Chin, Tzu-Yun; Davis, Susan L.; Plake, Barbara S. – Journal of Applied Testing Technology, 2008
This study examined the relationships between the evaluations obtained from standard setting panelists and changes in ratings between different rounds of a standard setting study that involved setting standards on different language versions of an exam. We investigated panelists' evaluations to determine if their perceptions of the standard…
Descriptors: Mathematics Tests, Standard Setting (Scoring), French, Evaluation Research
Clauser, Brian E.; Mee, Janet; Baldwin, Su G.; Margolis, Melissa J.; Dillon, Gerard F. – Journal of Educational Measurement, 2009
Although the Angoff procedure is among the most widely used standard setting procedures for tests comprising multiple-choice items, research has shown that subject matter experts have considerable difficulty accurately making the required judgments in the absence of examinee performance data. Some authors have viewed the need to provide…
Descriptors: Standard Setting (Scoring), Program Effectiveness, Expertise, Health Personnel
Ferrara, Steve; Perie, Marianne; Johnson, Eugene – Journal of Applied Testing Technology, 2008
Psychometricians continue to introduce new approaches to setting cut scores for educational assessments in an attempt to improve on current methods. In this paper we describe the Item-Descriptor (ID) Matching method, a method based on IRT item mapping. In ID Matching, test content area experts match items (i.e., their judgments about the knowledge…
Descriptors: Test Results, Test Content, Testing Programs, Educational Testing
Radwan, Nizam; Rogers, W. Todd – Alberta Journal of Educational Research, 2006
The recent increase in the use of constructed-response items in educational assessment and the dissatisfaction with the nature of the decision that the judges must make using traditional standard-setting methods created a need to develop new and effective standard-setting procedures for tests that include both multiple-choice and…
Descriptors: Criticism, Cutting Scores, Educational Assessment, Standard Setting (Scoring)

Kane, Michael – Educational Assessment, 1998
Examines criteria for choosing between test-centered and examinee-centered methods of standard setting in empirical terms and in terms of whether the method is consistent with the model of achievement underlying test design and interpretation and the assessment methods being used. Contains 35 references. (Author/SLD)
Descriptors: Academic Achievement, Criteria, Educational Assessment, Evaluation Methods

Hamilton, J. S.; McLone, R. R. – Studies in Educational Evaluation, 1989
Influences on the educational validity of examinations are reviewed. Changes occurring in approaches to standard setting are traced. A view of reliability is presented, with emphasis on assessment of project work, which often involves individual investigation and design by students. A consistency index formula for grading standards is presented.…
Descriptors: Cutting Scores, Educational Assessment, Elementary Secondary Education, Standard Setting (Scoring)
Cizek, Gregory J. – 1995
The concept of due process provides an analogy for the process of standard setting that emphasizes many of the procedural and substantive elements of the process over technical and statistical concerns. Surely such concerns can and should continue to be addressed. However, a sound rationale for standard setting does not rest on this foundation.…
Descriptors: Criteria, Decision Making, Due Process, Educational Assessment
Reckase, Mark D. – 1998
Standard setting is a fairly widespread activity in educational and psychological measurement, but there is no formal psychometric theory to guide the development of standard setting methodology. This paper presents a conceptual framework for such a psychometric theory and uses the conceptual framework to analyze a number of methods for setting…
Descriptors: Educational Assessment, Evaluation Methods, Judges, Measurement Techniques
van der Linden, Wim J. – 1994
Elements of arbitrariness in the standard setting process are explored, and an alternative to the use of cut scores is presented. The first part of the paper analyzes the use of cut scores in large-scale assessments, discussing three different functions: (1) cut scores define the qualifications used in assessments; (2) they simplify the reporting…
Descriptors: Academic Achievement, Criteria, Cutting Scores, Educational Assessment

Tyler, Ralph W. – Educational Measurement: Issues and Practice, 1983
Following a review of educational standard-setting experiences in the United States, the procedures used by the National Assessment of Educational Progress (NAEP) to arrive at a consensus on educational objectives are described. These objectives represent the only definitions of educational quality that are widely accepted by professional and lay…
Descriptors: Academic Achievement, Academic Standards, Educational Assessment, Educational Objectives