Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 9 |
Since 2006 (last 20 years) | 13 |
Descriptor
Evaluators | 20 |
Standard Setting | 20 |
Cutting Scores | 6 |
Decision Making | 5 |
English (Second Language) | 5 |
Evaluation Methods | 5 |
Foreign Countries | 5 |
Language Tests | 5 |
Test Items | 5 |
Item Analysis | 4 |
Second Language Learning | 4 |
More ▼ |
Source
Author
Clauser, Jerome C. | 2 |
Impara, James C. | 2 |
Pill, John | 2 |
Plake, Barbara S. | 2 |
Bacon, Donald R. | 1 |
Baldwin, Peter | 1 |
Baron, Patricia | 1 |
Batty, Aaron Olaf | 1 |
Beywl, Wolfgang | 1 |
Buckendahl, Chad W. | 1 |
Chinn, Roberta N. | 1 |
More ▼ |
Publication Type
Journal Articles | 19 |
Reports - Research | 14 |
Reports - Evaluative | 4 |
Opinion Papers | 2 |
Reports - Descriptive | 1 |
Speeches/Meeting Papers | 1 |
Tests/Questionnaires | 1 |
Education Level
Higher Education | 4 |
Adult Education | 2 |
Postsecondary Education | 2 |
Audience
Location
Australia | 1 |
Europe | 1 |
Japan | 1 |
Switzerland | 1 |
Thailand | 1 |
Laws, Policies, & Programs
Assessments and Surveys
International English… | 1 |
Test of English as a Foreign… | 1 |
Test of English for… | 1 |
United States Medical… | 1 |
What Works Clearinghouse Rating
Fisne, Fatima Nur; Sata, Mehmet; Karakaya, Ismail – International Journal of Assessment Tools in Education, 2022
Performance standards have important consequences for all the stakeholders in the assessment of L2 academic writing. These standards not only describe the level of writing performance but also provide a basis for making evaluative decisions on the academic writing. Such a high-stakes role of the performance standards requires the enhancement of…
Descriptors: Standard Setting, Writing Evaluation, Academic Language, English (Second Language)
Baron, Patricia; Sireci, Stephen G.; Slater, Sharon C. – Educational Measurement: Issues and Practice, 2021
Since the No Child Left Behind Act (No Child Left Behind [NCLB], 2001) was enacted, the Bookmark method has been used in many state standard setting studies (Karantonis and Sireci; Zieky, Perie, and Livingston). The purpose of the current study is to evaluate the criticism that when panelists are presented with data during the Bookmark standard…
Descriptors: State Standards, Standard Setting, Evaluators, Training
Clauser, Brian E.; Kane, Michael; Clauser, Jerome C. – Journal of Educational Measurement, 2020
An Angoff standard setting study generally yields judgments on a number of items by a number of judges (who may or may not be nested in panels). Variability associated with judges (and possibly panels) contributes error to the resulting cut score. The variability associated with items plays a more complicated role. To the extent that the mean item…
Descriptors: Cutting Scores, Generalization, Decision Making, Standard Setting
Batty, Aaron Olaf; Haug, Tobias; Ebling, Sarah; Tissi, Katja; Sidler-Miserez, Sandra – Language Testing, 2023
Sign languages present particular challenges to language assessors in relation to variation in signs, weakly defined citation forms, and a general lack of standard-setting work even in long-established measures of productive sign proficiency. The present article addresses and explores these issues via a mixed-methods study of a human-rated…
Descriptors: Sign Language, Language Tests, Standard Setting, Barriers
Peabody, Michael R.; Wind, Stefanie A. – Journal of Educational Measurement, 2019
Setting performance standards is a judgmental process involving human opinions and values as well as technical and empirical considerations. Although all cut score decisions are by nature somewhat arbitrary, they should not be capricious. Judges selected for standard-setting panels should have the proper qualifications to make the judgments asked…
Descriptors: Standard Setting, Decision Making, Performance Based Assessment, Evaluators
White, Mark C. – Educational Researcher, 2018
Raters must score accurately and consistently for classroom observation scores to be valid. This requires (a) a standard defining when scoring is accurate and consistent enough and (b) measuring and remediating rater performance against that standard. Current practice has focused on this second problem to the exclusion of the first. My goal here…
Descriptors: Evaluators, Standard Setting, Classroom Observation Techniques, Scoring
Clauser, Jerome C.; Hambleton, Ronald K.; Baldwin, Peter – Educational and Psychological Measurement, 2017
The Angoff standard setting method relies on content experts to review exam items and make judgments about the performance of the minimally proficient examinee. Unfortunately, at times content experts may have gaps in their understanding of specific exam content. These gaps are particularly likely to occur when the content domain is broad and/or…
Descriptors: Scores, Item Analysis, Classification, Decision Making
Wudthayagorn, Jirada – LEARN Journal: Language Education and Acquisition Research Network, 2018
The purpose of this study was to map the Chulalongkorn University Test of English Proficiency, or the CU-TEP, to the Common European Framework of Reference (CEFR) by employing a standard setting methodology. Thirteen experts judged 120 items of the CU-TEP using the Yes/No Angoff technique. The experts decided whether or not a borderline student at…
Descriptors: Guidelines, Rating Scales, English (Second Language), Language Tests
Pill, John; McNamara, Tim – Language Testing, 2016
This paper considers how to establish the minimum required level of professionally relevant oral communication ability in the medium of English for health practitioners with English as an additional language (EAL) to gain admission to practice in jurisdictions where English is the dominant language. A theoretical concern is the construct of…
Descriptors: Specialists, Standard Setting, Language Tests, English (Second Language)
Bacon, Donald R.; Paul, Pallab; Stewart, Kim A.; Mukhopadhyay, Kausiki – Journal of Marketing Education, 2012
Much has been written about the evaluation of faculty research productivity in promotion and tenure decisions, including many articles that seek to determine the rank of various marketing journals. Yet how faculty evaluators combine journal quality, quantity, and author contribution to form judgments of a scholar's performance is unclear. A…
Descriptors: Productivity, Evaluators, Models, Marketing
Pill, John; Harding, Luke – Language Testing, 2013
This study identifies a unique context for exploring lay understandings of language testing and, by extension, for characterizing the nature of language assessment literacy among non-practitioners, stemming from data in an inquiry into the registration processes and support for overseas trained doctors by the Australian House of Representatives…
Descriptors: Language Tests, Testing, Foreign Nationals, Foreign Medical Graduates
Kozaki, Yoko – Language Assessment Quarterly, 2010
This article describes an alternative approach to setting standards for performance assessments. The procedure was designed for use in low-budget, relatively low-stakes contexts where it is not possible to bring expert judges together. The procedure that allowed participant judges to work individually throughout the process was an effort to…
Descriptors: Performance Based Assessment, Standard Setting, Decision Making, Certification
Verheggen, M. M.; Muijtjens, A. M. M.; Os, J. Van; Schuwirth, L. W. T. – Advances in Health Sciences Education, 2008
Background: To establish credible, defensible and acceptable passing scores for written tests is a challenge for health profession educators. Angoff procedures are often used to establish pass/fail decisions for written and performance tests. In an Angoff procedure judges' expertise and professional skills are assumed to influence their ratings of…
Descriptors: Health Occupations, Performance Tests, Scoring, Item Response Theory

Myford, Carol M.; Wolfe, Edward W. – Journal of Applied Measurement, 2002
Examined a procedure for identifying and resolving discrepancies in ratings, focusing on the third rater adjudication procedure used in scoring the Test of Spoken English. Results for 1,446 adult examinees demonstrate that implementing a discrepancy resolution procedure is not sufficient in itself for quality control monitoring. (SLD)
Descriptors: Adults, Evaluators, Quality Control, Scoring

Plake, Barbara S.; Impara, James C.; Irwin, Patrick M. – Journal of Educational Measurement, 2000
Examined intra- and inter-rater consistency of item performance estimated from an Angoff standard setting over 2 years, with 29 panelists one year, and 30 the next. Results provide evidence that item performance estimates were consistent within and across panels within and across years. Factors that might have influenced this high degree of…
Descriptors: Evaluators, Prediction, Reliability, Standard Setting
Previous Page | Next Page ยป
Pages: 1 | 2