ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	9
Since 2006 (last 20 years)	13

Descriptor

Evaluators	20
Standard Setting	20
Cutting Scores	6
Decision Making	5
English (Second Language)	5
Evaluation Methods	5
Foreign Countries	5
Language Tests	5
Test Items	5
Item Analysis	4
Second Language Learning	4
Classification	3
Item Response Theory	3
Licensing Examinations…	3
Physicians	3
Scoring	3
Second Language Instruction	3
Specialists	3
Standards	3
Certification	2
Comparative Analysis	2
Error of Measurement	2
Evaluation Criteria	2
Expertise	2
Guidelines	2
More ▼

Source

Journal of Educational…	4
Language Testing	3
Educational Measurement:…	2
Advances in Health Sciences…	1
Educational Researcher	1
Educational and Psychological…	1
Evaluation and Program…	1
International Journal of…	1
Journal of Applied Measurement	1
Journal of Marketing Education	1
LEARN Journal: Language…	1
Language Assessment Quarterly	1
New Directions for Evaluation	1
More ▼

Publication Type

Journal Articles	19
Reports - Research	14
Reports - Evaluative	4
Opinion Papers	2
Reports - Descriptive	1
Speeches/Meeting Papers	1
Tests/Questionnaires	1

Education Level

Higher Education	4
Adult Education	2
Postsecondary Education	2

Audience

Location

Australia	1
Europe	1
Japan	1
Switzerland	1
Thailand	1

Laws, Policies, & Programs

Assessments and Surveys

International English…	1
Test of English as a Foreign…	1
Test of English for…	1
United States Medical…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 20 results Save | Export

Standard Setting in Academic Writing Assessment through Objective Standard Setting Method

Peer reviewed
PDF on ERIC

Download full text

Fisne, Fatima Nur; Sata, Mehmet; Karakaya, Ismail – International Journal of Assessment Tools in Education, 2022

Performance standards have important consequences for all the stakeholders in the assessment of L2 academic writing. These standards not only describe the level of writing performance but also provide a basis for making evaluative decisions on the academic writing. Such a high-stakes role of the performance standards requires the enhancement of…

Descriptors: Standard Setting, Writing Evaluation, Academic Language, English (Second Language)

Evaluating Panelists' Understanding of Standard Setting Data

Peer reviewed

Direct link

Baron, Patricia; Sireci, Stephen G.; Slater, Sharon C. – Educational Measurement: Issues and Practice, 2021

Since the No Child Left Behind Act (No Child Left Behind [NCLB], 2001) was enacted, the Bookmark method has been used in many state standard setting studies (Karantonis and Sireci; Zieky, Perie, and Livingston). The purpose of the current study is to evaluate the criticism that when panelists are presented with data during the Bookmark standard…

Descriptors: State Standards, Standard Setting, Evaluators, Training

Examining the Precision of Cut Scores within a Generalizability Theory Framework: A Closer Look at the Item Effect

Peer reviewed

Direct link

Clauser, Brian E.; Kane, Michael; Clauser, Jerome C. – Journal of Educational Measurement, 2020

An Angoff standard setting study generally yields judgments on a number of items by a number of judges (who may or may not be nested in panels). Variability associated with judges (and possibly panels) contributes error to the resulting cut score. The variability associated with items plays a more complicated role. To the extent that the mean item…

Descriptors: Cutting Scores, Generalization, Decision Making, Standard Setting

Challenges in Rating Signed Production: A Mixed-Methods Study of a Swiss German Sign Language Form-Recall Vocabulary Test

Peer reviewed

Direct link

Batty, Aaron Olaf; Haug, Tobias; Ebling, Sarah; Tissi, Katja; Sidler-Miserez, Sandra – Language Testing, 2023

Sign languages present particular challenges to language assessors in relation to variation in signs, weakly defined citation forms, and a general lack of standard-setting work even in long-established measures of productive sign proficiency. The present article addresses and explores these issues via a mixed-methods study of a human-rated…

Descriptors: Sign Language, Language Tests, Standard Setting, Barriers

Exploring the Influence of Judge Proficiency on Standard-Setting Judgments

Peer reviewed

Direct link

Peabody, Michael R.; Wind, Stefanie A. – Journal of Educational Measurement, 2019

Setting performance standards is a judgmental process involving human opinions and values as well as technical and empirical considerations. Although all cut score decisions are by nature somewhat arbitrary, they should not be capricious. Judges selected for standard-setting panels should have the proper qualifications to make the judgments asked…

Descriptors: Standard Setting, Decision Making, Performance Based Assessment, Evaluators

Rater Performance Standards for Classroom Observation Instruments

Peer reviewed

Direct link

White, Mark C. – Educational Researcher, 2018

Raters must score accurately and consistently for classroom observation scores to be valid. This requires (a) a standard defining when scoring is accurate and consistent enough and (b) measuring and remediating rater performance against that standard. Current practice has focused on this second problem to the exclusion of the first. My goal here…

Descriptors: Evaluators, Standard Setting, Classroom Observation Techniques, Scoring

The Effect of Rating Unfamiliar Items on Angoff Passing Scores

Peer reviewed

Direct link

Clauser, Jerome C.; Hambleton, Ronald K.; Baldwin, Peter – Educational and Psychological Measurement, 2017

The Angoff standard setting method relies on content experts to review exam items and make judgments about the performance of the minimally proficient examinee. Unfortunately, at times content experts may have gaps in their understanding of specific exam content. These gaps are particularly likely to occur when the content domain is broad and/or…

Descriptors: Scores, Item Analysis, Classification, Decision Making

Mapping the CU-TEP to the Common European Framework of Reference (CEFTR)

Peer reviewed
PDF on ERIC

Download full text

Wudthayagorn, Jirada – LEARN Journal: Language Education and Acquisition Research Network, 2018

The purpose of this study was to map the Chulalongkorn University Test of English Proficiency, or the CU-TEP, to the Common European Framework of Reference (CEFR) by employing a standard setting methodology. Thirteen experts judged 120 items of the CU-TEP using the Yes/No Angoff technique. The experts decided whether or not a borderline student at…

Descriptors: Guidelines, Rating Scales, English (Second Language), Language Tests

How Much Is Enough? Involving Occupational Experts in Setting Standards on a Specific-Purpose Language Test for Health Professionals

Peer reviewed

Direct link

Pill, John; McNamara, Tim – Language Testing, 2016

This paper considers how to establish the minimum required level of professionally relevant oral communication ability in the medium of English for health practitioners with English as an additional language (EAL) to gain admission to practice in jurisdictions where English is the dominant language. A theoretical concern is the construct of…

Descriptors: Specialists, Standard Setting, Language Tests, English (Second Language)

A New Tool for Identifying Research Standards and Evaluating Research Performance

Peer reviewed

Direct link

Bacon, Donald R.; Paul, Pallab; Stewart, Kim A.; Mukhopadhyay, Kausiki – Journal of Marketing Education, 2012

Much has been written about the evaluation of faculty research productivity in promotion and tenure decisions, including many articles that seek to determine the rank of various marketing journals. Yet how faculty evaluators combine journal quality, quantity, and author contribution to form judgments of a scholar's performance is unclear. A…

Descriptors: Productivity, Evaluators, Models, Marketing

Defining the Language Assessment Literacy Gap: Evidence from a Parliamentary Inquiry

Peer reviewed

Direct link

Pill, John; Harding, Luke – Language Testing, 2013

This study identifies a unique context for exploring lay understandings of language testing and, by extension, for characterizing the nature of language assessment literacy among non-practitioners, stemming from data in an inquiry into the registration processes and support for overseas trained doctors by the Australian House of Representatives…

Descriptors: Language Tests, Testing, Foreign Nationals, Foreign Medical Graduates

An Alternative Decision-Making Procedure for Performance Assessments: Using the Multifaceted Rash Model to Generate Cut Estimates

Peer reviewed

Direct link

Kozaki, Yoko – Language Assessment Quarterly, 2010

This article describes an alternative approach to setting standards for performance assessments. The procedure was designed for use in low-budget, relatively low-stakes contexts where it is not possible to bring expert judges together. The procedure that allowed participant judges to work individually throughout the process was an effort to…

Descriptors: Performance Based Assessment, Standard Setting, Decision Making, Certification

Is an Angoff Standard an Indication of Minimal Competence of Examinees or of Judges?

Peer reviewed

Direct link

Verheggen, M. M.; Muijtjens, A. M. M.; Os, J. Van; Schuwirth, L. W. T. – Advances in Health Sciences Education, 2008

Background: To establish credible, defensible and acceptable passing scores for written tests is a challenge for health profession educators. Angoff procedures are often used to establish pass/fail decisions for written and performance tests. In an Angoff procedure judges' expertise and professional skills are assumed to influence their ratings of…

Descriptors: Health Occupations, Performance Tests, Scoring, Item Response Theory

When Raters Disagree, Then What: Examining a Third-rating Discrepancy Resolution Procedure and Its Utility for Identifying Unusual Patterns of Ratings.

Peer reviewed

Myford, Carol M.; Wolfe, Edward W. – Journal of Applied Measurement, 2002

Examined a procedure for identifying and resolving discrepancies in ratings, focusing on the third rater adjudication procedure used in scoring the Test of Spoken English. Results for 1,446 adult examinees demonstrate that implementing a discrepancy resolution procedure is not sufficient in itself for quality control monitoring. (SLD)

Descriptors: Adults, Evaluators, Quality Control, Scoring

Consistency of Angoff-based Predictions of Item Performance: Evidence of Technical Quality of Results from the Angoff Standard Setting Method.

Peer reviewed

Plake, Barbara S.; Impara, James C.; Irwin, Patrick M. – Journal of Educational Measurement, 2000

Examined intra- and inter-rater consistency of item performance estimated from an Angoff standard setting over 2 years, with 29 panelists one year, and 30 the next. Results provide evidence that item performance estimates were consistent within and across panels within and across years. Factors that might have influenced this high degree of…

Descriptors: Evaluators, Prediction, Reliability, Standard Setting

Previous Page | Next Page »

Pages: 1 | 2

Clauser, Jerome C.	2
Impara, James C.	2
Pill, John	2
Plake, Barbara S.	2
Bacon, Donald R.	1
Baldwin, Peter	1
Baron, Patricia	1
Batty, Aaron Olaf	1
Beywl, Wolfgang	1
Buckendahl, Chad W.	1
Chinn, Roberta N.	1
Clauser, Brian E.	1
Ebling, Sarah	1
Fisne, Fatima Nur	1
Hambleton, Ronald K.	1
Harding, Luke	1
Haug, Tobias	1
Hertz, Norman R.	1
Irwin, Patrick M.	1
Kane, Michael	1
Karakaya, Ismail	1
Kozaki, Yoko	1
McNamara, Tim	1
Muijtjens, A. M. M.	1
Mukhopadhyay, Kausiki	1
More ▼