ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	14
Since 2006 (last 20 years)	25

Descriptor

Cutting Scores	38
Standard Setting (Scoring)	19
Evaluation Methods	10
Testing Problems	8
Models	7
Test Items	7
Measurement Objectives	6
Standard Setting	6
Standards	6
Test Use	6
Decision Making	5
Minimum Competency Testing	5
Scoring	5
Student Evaluation	5
Validity	5
Academic Standards	4
Accountability	4
Educational Testing	4
Elementary Secondary Education	4
Psychometrics	4
Test Construction	4
Test Interpretation	4
Test Validity	4
Testing Programs	4
Academic Achievement	3
More ▼

Source

Educational Measurement:…

Publication Type

Journal Articles	38
Reports - Research	19
Reports - Evaluative	8
Reports - Descriptive	6
Opinion Papers	5
Guides - Non-Classroom	1
Speeches/Meeting Papers	1

Education Level

Elementary Secondary Education	5
Elementary Education	3
Higher Education	2
Postsecondary Education	2
Secondary Education	2
Grade 3	1
Grade 4	1
Grade 5	1
Grade 6	1
Junior High Schools	1
Middle Schools	1
More ▼

Audience

Location

Canada	1
Germany	1
Maryland	1
Nebraska	1
New Hampshire	1

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

National Assessment of…

What Works Clearinghouse Rating

Showing 1 to 15 of 38 results Save | Export

Using Classification Tree Models to Determine Course Placement

Peer reviewed

Direct link

Lee, Chansoon – Educational Measurement: Issues and Practice, 2022

Appropriate placement into courses at postsecondary institutions is critical for the success of students in terms of retention and graduation rates. To reduce the number of students who are misplaced, using multiple measures in placing students is encouraged. However, in practice most postsecondary schools utilize only a few measures to determine…

Descriptors: Classification, Models, Student Placement, College Students

Applying a Mixture Rasch Model-Based Approach to Standard Setting

Peer reviewed

Direct link

Peabody, Michael R.; Muckle, Timothy J.; Meng, Yu – Educational Measurement: Issues and Practice, 2023

The subjective aspect of standard-setting is often criticized, yet data-driven standard-setting methods are rarely applied. Therefore, we applied a mixture Rasch model approach to setting performance standards across several testing programs of various sizes and compared the results to existing passing standards derived from traditional…

Descriptors: Item Response Theory, Standard Setting, Testing, Sampling

Setting and Validating Multiple Standards on a Multistage-Adaptive Test

Peer reviewed

Direct link

Lewis, Jennifer; Lim, Hwanggyu; Padellaro, Frank; Sireci, Stephen G.; Zenisky, April L. – Educational Measurement: Issues and Practice, 2022

Setting cut scores on (MSTs) is difficult, particularly when the test spans several grade levels, and the selection of items from MST panels must reflect the operational test specifications. In this study, we describe, illustrate, and evaluate three methods for mapping panelists' Angoff ratings into cut scores on the scale underlying an MST. The…

Descriptors: Cutting Scores, Adaptive Testing, Test Items, Item Analysis

A Problem with the Bookmark Procedure's Correction for Guessing

Peer reviewed

Direct link

Baldwin, Peter – Educational Measurement: Issues and Practice, 2021

In the Bookmark standard-setting procedure, panelists are instructed to consider what examinees know rather than what they might attain by guessing; however, because examinees sometimes do guess, the procedure includes a correction for guessing. Like other corrections for guessing, the Bookmark's correction assumes that examinees either know the…

Descriptors: Guessing (Tests), Student Evaluation, Evaluation Methods, Standard Setting (Scoring)

The Choice of Response Probability in Bookmark Standard Setting: An Experimental Study

Peer reviewed

Direct link

Baldwin, Peter; Margolis, Melissa J.; Clauser, Brian E.; Mee, Janet; Winward, Marcia – Educational Measurement: Issues and Practice, 2020

Evidence of the internal consistency of standard-setting judgments is a critical part of the validity argument for tests used to make classification decisions. The bookmark standard-setting procedure is a popular approach to establishing performance standards, but there is relatively little research that reflects on the internal consistency of the…

Descriptors: Standard Setting (Scoring), Probability, Cutting Scores, Evaluation Methods

A Critical Look into the Beuk Standard-Setting Method

Peer reviewed

Direct link

Wyse, Adam E. – Educational Measurement: Issues and Practice, 2020

One commonly used compromise standard-setting method is the Beuk (1984) method. A key assumption of the Beuk method is that the emphasis given to the pass rate and the percent correct ratings should be proportional to the extent that the panelists agree on their ratings. However, whether the slope of Beuk line reflects the emphasis that panelists…

Descriptors: Standard Setting (Scoring), Cutting Scores, Weighted Scores, Evaluation Methods

Using Diagnostic Profiles to Describe Borderline Performance in Standard Setting

Peer reviewed

Direct link

Skaggs, Gary; Hein, Serge F.; Wilkins, Jesse L. M. – Educational Measurement: Issues and Practice, 2020

In test-centered standard-setting methods, borderline performance can be represented by many different profiles of strengths and weaknesses. As a result, asking panelists to estimate item or test performance for a hypothetical group study of borderline examinees, or a typical borderline examinee, may be an extremely difficult task and one that can…

Descriptors: Standard Setting (Scoring), Cutting Scores, Testing Problems, Profiles

Digital Module 14: Planning and Conducting Standard Setting

Peer reviewed

Direct link

Bunch, Michael B. – Educational Measurement: Issues and Practice, 2020

In this digital ITEMS module, Dr. Michael Bunch provides an in-depth, step-by-step look at how standard setting is done. It does not focus on any specific procedure or methodology (e.g., modified Angoff, bookmark, and body of work) but on the practical tasks that must be completed for any standard setting activity. Dr. Bunch carries the…

Descriptors: Standard Setting, Cutting Scores, Scores, Reports

Embedded Standard Setting: Aligning Standard-Setting Methodology with Contemporary Assessment Design Principles

Peer reviewed

Direct link

Lewis, Daniel; Cook, Robert – Educational Measurement: Issues and Practice, 2020

In this paper we assert that the practice of principled assessment design renders traditional standard-setting methodology redundant at best and contradictory at worst. We describe the rationale for, and methodological details of, Embedded Standard Setting (ESS; previously, Engineered Cut Scores. Lewis, 2016), an approach to establish performance…

Descriptors: Standard Setting, Evaluation, Cutting Scores, Performance Based Assessment

Condensed Mastery Profile Method for Setting Standards for Diagnostic Assessment Systems

Peer reviewed

Direct link

Clark, A. K.; Nash, B.; Karvonen, M.; Kingston, N. – Educational Measurement: Issues and Practice, 2017

The purpose of this study was to develop a standard-setting method appropriate for use with a diagnostic assessment that produces profiles of student mastery rather than a single raw or scale score value. The condensed mastery profile method draws from established holistic standard-setting methods to use rounds of range finding and pinpointing to…

Descriptors: Diagnostic Tests, Standard Setting (Scoring), Cutting Scores, Performance

An Investigation of Undefined Cut Scores with the Hofstee Standard-Setting Method

Peer reviewed

Direct link

Wyse, Adam E.; Babcock, Ben – Educational Measurement: Issues and Practice, 2017

This article provides an overview of the Hofstee standard-setting method and illustrates several situations where the Hofstee method will produce undefined cut scores. The situations where the cut scores will be undefined involve cases where the line segment derived from the Hofstee ratings does not intersect the score distribution curve based on…

Descriptors: Cutting Scores, Evaluation Methods, Standard Setting (Scoring), Comparative Analysis

Five Methods for Estimating Angoff Cut Scores with IRT

Peer reviewed

Direct link

Wyse, Adam E. – Educational Measurement: Issues and Practice, 2017

This article illustrates five different methods for estimating Angoff cut scores using item response theory (IRT) models. These include maximum likelihood (ML), expected a priori (EAP), modal a priori (MAP), and weighted maximum likelihood (WML) estimators, as well as the most commonly used approach based on translating ratings through the test…

Descriptors: Cutting Scores, Item Response Theory, Bayesian Statistics, Maximum Likelihood Statistics

Comparability in Balanced Assessment Systems for State Accountability

Peer reviewed

Direct link

Evans, Carla M.; Lyons, Susan – Educational Measurement: Issues and Practice, 2017

The purpose of this study was to test methods that strengthen the comparability claims about annual determinations of student proficiency in English language arts, math, and science (Grades 3-12) in the New Hampshire Performance Assessment of Competency Education (NH PACE) pilot project. First, we examined the literature in order to define…

Descriptors: Academic Achievement, Language Arts, Mathematics Achievement, Science Achievement

The Impact of Examinee Performance Information on Judges' Cut Scores in Modified Angoff Standard-Setting Exercises

Peer reviewed

Direct link

Margolis, Melissa J.; Clauser, Brian E. – Educational Measurement: Issues and Practice, 2014

This research evaluated the impact of a common modification to Angoff standard-setting exercises: the provision of examinee performance data. Data from 18 independent standard-setting panels across three different medical licensing examinations were examined to investigate whether and how the provision of performance information impacted judgments…

Descriptors: Cutting Scores, Standard Setting (Scoring), Data, Licensing Examinations (Professions)

The Issue of Range Restriction in Bookmark Standard Setting

Peer reviewed

Direct link

Wyse, Adam E. – Educational Measurement: Issues and Practice, 2015

This article uses data from a large-scale assessment program to illustrate the potential issue of range restriction with the Bookmark method in the context of trying to set cut scores to closely align with a set of college and career readiness benchmarks. Analyses indicated that range restriction issues existed across different response…

Descriptors: Cutting Scores, Alignment (Education), College Readiness, Career Readiness

Previous Page | Next Page »

Pages: 1 | 2 | 3

Wyse, Adam E.	4
Clauser, Brian E.	3
Margolis, Melissa J.	3
Baldwin, Peter	2
Geisinger, Kurt F.	2
Linn, Robert L.	2
Mee, Janet	2
Mehrens, William A.	2
Winward, Marcia	2
Babcock, Ben	1
Buckendahl, Chad W.	1
Bunch, Michael B.	1
Cangelosi, James S.	1
Childs, Ruth A.	1
Cizek, Gregory J.	1
Clark, A. K.	1
Clauser, Jerome C.	1
Cook, Robert	1
Davidson, Anne H.	1
Evans, Carla M.	1
Ferrara, Steve	1
Fisher, Thomas H.	1
Gaertner, Matthew N.	1
Haertel, Edward H.	1
More ▼