ERIC - Search Results

Publication Date

In 2026	0
Since 2025	2
Since 2022 (last 5 years)	10
Since 2017 (last 10 years)	49
Since 2007 (last 20 years)	145

Descriptor

Standard Setting (Scoring)	505
Cutting Scores	228
Standards	165
Elementary Secondary Education	108
Test Items	94
Evaluation Methods	91
Academic Standards	79
Scoring	76
Minimum Competency Testing	70
Licensing Examinations…	66
Educational Assessment	65
Higher Education	63
Test Construction	61
Criterion Referenced Tests	60
Interrater Reliability	60
Foreign Countries	59
Comparative Analysis	57
Performance Based Assessment	56
Academic Achievement	55
Test Validity	54
Evaluators	53
Scores	50
Testing Programs	50
Student Evaluation	48
Decision Making	42
More ▼

Education Level

Higher Education	28
Elementary Secondary Education	22
Secondary Education	19
Postsecondary Education	17
Elementary Education	16
Middle Schools	11
Junior High Schools	9
High Schools	8
Intermediate Grades	8
Grade 5	6
Grade 8	5
Adult Education	4
Grade 3	4
Grade 4	4
Grade 6	4
Grade 7	4
Early Childhood Education	3
Primary Education	2
Grade 11	1
Grade 12	1
Grade 2	1
Grade 9	1
High School Equivalency…	1
Kindergarten	1
Two Year Colleges	1
More ▼

Audience

Researchers	28
Policymakers	7
Practitioners	6
Administrators	4
Teachers	3
Students	1

Location

Canada	10
Australia	8
Tennessee	8
United Kingdom	7
California	4
Kansas	4
Massachusetts	4
New Jersey	4
United States	4
Illinois	3
Michigan	3
Minnesota	3
North Carolina	3
Taiwan	3
Arizona	2
China	2
Georgia	2
Germany	2
Indiana	2
Kentucky	2
Louisiana	2
Maine	2
Maryland	2
Nebraska	2
Netherlands	2
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	5
Comprehensive Education…	3
Carl D Perkins Vocational and…	1
Education Consolidation…	1
Improving Americas Schools…	1
Improving Americas Schools…	1
Job Training Partnership Act…	1
Lau v Nichols	1

What Works Clearinghouse Rating

Showing 46 to 60 of 505 results Save | Export

The Reliability of Setting Grade Boundaries Using Comparative Judgement

Peer reviewed

Direct link

Benton, Tom; Elliott, Gill – Research Papers in Education, 2016

In recent years the use of expert judgement to set and maintain examination standards has been increasingly criticised in favour of approaches based on statistical modelling. This paper reviews existing research on this controversy and attempts to unify the evidence within a framework where expertise is utilised in the form of comparative…

Descriptors: Reliability, Expertise, Mathematical Models, Standard Setting (Scoring)

Non-Numeric Intrajudge Consistency Feedback in an Angoff Procedure

Peer reviewed

Direct link

Harrison, George M. – Journal of Educational Measurement, 2015

The credibility of standard-setting cut scores depends in part on two sources of consistency evidence: intrajudge and interjudge consistency. Although intrajudge consistency feedback has often been provided to Angoff judges in practice, more evidence is needed to determine whether it achieves its intended effect. In this randomized experiment with…

Descriptors: Interrater Reliability, Standard Setting (Scoring), Cutting Scores, Feedback (Response)

An Examination of the Replicability of Angoff Standard Setting Results within a Generalizability Theory Framework

Peer reviewed

Direct link

Clauser, Jerome C.; Margolis, Melissa J.; Clauser, Brian E. – Journal of Educational Measurement, 2014

Evidence of stable standard setting results over panels or occasions is an important part of the validity argument for an established cut score. Unfortunately, due to the high cost of convening multiple panels of content experts, standards often are based on the recommendation from a single panel of judges. This approach implicitly assumes that…

Descriptors: Standard Setting (Scoring), Generalizability Theory, Replication (Evaluation), Cutting Scores

(Re)Shaping Educational Research through 'Programmification': Institutional Expansion, Change, and Translation in Norway

Peer reviewed

Direct link

Zapp, Mike; Helgetun, Jo B.; Powell, Justin J. W. – European Journal of Education, 2018

Educational research in Norway has experienced unprecedented structural expansion and cognitive shifts over the last two decades because of greater state investments and the strategic use of extensive and multi-year thematic programmes to fund research projects. Using a neo-institutionalist framework, we examine institutionalisation dynamics in…

Descriptors: Foreign Countries, Educational Research, Institutional Characteristics, Organizational Change

Toward Education Quality Improvement in China: A Brief Overview of the National Assessment of Education Quality

Peer reviewed

Direct link

Jiang, Yu; Zhang, Jiahui; Xin, Tao – Journal of Educational and Behavioral Statistics, 2019

This article is an overview of the National Assessment of Education Quality (NAEQ) of China in reading, mathematics, sciences, arts, physical education, and moral education at Grades 4 and 8. After a review of the background and history of NAEQ, we present the assessment framework with students' holistic development at the core and the design for…

Descriptors: Foreign Countries, Educational Quality, Educational Improvement, National Competency Tests

The Cut-Score Operating Function: A New Tool to Aid in Standard Setting

Peer reviewed

Direct link

Grabovsky, Irina; Wainer, Howard – Journal of Educational and Behavioral Statistics, 2017

In this essay, we describe the construction and use of the Cut-Score Operating Function in aiding standard setting decisions. The Cut-Score Operating Function shows the relation between the cut-score chosen and the consequent error rate. It allows error rates to be defined by multiple loss functions and will show the behavior of each loss…

Descriptors: Cutting Scores, Standard Setting (Scoring), Decision Making, Error Patterns

Modeling for Directly Setting Theory-Based Performance Levels

Peer reviewed
PDF on ERIC

Download full text

Torres Irribarra, David; Diakow, Ronli; Freund, Rebecca; Wilson, Mark – Grantee Submission, 2015

This paper presents the Latent Class Level-PCM as a method for identifying and interpreting latent classes of respondents according to empirically estimated performance levels. The model, which combines elements from latent class models and reparameterized partial credit models for polytomous data, can simultaneously (a) identify empirical…

Descriptors: Item Response Theory, Test Items, Statistical Analysis, Models

Getting Lucky: How Guessing Threatens the Validity of Performance Classifications

Peer reviewed
PDF on ERIC

Download full text

Foley, Brett P. – Practical Assessment, Research & Evaluation, 2016

There is always a chance that examinees will answer multiple choice (MC) items correctly by guessing. Design choices in some modern exams have created situations where guessing at random through the full exam--rather than only for a subset of items where the examinee does not know the answer--can be an effective strategy to pass the exam. This…

Descriptors: Guessing (Tests), Multiple Choice Tests, Case Studies, Test Construction

Teeter-Totters Have Two Ends

Peer reviewed

Direct link

Popham, W. James – Measurement: Interdisciplinary Research and Perspectives, 2013

The author recalls that as a child, he grooved on teeter-totters. Also known as a seesaw, a teeter-totter is a long, narrow board that's elevated with a pivot point in the middle so that as one end goes down the other end goes up. When going up or going down, sometimes quite rapidly, teeter-totters can provide their two riders with some…

Descriptors: Standard Setting (Scoring), Maps, Performance, Standards

The Impact of Examinee Performance Information on Judges' Cut Scores in Modified Angoff Standard-Setting Exercises

Peer reviewed

Direct link

Margolis, Melissa J.; Clauser, Brian E. – Educational Measurement: Issues and Practice, 2014

This research evaluated the impact of a common modification to Angoff standard-setting exercises: the provision of examinee performance data. Data from 18 independent standard-setting panels across three different medical licensing examinations were examined to investigate whether and how the provision of performance information impacted judgments…

Descriptors: Cutting Scores, Standard Setting (Scoring), Data, Licensing Examinations (Professions)

Spring 2018 NSCAS Summative ELA, Mathematics, and Science Technical Report

Download full text

Nebraska Department of Education, 2018

The 2018 Nebraska Student-Centered Assessment System (NSCAS) Summative technical report documents the processes and procedures implemented to support the Spring 2018 NSCAS Summative English Language Arts (ELA), Mathematics, and Science assessments by NWEA under the supervision of the Nebraska Department of Education (NDE). The technical report…

Descriptors: Summative Evaluation, Language Tests, English, Mathematics Tests

The Issue of Range Restriction in Bookmark Standard Setting

Peer reviewed

Direct link

Wyse, Adam E. – Educational Measurement: Issues and Practice, 2015

This article uses data from a large-scale assessment program to illustrate the potential issue of range restriction with the Bookmark method in the context of trying to set cut scores to closely align with a set of college and career readiness benchmarks. Analyses indicated that range restriction issues existed across different response…

Descriptors: Cutting Scores, Alignment (Education), College Readiness, Career Readiness

Setting Cut Scores on an EFL Placement Test Using the Prototype Group Method: A Receiver Operating Characteristic (ROC) Analysis

Peer reviewed

Direct link

Eckes, Thomas – Language Testing, 2017

This paper presents an approach to standard setting that combines the prototype group method (PGM; Eckes, 2012) with a receiver operating characteristic (ROC) analysis. The combined PGM-ROC approach is applied to setting cut scores on a placement test of English as a foreign language (EFL). To implement the PGM, experts first named learners whom…

Descriptors: English (Second Language), Language Tests, Cutting Scores, Standard Setting (Scoring)

The Role of Construct Maps in Standard Setting

Peer reviewed

Direct link

Kane, Michael T.; Tannenbaum, Richard J. – Measurement: Interdisciplinary Research and Perspectives, 2013

The authors observe in this commentary that construct maps can help standard-setting panels to make realistic and internally consistent recommendations for performance-level descriptions (PLDs) and cut-scores, but the benefits may not be realized if policymakers do not fully understand the rationale for the recommendations provided by the…

Descriptors: Standard Setting (Scoring), Maps, Cutting Scores, Policy

Construct Maps for the Road Ahead

Peer reviewed

Direct link

Bunch, Michael B. – Measurement: Interdisciplinary Research and Perspectives, 2013

In this issue of "Measurement: Interdisciplinary Research and Perspectives," Adam E. Wyse provides a thorough review of research to date on the use of construct maps in standard setting. He juxtaposes concepts and methods in ways that make their connections to one another clearer and more obvious than they might otherwise have been. In…

Descriptors: Standard Setting (Scoring), Maps, Validity, Design

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 34

Applied Measurement in…	37
Educational Measurement:…	34
Journal of Educational…	28
Educational and Psychological…	24
Measurement:…	13
International Journal of…	9
Evaluation and the Health…	8
Practical Assessment,…	7
Educational Assessment	5
Educational Evaluation and…	5
Advances in Health Sciences…	4
Review of Educational Research	4
Studies in Educational…	4
Alberta Journal of…	3
Assessment & Evaluation in…	3
Assessment in Education:…	3
Educational Testing Service	3
Journal of Applied Testing…	3
Journal of Educational and…	3
Language Assessment Quarterly	3
Language Testing	3
New Meridian Corporation	3
Online Submission	3
ProQuest LLC	3
Academic Medicine	2
More ▼

Plake, Barbara S.	36
Hambleton, Ronald K.	17
Impara, James C.	16
Jaeger, Richard M.	15
Wyse, Adam E.	12
Clauser, Brian E.	9
Margolis, Melissa J.	9
Giraud, Gerald	8
Livingston, Samuel A.	8
Norcini, John J.	8
Buckendahl, Chad W.	7
Busch, John Christian	7
Bowman, Harry L.	6
Cizek, Gregory J.	6
Ferdous, Abdullah A.	6
Reckase, Mark D.	6
Tannenbaum, Richard J.	6
Chang, Lei	5
Kane, Michael	5
Linn, Robert L.	5
Mee, Janet	5
Sireci, Stephen G.	5
Halpin, Gerald	4
Kane, Michael T.	4
More ▼

Journal Articles	269
Reports - Research	213
Reports - Evaluative	171
Speeches/Meeting Papers	166
Reports - Descriptive	67
Opinion Papers	34
Information Analyses	17
Tests/Questionnaires	17
Guides - Non-Classroom	14
Numerical/Quantitative Data	12
Guides - General	5
Legal/Legislative/Regulatory…	4
Collected Works - General	3
Collected Works - Serials	3
Dissertations/Theses -…	3
Reports - General	3
Book/Product Reviews	2
Books	2
Collected Works - Proceedings	2
Historical Materials	2
ERIC Digests in Full Text	1
ERIC Publications	1
Guides - Classroom - Teacher	1
More ▼

National Assessment of…	39
National Teacher Examinations	16
Alabama High School…	4
Praxis Series	2
Test of English as a Foreign…	2
United States Medical…	2
edTPA (Teacher Performance…	2
Advanced Placement…	1
California Basic Educational…	1
College Board Achievement…	1
General Educational…	1
International English…	1
Iowa Tests of Basic Skills	1
Massachusetts Comprehensive…	1
New Jersey College Basic…	1
Pre Professional Skills Tests	1
SAT (College Admission Test)	1
TerraNova Multiple Assessments	1
Test of English for…	1
Wechsler Adult Intelligence…	1
More ▼