ERIC - Search Results

Publication Date

In 2025	2
Since 2024	4
Since 2021 (last 5 years)	11
Since 2016 (last 10 years)	50
Since 2006 (last 20 years)	150

Descriptor

Standard Setting (Scoring)	502
Cutting Scores	228
Standards	165
Elementary Secondary Education	107
Test Items	92
Evaluation Methods	90
Academic Standards	79
Scoring	75
Minimum Competency Testing	70
Licensing Examinations…	66
Educational Assessment	64
Higher Education	63
Interrater Reliability	60
Criterion Referenced Tests	59
Foreign Countries	59
Test Construction	59
Comparative Analysis	57
Performance Based Assessment	56
Academic Achievement	55
Evaluators	53
Test Validity	53
Scores	50
Testing Programs	50
Student Evaluation	47
Decision Making	42
More ▼

Education Level

Higher Education	28
Elementary Secondary Education	21
Secondary Education	18
Postsecondary Education	17
Elementary Education	15
Middle Schools	10
High Schools	8
Intermediate Grades	8
Junior High Schools	8
Grade 5	6
Grade 8	5
Adult Education	4
Grade 3	4
Grade 4	4
Grade 6	4
Grade 7	4
Early Childhood Education	3
Primary Education	2
Grade 11	1
Grade 12	1
Grade 2	1
Grade 9	1
High School Equivalency…	1
Kindergarten	1
Two Year Colleges	1
More ▼

Audience

Researchers	28
Policymakers	7
Practitioners	6
Administrators	4
Teachers	3
Students	1

Location

Canada	10
Australia	8
Tennessee	8
United Kingdom	7
California	4
Kansas	4
Massachusetts	4
New Jersey	4
United States	4
Illinois	3
Michigan	3
Minnesota	3
North Carolina	3
Taiwan	3
Arizona	2
China	2
Georgia	2
Germany	2
Indiana	2
Kentucky	2
Louisiana	2
Maine	2
Maryland	2
Netherlands	2
Nevada	2
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	5
Comprehensive Education…	3
Carl D Perkins Vocational and…	1
Education Consolidation…	1
Improving Americas Schools…	1
Improving Americas Schools…	1
Job Training Partnership Act…	1
Lau v Nichols	1

What Works Clearinghouse Rating

Standard Setting (Scoring) X

Showing 1 to 15 of 502 results Save | Export

Embedding Embedded Standard Setting: An Application of Cross-Classified Item Response Theory. CRESST Report 876

Download full text

Yun-Kyung Kim; Li Cai – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2025

This paper introduces an application of cross-classified item response theory (IRT) modeling to an assessment utilizing the embedded standard setting (ESS) method (Lewis & Cook). The cross-classified IRT model is used to treat both item and person effects as random, where the item effects are regressed on the target performance levels (target…

Descriptors: Standard Setting (Scoring), Item Response Theory, Test Items, Difficulty Level

The Response Vector for Mastery Method of Standard Setting

Peer reviewed

Direct link

Dimitrov, Dimiter M. – Educational and Psychological Measurement, 2022

Proposed is a new method of standard setting referred to as response vector for mastery (RVM) method. Under the RVM method, the task of panelists that participate in the standard setting process does not involve conceptualization of a borderline examinee and probability judgments as it is the case with the Angoff and bookmark methods. Also, the…

Descriptors: Standard Setting (Scoring), Cutting Scores, Computation, Mastery Learning

Setting and Validating Multiple Standards on a Multistage-Adaptive Test

Peer reviewed

Direct link

Lewis, Jennifer; Lim, Hwanggyu; Padellaro, Frank; Sireci, Stephen G.; Zenisky, April L. – Educational Measurement: Issues and Practice, 2022

Setting cut scores on (MSTs) is difficult, particularly when the test spans several grade levels, and the selection of items from MST panels must reflect the operational test specifications. In this study, we describe, illustrate, and evaluate three methods for mapping panelists' Angoff ratings into cut scores on the scale underlying an MST. The…

Descriptors: Cutting Scores, Adaptive Testing, Test Items, Item Analysis

A Problem with the Bookmark Procedure's Correction for Guessing

Peer reviewed

Direct link

Baldwin, Peter – Educational Measurement: Issues and Practice, 2021

In the Bookmark standard-setting procedure, panelists are instructed to consider what examinees know rather than what they might attain by guessing; however, because examinees sometimes do guess, the procedure includes a correction for guessing. Like other corrections for guessing, the Bookmark's correction assumes that examinees either know the…

Descriptors: Guessing (Tests), Student Evaluation, Evaluation Methods, Standard Setting (Scoring)

The Choice of Response Probability in Bookmark Standard Setting: An Experimental Study

Peer reviewed

Direct link

Baldwin, Peter; Margolis, Melissa J.; Clauser, Brian E.; Mee, Janet; Winward, Marcia – Educational Measurement: Issues and Practice, 2020

Evidence of the internal consistency of standard-setting judgments is a critical part of the validity argument for tests used to make classification decisions. The bookmark standard-setting procedure is a popular approach to establishing performance standards, but there is relatively little research that reflects on the internal consistency of the…

Descriptors: Standard Setting (Scoring), Probability, Cutting Scores, Evaluation Methods

A Critical Look into the Beuk Standard-Setting Method

Peer reviewed

Direct link

Wyse, Adam E. – Educational Measurement: Issues and Practice, 2020

One commonly used compromise standard-setting method is the Beuk (1984) method. A key assumption of the Beuk method is that the emphasis given to the pass rate and the percent correct ratings should be proportional to the extent that the panelists agree on their ratings. However, whether the slope of Beuk line reflects the emphasis that panelists…

Descriptors: Standard Setting (Scoring), Cutting Scores, Weighted Scores, Evaluation Methods

Comparing Cut Scores from the Angoff Method and Two Variations of the Hofstee and Beuk Methods

Peer reviewed

Direct link

Wyse, Adam E. – Applied Measurement in Education, 2020

This article compares cut scores from two variations of the Hofstee and Beuk methods, which determine cut scores by resolving inconsistencies in panelists' judgments about cut scores and pass rates, with the Angoff method. The first variation uses responses to the Hofstee and Beuk percentage correct and pass rate questions to calculate cut scores.…

Descriptors: Cutting Scores, Evaluation Methods, Standard Setting (Scoring), Equations (Mathematics)

Using Diagnostic Profiles to Describe Borderline Performance in Standard Setting

Peer reviewed

Direct link

Skaggs, Gary; Hein, Serge F.; Wilkins, Jesse L. M. – Educational Measurement: Issues and Practice, 2020

In test-centered standard-setting methods, borderline performance can be represented by many different profiles of strengths and weaknesses. As a result, asking panelists to estimate item or test performance for a hypothetical group study of borderline examinees, or a typical borderline examinee, may be an extremely difficult task and one that can…

Descriptors: Standard Setting (Scoring), Cutting Scores, Testing Problems, Profiles

The Riddle Knowledge Inference Test (R-Kit)

Peer reviewed

Direct link

Nicolas Rochat; Laurent Lima; Pascal Bressoux – Journal of Psychoeducational Assessment, 2025

Inference is considered an important factor in comprehension models and has been described as a causal factor in predicting comprehension. To date, specific tests for inference are rare and often rely on specific thematic texts. This reliance on thematic inference may raise some concerns as inference is related to prior text-specific knowledge.…

Descriptors: Inferences, Reading Comprehension, Reading Tests, Test Reliability

Making the Grade with Recreational Therapy Accreditation: Comparing the NCTRC Pass Rates of CAAHEP/CARTE Accredited Programs to National Averages

Peer reviewed

Direct link

David Loy; Rhonda Nelson; Jared Allsop; Carol Johnston – Schole: A Journal of Leisure Studies and Recreation Education, 2024

Accreditation is a critical process in maintaining standards of consistency and excellence in the academic preparation of students for their chosen profession. While academic programs, professional associations, and credentialing organizations all recognize the importance of programmatic accreditation in recreational therapy professional…

Descriptors: Therapeutic Recreation, Accreditation (Institutions), Scores, Tests

Examining the Impact of a Consensus Approach to Content Alignment Studies

Peer reviewed
PDF on ERIC

Download full text

Russell, Michael; Moncaleano, Sebastian – Practical Assessment, Research & Evaluation, 2020

Although both content alignment and standard-setting procedures rely on content-expert panel judgements, only the latter employs discussion among panel members. This study employed a modified form of the Webb methodology to examine content alignment for twelve tests administered as part of the Massachusetts Comprehensive Assessment System (MCAS).…

Descriptors: Test Content, Test Items, Discussion, Test Validity

Mapping "TOEFL® Essentials"™ Test Scores to the Canadian Language Benchmarks. "TOEFL"® Research Report. TOEFL-RR-100. ETS Research Report No. RR-22-16

Peer reviewed
PDF on ERIC

Download full text

Papageorgiou, Spiros; Davis, Larry; Ohta, Renka; Gomez, Pablo Garcia – ETS Research Report Series, 2022

In this research report, we describe a study to map the scores of the "TOEFL® Essentials"™ test to the Canadian Language Benchmarks (CLB). The TOEFL Essentials test is a four-skills assessment of foundational English language skills and communication abilities in academic and general (daily life) contexts. At the time of writing this…

Descriptors: Foreign Countries, Language Tests, English (Second Language), Second Language Learning

Adding Objectivity to Standard Setting: Evaluating Consequence Using the Conscious and Subconscious Weight Methods

Peer reviewed

Direct link

Leventhal, Brian C.; Grabovsky, Irina – Educational Measurement: Issues and Practice, 2020

Standard setting is arguably one of the most subjective techniques in test development and psychometrics. The decisions when scores are compared to standards, however, are arguably the most consequential outcomes of testing. Providing licensure to practice in a profession has high stake consequences for the public. Denying graduation or forcing…

Descriptors: Standard Setting (Scoring), Weighted Scores, Test Construction, Psychometrics

It's Not Just Angoff: Misperceptions of Hard and Easy Items in Bookmark-Type Ratings

Peer reviewed

Direct link

Wyse, Adam E.; Babcock, Ben – Educational Measurement: Issues and Practice, 2020

A common belief is that the Bookmark method is a cognitively simpler standard-setting method than the modified Angoff method. However, a limited amount of research has investigated panelist's ability to perform well the Bookmark method, and whether some of the challenges panelists face with the Angoff method may also be present in the Bookmark…

Descriptors: Standard Setting (Scoring), Evaluation Methods, Testing Problems, Test Items

Applicability of Two Standard Setting Methods for Enhancing the Reporting of Assessment Results within the South African Education Context

Peer reviewed
PDF on ERIC

Download full text

Moloi, Qetelo; Kanjee, Anil – South African Journal of Education, 2021

The study reported on here contributes to the growing body of knowledge on the use of standard setting methods for improving the reporting and utility value of assessment results in South Africa as well as for addressing the conceptual shortcomings of the Curriculum and Assessment Policy Statement (CAPS) reporting framework. Using data from the…

Descriptors: Foreign Countries, Standard Setting (Scoring), Student Evaluation, Elementary School Students

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 34

Applied Measurement in…	37
Educational Measurement:…	34
Journal of Educational…	28
Educational and Psychological…	24
Measurement:…	13
International Journal of…	9
Evaluation and the Health…	8
Practical Assessment,…	7
Educational Assessment	5
Educational Evaluation and…	5
Advances in Health Sciences…	4
Review of Educational Research	4
Studies in Educational…	4
Alberta Journal of…	3
Assessment & Evaluation in…	3
Assessment in Education:…	3
Educational Testing Service	3
Journal of Educational and…	3
Language Assessment Quarterly	3
Language Testing	3
New Meridian Corporation	3
Online Submission	3
ProQuest LLC	3
Academic Medicine	2
Applied Psychological…	2
More ▼

Plake, Barbara S.	36
Hambleton, Ronald K.	17
Impara, James C.	16
Jaeger, Richard M.	15
Wyse, Adam E.	12
Clauser, Brian E.	9
Margolis, Melissa J.	9
Giraud, Gerald	8
Livingston, Samuel A.	8
Norcini, John J.	8
Buckendahl, Chad W.	7
Busch, John Christian	7
Bowman, Harry L.	6
Cizek, Gregory J.	6
Ferdous, Abdullah A.	6
Reckase, Mark D.	6
Tannenbaum, Richard J.	6
Chang, Lei	5
Kane, Michael	5
Linn, Robert L.	5
Mee, Janet	5
Sireci, Stephen G.	5
Halpin, Gerald	4
Kane, Michael T.	4
More ▼

Journal Articles	268
Reports - Research	211
Reports - Evaluative	171
Speeches/Meeting Papers	166
Reports - Descriptive	66
Opinion Papers	34
Information Analyses	17
Tests/Questionnaires	17
Guides - Non-Classroom	14
Numerical/Quantitative Data	11
Guides - General	5
Legal/Legislative/Regulatory…	4
Collected Works - General	3
Collected Works - Serials	3
Dissertations/Theses -…	3
Reports - General	3
Book/Product Reviews	2
Books	2
Collected Works - Proceedings	2
Historical Materials	2
ERIC Digests in Full Text	1
ERIC Publications	1
Guides - Classroom - Teacher	1
More ▼

National Assessment of…	39
National Teacher Examinations	16
Alabama High School…	4
Praxis Series	2
Test of English as a Foreign…	2
United States Medical…	2
edTPA (Teacher Performance…	2
Advanced Placement…	1
California Basic Educational…	1
College Board Achievement…	1
General Educational…	1
International English…	1
Iowa Tests of Basic Skills	1
Massachusetts Comprehensive…	1
New Jersey College Basic…	1
Pre Professional Skills Tests	1
SAT (College Admission Test)	1
TerraNova Multiple Assessments	1
Test of English for…	1
Wechsler Adult Intelligence…	1
More ▼