Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 6 |
Since 2006 (last 20 years) | 16 |
Descriptor
Scores | 50 |
Standard Setting (Scoring) | 50 |
Standards | 16 |
Cutting Scores | 12 |
Elementary Secondary Education | 12 |
Minimum Competency Testing | 11 |
Evaluation Methods | 10 |
Scoring | 10 |
Test Items | 10 |
Test Validity | 10 |
Error of Measurement | 9 |
More ▼ |
Source
Author
Publication Type
Audience
Researchers | 4 |
Policymakers | 2 |
Location
Australia | 2 |
Tennessee | 2 |
California | 1 |
Canada | 1 |
Illinois | 1 |
Kentucky | 1 |
Malaysia | 1 |
Netherlands | 1 |
New Jersey | 1 |
New York | 1 |
United Kingdom | 1 |
More ▼ |
Laws, Policies, & Programs
Comprehensive Education… | 2 |
Assessments and Surveys
National Assessment of… | 6 |
National Teacher Examinations | 3 |
Alabama High School… | 1 |
College Board Achievement… | 1 |
Test of English as a Foreign… | 1 |
Wechsler Adult Intelligence… | 1 |
edTPA (Teacher Performance… | 1 |
What Works Clearinghouse Rating
David Loy; Rhonda Nelson; Jared Allsop; Carol Johnston – Schole: A Journal of Leisure Studies and Recreation Education, 2024
Accreditation is a critical process in maintaining standards of consistency and excellence in the academic preparation of students for their chosen profession. While academic programs, professional associations, and credentialing organizations all recognize the importance of programmatic accreditation in recreational therapy professional…
Descriptors: Therapeutic Recreation, Accreditation (Institutions), Scores, Tests
Papageorgiou, Spiros; Davis, Larry; Ohta, Renka; Gomez, Pablo Garcia – ETS Research Report Series, 2022
In this research report, we describe a study to map the scores of the "TOEFL® Essentials"™ test to the Canadian Language Benchmarks (CLB). The TOEFL Essentials test is a four-skills assessment of foundational English language skills and communication abilities in academic and general (daily life) contexts. At the time of writing this…
Descriptors: Foreign Countries, Language Tests, English (Second Language), Second Language Learning
Liu, Ren; Qian, Hong; Luo, Xiao; Woo, Ada – Educational and Psychological Measurement, 2018
Subscore reporting under item response theory models has always been a challenge partly because the test length of each subdomain is limited for precisely locating individuals on multiple continua. Diagnostic classification models (DCMs), providing a pass/fail decision and associated probability of pass on each subdomain, are promising…
Descriptors: Classification, Probability, Pass Fail Grading, Scores
Sinclair, Andrea L., Ed.; Thacker, Arthur, Ed. – Human Resources Research Organization (HumRRO), 2019
These are the appendices for the technical report, "An Investigation of the Comparability of Commission-Approved Teaching Performance Assessment Models." California's Commission on Teacher Credentialing (Commission) requires all programs of preliminary multiple and single subject teacher preparation to use a Commission-approved Teaching…
Descriptors: Performance Based Assessment, Preservice Teachers, Models, Scoring Rubrics
Papageorgiou, Spiros; Tannenbaum, Richard J. – Language Assessment Quarterly, 2016
Although there has been substantial work on argument-based approaches to validation as well as standard-setting methodologies, it might not always be clear how standard setting fits into argument-based validity. The purpose of this article is to address this lack in the literature, with a specific focus on topics related to argument-based…
Descriptors: Standard Setting (Scoring), Language Tests, Test Validity, Test Construction
Tannenbaum, Richard J.; Kannan, Priya – Educational Assessment, 2015
Angoff-based standard setting is widely used, especially for high-stakes licensure assessments. Nonetheless, some critics have claimed that the judgment task is too cognitively complex for panelists, whereas others have explicitly challenged the consistency in (replicability of) standard-setting outcomes. Evidence of consistency in item judgments…
Descriptors: Standard Setting (Scoring), Reliability, Scores, Licensing Examinations (Professions)
Shulruf, Boaz; Poole, Phillippa; Jones, Philip; Wilkinson, Tim – Assessment & Evaluation in Higher Education, 2015
A new probability-based standard setting technique, the Objective Borderline Method (OBM), was introduced recently. This was based on a mathematical model of how test scores relate to student ability. The present study refined the model and tested it using 2500 simulated data-sets. The OBM was feasible to use. On average, the OBM performed well…
Descriptors: Probability, Methods, Standard Setting (Scoring), Scores
New Meridian Corporation, 2020
New Meridian Corporation has developed the "Quality Testing Standards and Criteria for Comparability Claims" (QTS) to provide guidance to states that are interested in including New Meridian content and would like to either keep reporting scores on the New Meridian Scale or use the New Meridian performance levels; that is, the state…
Descriptors: Testing, Standards, Comparative Analysis, Test Content
Shulruf, Boaz; Turner, Rolf; Poole, Phillippa; Wilkinson, Tim – Advances in Health Sciences Education, 2013
The decision to pass or fail a medical student is a "high stakes" one. The aim of this study is to introduce and demonstrate the feasibility and practicality of a new objective standard-setting method for determining the pass/fail cut-off score from borderline grades. Three methods for setting up pass/fail cut-off scores were compared: the…
Descriptors: Standard Setting (Scoring), Probability, Medical Schools, Medical Students
Northwest Evaluation Association, 2015
Recently, the Smarter Balanced Assessment Consortium (Smarter Balanced) released a document that established initial performance levels and the associated threshold scale scores for the Smarter Balanced assessment. The report included estimated percentages of students expected to perform at each of the four performance levels, reported by grade…
Descriptors: Standard Setting, Standard Setting (Scoring), Pretesting, Cutting Scores
Khatimin, Nuraini; Aziz, Azrilah Abdul; Zaharim, Azami; Yasin, Siti Hanani Mat – International Education Studies, 2013
Measurement and evaluation of students' achievement are an important aspect to make sure that students really understand the course content and monitor students' achievement level. Performance is not only reflected from the numbers of high achievers of the students, but also on quality of the grade obtained; does the grade "A" truly…
Descriptors: Standard Setting, Item Response Theory, Measurement Objectives, Measurement Techniques
Gotham, Katherine; Pickles, Andrew; Lord, Catherine – Journal of Autism and Developmental Disorders, 2009
The aim of this study is to standardize Autism Diagnostic Observation Schedule (ADOS) scores within a large sample to approximate an autism severity metric. Using a dataset of 1,415 individuals aged 2-16 years with autism spectrum disorders (ASD) or nonspectrum diagnoses, a subset of 1,807 assessments from 1,118 individuals with ASD were divided…
Descriptors: Autism, Severity (of Disability), Scores, Pervasive Developmental Disorders
Judd, Wallace – Practical Assessment, Research & Evaluation, 2009
Over the past twenty years in performance testing a specific item type with distinguishing characteristics has arisen time and time again. It's been invented independently by dozens of test development teams. And yet this item type is not recognized in the research literature. This article is an invitation to investigate the item type, evaluate…
Descriptors: Test Items, Test Format, Evaluation, Item Analysis
MacCann, Robert G. – Educational and Psychological Measurement, 2008
It is shown that the Angoff and bookmarking cut scores are examples of true score equating that in the real world must be applied to observed scores. In the context of defining minimal competency, the percentage "failed" by such methods is a function of the length of the measuring instrument. It is argued that this length is largely…
Descriptors: True Scores, Cutting Scores, Minimum Competencies, Scores
Bechger, Timo M.; Kuijper, Henk; Maris, Gunter – Language Assessment Quarterly, 2009
This article reports on two related studies carried out to link the State examination of Dutch as a second language to the Common European Framework of Reference for languages (CEFR). In the first study, key persons from institutions for higher education were asked to determine the minimally required language level of beginning students. In the…
Descriptors: Second Language Learning, Standard Setting (Scoring), Indo European Languages, Guidelines