Publication Date
In 2025 | 1 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 6 |
Since 2006 (last 20 years) | 11 |
Descriptor
Standard Setting (Scoring) | 53 |
Test Validity | 53 |
Elementary Secondary Education | 21 |
Cutting Scores | 20 |
Test Reliability | 20 |
Minimum Competency Testing | 18 |
Test Construction | 14 |
Testing Programs | 13 |
Higher Education | 12 |
Licensing Examinations… | 12 |
State Standards | 12 |
More ▼ |
Source
Author
Publication Type
Education Level
Elementary Secondary Education | 3 |
Early Childhood Education | 2 |
Adult Education | 1 |
Elementary Education | 1 |
Grade 2 | 1 |
Grade 3 | 1 |
Grade 4 | 1 |
Grade 5 | 1 |
High School Equivalency… | 1 |
High Schools | 1 |
Higher Education | 1 |
More ▼ |
Audience
Policymakers | 2 |
Practitioners | 1 |
Researchers | 1 |
Location
Tennessee | 6 |
Arizona | 2 |
Kansas | 2 |
Massachusetts | 2 |
Nevada | 2 |
North Carolina | 2 |
Arkansas | 1 |
California | 1 |
Colorado | 1 |
Delaware | 1 |
France | 1 |
More ▼ |
Laws, Policies, & Programs
Comprehensive Education… | 3 |
No Child Left Behind Act 2001 | 2 |
Lau v Nichols | 1 |
Assessments and Surveys
National Teacher Examinations | 8 |
National Assessment of… | 4 |
General Educational… | 1 |
Massachusetts Comprehensive… | 1 |
Pre Professional Skills Tests | 1 |
Test of English as a Foreign… | 1 |
What Works Clearinghouse Rating
Nicolas Rochat; Laurent Lima; Pascal Bressoux – Journal of Psychoeducational Assessment, 2025
Inference is considered an important factor in comprehension models and has been described as a causal factor in predicting comprehension. To date, specific tests for inference are rare and often rely on specific thematic texts. This reliance on thematic inference may raise some concerns as inference is related to prior text-specific knowledge.…
Descriptors: Inferences, Reading Comprehension, Reading Tests, Test Reliability
Russell, Michael; Moncaleano, Sebastian – Practical Assessment, Research & Evaluation, 2020
Although both content alignment and standard-setting procedures rely on content-expert panel judgements, only the latter employs discussion among panel members. This study employed a modified form of the Webb methodology to examine content alignment for twelve tests administered as part of the Massachusetts Comprehensive Assessment System (MCAS).…
Descriptors: Test Content, Test Items, Discussion, Test Validity
Sondergeld, Toni A.; Stone, Gregory E.; Kruse, Lance M. – Educational Policy, 2020
Assessment and evaluation at all levels of educational systems have become policy priorities for many countries. Two common reasons for this are student learning expectations and accountability. Although much effort has been put into the creation and refinement of content standards, standardized tests, and methods for using testing results, there…
Descriptors: Standard Setting (Scoring), Criterion Referenced Tests, Multiple Choice Tests, Student Evaluation
Papageorgiou, Spiros; Tannenbaum, Richard J. – Language Assessment Quarterly, 2016
Although there has been substantial work on argument-based approaches to validation as well as standard-setting methodologies, it might not always be clear how standard setting fits into argument-based validity. The purpose of this article is to address this lack in the literature, with a specific focus on topics related to argument-based…
Descriptors: Standard Setting (Scoring), Language Tests, Test Validity, Test Construction
Foley, Brett P. – Practical Assessment, Research & Evaluation, 2016
There is always a chance that examinees will answer multiple choice (MC) items correctly by guessing. Design choices in some modern exams have created situations where guessing at random through the full exam--rather than only for a subset of items where the examinee does not know the answer--can be an effective strategy to pass the exam. This…
Descriptors: Guessing (Tests), Multiple Choice Tests, Case Studies, Test Construction
Nebraska Department of Education, 2018
The 2018 Nebraska Student-Centered Assessment System (NSCAS) Summative technical report documents the processes and procedures implemented to support the Spring 2018 NSCAS Summative English Language Arts (ELA), Mathematics, and Science assessments by NWEA under the supervision of the Nebraska Department of Education (NDE). The technical report…
Descriptors: Summative Evaluation, Language Tests, English, Mathematics Tests
GED Testing Service, 2014
This manual was written to provide technical information regarding the General Educational Development (GED®) test as evidence that the GED® test is technically sound. Throughout this manual, documentation is provided regarding the development of the GED® test and data collection activities, as well as evidence of reliability and validity. This…
Descriptors: High School Equivalency Programs, Equivalency Tests, Testing Programs, Test Validity
Morgan, Deanna L. – National Center for Postsecondary Research, 2010
Cut scores are used in a variety of circumstances to aid in decision making through the establishment of a clear cut line between adjacent categories. Community colleges regularly use cut scores on placement tests to decide the appropriate course for each beginning student: the first college-level course or a developmental course, depending on…
Descriptors: Standard Setting (Scoring), Cutting Scores, Psychometrics, Best Practices
Florez, Ida Rose – Civil Rights Project / Proyecto Derechos Civiles, 2010
The Arizona English Language Learners Assessment (AZELLA) is used by the Arizona Department of Education to determine which children should receive English support services. AZELLA results are used to determine if children are either proficient in English or have English language skills in one of four pre-proficient categories (pre-emergent,…
Descriptors: Validity, Second Language Learning, Cutting Scores, Kindergarten
Lin, Jie – Alberta Journal of Educational Research, 2006
The Bookmark standard-setting procedure was developed to address the perceived problems with the most popular method for setting cut-scores: the Angoff procedure (Angoff, 1971). The purposes of this article are to review the Bookmark procedure and evaluate it in terms of Berk's (1986) criteria for evaluating cut-score setting methods. The…
Descriptors: Standard Setting (Scoring), Cutting Scores, Evaluation Criteria, Evaluation Research
Haertel, Edward H.; Lorie, William A. – Measurement: Interdisciplinary Research and Perspectives, 2004
Standards-based score reports interpret test performance with reference to cut scores defining categories like "below basic," "proficient," or "master." This article first develops a conceptual framework for validity arguments supporting such interpretations, then presents three applications. Two of these serve to introduce new standard-setting…
Descriptors: Scores, Test Interpretation, Test Validity, Standard Setting (Scoring)

Hamilton, J. S.; McLone, R. R. – Studies in Educational Evaluation, 1989
Influences on the educational validity of examinations are reviewed. Changes occurring in approaches to standard setting are traced. A view of reliability is presented, with emphasis on assessment of project work, which often involves individual investigation and design by students. A consistency index formula for grading standards is presented.…
Descriptors: Cutting Scores, Educational Assessment, Elementary Secondary Education, Standard Setting (Scoring)

Journal of School Improvement, 2000
States that standard scores are the numerical universal language for reporting and comparisons. Discusses what standard scores are, specifically, and why they are used, along with how the conversion assessment of raw scores to standard scores is accomplished. Provides contact information for those who would like to further their knowledge on the…
Descriptors: Educational Practices, Elementary Secondary Education, Higher Education, Standard Setting (Scoring)
Schoon, Craig G.; And Others – 1988
The determination of appropriate cut scores is a critical step in the development of licensing and certification examinations. Passing point methodologies based on the estimation of item difficulties are underlain by the estimation of the probability of a correct response to items by a hypothetically minimally competent candidate. The Angoff…
Descriptors: Cutting Scores, Difficulty Level, Estimation (Mathematics), Item Analysis
Jaeger, Richard M. – 1982
The implicit definition of competence and the inferential chain that links the standard-setting process to the decision outcomes of the method are considered for two classes of standard-setting procedures: those involving data-free judgments of items and those involving data-based judgment of items. The major underlying assumptions of competence…
Descriptors: Competence, Evaluation Methods, Graduation Requirements, High Schools