Publication Date
In 2025 | 12 |
Since 2024 | 40 |
Since 2021 (last 5 years) | 124 |
Since 2016 (last 10 years) | 321 |
Since 2006 (last 20 years) | 702 |
Descriptor
Cutting Scores | 1728 |
Test Validity | 641 |
Test Reliability | 574 |
Evaluation Criteria | 485 |
Aptitude Tests | 445 |
Norms | 441 |
Job Skills | 434 |
Personnel Evaluation | 431 |
Job Applicants | 429 |
Career Guidance | 425 |
Standard Setting (Scoring) | 228 |
More ▼ |
Source
Author
Publication Type
Education Level
Elementary Education | 157 |
Higher Education | 138 |
Postsecondary Education | 111 |
Elementary Secondary Education | 106 |
Secondary Education | 106 |
Middle Schools | 89 |
Grade 3 | 87 |
Grade 4 | 82 |
Grade 8 | 81 |
Grade 5 | 79 |
Grade 6 | 68 |
More ▼ |
Audience
Researchers | 58 |
Practitioners | 14 |
Policymakers | 11 |
Teachers | 11 |
Administrators | 5 |
Students | 4 |
Parents | 1 |
Location
California | 29 |
Florida | 28 |
Texas | 22 |
Canada | 16 |
Massachusetts | 15 |
New York | 15 |
North Carolina | 14 |
United Kingdom | 14 |
Washington | 13 |
Arizona | 12 |
Pennsylvania | 12 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 1 |
Meets WWC Standards with or without Reservations | 1 |
Does not meet standards | 3 |
Wyse, Adam E. – Applied Measurement in Education, 2020
This article compares cut scores from two variations of the Hofstee and Beuk methods, which determine cut scores by resolving inconsistencies in panelists' judgments about cut scores and pass rates, with the Angoff method. The first variation uses responses to the Hofstee and Beuk percentage correct and pass rate questions to calculate cut scores.…
Descriptors: Cutting Scores, Evaluation Methods, Standard Setting (Scoring), Equations (Mathematics)
Morris, Nicole M.; Ingram, Paul B.; Mitchell, Sean M.; Victor, Sarah E. – Measurement and Evaluation in Counseling and Development, 2023
We investigated the validity and screening effectiveness of the PHQ-2 and PHQ-9 scores in 229 college students in a cross-sectional design. PHQ associations with Minnesota Multiphasic Personality Inventory-3 internalizing scales suggest PHQ scores are effective screening tools for college students and may aid in effective triage and service needs.
Descriptors: Personality Measures, Test Validity, College Students, Screening Tests
Oliva, Jose M.; Blanco, Ángel – European Journal of Science and Mathematics Education, 2023
A questionnaire was recently developed for the use with the Spanish-speaking, and evidence have been provided about the construct internal validity by means of structural equation modelling. In this paper, two research questions were considered: (i) What new evidence does application of the Rasch model provide regarding the validity of this…
Descriptors: Spanish Speaking, High School Students, College Students, Item Response Theory
D. Betsy McCoach; Anthony J. Gambino; Scott J. Peters; Daniel Long; Del Siegle – Annenberg Institute for School Reform at Brown University, 2023
Teacher rating scales (TRS) are often used to make service eligibility decisions for exceptional learners. Although TRS are regularly used to identify student exceptionalism either as part of an informal nomination process or through behavioral rating scales, there is little research documenting the between-teacher variance in teacher ratings or…
Descriptors: Rating Scales, Student Evaluation, Academically Gifted, Ability Identification
Alexis Clyde; Danna Bismar; Gabrielle Agnew; Laura E. Kuper – Journal of Autism and Developmental Disorders, 2024
Autism spectrum disorder (ASD) and ASD symptoms are overrepresented among gender-diverse youth across studies. Gender-diverse and ASD youth are at risk for anxiety, but anxiety is unclear among gender-diverse youth with ASD. The Social Communication Questionnaire (SCQ) is a commonly used ASD screener, including in multidisciplinary…
Descriptors: Autism Spectrum Disorders, Identification, Accuracy, Interpersonal Competence
Blake H. Heller – Annenberg Institute for School Reform at Brown University, 2024
In 2016, the GED® introduced college readiness benchmarks designed to identify testers who are academically prepared for credit-bearing college coursework. The benchmarks are promoted as awarding college credits or exempting "college-ready" GED® graduates from remedial coursework. I show descriptive evidence that those identified as…
Descriptors: High School Equivalency Programs, College Readiness, Eligibility, Benchmarking
Weiss, Brandi A.; Dardick, William – Journal of Experimental Education, 2021
Classification measures and entropy variants can be used as indicators of model fit for logistic regression. These measures rely on a cut-point, "c," to determine predicted group membership. While recommendations exist for determining the location of the cut-point, these methods are primarily anecdotal. The current study used Monte Carlo…
Descriptors: Cutting Scores, Regression (Statistics), Classification, Monte Carlo Methods
Prentza, Alexandra; Tafiadis, Dionysios; Chondrogianni, Vasiliki; Tsimpli, Ianthi-Maria – Journal of Psycholinguistic Research, 2022
This study provides a preliminary validation of a Greek Sentence Repetition Task (SRT) with a sample of 110 monolingual and bilingual typically developing (TLD) children and examines the test's ability to distinguish between Greek monolingual children and age-matched Albanian-Greek bilinguals using a Receiver Operating Characteristics (ROC)…
Descriptors: Greek, Sentences, Repetition, Monolingualism
Melissa G. Wolf; Daniel McNeish – Grantee Submission, 2023
To evaluate the fit of a confirmatory factor analysis model, researchers often rely on fit indices such as SRMR, RMSEA, and CFI. These indices are frequently compared to benchmark values of 0.08, 0.06, and 0.96, respectively, established by Hu and Bentler (1999). However, these indices are affected by model characteristics and their sensitivity to…
Descriptors: Programming Languages, Cutting Scores, Benchmarking, Factor Analysis
Kaj Sparle Christensen; Ole Jakob Storebø; Bo Bach – Journal of Attention Disorders, 2025
Objective: This study examines the validity of the ASRS-5 as a new screening tool for ADHD and evaluates its proposed screening cut-off in a general population context. Method: A nationally representative sample of 2,002 individuals aged 18 to 80 years was surveyed using the ASRS-5, with complete data obtained from 714 participants. Psychometric…
Descriptors: Foreign Countries, Construct Validity, Psychometrics, Item Analysis
Hakan Baran; Murat Akyildiz – Turkish Online Journal of Distance Education, 2025
Evaluation decisions regarding students' success in Open Education faculties such as pass/fail based on cut-off scores affect the quality of these systems. The qualification of Open Education students to obtain a bachelor's or associate's degree is determined by their passing grade. The purpose of this study was to investigate whether the minimum…
Descriptors: Open Universities, Academic Standards, Cutting Scores, Evaluation Methods
Acar, Selcuk; Branch, Marcus J.; Burnett, Cyndi; Cabra, John F. – Gifted Child Quarterly, 2021
Originality is scored based on standard zero-originality lists (ZOLs) in the Torrance Tests of Creative Thinking (TTCT). The applicability of those ZOLs to diverse groups has not been examined. We examined the consistency of TTCT-Figural's sample-based (SB) ZOLs and the published ZOLs based on a sample of predominantly African American college…
Descriptors: Creative Thinking, Creativity Tests, African American Students, College Students
Henry May; Aly Blakeney; Pragya Shrestha; Mia Mazal; Nicole Kennedy – Journal of Research on Educational Effectiveness, 2024
To estimate the long-term effects of the Reading Recovery® intervention, a regression discontinuity design (RD) was implemented in a randomly selected sample of Reading Recovery schools during each year of the federally-funded i3 Scale-Up external evaluation (2011-2015) and also in one additional cohort during the 2016-2017 school year. Long-term…
Descriptors: Reading Programs, Outcomes of Education, Elementary School Students, Reading Tests
Christine M. White; Christopher Schatschneider – Contemporary School Psychology, 2024
Universal screening to predict students' risk for reading problems is a foundational component of the Multi-Tiered Systems of Support framework and is required by law in many US states. School or district administrators are tasked with selecting screening assessments that are both technically adequate and feasible given the resources of their…
Descriptors: Screening Tests, Reading Tests, Reading Difficulties, Classification
Skaggs, Gary; Hein, Serge F.; Wilkins, Jesse L. M. – Educational Measurement: Issues and Practice, 2020
In test-centered standard-setting methods, borderline performance can be represented by many different profiles of strengths and weaknesses. As a result, asking panelists to estimate item or test performance for a hypothetical group study of borderline examinees, or a typical borderline examinee, may be an extremely difficult task and one that can…
Descriptors: Standard Setting (Scoring), Cutting Scores, Testing Problems, Profiles