Publication Date
In 2025 | 12 |
Since 2024 | 40 |
Since 2021 (last 5 years) | 124 |
Since 2016 (last 10 years) | 321 |
Since 2006 (last 20 years) | 702 |
Descriptor
Cutting Scores | 1728 |
Test Validity | 641 |
Test Reliability | 574 |
Evaluation Criteria | 485 |
Aptitude Tests | 445 |
Norms | 441 |
Job Skills | 434 |
Personnel Evaluation | 431 |
Job Applicants | 429 |
Career Guidance | 425 |
Standard Setting (Scoring) | 228 |
More ▼ |
Source
Author
Publication Type
Education Level
Elementary Education | 157 |
Higher Education | 138 |
Postsecondary Education | 111 |
Elementary Secondary Education | 106 |
Secondary Education | 106 |
Middle Schools | 89 |
Grade 3 | 87 |
Grade 4 | 82 |
Grade 8 | 81 |
Grade 5 | 79 |
Grade 6 | 68 |
More ▼ |
Audience
Researchers | 58 |
Practitioners | 14 |
Policymakers | 11 |
Teachers | 11 |
Administrators | 5 |
Students | 4 |
Parents | 1 |
Location
California | 29 |
Florida | 28 |
Texas | 22 |
Canada | 16 |
Massachusetts | 15 |
New York | 15 |
North Carolina | 14 |
United Kingdom | 14 |
Washington | 13 |
Arizona | 12 |
Pennsylvania | 12 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 1 |
Meets WWC Standards with or without Reservations | 1 |
Does not meet standards | 3 |
Yoo Jeong Jang – ProQuest LLC, 2022
Despite the increasing demand for diagnostic information, observed subscores have been often reported to lack adequate psychometric qualities such as reliability, distinctiveness, and validity. Therefore, several statistical techniques based on CTT and IRT frameworks have been proposed to improve the quality of subscores. More recently, DCM has…
Descriptors: Classification, Accuracy, Item Response Theory, Correlation
Kim, Stella Y.; Lee, Won-Chan – Journal of Educational Measurement, 2020
The current study aims to evaluate the performance of three non-IRT procedures (i.e., normal approximation, Livingston-Lewis, and compound multinomial) for estimating classification indices when the observed score distribution shows atypical patterns: (a) bimodality, (b) structural (i.e., systematic) bumpiness, or (c) structural zeros (i.e., no…
Descriptors: Classification, Accuracy, Scores, Cutting Scores
Bunch, Michael B. – Educational Measurement: Issues and Practice, 2020
In this digital ITEMS module, Dr. Michael Bunch provides an in-depth, step-by-step look at how standard setting is done. It does not focus on any specific procedure or methodology (e.g., modified Angoff, bookmark, and body of work) but on the practical tasks that must be completed for any standard setting activity. Dr. Bunch carries the…
Descriptors: Standard Setting, Cutting Scores, Scores, Reports
Ockey, Gary J.; Vo, Sonca; Baghestani, Shireen – TESOL Quarterly: A Journal for Teachers of English to Speakers of Other Languages and of Standard English as a Second Dialect, 2021
Various approaches help determine students' need for language support courses. Some programs use standardized language proficiency test scores, which are available from the admissions process; others use local placement tests, which are typically designed to be aligned with local language needs and ESL course curricula. Using local placement tests…
Descriptors: Cutting Scores, Standardized Tests, Placement Tests, Student Evaluation
Klingbeil, David A.; Osman, David J.; Carrigan, Jamison E.; Paly, Benjamin J.; Berry-Corie, Kimberly – School Psychology, 2021
Multiple popular math curriculum-based measures have recently been revised in ways that may improve their utility for universal screening. However, the applied use of these tools has yet to be evaluated independently. We conducted a retrospective analysis of the diagnostic accuracy of prior year statewide test results and aimswebPlus math…
Descriptors: Screening Tests, Elementary School Students, Middle School Students, Curriculum Based Assessment
Moloi, Qetelo; Kanjee, Anil – South African Journal of Education, 2021
The study reported on here contributes to the growing body of knowledge on the use of standard setting methods for improving the reporting and utility value of assessment results in South Africa as well as for addressing the conceptual shortcomings of the Curriculum and Assessment Policy Statement (CAPS) reporting framework. Using data from the…
Descriptors: Foreign Countries, Standard Setting (Scoring), Student Evaluation, Elementary School Students
Young, Stephanie R.; Maddocks, Danika L. S.; Carrigan, Jamison E. – Gifted Child Quarterly, 2021
Research on high-ability postsecondary students has increased in recent years; yet identifying such students can be challenging. The International Cognitive Ability Resource (ICAR) is an online, open-access tool designed to facilitate measurement of cognitive abilities in research. We evaluated whether the ICAR is appropriate to identify…
Descriptors: Cognitive Ability, Academically Gifted, College Students, Identification
Wolkowitz, Amanda A.; Foley, Brett; Zurn, Jared – Practical Assessment, Research & Evaluation, 2023
The purpose of this study is to introduce a method for converting scored 4-option multiple-choice (MC) items into scored 3-option MC items without re-pretesting the 3-option MC items. This study describes a six-step process for achieving this goal. Data from a professional credentialing exam was used in this study and the method was applied to 24…
Descriptors: Multiple Choice Tests, Test Items, Accuracy, Test Format
Mahmut Sami Koyuncu – Open Journal for Educational Research, 2023
This study aims to demonstrate the optimal way to determine the cut-off score to be used to interpret the total scores obtained from an achievement test or scale using the Artificial Neural Networks method. To this end, the multiple-choice item responses in the Booklet-11 Mathematics subtest at the 8th grade level in the TIMSS 2015 Turkey sample…
Descriptors: Standard Setting, Artificial Intelligence, Mathematics Education, Foreign Countries
Toland, Michael D.; Grisham, Jennifer; Waddell, Misti; Crawford, Rebecca; Dueber, David M. – Topics in Early Childhood Special Education, 2022
Rasch and classification analyses on a field-test version of the third edition of the Assessment, Evaluation, and Programming System (AEPS-3), a curriculum-based assessment used to assess young children birth to age 6 years, were conducted. First, an evaluation of the psychometric properties of data from each developmental area of an AEPS-3…
Descriptors: Curriculum Based Assessment, Field Tests, Young Children, Item Response Theory
Florence Ran; Hojung Lee – Society for Research on Educational Effectiveness, 2022
Background/Context: The landscape of college remediation programs experienced significant shifts from prerequisite models to corequisite models in the past few years across the nation. The traditional prerequisite models required students placed below college level to pass a sequence of remedial courses before they could enroll in college-level…
Descriptors: Required Courses, Prerequisites, Remedial Programs, Remedial Instruction
Benedetti, Andrea; Levis, Brooke; Rücker, Gerta; Jones, Hayley E.; Schumacher, Martin; Ioannidis, John P. A.; Thombs, Brett – Research Synthesis Methods, 2020
Selective cutoff reporting in primary diagnostic accuracy studies with continuous or ordinal data may result in biased estimates when meta-analyzing studies. Collecting individual participant data (IPD) and estimating accuracy across all relevant cutoffs for all studies can overcome such bias but is labour intensive. We meta-analyzed the…
Descriptors: Cutting Scores, Diagnostic Tests, Screening Tests, Meta Analysis
Henry May; Aly Blakeney; Pragya Shrestha; Mia Mazal; Nicole Kennedy – Grantee Submission, 2023
To estimate the long-term effects of the Reading Recovery® intervention, a regression discontinuity design (RD) was implemented in a randomly selected sample of Reading Recovery schools during each year of the federally-funded i3 Scale-Up external evaluation (2011-2015) and also in one additional cohort during the 2016-17 school year. Long-term…
Descriptors: Reading Programs, Outcomes of Education, Elementary School Students, Reading Tests
Christine M. White; Christopher Schatschneider – Grantee Submission, 2023
Universal screening to predict students' risk for reading problems is a foundational component of the Multi-Tiered Systems of Support framework and is required by law in many US states. School or district administrators are tasked with selecting screening assessments that are both technically adequate and feasible given the resources of their…
Descriptors: Screening Tests, Reading Tests, Reading Difficulties, Classification
Takeshi Terada – ProQuest LLC, 2021
Since the No Child Left Behind (NCLB) Act required classifications of students' performance levels, test scores have been used to measure students' achievement; in particular, test scores are used to determine whether students reach a proficiency level in the state assessment. Accordingly, school districts have started using benchmark assessments…
Descriptors: State Standards, Progress Monitoring, Achievement Gains, Cutting Scores