Publication Date
In 2025 | 238 |
Since 2024 | 1095 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
Policymakers | 8 |
Teachers | 6 |
Administrators | 5 |
Researchers | 1 |
Location
China | 35 |
Turkey | 30 |
Texas | 26 |
Indonesia | 22 |
California | 20 |
Thailand | 20 |
Iran | 14 |
Tennessee | 14 |
Canada | 13 |
Japan | 13 |
South Korea | 10 |
More ▼ |
Laws, Policies, & Programs
Every Student Succeeds Act… | 5 |
Head Start | 5 |
Elementary and Secondary… | 4 |
Elementary and Secondary… | 1 |
Individuals with Disabilities… | 1 |
No Child Left Behind Act 2001 | 1 |
Pell Grant Program | 1 |
Rehabilitation Act 1973… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards with or without Reservations | 1 |
David Furjanic; Christopher Ives; David Fainstein; Patrick C. Kennedy; Gina Biancarosa – Elementary School Journal, 2024
The COVID-19 pandemic disrupted school, work, and daily life on a global scale. In the wake of this unprecedented health crisis, schools across the United States were forced to abruptly adapt their educational delivery models. Understanding how student learning trajectories shifted throughout the ongoing pandemic is critical for equipping…
Descriptors: COVID-19, Pandemics, Scores, Reading Fluency
Karoline A. Sachse; Sebastian Weirich; Nicole Mahler; Camilla Rjosk – International Journal of Testing, 2024
In order to ensure content validity by covering a broad range of content domains, the testing times of some educational large-scale assessments last up to a total of two hours or more. Performance decline over the course of taking the test has been extensively documented in the literature. It can occur due to increases in the numbers of: (a)…
Descriptors: Test Wiseness, Test Score Decline, Testing Problems, Foreign Countries
Kelly Edwards; James Soland – Educational Assessment, 2024
Classroom observational protocols, in which raters observe and score the quality of teachers' instructional practices, are often used to evaluate teachers for consequential purposes despite evidence that scores from such protocols are frequently driven by factors, such as rater and temporal effects, that have little to do with teacher quality. In…
Descriptors: Classroom Observation Techniques, Teacher Evaluation, Accuracy, Scores
Kylie Gorney; Sandip Sinharay – Journal of Educational Measurement, 2025
Although there exists an extensive amount of research on subscores and their properties, limited research has been conducted on categorical subscores and their interpretations. In this paper, we focus on the claim of Feinberg and von Davier that categorical subscores are useful for remediation and instructional purposes. We investigate this claim…
Descriptors: Tests, Scores, Test Interpretation, Alternative Assessment
Lara Climer – ProQuest LLC, 2024
This quantitative study investigated the relationship between institutional control and NCLEX first-time pass rates during 2017-2021 for BSN, ADN, and LVN programs in the United States while controlling for school and program level characteristics. The study included five years of data gathered from nursing regulatory agencies and the Integrated…
Descriptors: Licensing Examinations (Professions), Nursing, Data, Institutional Characteristics
Cristan Farmer; Aaron J. Kaat; Michael C. Edwards; Luc Lecavalier – American Journal on Intellectual and Developmental Disabilities, 2024
Measurement invariance (MI) is a psychometric property of an instrument indicating the degree to which scores from an instrument are comparable across groups. In recent years, there has been a marked uptick in publications using MI in intellectual and developmental disability (IDD) samples. Our goal here is to provide an overview of why MI is…
Descriptors: Measurement, Psychometrics, Scores, Intellectual Disability
Deborah J. Harris – Educational Measurement: Issues and Practice, 2024
This article is based on my 2023 NCME Presidential Address, where I talked a bit about my journey into the profession, and more substantively about comparable scores. Specifically, I discussed some of the different ways 'comparable scores' are defined, highlighted some areas I think we as a profession need to pay more attention to when considering…
Descriptors: Scores, Comparative Analysis, Speeches, Career Development
Zafer Ozen; Nielsen Pereira; Tugce Karatas; Hernán Castillo-Hermosilla; Yukiko Maeda – Gifted Child Quarterly, 2025
Cognitive Abilities Test (CogAT) is one of the most frequently used gifted identification tools. In this meta-analytic study, we investigated empirical evidence of the validity of CogAT, in relation to different types of instruments. After reviewing 1,480 studies, a total of 24 with 33 effect sizes were included in the meta-analysis. According to…
Descriptors: Test Validity, Cognitive Tests, Disability Identification, Scores
Abigail R. Vild; Maggie E. Wilson; Christopher A. Was – Journal of Research in Education, 2025
Theories of self-regulated learning suggest a positive link between knowledge monitoring accuracy (the ability to predict test performance) and performance on tests. Put differently, students who accurately monitor their knowledge of course content more efficiently regulate study of course materials. However, a plethora of literature indicates…
Descriptors: Student Satisfaction, Undergraduate Students, Scores, Prediction
Stefan O'Grady – TESOL Journal, 2025
Task-based language assessment represents a major component of task-based language teaching syllabi. Current perspectives emphasise the importance of tasks in the assessment process, suggesting that adherence to influential models of language production during task design yields predictable test outcomes. The current study contends that the…
Descriptors: Task Analysis, Language Tests, Evaluators, Rating Scales
Lauren E. Bates; Sarah J. Myers; Edward L. DeLosh; Matthew G. Rhodes – Psychology Learning and Teaching, 2025
The present work assessed a quizzing method that combines the benefits of retrieval practice and feedback, whereby learners must continue taking quizzes until they achieve a perfect score with feedback provided (i.e., "mastery quizzing"). Across four experiments (n = 952; age 18-76, M = 37.10, SD = 11.61; 50% female, 48% male, 2% other…
Descriptors: Mastery Tests, Retention (Psychology), Evaluation Methods, Adults
Blake H. Heller – Annenberg Institute for School Reform at Brown University, 2024
In 2016, the GED® introduced college readiness benchmarks designed to identify testers who are academically prepared for credit-bearing college coursework. The benchmarks are promoted as awarding college credits or exempting "college-ready" GED® graduates from remedial coursework. I show descriptive evidence that those identified as…
Descriptors: High School Equivalency Programs, College Readiness, Eligibility, Benchmarking
Marion Durbahn; Michael Rodgers; Marijana Macis; Elke Peters – Studies in Second Language Acquisition, 2024
This study aimed to investigate the relationship between lexical coverage and TV viewing comprehension. Previous studies have indicated that 95% to 98% of lexical coverage may be needed for reading comprehension (Hu & Nation, 2000). To understand informal listening passages, lower coverage figures (95%-90%) may suffice. However, no study has…
Descriptors: Television Viewing, Lexicology, Comprehension, Visual Aids
Kristen Bottema-Beutel; Shannon Crowley LaPoint; So Yoon Kim; Sarah Mohiuddin; Qun Yu; Rachael McKinnon – Exceptional Children, 2024
In this secondary analysis of a previously conducted systematic review, we analyze social validity assessments in intervention research for transition-age autistic youth. Social validity is concerned with the acceptability of the intervention goals, the acceptability and feasibility of the intervention procedures, and the perceived importance of…
Descriptors: Autism Spectrum Disorders, Intervention, Validity, Psychometrics
Bahar Saberzadeh-Ardestani; Ali Reza Sima; Bardia Khosravi; Meredith Young; Sara Mortaz Hejri – Advances in Health Sciences Education, 2024
Few studies have engaged in data-driven investigations of the presence, or frequency, of what could be considered retaliatory assessor behaviour in Multi-source Feedback (MSF) systems. In this study, authors explored how assessors scored others if, before assessing others, they received their own assessment score. The authors examined assessments…
Descriptors: Feedback (Response), Scores, Evaluators, Behavior