Publication Date
In 2025 | 218 |
Since 2024 | 789 |
Since 2021 (last 5 years) | 3388 |
Since 2016 (last 10 years) | 9518 |
Since 2006 (last 20 years) | 18916 |
Descriptor
Scores | 21862 |
Foreign Countries | 7732 |
Correlation | 4450 |
Comparative Analysis | 4114 |
Statistical Analysis | 3562 |
Academic Achievement | 3529 |
Teaching Methods | 3003 |
Student Attitudes | 2675 |
Measures (Individuals) | 2557 |
Second Language Learning | 2361 |
Gender Differences | 2238 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Teachers | 118 |
Practitioners | 100 |
Researchers | 72 |
Administrators | 55 |
Policymakers | 29 |
Counselors | 19 |
Students | 8 |
Media Staff | 6 |
Parents | 4 |
Community | 2 |
Support Staff | 1 |
More ▼ |
Location
Turkey | 1107 |
China | 429 |
Australia | 400 |
Canada | 391 |
United States | 321 |
United Kingdom | 311 |
Iran | 299 |
California | 293 |
Texas | 279 |
Netherlands | 263 |
Taiwan | 263 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 22 |
Meets WWC Standards with or without Reservations | 40 |
Does not meet standards | 48 |
David Furjanic; Christopher Ives; David Fainstein; Patrick C. Kennedy; Gina Biancarosa – Elementary School Journal, 2024
The COVID-19 pandemic disrupted school, work, and daily life on a global scale. In the wake of this unprecedented health crisis, schools across the United States were forced to abruptly adapt their educational delivery models. Understanding how student learning trajectories shifted throughout the ongoing pandemic is critical for equipping…
Descriptors: COVID-19, Pandemics, Scores, Reading Fluency
Karoline A. Sachse; Sebastian Weirich; Nicole Mahler; Camilla Rjosk – International Journal of Testing, 2024
In order to ensure content validity by covering a broad range of content domains, the testing times of some educational large-scale assessments last up to a total of two hours or more. Performance decline over the course of taking the test has been extensively documented in the literature. It can occur due to increases in the numbers of: (a)…
Descriptors: Test Wiseness, Test Score Decline, Testing Problems, Foreign Countries
Kelly Edwards; James Soland – Educational Assessment, 2024
Classroom observational protocols, in which raters observe and score the quality of teachers' instructional practices, are often used to evaluate teachers for consequential purposes despite evidence that scores from such protocols are frequently driven by factors, such as rater and temporal effects, that have little to do with teacher quality. In…
Descriptors: Classroom Observation Techniques, Teacher Evaluation, Accuracy, Scores
Kylie Gorney; Sandip Sinharay – Journal of Educational Measurement, 2025
Although there exists an extensive amount of research on subscores and their properties, limited research has been conducted on categorical subscores and their interpretations. In this paper, we focus on the claim of Feinberg and von Davier that categorical subscores are useful for remediation and instructional purposes. We investigate this claim…
Descriptors: Tests, Scores, Test Interpretation, Alternative Assessment
van der Linden, Wim J. – Journal of Educational and Behavioral Statistics, 2022
The current literature on test equating generally defines it as the process necessary to obtain score comparability between different test forms. The definition is in contrast with Lord's foundational paper which viewed equating as the process required to obtain comparability of measurement scale between forms. The distinction between the notions…
Descriptors: Equated Scores, Test Items, Scores, Probability
Bal-Sezerel, Bilge; Atesgöz, N. Nazli; Kirisçi, Nilgün – Journal of Theoretical Educational Science, 2023
The Flynn effect, which advocated that there was a rise in the global IQ score, was widely accepted by the relevant scientific community. However, there are recent research findings that this effect has been reversed. In this study, both Flynn and anti-Flynn effects were investigated. The purpose of this study is to analyze students' general,…
Descriptors: Intelligence Tests, Scores, Elementary School Students, Intelligence Quotient
Cristan Farmer; Aaron J. Kaat; Michael C. Edwards; Luc Lecavalier – American Journal on Intellectual and Developmental Disabilities, 2024
Measurement invariance (MI) is a psychometric property of an instrument indicating the degree to which scores from an instrument are comparable across groups. In recent years, there has been a marked uptick in publications using MI in intellectual and developmental disability (IDD) samples. Our goal here is to provide an overview of why MI is…
Descriptors: Measurement, Psychometrics, Scores, Intellectual Disability
Deborah J. Harris – Educational Measurement: Issues and Practice, 2024
This article is based on my 2023 NCME Presidential Address, where I talked a bit about my journey into the profession, and more substantively about comparable scores. Specifically, I discussed some of the different ways 'comparable scores' are defined, highlighted some areas I think we as a profession need to pay more attention to when considering…
Descriptors: Scores, Comparative Analysis, Speeches, Career Development
Zafer Ozen; Nielsen Pereira; Tugce Karatas; Hernán Castillo-Hermosilla; Yukiko Maeda – Gifted Child Quarterly, 2025
Cognitive Abilities Test (CogAT) is one of the most frequently used gifted identification tools. In this meta-analytic study, we investigated empirical evidence of the validity of CogAT, in relation to different types of instruments. After reviewing 1,480 studies, a total of 24 with 33 effect sizes were included in the meta-analysis. According to…
Descriptors: Test Validity, Cognitive Tests, Disability Identification, Scores
Abigail R. Vild; Maggie E. Wilson; Christopher A. Was – Journal of Research in Education, 2025
Theories of self-regulated learning suggest a positive link between knowledge monitoring accuracy (the ability to predict test performance) and performance on tests. Put differently, students who accurately monitor their knowledge of course content more efficiently regulate study of course materials. However, a plethora of literature indicates…
Descriptors: Student Satisfaction, Undergraduate Students, Scores, Prediction
Stefan O'Grady – TESOL Journal, 2025
Task-based language assessment represents a major component of task-based language teaching syllabi. Current perspectives emphasise the importance of tasks in the assessment process, suggesting that adherence to influential models of language production during task design yields predictable test outcomes. The current study contends that the…
Descriptors: Task Analysis, Language Tests, Evaluators, Rating Scales
Lauren E. Bates; Sarah J. Myers; Edward L. DeLosh; Matthew G. Rhodes – Psychology Learning and Teaching, 2025
The present work assessed a quizzing method that combines the benefits of retrieval practice and feedback, whereby learners must continue taking quizzes until they achieve a perfect score with feedback provided (i.e., "mastery quizzing"). Across four experiments (n = 952; age 18-76, M = 37.10, SD = 11.61; 50% female, 48% male, 2% other…
Descriptors: Mastery Tests, Retention (Psychology), Evaluation Methods, Adults
Soland, James – Educational Measurement: Issues and Practice, 2023
Most individuals who take, interpret, design, or score tests are aware that examinees do not always provide full effort when responding to items. However, many such individuals are not aware of how pervasive the issue is, what its consequences are, and how to address it. In this digital ITEMS module, Dr. James Soland will help fill these gaps in…
Descriptors: Student Behavior, Tests, Scores, Incidence
Marion Durbahn; Michael Rodgers; Marijana Macis; Elke Peters – Studies in Second Language Acquisition, 2024
This study aimed to investigate the relationship between lexical coverage and TV viewing comprehension. Previous studies have indicated that 95% to 98% of lexical coverage may be needed for reading comprehension (Hu & Nation, 2000). To understand informal listening passages, lower coverage figures (95%-90%) may suffice. However, no study has…
Descriptors: Television Viewing, Lexicology, Comprehension, Visual Aids
Shao, Lucy; Levine, Richard A.; Guarcello, Maureen A.; Wilke, Morten C.; Stronach, Jeanne; Frazee, James P.; Fan, Juanjuan – International Journal of Artificial Intelligence in Education, 2023
Propensity score matching and weighting methods are applied to balance covariates and reduce selection bias in the analysis of observational study data, and ultimately estimate a treatment effect. We wish to evaluate the impact of a Supplemental Instruction (SI) program on student success in an Introductory Statistics course. In such student…
Descriptors: Statistical Bias, Probability, Scores, Weighted Scores