Publication Date
In 2025 | 227 |
Since 2024 | 858 |
Since 2021 (last 5 years) | 3728 |
Since 2016 (last 10 years) | 10252 |
Since 2006 (last 20 years) | 18354 |
Descriptor
Scores | 21330 |
Foreign Countries | 7446 |
Correlation | 4506 |
Comparative Analysis | 4144 |
Academic Achievement | 3853 |
Statistical Analysis | 3719 |
Teaching Methods | 2920 |
Student Attitudes | 2672 |
Gender Differences | 2375 |
Second Language Learning | 2253 |
Measures (Individuals) | 2244 |
More ▼ |
Source
Author
Publication Type
Reports - Research | 21330 |
Journal Articles | 17653 |
Tests/Questionnaires | 1155 |
Speeches/Meeting Papers | 931 |
Numerical/Quantitative Data | 727 |
Information Analyses | 181 |
Reports -… | 26 |
Reports - Evaluative | 24 |
Opinion Papers | 20 |
Books | 18 |
Dissertations/Theses -… | 15 |
More ▼ |
Education Level
Audience
Practitioners | 185 |
Policymakers | 176 |
Researchers | 161 |
Teachers | 79 |
Administrators | 35 |
Counselors | 19 |
Community | 10 |
Parents | 6 |
Students | 6 |
Media Staff | 3 |
Support Staff | 1 |
More ▼ |
Location
Turkey | 1104 |
China | 399 |
Australia | 377 |
Canada | 370 |
California | 349 |
Texas | 321 |
Iran | 299 |
United States | 282 |
Florida | 281 |
United Kingdom | 278 |
Taiwan | 257 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 42 |
Meets WWC Standards with or without Reservations | 78 |
Does not meet standards | 65 |
Karoline A. Sachse; Sebastian Weirich; Nicole Mahler; Camilla Rjosk – International Journal of Testing, 2024
In order to ensure content validity by covering a broad range of content domains, the testing times of some educational large-scale assessments last up to a total of two hours or more. Performance decline over the course of taking the test has been extensively documented in the literature. It can occur due to increases in the numbers of: (a)…
Descriptors: Test Wiseness, Test Score Decline, Testing Problems, Foreign Countries
Kelly Edwards; James Soland – Educational Assessment, 2024
Classroom observational protocols, in which raters observe and score the quality of teachers' instructional practices, are often used to evaluate teachers for consequential purposes despite evidence that scores from such protocols are frequently driven by factors, such as rater and temporal effects, that have little to do with teacher quality. In…
Descriptors: Classroom Observation Techniques, Teacher Evaluation, Accuracy, Scores
Kylie Gorney; Sandip Sinharay – Journal of Educational Measurement, 2025
Although there exists an extensive amount of research on subscores and their properties, limited research has been conducted on categorical subscores and their interpretations. In this paper, we focus on the claim of Feinberg and von Davier that categorical subscores are useful for remediation and instructional purposes. We investigate this claim…
Descriptors: Tests, Scores, Test Interpretation, Alternative Assessment
Bal-Sezerel, Bilge; Atesgöz, N. Nazli; Kirisçi, Nilgün – Journal of Theoretical Educational Science, 2023
The Flynn effect, which advocated that there was a rise in the global IQ score, was widely accepted by the relevant scientific community. However, there are recent research findings that this effect has been reversed. In this study, both Flynn and anti-Flynn effects were investigated. The purpose of this study is to analyze students' general,…
Descriptors: Intelligence Tests, Scores, Elementary School Students, Intelligence Quotient
Abigail R. Vild; Maggie E. Wilson; Christopher A. Was – Journal of Research in Education, 2025
Theories of self-regulated learning suggest a positive link between knowledge monitoring accuracy (the ability to predict test performance) and performance on tests. Put differently, students who accurately monitor their knowledge of course content more efficiently regulate study of course materials. However, a plethora of literature indicates…
Descriptors: Student Satisfaction, Undergraduate Students, Scores, Prediction
Stefan O'Grady – TESOL Journal, 2025
Task-based language assessment represents a major component of task-based language teaching syllabi. Current perspectives emphasise the importance of tasks in the assessment process, suggesting that adherence to influential models of language production during task design yields predictable test outcomes. The current study contends that the…
Descriptors: Task Analysis, Language Tests, Evaluators, Rating Scales
Lauren E. Bates; Sarah J. Myers; Edward L. DeLosh; Matthew G. Rhodes – Psychology Learning and Teaching, 2025
The present work assessed a quizzing method that combines the benefits of retrieval practice and feedback, whereby learners must continue taking quizzes until they achieve a perfect score with feedback provided (i.e., "mastery quizzing"). Across four experiments (n = 952; age 18-76, M = 37.10, SD = 11.61; 50% female, 48% male, 2% other…
Descriptors: Mastery Tests, Retention (Psychology), Evaluation Methods, Adults
Blake H. Heller – Annenberg Institute for School Reform at Brown University, 2024
In 2016, the GED® introduced college readiness benchmarks designed to identify testers who are academically prepared for credit-bearing college coursework. The benchmarks are promoted as awarding college credits or exempting "college-ready" GED® graduates from remedial coursework. I show descriptive evidence that those identified as…
Descriptors: High School Equivalency Programs, College Readiness, Eligibility, Benchmarking
Marion Durbahn; Michael Rodgers; Marijana Macis; Elke Peters – Studies in Second Language Acquisition, 2024
This study aimed to investigate the relationship between lexical coverage and TV viewing comprehension. Previous studies have indicated that 95% to 98% of lexical coverage may be needed for reading comprehension (Hu & Nation, 2000). To understand informal listening passages, lower coverage figures (95%-90%) may suffice. However, no study has…
Descriptors: Television Viewing, Lexicology, Comprehension, Visual Aids
Kristen Bottema-Beutel; Shannon Crowley LaPoint; So Yoon Kim; Sarah Mohiuddin; Qun Yu; Rachael McKinnon – Exceptional Children, 2024
In this secondary analysis of a previously conducted systematic review, we analyze social validity assessments in intervention research for transition-age autistic youth. Social validity is concerned with the acceptability of the intervention goals, the acceptability and feasibility of the intervention procedures, and the perceived importance of…
Descriptors: Autism Spectrum Disorders, Intervention, Validity, Psychometrics
Bahar Saberzadeh-Ardestani; Ali Reza Sima; Bardia Khosravi; Meredith Young; Sara Mortaz Hejri – Advances in Health Sciences Education, 2024
Few studies have engaged in data-driven investigations of the presence, or frequency, of what could be considered retaliatory assessor behaviour in Multi-source Feedback (MSF) systems. In this study, authors explored how assessors scored others if, before assessing others, they received their own assessment score. The authors examined assessments…
Descriptors: Feedback (Response), Scores, Evaluators, Behavior
Karen B. Schmaling; Gabriel R. Evenson; Blake K. Marble; Stephen A. Gallo – Research Evaluation, 2024
Peer review is integral to the evaluation of grant proposals. Reviewer perceptions and characteristics have received limited study, especially their associations with reviewers' evaluations. This mixed methods study analyzed the unstructured comments of 270 experienced peer reviewers after they scored proposals based on mock overall evaluations…
Descriptors: Peer Evaluation, Grants, Evaluation Research, Program Proposals
Danielle S. McNamara; Micah Watanabe; Linh Huynh; Kathryn S. McCarthy; Larua K. Allen; Joseph P. Magliano – Grantee Submission, 2023
Writing an integrated essay based on multiple-documents requires students to both comprehend the documents and integrate the documents into a coherent essay. In the current study, we examined the effects of summarization as a potential reading strategy to enhance participants' multiple-document comprehension and integrated essay writing.…
Descriptors: Reading Strategies, Reading Comprehension, Essays, Scores
Joseph A. Rios; Jiayi Deng – Educational and Psychological Measurement, 2024
Rapid guessing (RG) is a form of non-effortful responding that is characterized by short response latencies. This construct-irrelevant behavior has been shown in previous research to bias inferences concerning measurement properties and scores. To mitigate these deleterious effects, a number of response time threshold scoring procedures have been…
Descriptors: Reaction Time, Scores, Item Response Theory, Guessing (Tests)
Gerhard Tutz; Pascal Jordan – Journal of Educational and Behavioral Statistics, 2024
A general framework of latent trait item response models for continuous responses is given. In contrast to classical test theory (CTT) models, which traditionally distinguish between true scores and error scores, the responses are clearly linked to latent traits. It is shown that CTT models can be derived as special cases, but the model class is…
Descriptors: Item Response Theory, Responses, Scores, Models