Publication Date
| In 2026 | 1 |
| Since 2025 | 672 |
| Since 2022 (last 5 years) | 4054 |
| Since 2017 (last 10 years) | 11845 |
| Since 2007 (last 20 years) | 29242 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Policymakers | 492 |
| Practitioners | 488 |
| Researchers | 349 |
| Teachers | 336 |
| Administrators | 189 |
| Parents | 68 |
| Community | 67 |
| Students | 45 |
| Counselors | 33 |
| Media Staff | 7 |
| Support Staff | 4 |
| More ▼ | |
Location
| Turkey | 1166 |
| Texas | 790 |
| California | 740 |
| Florida | 603 |
| United States | 572 |
| Canada | 516 |
| Australia | 504 |
| China | 490 |
| North Carolina | 441 |
| New York | 384 |
| United Kingdom | 380 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 65 |
| Meets WWC Standards with or without Reservations | 112 |
| Does not meet standards | 116 |
Metsämuuronen, Jari – Practical Assessment, Research & Evaluation, 2023
Traditional estimators of reliability such as coefficients alpha, theta, omega, and rho (maximal reliability) are prone to give radical underestimates of reliability for the tests common when testing educational achievement. These tests are often structured by widely deviating item difficulties. This is a typical pattern where the traditional…
Descriptors: Test Reliability, Achievement Tests, Computation, Test Items
Kim, Stella Y.; Lee, Won-Chan; Kolen, Michael J. – Educational and Psychological Measurement, 2020
A theoretical and conceptual framework for true-score equating using a simple-structure multidimensional item response theory (SS-MIRT) model is developed. A true-score equating method, referred to as the SS-MIRT true-score equating (SMT) procedure, also is developed. SS-MIRT has several advantages over other complex multidimensional item response…
Descriptors: Item Response Theory, Equated Scores, True Scores, Accuracy
Wyse, Adam E. – Educational Measurement: Issues and Practice, 2020
One commonly used compromise standard-setting method is the Beuk (1984) method. A key assumption of the Beuk method is that the emphasis given to the pass rate and the percent correct ratings should be proportional to the extent that the panelists agree on their ratings. However, whether the slope of Beuk line reflects the emphasis that panelists…
Descriptors: Standard Setting (Scoring), Cutting Scores, Weighted Scores, Evaluation Methods
Kwok, Elaine; Feiner, Hannah; Grauzer, Jeffrey; Kaat, Aaron; Roberts, Megan Y. – Journal of Speech, Language, and Hearing Research, 2022
Purpose: Norm-referenced, standardized measures are tools designed to characterize a child's abilities relative to their same-age peers, but they also have been used to measure changes in skills during intervention. This study compared the psychometric properties of four types of available scores from one commonly used standardized measure, the…
Descriptors: Language Tests, Preschool Children, Norm Referenced Tests, Standardized Tests
Allen, Jeff – ACT, Inc., 2022
The COVID-19 pandemic caused widespread disruptions to the educational system in Arkansas and across the United States. At the onset of the pandemic in March 2020, schools in Arkansas were forced to replace on-site instruction with virtual instruction. During the 2020-2021 academic year, there were three student instructional options:…
Descriptors: COVID-19, Pandemics, Academic Achievement, Electronic Learning
Megan Kuhfeld; James Soland; Karyn Lewis – Annenberg Institute for School Reform at Brown University, 2022
The COVID-19 pandemic has been a seismic and on-going disruption to K-12 schooling. Using test scores from 5.4 million U.S. students in grades 3-8, we tracked changes in math and reading achievement across the first two years of the pandemic. Average fall 2021 math test scores in grades 3-8 were 0.20-27 standard deviations (SDs) lower relative to…
Descriptors: COVID-19, Pandemics, Scores, Elementary School Students
Gregory Chernov – Evaluation Review, 2025
Most existing solutions to the current replication crisis in science address only the factors stemming from specific poor research practices. We introduce a novel mechanism that leverages the experts' predictive abilities to analyze the root causes of replication failures. It is backed by the principle that the most accurate predictor is the most…
Descriptors: Replication (Evaluation), Prediction, Scientific Research, Failure
Masahiro Hirai; Ayaka Ikeda; Takeo Kato; Takahiro Ikeda; Kosuke Asada; Yoko Hakuno; Kanae Matsushima; Tomonari Awaya; Shin Okazaki; Toshihiro Kato; Toshio Heike; Masatoshi Hagiwara; Takanori Yamagata; Kiyotaka Tomiwa; Ryo Kimura – Journal of Autism and Developmental Disorders, 2025
Purpose: With the current study, we aimed to reveal the similarities and differences in sensory profiles between Williams syndrome (WS) and autism spectrum disorder. Methods: Using the sensory profile questionnaire completed by the caregivers, we analyzed the WS (n = 60, 3.4-19.8 years) and autistic (n = 39, 4.2-14.0 years) groups. Results: The…
Descriptors: Sensory Experience, Profiles, Autism Spectrum Disorders, Genetic Disorders
Wei Ping Sze; Jane Warren; Carol Sacchett; Wendy Best – International Journal of Language & Communication Disorders, 2025
Background: Current clinical approaches to the treatment of spoken word-finding difficulties in acquired aphasia encourage multimodal cueing, especially the joint application of written and spoken forms. Research that exclusively examines the effects and mechanisms of written cues is limited, with most studies engaging written forms only as part…
Descriptors: Oral Language, Chronic Illness, Aphasia, Orthographic Symbols
Mohammad Ghulam Ali – Online Submission, 2025
This research article establishes the relationship between key performance indicators and the academic and research quality performance and assessment and quality assurance of any large multidisciplinary academic and research institution or university. The indicators in terms of qualitative and quantitative are being proposed below and are…
Descriptors: Research Universities, Reputation, Educational Quality, Educational Assessment
Min Wu; Peiyao Tian; Daner Sun; Dan Feng; Ma Luo – International Journal of Science and Mathematics Education, 2025
This study aimed to develop a comprehensive diagnostic tool for assessing upper-secondary school students' understanding of isomers, expanding upon existing two- and three-tier conceptual diagnostic methods. By incorporating 'Confidence Rating Factor' tiers within the answer and reason sections, a four-tier test was designed and developed. This…
Descriptors: Secondary School Science, High School Students, Scientific Concepts, Science Tests
Hamed Ghaemi; Robert Kirkpatrick – Language Testing in Asia, 2025
This study examines the impact of teacher metapathy, or teachers' capacity to comprehend and sympathize with students' emotional and cognitive requirements, on the academic outcomes of IELTS applicants, such as total band scores, motivation, and language learning orientation. Ten IELTS teachers from five language training centers and 100 IELTS…
Descriptors: Second Language Learning, Language Tests, English (Second Language), Scores
Owen Henkel; Hannah Horne-Robinson; Libby Hills; Bill Roberts; Josh McGrane – International Journal of Artificial Intelligence in Education, 2025
This paper reports on a set of three recent experiments utilizing large-scale speech models to assess the oral reading fluency (ORF) of students in Ghana. While ORF is a well-established measure of foundational literacy, assessing it typically requires one-on-one sessions between a student and a trained rater, a process that is time-consuming and…
Descriptors: Foreign Countries, Oral Reading, Reading Fluency, Literacy
Kylie Gorney; Sandip Sinharay – Educational and Psychological Measurement, 2025
Test-takers, policymakers, teachers, and institutions are increasingly demanding that testing programs provide more detailed feedback regarding test performance. As a result, there has been a growing interest in the reporting of subscores that potentially provide such detailed feedback. Haberman developed a method based on classical test theory…
Descriptors: Scores, Test Theory, Test Items, Testing
Jiawei Xiong; George Engelhard; Allan S. Cohen – Measurement: Interdisciplinary Research and Perspectives, 2025
It is common to find mixed-format data results from the use of both multiple-choice (MC) and constructed-response (CR) questions on assessments. Dealing with these mixed response types involves understanding what the assessment is measuring, and the use of suitable measurement models to estimate latent abilities. Past research in educational…
Descriptors: Responses, Test Items, Test Format, Grade 8

Peer reviewed
Direct link
