Publication Date
In 2025 | 1 |
Since 2024 | 7 |
Since 2021 (last 5 years) | 21 |
Since 2016 (last 10 years) | 49 |
Since 2006 (last 20 years) | 87 |
Descriptor
Interrater Reliability | 130 |
Literature Reviews | 38 |
Research Methodology | 29 |
Meta Analysis | 24 |
Educational Research | 22 |
Intervention | 19 |
Evaluation Methods | 18 |
Research Design | 16 |
Journal Articles | 15 |
Test Validity | 15 |
Coding | 14 |
More ▼ |
Source
Author
Publication Type
Information Analyses | 130 |
Journal Articles | 109 |
Reports - Research | 68 |
Speeches/Meeting Papers | 13 |
Reports - Evaluative | 12 |
Opinion Papers | 3 |
Dissertations/Theses | 1 |
Dissertations/Theses -… | 1 |
Tests/Questionnaires | 1 |
Education Level
Audience
Researchers | 4 |
Practitioners | 1 |
Location
China | 3 |
South Africa | 3 |
Australia | 2 |
Canada | 2 |
Israel | 2 |
Netherlands | 2 |
Turkey | 2 |
United Kingdom | 2 |
United States | 2 |
Argentina | 1 |
Belgium | 1 |
More ▼ |
Laws, Policies, & Programs
Americans with Disabilities… | 1 |
Education for All Handicapped… | 1 |
Elementary and Secondary… | 1 |
Individuals with Disabilities… | 1 |
No Child Left Behind Act 2001 | 1 |
Rehabilitation Act 1973… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Does not meet standards | 1 |
Julia Brochey-Taylor; Joseph A. Taylor – Educational Research and Reviews, 2024
The purpose of this synthesis study was to assess the reliability and validity of the Draw-A-Scientist Test (DAST) and its variations across multiple studies, aiming to understand limitations and propose modifications for future application within and beyond the science domain. Given the existence of multiple DAST versions, this study quantified…
Descriptors: Cognitive Tests, Freehand Drawing, Personality Measures, Projective Measures
Kim, Soo Youn; Lecavalier, Luc – Journal of Autism and Developmental Disorders, 2022
The current review examined the use of self-report measures in autistic individuals in the context of psychiatric assessments. It focused on inter-rater agreement, internal consistency, test-retest reliability, and criterion validity with clinical diagnoses. It also gathered information on constructs measured, the nature of the samples, and the…
Descriptors: Measurement Techniques, Autism Spectrum Disorders, Psychiatric Services, Literature Reviews
Alexandra M. Pierce; Lisa M. H. Sanetti; Melissa A. Collier-Meek; Austin H. Johnson – Grantee Submission, 2024
Visual analysis is the primary methodology used to determine treatment effects from graphed single-case design data. Previous studies have demonstrated mixed findings related to interrater agreement between both expert and novice visual analysts, which represents a critical limitation of visual analysis and supports calls for also presenting…
Descriptors: Graphs, Interrater Reliability, Statistical Analysis, Expertise
Elizabeth J. Preas; Mary E. Halbur; Regina A. Carroll – Analysis of Verbal Behavior, 2024
Procedural fidelity refers to the degree to which procedures for an assessment or intervention (i.e., independent variables) are implemented consistent with the prescribed protocols. Procedural fidelity is an important factor in demonstrating the internal validity of an experiment and clinical treatments. Previous reviews evaluating the inclusion…
Descriptors: Verbal Communication, Behavioral Science Research, Periodicals, Fidelity
Bryce D. McLeod; Nicole Porter; Aaron Hogue; Emily M. Becker-Haimes; Amanda Jensen-Doss – Grantee Submission, 2023
Objective: The precise measurement of treatment fidelity (quantity and quality in the delivery of treatment strategies in an intervention) is essential for intervention development, evaluation, and implementation. Various informants are used in fidelity assessment (e.g., observers, practitioners [clinicians, teachers], clients), but these…
Descriptors: Measurement, Fidelity, Educational Research, Evidence Based Practice
Cheung, Kason Ka Ching; Tai, Kevin W. H. – Research in Science & Technological Education, 2023
Background: Intercoder reliability is a statistic commonly reported by researchers to demonstrate the rigour of coding procedures during data analysis. Its importance is debatable in the analysis of qualitative interview data. It raises a question on whether researchers should identify the same codes and themes in a transcript or they should…
Descriptors: Interrater Reliability, Data Analysis, Interviews, Research Methodology
Liu, Yilan; Lee, Sue Ann S.; Chen, Wenjun – Journal of Speech, Language, and Hearing Research, 2022
Introduction: Assessment of resonance characteristics is essential in research and clinical practice in individuals with velopharyngeal impairment. The purpose of this study was to systematically review correlations between auditory perceptual ratings and nasalance scores obtained by a nasometer in individuals with resonance disorders and to…
Descriptors: Correlation, Auditory Perception, Meta Analysis, Guidelines
Kübra Karakaya Özyer – Journal of Educators Online, 2025
This meta-analytic study investigates the impact of online peer assessment on academic achievement in higher education. By synthesizing 20 effect sizes, we provide a comprehensive understanding of how online peer assessment influences student learning outcomes. The findings reveal a statistically significant positive effect (Hedges's g = 0.672),…
Descriptors: Electronic Learning, Peer Evaluation, Higher Education, Meta Analysis
Chao Han; Binghan Zheng; Mingqing Xie; Shirong Chen – Interpreter and Translator Trainer, 2024
Human raters' assessment of interpreting is a complex process. Previous researchers have mainly relied on verbal reports to examine this process. To advance our understanding, we conducted an empirical study, collecting raters' eye-movement and retrospection data in a computerised interpreting assessment in which three groups of raters (n = 35)…
Descriptors: Foreign Countries, College Students, College Graduates, Interrater Reliability
Fuentes, Milton A.; Reyes-Portillo, Jazmin A.; Tineo, Petty; Gonzalez, Kenny; Butt, Mamona – Hispanic Journal of Behavioral Sciences, 2021
While skin color is relevant and important in the Latinx community, as it is associated with colorism, little is known about how often it is measured or the best way to measure it. This article presents results from two studies examining these key concerns in three prominent journals, where Latinx research is typically published (i.e., the…
Descriptors: Hispanic Americans, Measures (Individuals), Undergraduate Students, Social Bias
Norouzian, Reza – Studies in Second Language Acquisition, 2021
There has recently been a surge of interest in improving the replicability of second language (L2) research. However, less attention is paid to replicability in the context of L2 meta-analyses. I argue that conducting interrater reliability (IRR) analyses is a key step toward improving the replicability of L2 meta-analyses. To that end, I first…
Descriptors: Interrater Reliability, Second Languages, Language Research, Meta Analysis
Catharine Lory; Emily Gregori – Behavioral Disorders, 2024
Systematic reviews of single-case experimental research (SCER) in special education often use the What Works Clearinghouse (WWC) Standards to assess the methodological rigor of studies within a given literature base. While significant changes were made between the two most recent versions of the WWC standards, no research to date has evaluated the…
Descriptors: Program Effectiveness, Standards, Evidence, Case Studies
Lambert, Matthew C.; Sointu, Erkko T.; Epstein, Michael H. – International Journal of School & Educational Psychology, 2019
Child assessment practices have undergone, and are continuing to undergo, significant changes. Among the most prominent changes is the movement toward measuring child well-being, in general, and emotional and behavioral strengths, in particular. The Behavioral and Emotional Rating Scale (BERS) is a strength-based instrument which is widely used in…
Descriptors: Behavior Rating Scales, Translation, Psychometrics, Scores
Patel, Priya; Lee, Seungmin; Myers, Nicholas D.; Lee, Mei-Hua – Journal of Motor Learning and Development, 2021
Missing data incidents are common in experimental studies of motor learning and development. Inadequate handling of missing data may lead to serious problems, such as addition of bias, reduction in power, and so on. Thus, this study aimed to conduct a systematic review of the past (2007) and present (2017) practices used for reporting and…
Descriptors: Motor Development, Research Reports, Periodicals, Research Methodology
Moeyaert, Mariola; Yang, Panpan; Xu, Xinyun; Kim, Esther – Grantee Submission, 2021
Hierarchical linear modeling (HLM) has been recommended as a meta-analytic technique for the quantitative synthesis of single-case experimental design (SCED) studies. The HLM approach is flexible and can model a variety of different SCED data complexities, such as intervention heterogeneity. A major advantage of using HLM is that participant…
Descriptors: Meta Analysis, Case Studies, Research Design, Hierarchical Linear Modeling