ERIC - Search Results

Publication Date

In 2025	27
Since 2024	95
Since 2021 (last 5 years)	356
Since 2016 (last 10 years)	878
Since 2006 (last 20 years)	2091

Descriptor

Interrater Reliability	3093
Foreign Countries	642
Evaluation Methods	501
Test Reliability	498
Test Validity	406
Correlation	401
Scoring	336
Comparative Analysis	327
Scores	321
Validity	309
Student Evaluation	301
Measures (Individuals)	298
Evaluators	291
Rating Scales	282
Statistical Analysis	268
Higher Education	263
Psychometrics	238
Observation	228
Reliability	228
Scoring Rubrics	214
Test Construction	212
Teaching Methods	208
English (Second Language)	203
Writing Evaluation	202
Intervention	200
More ▼

Education Level

Higher Education	562
Postsecondary Education	408
Elementary Education	280
Secondary Education	177
Early Childhood Education	142
Elementary Secondary Education	119
Middle Schools	108
High Schools	84
Preschool Education	72
Junior High Schools	64
Adult Education	58
Primary Education	55
Kindergarten	45
Grade 4	41
Grade 5	40
Intermediate Grades	40
Grade 1	36
Grade 6	35
Grade 8	32
Grade 3	30
Grade 7	27
Grade 2	25
Grade 10	13
Grade 9	11
Two Year Colleges	8
More ▼

Audience

Researchers	130
Practitioners	42
Teachers	22
Administrators	11
Counselors	3
Policymakers	2

Location

Australia	56
Turkey	52
United Kingdom	46
Canada	45
Netherlands	40
California	37
China	37
United States	30
United Kingdom (England)	24
Taiwan	23
Japan	22
Pennsylvania	22
Florida	21
Germany	21
Sweden	21
Iran	19
North Carolina	19
Hong Kong	17
Texas	17
Georgia	16
South Korea	16
Israel	15
New Zealand	14
Washington	14
South Africa	13
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	13
Individuals with Disabilities…	7
Race to the Top	3
Elementary and Secondary…	2
American Recovery and…	1
Americans with Disabilities…	1
Education Consolidation…	1
Education for All Handicapped…	1
Elementary and Secondary…	1
Improving Americas Schools…	1
Individuals with Disabilities…	1
Individuals with Disabilities…	1
Pell Grant Program	1
Rehabilitation Act 1973…	1
Stewart B McKinney Homeless…	1
Temporary Assistance for…	1
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	3
Meets WWC Standards with or without Reservations	3
Does not meet standards	3

Interrater Reliability X

Showing 16 to 30 of 3,093 results Save | Export

Assessing Inter-Rater Agreement of the Intellectual Disability-Frailty Index Short Form: A Descriptive Pilot Study

Peer reviewed

Direct link

Heather Hirst; Jennifer Campbell; Samantha Chamberlin; Ibukun Olagunju; Frank Bird; James K. Luiselli – Journal of Intellectual Disabilities, 2024

Frailty is a health concern for many adults with intellectual disability and should be measured to detect at-risk conditions, monitor disease, plan treatment, and gauge mortality. This descriptive pilot study evaluated measurement consistency (inter-rater agreement) of the Intellectual Disability-Frailty Index Short Form among multiple assessors…

Descriptors: Adults, Intellectual Disability, Physical Health, Aging (Individuals)

Self-Assessment Survey: Evaluation of a Revised Measure Assessing Positive Behavioral Interventions and Supports

Peer reviewed

Direct link

Angus Kittelman; Sara Izzard; Kent McIntosh; Kelsey R. Morris; Timothy J. Lewis – Assessment for Effective Intervention, 2024

The purpose of this study was to evaluate the psychometric properties of the Self-Assessment Survey (SAS) 4.0, an updated measure assessing implementation fidelity of positive behavioral interventions and supports (PBIS). A total of 627 school personnel from 33 schools in six U.S. states completed the SAS 4.0 during the 2021-2022 school year. We…

Descriptors: Positive Behavior Supports, Teachers, Self Evaluation (Individuals), Test Reliability

A Data-Driven Approach for the Identification of Features for Automated Feedback on Academic Essays

Peer reviewed

Direct link

Abbas, Mohsin; van Rosmalen, Peter; Kalz, Marco – IEEE Transactions on Learning Technologies, 2023

For predicting and improving the quality of essays, text analytic metrics (surface, syntactic, morphological, and semantic features) can be used to provide formative feedback to the students in higher education. In this study, the goal was to identify a sufficient number of features that exhibit a fair proxy of the scores given by the human raters…

Descriptors: Feedback (Response), Automation, Essays, Scoring

Monitoring Rater Quality in Observational Systems: Issues Due to Unreliable Estimates of Rater Quality

Peer reviewed

Direct link

Mark White; Matt Ronfeldt – Educational Assessment, 2024

Standardized observation systems seek to reliably measure a specific conceptualization of teaching quality, managing rater error through mechanisms such as certification, calibration, validation, and double-scoring. These mechanisms both support high quality scoring and generate the empirical evidence used to support the scoring inference (i.e.,…

Descriptors: Interrater Reliability, Quality Control, Teacher Effectiveness, Error Patterns

Agree to Disagree: Multiple Methods to Assess Rater Agreement during Student Teaching

Peer reviewed

Direct link

Elayne P. Colón; Lori M. Dassa; Thomas M. Dana; Nathan P. Hanson – Action in Teacher Education, 2024

To meet accreditation expectations, teacher preparation programs must demonstrate their candidates are evaluated using summative assessment tools that yield sound, reliable, and valid data. These tools are primarily used by the clinical experience team -- university supervisors and mentor teachers. Institutional beliefs regarding best practices…

Descriptors: Student Teachers, Teacher Interns, Evaluation Methods, Interrater Reliability

Statistical Inference for G-Indices of Agreement

Peer reviewed

Direct link

Bonett, Douglas G. – Journal of Educational and Behavioral Statistics, 2022

The limitations of Cohen's ? are reviewed and an alternative G-index is recommended for assessing nominal-scale agreement. Maximum likelihood estimates, standard errors, and confidence intervals for a two-rater G-index are derived for one-group and two-group designs. A new G-index of agreement for multirater designs is proposed. Statistical…

Descriptors: Statistical Inference, Statistical Data, Interrater Reliability, Design

Evaluating Mathematics Lessons for Cognitive Demand: Applying a Discursive Lens to the Process of Achieving Inter-Rater Reliability

Peer reviewed

Direct link

Weingarden, Merav; Heyd-Metzuyanim, Einat – Journal of Mathematics Teacher Education, 2023

In this study, we examine "what went wrong" in our professional development program for encouraging cognitively demanding instruction, focusing on the difficulties we encountered in using an observational tool for evaluating this type of instruction and reaching inter-rater reliability. We do so through the lens of a discursive theory of…

Descriptors: Mathematics Instruction, Interrater Reliability, Cognitive Processes, Difficulty Level

What Is the Status of Multi-Informant Treatment Fidelity Research?

Peer reviewed
PDF on ERIC

Download full text

Direct link

Bryce D. McLeod; Nicole Porter; Aaron Hogue; Emily M. Becker-Haimes; Amanda Jensen-Doss – Grantee Submission, 2023

Objective: The precise measurement of treatment fidelity (quantity and quality in the delivery of treatment strategies in an intervention) is essential for intervention development, evaluation, and implementation. Various informants are used in fidelity assessment (e.g., observers, practitioners [clinicians, teachers], clients), but these…

Descriptors: Measurement, Fidelity, Educational Research, Evidence Based Practice

Interrater Reliability of the FOCUS-34: Parent-to-Parent and Parent-to-Clinician

Peer reviewed

Direct link

Barbara Jane Cunningham; Peter Rosenbaum; Anastasia Nepotiuk; Nancy Thomas-Stonell – Communication Disorders Quarterly, 2024

This brief report presents interrater reliability data for the Focus on the Outcomes of Communication Under Six (FOCUS-34) between parents, and between parents and speech-language pathologists (SLPs). Reliability for all three raters combined was good to excellent across three assessments. Reliability for pairs of raters was variable but generally…

Descriptors: Interrater Reliability, Outcome Measures, Preschool Children, Parents

Communal Factors in Rater Severity and Consistency over Time in High-Stakes Oral Assessment

Peer reviewed

Direct link

Reeta Neittaanmäki; Iasonas Lamprianou – Language Testing, 2024

This article focuses on rater severity and consistency and their relation to major changes in the rating system in a high-stakes testing context. The study is based on longitudinal data collected from 2009 to 2019 from the second language (L2) Finnish speaking subtest in the National Certificates of Language Proficiency in Finland. We investigated…

Descriptors: Foreign Countries, Interrater Reliability, Evaluators, Item Response Theory

Reliable Assessment of Pain Behaviour in Adults with Profound Intellectual and Multiple Disabilities: The Development of an Instruction Protocol

Peer reviewed

Direct link

Enninga, Annemieke; Waninge, Aly; Post, Wendy J.; van der Putten, Annette A. J. – Journal of Applied Research in Intellectual Disabilities, 2023

Background: Persons with profound intellectual and multiple disabilities (PIMD) are vulnerable when it comes to experiencing pain. Reliable assessment of pain-related behaviour in these persons is difficult. "Aim" To determine how pain items can be reliably scored in adults with PIMD. Methods: We developed an instruction protocol for the…

Descriptors: Test Reliability, Pain, Behavior, Adults

Citation Metrics and Boyer's Model of Scholarship: How Do Bibliometrics and Altmetrics Respond to Research Impact?

Peer reviewed

Direct link

Gilstrap, Donald L.; Whitver, Sara Maurice; Scalfani, Vincent F.; Bray, Nathaniel J. – Innovative Higher Education, 2023

This article explores how well bibliometrics and altmetrics reflect research impact in relation to Boyer's Model of the Scholarship. Indices used for both types of metrics are explored and discussed while including an analysis on primary methodological works performed on each in the literature to date. As confirmatory in nature, we chose as our…

Descriptors: Bibliometrics, Models, Scholarship, Research

Which Blueberries Are Better Value? The Development and Validation of the Functional Numeracy Assessment for Adults with Aphasia

Peer reviewed

Direct link

Ichikowitz, Kerri; Bruce, Carolyn; Meitanis, Vanessa; Cheung, Kelly; Kim, Yekyung; Talbourdet, Esther; Newton, Caroline – International Journal of Language & Communication Disorders, 2023

Background: People with aphasia (PWA) can experience functional numeracy difficulties, that is, problems understanding or using numbers in everyday life, which can have numerous negative impacts on their daily lives. There is growing interest in designing functional numeracy interventions for PWA; however, there are limited suitable assessments…

Descriptors: Test Construction, Test Validity, Numeracy, Adults

Measuring and Visualizing Coders' Reliability: New Approaches and Guidelines from Experimental Data

Peer reviewed

Direct link

Lamprianou, Iasonas – Sociological Methods & Research, 2023

This study investigates inter- and intracoder reliability, proposing a new approach based on social network analysis (SNA) and exponential random graph models (ERGM). During a recent exit poll, the responses of voters to two open-ended questions were recorded. A coding experiment was conducted where a group of coders coded a sample of text…

Descriptors: Interrater Reliability, Coding, Social Networks, Network Analysis

Do Mathematicians and Undergraduates Agree about Explanation Quality?

Peer reviewed

Direct link

Evans, Tanya; Mejía-Ramos, Juan Pablo; Inglis, Matthew – Educational Studies in Mathematics, 2022

Offering explanations is a central part of teaching mathematics, and understanding those explanations is a vital activity for learners. Given this, it is natural to ask what makes a good mathematical explanation. This question has received surprisingly little attention in the mathematics education literature, perhaps because the field has no…

Descriptors: Mathematics, Professional Personnel, Undergraduate Students, Mathematics Activities

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 207

ProQuest LLC	86
Educational and Psychological…	61
Journal of Speech, Language,…	61
Journal of Autism and…	56
Grantee Submission	40
Language Testing	37
Online Submission	35
Assessment & Evaluation in…	33
International Journal of…	33
Research in Developmental…	31
Applied Measurement in…	28
Assessment for Effective…	26
Advances in Health Sciences…	25
ETS Research Report Series	25
Journal of Educational…	24
Educational Measurement:…	22
Measurement in Physical…	20
Language Assessment Quarterly	19
Psychology in the Schools	19
Topics in Early Childhood…	19
Psychological Assessment	18
Educational Assessment	16
Autism: The International…	15
Journal of Consulting and…	15
Personnel Psychology	15
More ▼

Lunz, Mary E.	10
Wind, Stefanie A.	10
Engelhard, George, Jr.	8
Epstein, Michael H.	8
Ingham, Roger J.	8
Johnson, Evelyn S.	8
Matson, Johnny L.	7
McLeod, Bryce D.	7
Moylan, Laura A.	7
Cason, Carolyn L.	6
Cordes, Anne K.	6
Jaeger, Richard M.	6
Johnson, Robert L.	6
Lecavalier, Luc	6
Plake, Barbara S.	6
Tasse, Marc J.	6
Wyse, Adam E.	6
Zheng, Yuzhu	6
Aman, Michael G.	5
Barton, Erin E.	5
Cason, Gerald J.	5
Coniam, David	5
Conroy, Maureen A.	5
Crawford, Angela R.	5
More ▼

Journal Articles	2526
Reports - Research	2212
Reports - Evaluative	515
Speeches/Meeting Papers	272
Reports - Descriptive	163
Tests/Questionnaires	162
Information Analyses	129
Dissertations/Theses -…	89
Opinion Papers	61
Numerical/Quantitative Data	31
Guides - Non-Classroom	11
Books	7
Collected Works - General	3
Guides - Classroom - Teacher	3
Non-Print Media	3
Book/Product Reviews	2
Collected Works - Serials	2
Dissertations/Theses	2
ERIC Digests in Full Text	2
ERIC Publications	2
Guides - General	2
Reports - General	2
Collected Works - Proceedings	1
Reference Materials -…	1
Reference Materials - General	1
More ▼

Test of English as a Foreign…	29
Child Behavior Checklist	18
National Assessment of…	14
Vineland Adaptive Behavior…	14
Autism Diagnostic Observation…	13
Strengths and Difficulties…	10
Woodcock Johnson Tests of…	10
Peabody Picture Vocabulary…	9
Wechsler Intelligence Scale…	9
Behavior Assessment System…	8
Dynamic Indicators of Basic…	8
Early Childhood Environment…	8
Graduate Record Examinations	8
SAT (College Admission Test)	8
International English…	6
Teacher Performance…	6
Advanced Placement…	5
Behavioral and Emotional…	5
Childhood Autism Rating Scale	5
Conners Teacher Rating Scale	5
Draw a Person Test	5
Raven Progressive Matrices	5
ACT Assessment	4
ACTFL Oral Proficiency…	4
Battelle Developmental…	4
More ▼