Publication Date
In 2025 | 27 |
Since 2024 | 95 |
Since 2021 (last 5 years) | 356 |
Since 2016 (last 10 years) | 878 |
Since 2006 (last 20 years) | 2091 |
Descriptor
Interrater Reliability | 3093 |
Foreign Countries | 642 |
Evaluation Methods | 501 |
Test Reliability | 498 |
Test Validity | 406 |
Correlation | 401 |
Scoring | 336 |
Comparative Analysis | 327 |
Scores | 321 |
Validity | 309 |
Student Evaluation | 301 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Researchers | 130 |
Practitioners | 42 |
Teachers | 22 |
Administrators | 11 |
Counselors | 3 |
Policymakers | 2 |
Location
Australia | 56 |
Turkey | 52 |
United Kingdom | 46 |
Canada | 45 |
Netherlands | 40 |
California | 37 |
China | 37 |
United States | 30 |
United Kingdom (England) | 24 |
Taiwan | 23 |
Japan | 22 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 3 |
Meets WWC Standards with or without Reservations | 3 |
Does not meet standards | 3 |
Brent J. Goertzen; Kaley Klaus – Research & Practice in Assessment, 2023
When evaluating student learning, educators often employ scoring rubrics, for which quality can be determined through evaluating validity and reliability. This article discusses the norming process utilized in a graduate organizational leadership program for a capstone scoring rubric. Concepts of validity and reliability are discussed, as is the…
Descriptors: Graduate Students, Graduate Study, Graduate School Faculty, Scoring Rubrics
Mahr, Tristan J.; Berisha, Visar; Kawabata, Kan; Liss, Julie; Hustad, Katherine C. – Journal of Speech, Language, and Hearing Research, 2021
Purpose: Acoustic measurement of speech sounds requires first segmenting the speech signal into relevant units (words, phones, etc.). Manual segmentation is cumbersome and time consuming. Forced-alignment algorithms automate this process by aligning a transcript and a speech sample. We compared the phoneme-level alignment performance of five…
Descriptors: Speech, Young Children, Automation, Phonemes
Leung, Yeptain; Oates, Jennifer; Chan, Siew-Pang; Papp, Viktória – Journal of Speech, Language, and Hearing Research, 2021
Purpose: The aim of the study was to examine associations between speaking fundamental frequency (f[subscript os]), vowel formant frequencies (F), listener perceptions of speaker gender, and vocal femininity-masculinity. Method: An exploratory study was undertaken to examine associations between f[subscript os], F[subscript 1]-F[subscript 3],…
Descriptors: Acoustics, Speech, Vowels, Femininity
McCarthy, Kathryn S.; Magliano, Joseph P.; Snyder, Jacob O.; Kenney, Elizabeth A.; Newton, Natalie N.; Perret, Cecile A.; Knezevic, Melanie; Allen, Laura K.; McNamara, Danielle S. – Grantee Submission, 2021
The objective in the current paper is to examine the processes of how our research team negotiated meaning using an iterative design approach as we established, developed, and refined a rubric to capture comprehension processes and strategies evident in students' verbal protocols. The overarching project comprises multiple data sets, multiple…
Descriptors: Scoring Rubrics, Interrater Reliability, Design, Learning Processes
Swapneel Thite; Jayashri Ravishankar; Inmaculada Tomeo-Reyes; Araceli Martinez Ortiz – European Journal of Engineering Education, 2024
Effectively working in an engineering workplace requires strong teamwork skills, yet the existing literature within various disciplines reveals discrepancies in evaluating these skills. This complicates the design of a generic teamwork peer evaluation tool for engineering students. This study aims to address this gap by introducing the DRIVE…
Descriptors: Scoring Rubrics, Evaluation Methods, Peer Evaluation, Teamwork
Janice Kinghorn; Katherine McGuire; Bethany L. Miller; Aaron Zimmerman – Assessment Update, 2024
In this article, the authors share their reflections on how different experiences and paradigms have broadened their understanding of the work of assessment in higher education. As they collaborated to create a panel for the 2024 International Conference on Assessing Quality in Higher Education, they recognized that they, as assessment…
Descriptors: Higher Education, Assessment Literacy, Evaluation Criteria, Evaluation Methods
Cristina Menescardi; Aida Carballo-Fazanes; Núria Ortega-Benavent; Isaac Estevan – Journal of Motor Learning and Development, 2024
The Canadian Agility and Movement Skill Assessment (CAMSA) is a valid and reliable circuit-based test of motor competence which can be used to assess children's skills in a live or recorded performance and then coded. We aimed to analyze the intrarater reliability of the CAMSA scores (total, time, and skill score) and time measured, by comparing…
Descriptors: Interrater Reliability, Evaluators, Scoring, Psychomotor Skills
Elizabeth Choi-Tucci; John Sideris; Cristin Holland; Grace T. Baranek; Linda R. Watson – Journal of Speech, Language, and Hearing Research, 2025
Purpose: Intentional communication acts, or purposefully directed vocalizations and gestures, are particularly difficult for infants at elevated likelihood for eventual diagnosis of autism. The ability to measure and track intentional communication in infancy thus has the potential to aid early identification and intervention efforts. This study…
Descriptors: Infants, Autism Spectrum Disorders, Caregiver Child Relationship, Nonverbal Communication
Roessger, Kevin M. – Adult Learning, 2020
Practitioners often struggle to assess reflective learning in the workplace because of difficulties conceptualizing reflection and its effects in the workplace. This article addresses this problem by offering a pragmatic approach to assessment that asks practitioners to specify why they are using reflection, what they are hoping to gain from it,…
Descriptors: Workplace Learning, Evaluation Methods, Reflection, Adult Education
Gyamfi, George; Hanna, Barbara E.; Khosravi, Hassan – Assessment & Evaluation in Higher Education, 2022
Rubrics have been suggested as a means to foster students' evaluative judgement, the capacity to appraise their own work and that of others; however, empirical evidence of rubrics' effectiveness is still emerging. This paper contributes findings from a randomised controlled experiment on the effect of rubrics on evaluative judgement. Participants…
Descriptors: Scoring Rubrics, Evaluative Thinking, Peer Evaluation, Undergraduate Students
Purwadi; Saputra, Wahyu N. E.; Handaka, Irvan B.; Barida, Muya; Wahyudi, Amien; Widyastuti, Dian A.; Agungbudiprabowo; Rodhiya, Zaenab A. – Pegem Journal of Education and Instruction, 2022
This study aims to identify the acceptability and effectiveness of peace guidance based on the perspective of Markesot. This model seeks to reduce student aggressiveness. This study uses the research and development stages by adapting the Borg & Gall model. The participants of this study were 275 students who were taken randomly. The study…
Descriptors: Peace, Guidance, Models, Interrater Reliability
Bodfish, James W.; Lecavalier, Luc; Harrop, Clare; Dallman, Aaron; Kalburgi, Sahana Nagabhushan; Hollway, Jill; Faldowski, Richard; Boyd, Brian A. – Journal of Autism and Developmental Disorders, 2022
For individuals with autism spectrum disorder (ASD), behavioral inflexibility can affect multiple domains of functioning and family life. The objective of this study was to develop and validate a clinical interview version of the Behavioral Inflexibility Scale. Trained interviewers conducted interviews with parents of 144 children with ASD and 70…
Descriptors: Children, Autism, Pervasive Developmental Disorders, Child Behavior
Kaila L. Stipancic; Mojgan Golzy; Yunxin Zhao; Louise Pinkerton; Andrea Rohl; Mili Kuruvilla-Dugdale – Journal of Speech, Language, and Hearing Research, 2023
Purpose: Auditory training has been shown to reduce rater variability in perceptual voice assessment. Because rater variability is also a central issue in the auditory-perceptual assessment of dysarthria, this study sought to determine if training produces a meaningful change in rater reliability, criterion validity, and scaling magnitude of four…
Descriptors: Auditory Training, Auditory Perception, Program Effectiveness, Speech Impairments
Kelly Little; Yongyue Qi; Vanessa D. Jewell – Journal of Occupational Therapy Education, 2023
The Occupation-Centered Intervention Assessment (OCIA) was developed as a reflective tool for students to improve their comprehension of occupation-centered practice. Finding new and innovative ways to incorporate occupation-centered assignments can serve as a strategy to develop student integration of occupation-centered practice and allow…
Descriptors: Occupational Therapy, Allied Health Occupations Education, Interrater Reliability, Intervention
King-Dow Su – Journal of Baltic Science Education, 2024
Building 21st-century life science skills requires educating participants according to STEM abilities. Therefore, this research aimed to examine the effectiveness and feasibility of the STEM ability assessment framework in the practical learning environment. The study uses STEM coffee preparation experiential activity with a Royal Belgian siphon…
Descriptors: STEM Education, Content Validity, Instructional Effectiveness, Interrater Reliability