Publication Date
In 2025 | 0 |
Since 2024 | 3 |
Since 2021 (last 5 years) | 20 |
Since 2016 (last 10 years) | 56 |
Since 2006 (last 20 years) | 250 |
Descriptor
Interrater Reliability | 298 |
Measures (Individuals) | 298 |
Correlation | 76 |
Psychometrics | 75 |
Validity | 63 |
Foreign Countries | 54 |
Test Reliability | 54 |
Test Validity | 50 |
Children | 40 |
Intervention | 40 |
Evaluation Methods | 39 |
More ▼ |
Source
Author
McLeod, Bryce D. | 6 |
Conroy, Maureen A. | 4 |
Tasse, Marc J. | 4 |
Aman, Michael G. | 3 |
Lord, Catherine | 3 |
Matson, Johnny L. | 3 |
Sutherland, Kevin S. | 3 |
Avery, Marybell | 2 |
Beach, Kristen D. | 2 |
Bocian, Kathleen M. | 2 |
Broda, Michael | 2 |
More ▼ |
Publication Type
Education Level
Audience
Researchers | 4 |
Policymakers | 1 |
Practitioners | 1 |
Location
Australia | 6 |
United Kingdom | 6 |
Arizona | 4 |
California | 4 |
China | 4 |
Florida | 4 |
Illinois | 4 |
Netherlands | 4 |
North Carolina | 4 |
United States | 4 |
Hong Kong | 3 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Does not meet standards | 1 |
Elayne P. Colón; Lori M. Dassa; Thomas M. Dana; Nathan P. Hanson – Action in Teacher Education, 2024
To meet accreditation expectations, teacher preparation programs must demonstrate their candidates are evaluated using summative assessment tools that yield sound, reliable, and valid data. These tools are primarily used by the clinical experience team -- university supervisors and mentor teachers. Institutional beliefs regarding best practices…
Descriptors: Student Teachers, Teacher Interns, Evaluation Methods, Interrater Reliability
Verdugo, Miguel Angel; Vicente, Eva; Guillén, Verónica Marina; Sánchez, Sergio; Ibáñez, Alba; Gómez, Laura Elisabet – International Journal of Developmental Disabilities, 2023
Background: Appropriate supports and instructional practices contribute to the development of self-determination. Also, research shows that the promotion of skills related to self-determination has been linked to the achievement of desired outcomes over the different life stages. Advances in self-determination require the development of assessment…
Descriptors: Measures (Individuals), Self Determination, Intellectual Disability, Test Reliability
Huscroft-D'Angelo, Jacqueline; Wery, Jessica; Martin, Jodie Diane; Pierce, Corey; Crawford, Lindy – Behavioral Disorders, 2021
"The Scales for Assessing Emotional Disturbance--Third Edition Rating Scale" (SAED-3 RS; Epstein et al.) is a standardized, norm-referenced measure designed to aid in the identification process by providing useful data to professionals determining eligibility of students with an emotional disturbance (ED). Three studies are reported to…
Descriptors: Measures (Individuals), Emotional Disturbances, Test Reliability, Interrater Reliability
Anna Kay Steadman – ProQuest LLC, 2023
The Performance Assessment and Evaluation System (PAES) is used by all major universities in the state of Utah to measure the effective teaching skills of preservice candidates as they progress through their teaching preparation program. The resulting ratings are used to make high-stakes decisions relating to course completion as well as…
Descriptors: Preservice Teachers, Student Evaluation, Teaching Skills, Elementary School Teachers
Fuentes, Milton A.; Reyes-Portillo, Jazmin A.; Tineo, Petty; Gonzalez, Kenny; Butt, Mamona – Hispanic Journal of Behavioral Sciences, 2021
While skin color is relevant and important in the Latinx community, as it is associated with colorism, little is known about how often it is measured or the best way to measure it. This article presents results from two studies examining these key concerns in three prominent journals, where Latinx research is typically published (i.e., the…
Descriptors: Hispanic Americans, Measures (Individuals), Undergraduate Students, Social Bias
Arielle Boguslav; Julie Cohen – Journal of Teacher Education, 2024
Teacher preparation programs are increasingly expected to use data on preservice teacher (PST) skills to drive program improvement and provide targeted supports. Observational ratings are especially vital, but also prone to measurement issues. Scores may be influenced by factors unrelated to PSTs' instructional skills, including rater standards.…
Descriptors: Preservice Teachers, Measures (Individuals), Evaluation Problems, Teaching Skills
Toma, Radu Bogdan – Technology, Knowledge and Learning, 2023
The development of computational thinking skills is attracting attention worldwide. The use of visual or block-based coding in primary schools has gained momentum. Yet, students' acceptance of such coding environments has been neglected in the literature. This study presents a measurement instrument that will allow pursuing such an endeavor. The…
Descriptors: Computation, Thinking Skills, Coding, Measurement
King-Dow Su – Journal of Baltic Science Education, 2024
Building 21st-century life science skills requires educating participants according to STEM abilities. Therefore, this research aimed to examine the effectiveness and feasibility of the STEM ability assessment framework in the practical learning environment. The study uses STEM coffee preparation experiential activity with a Royal Belgian siphon…
Descriptors: STEM Education, Content Validity, Instructional Effectiveness, Interrater Reliability
Gao, Ruiqin; Raygoza, Alyssa; Distefano, Christine; Greer, Fred; Dowdy, Erin – School Psychology International, 2022
The Pediatric Symptom Checklist-17 (PSC-17) is a popular screening instrument used by parents and clinicians to assess children's behavioral functioning. However, more schools are examining the potential of the PSC-17 as part of a Multi-Tier System of Support framework. To investigate the potential of the PSC-17 in the schools, a sample of 1,779…
Descriptors: Check Lists, Measures (Individuals), Screening Tests, Child Behavior
Davidow, Jason H.; Ye, Jun; Edge, Robin L. – International Journal of Language & Communication Disorders, 2023
Background: Speech-language pathologists often multitask in order to be efficient with their commonly large caseloads. In stuttering assessment, multitasking often involves collecting multiple measures simultaneously. Aims: The present study sought to determine reliability when collecting multiple measures simultaneously versus individually.…
Descriptors: Graduate Students, Measurement, Reliability, Group Activities
Dankiw, Kylie A.; Baldock, Katherine L.; Kumar, Saravana; Tsiros, Margarita D. – Australasian Journal of Early Childhood, 2021
Identifying and describing children's play behaviours is an important component of evaluating child development. The Behaviour Mapping Schedule is a direct observational tool which aims to describe and quantify children's play behaviours but is yet to undergo reliability testing. This study aimed to determine the intra- and inter-rater reliability…
Descriptors: Interrater Reliability, Classification, Child Behavior, Play
Starmer, Heather M.; Arrese, Loni; Langmore, Susan; Ma, Yifei; Murray, Joseph; Patterson, Joanne; Pisegna, Jessica; Roe, Justin; Tabor-Gray, Lauren; Hutcheson, Katherine – Journal of Speech, Language, and Hearing Research, 2021
Purpose: While flexible endoscopic evaluation of swallowing (FEES) is a common clinical procedure used in the head and neck cancer (HNC) population, extant outcome measures for FEES such as bolus-level penetration-aspiration and residue scores are not well suited as global patient-level endpoint measures of dysphagia severity in cooperative group…
Descriptors: Medical Evaluation, Physical Disabilities, Safety, Efficiency
Mary M. Stone; Sudi Kash; Teresa Butler; Karolina Callahan; Miguel A. Verdugo; Laura E. Gómez – Journal of Developmental and Physical Disabilities, 2020
Quality of life (QoL) is a key outcome used to monitor service planning and delivery for individuals with Intellectual and Developmental Disabilities (IDD). Unfortunately, many current instruments used to measure QoL have psychometric and content limitations and none are suitable for use with individuals with the lowest levels of functioning and…
Descriptors: Quality of Life, Autism Spectrum Disorders, Residential Care, Measures (Individuals)
McDonald, Margarethe; Kwon, Taeahn; Kim, Hyunji; Lee, Youngki; Ko, Eon-Suk – Journal of Speech, Language, and Hearing Research, 2021
Purpose: The algorithm of the Language ENvironment Analysis (LENA) system for calculating language environment measures was trained on American English; thus, its validity with other languages cannot be assumed. This article evaluates the accuracy of the LENA system applied to Korean. Method: We sampled sixty 5-min recording clips involving 38 key…
Descriptors: Computational Linguistics, Korean, Audio Equipment, Accuracy
An, Mihee; Nord, Jayden; Koziol, Natalie A.; Dusing, Stacey C.; Kane, Audrey E.; Lobo, Michele A.; McCoy, Sarah W.; Harbourne, Regina T. – Grantee Submission, 2021
Aim: To describe the development of an intervention-specific fidelity measure and its utilization and to determine whether the newly developed Sitting Together and Reaching to Play (START-Play) intervention was implemented as intended. Also, to quantify differences between START-Play and usual early intervention (uEI) services. Method: A fidelity…
Descriptors: Test Construction, Measures (Individuals), Fidelity, Early Intervention