Publication Date
In 2025 | 285 |
Since 2024 | 1149 |
Since 2021 (last 5 years) | 3719 |
Since 2016 (last 10 years) | 7918 |
Since 2006 (last 20 years) | 15095 |
Descriptor
Test Reliability | 14751 |
Test Validity | 10028 |
Reliability | 9655 |
Foreign Countries | 6903 |
Test Construction | 4695 |
Validity | 4150 |
Measures (Individuals) | 3801 |
Factor Analysis | 3768 |
Psychometrics | 3447 |
Interrater Reliability | 3093 |
Correlation | 3027 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Researchers | 705 |
Practitioners | 449 |
Teachers | 206 |
Administrators | 122 |
Policymakers | 66 |
Counselors | 42 |
Students | 37 |
Parents | 11 |
Community | 7 |
Media Staff | 5 |
Support Staff | 5 |
More ▼ |
Location
Turkey | 1274 |
Australia | 432 |
Canada | 375 |
China | 346 |
United States | 268 |
United Kingdom | 250 |
Taiwan | 227 |
Indonesia | 223 |
Netherlands | 218 |
California | 212 |
Spain | 210 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 8 |
Meets WWC Standards with or without Reservations | 9 |
Does not meet standards | 6 |
Kaila L. Stipancic; Mojgan Golzy; Yunxin Zhao; Louise Pinkerton; Andrea Rohl; Mili Kuruvilla-Dugdale – Journal of Speech, Language, and Hearing Research, 2023
Purpose: Auditory training has been shown to reduce rater variability in perceptual voice assessment. Because rater variability is also a central issue in the auditory-perceptual assessment of dysarthria, this study sought to determine if training produces a meaningful change in rater reliability, criterion validity, and scaling magnitude of four…
Descriptors: Auditory Training, Auditory Perception, Program Effectiveness, Speech Impairments
Pearson, Terry – FORUM: for promoting 3-19 comprehensive education, 2023
Ofsted has frequently defended the judgements made during inspections by claiming that inspection ratings are reliable, as shown by the results from the collection of studies the inspectorate has conducted. I outline the inspectorate's view of reliability and problematise the studies that it has carried out, noting that these provide insufficient…
Descriptors: Inspection, Interrater Reliability, Decision Making, Value Judgment
Süreyya Yörük; Sedat Sen – Creativity Research Journal, 2023
The Creative Achievement Questionnaire (CAQ) is widely used to measure the creative achievement levels of individuals. Previous studies reported a varying range of reliability coefficients for the CAQ. To this date, no study has investigated the variability in the reliability coefficients of the CAQ. A random-effects reliability generalization…
Descriptors: Reliability, Generalization, Meta Analysis, Creativity
Emma Healy – ProQuest LLC, 2024
The shortage of autism specialists and lack of culturally sensitive autism assessment tools are helping to perpetuate racial and ethnic disparities in autism identification and treatment. Using DisCrit as a framework, this quantitative study examined the utility of one autism assessment tool, the Social Responsiveness Scale, second edition (SRS-2)…
Descriptors: Autism Spectrum Disorders, Student Evaluation, Diagnostic Tests, Disability Identification
Kristen Bottema-Beutel; Shannon Crowley LaPoint; So Yoon Kim; Sarah Mohiuddin; Qun Yu; Rachael McKinnon – Exceptional Children, 2024
In this secondary analysis of a previously conducted systematic review, we analyze social validity assessments in intervention research for transition-age autistic youth. Social validity is concerned with the acceptability of the intervention goals, the acceptability and feasibility of the intervention procedures, and the perceived importance of…
Descriptors: Autism Spectrum Disorders, Intervention, Validity, Psychometrics
Jiayu Zhai; Vahid Aryadoust – Metacognition and Learning, 2024
Metacognitive awareness is essential in regulating second language (L2) listening and has been predominantly assessed by a multidimensional instrument named the Metacognitive Awareness Listening Questionnaire (MALQ). Since previous studies have yielded inconclusive evidence concerning the generalization of MALQ, it is important to examine the…
Descriptors: Metacognition, Second Language Learning, Listening, Test Reliability
Amanda M. Snyder – ProQuest LLC, 2024
The ever-changing advances in technology require digital literacy skills for success in the workplace. To determine the critical digital literacy skills needed in the workplace today, the development of a reliable, valid instrument occurred using the nine steps of scale development by DeVellis and Thorpe (2021). Based on the SkillRise (2020a)…
Descriptors: Digital Literacy, Measures (Individuals), Job Skills, Test Reliability
Paul Alexander Siegel – ProQuest LLC, 2024
While multimodality and multiliteracies has been a concept for 25 years (Kalantzis & Cope, 2023; The New London Group, 1996), research on and application of the concept within text complexity measures has been limited. Attempts to assess multiliteracies and multimodality (Jacobs, 2013; Schmerbeck & Lucht, 2017; Wyatt-Smith & Kimber,…
Descriptors: Multiple Literacies, Learning Modalities, Test Validity, Test Reliability
Michael T. Kalkbrenner – Measurement and Evaluation in Counseling and Development, 2024
The purpose of this instructional piece was to provide a nontechnical synthesis of common internal consistency reliability estimates used in professional counseling and in related fields. The article begins with an overview of coefficients alpha, omega, omega hierarchical, and H, with guidelines for their selection. Next, I provide recommendations…
Descriptors: Reliability, Counseling, Cutting Scores, High Stakes Tests
Huscroft-D'Angelo, Jacqueline; Wery, Jessica; Martin, Jodie Diane; Pierce, Corey; Crawford, Lindy – Behavioral Disorders, 2021
"The Scales for Assessing Emotional Disturbance--Third Edition Rating Scale" (SAED-3 RS; Epstein et al.) is a standardized, norm-referenced measure designed to aid in the identification process by providing useful data to professionals determining eligibility of students with an emotional disturbance (ED). Three studies are reported to…
Descriptors: Measures (Individuals), Emotional Disturbances, Test Reliability, Interrater Reliability
Todaro, Francesca; Pizzorni, Nicole; Scarponi, Letizia; Ronzoni, Clara; Huckabee, Maggie-Lee; Schindler, Antonio – International Journal of Language & Communication Disorders, 2021
Background: The Test of Masticating and Swallowing Solids (TOMASS) is an international standardized swallowing assessment tool. However, its psychometric characteristics have not been analysed in patients with dysphagia. Aims: To analyse TOMASS's (1) inter- and intra-rater reliability in a clinical population of patients with dysphagia, (2)…
Descriptors: Physical Disabilities, Test Reliability, Test Validity, Standardized Tests
Anthony S. Bryk; Angel Yee-Lam Li; Stuart Luppescu; Mai Anh Bui – Peabody Journal of Education, 2025
This is the second article in a series of three in this special issue on establishing a boundary object to foster network health and development. The first article laid out the theoretical rationale for an Improvement Network Health and Development Framework. This article details the efforts to develop a set of practical measures tied to this…
Descriptors: Validity, Networks, Measurement Techniques, Reliability
Alaa Eldin A. Ayoub; Muneera R. Ghablan; Eid G. Abo Hamza; Ahmed M. Abdulla Alabbasi – European Journal of STEM Education, 2025
This study describes the development of the science, technology, engineering, and mathematics (STEM) Scale, intended to assess parental attitudes toward school programs designed to deliver STEM, and evaluates its psychometric properties. The study group included 400 parents of students (138 males and 262 females) enrolled in STEM programs…
Descriptors: STEM Education, Test Construction, Parent Attitudes, Psychometrics
Todd Grindal; Sarah Nixon Gerard; Anne Partika; Nancy Perez; Gullnar Syed; Morgan Solender; Anna Mark – SRI Education, a Division of SRI International, 2025
Accurate, reliable, and scalable measurement of classroom quality represent a critical tool for ensuring that young children benefit from early learning programs. The Early Childhood Classroom Observation (ECCO) study was designed to better understand how video recordings can support high-quality measurement of pre-kindergarten (pre-K) classrooms,…
Descriptors: Classroom Observation Techniques, Video Technology, Preschool Education, Reliability
Van Elsen, Joris; Faddar, Jerich; Appels, Lies; De Maeyer, Sven; Vanhoof, Jan; Van Petegem, Peter – School Effectiveness and School Improvement, 2023
In order to support research on school effectiveness, there is a need for valid and reliable instruments to assess policymaking capacities of schools. Increasingly, policymaking is seen as a shared responsibility of the entire pedagogical team of a school. In this article, data were analysed from a sample of 1,696 (care) teachers coordinators and…
Descriptors: Educational Policy, Policy Formation, Questionnaires, School Effectiveness