Publication Date
| In 2026 | 3 |
| Since 2025 | 656 |
| Since 2022 (last 5 years) | 3157 |
| Since 2017 (last 10 years) | 7398 |
| Since 2007 (last 20 years) | 15036 |
Descriptor
| Test Reliability | 15028 |
| Test Validity | 10265 |
| Reliability | 9757 |
| Foreign Countries | 7137 |
| Test Construction | 4821 |
| Validity | 4191 |
| Measures (Individuals) | 3876 |
| Factor Analysis | 3822 |
| Psychometrics | 3520 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1326 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 251 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Georgios Zacharis; Stamatios Papadakis – Educational Process: International Journal, 2025
Background/purpose: Generative artificial intelligence (GenAI) is often promoted as a transformative tool for assessment, yet evidence of its validity compared to human raters remains limited. This study examined whether an AI-based rater could be used interchangeably with trained faculty in scoring complex coursework. Materials/methods:…
Descriptors: Artificial Intelligence, Technology Uses in Education, Computer Assisted Testing, Grading
Kristen Bottema-Beutel; Shannon Crowley LaPoint; So Yoon Kim; Sarah Mohiuddin; Qun Yu; Rachael McKinnon – Exceptional Children, 2024
In this secondary analysis of a previously conducted systematic review, we analyze social validity assessments in intervention research for transition-age autistic youth. Social validity is concerned with the acceptability of the intervention goals, the acceptability and feasibility of the intervention procedures, and the perceived importance of…
Descriptors: Autism Spectrum Disorders, Intervention, Validity, Psychometrics
Jiayu Zhai; Vahid Aryadoust – Metacognition and Learning, 2024
Metacognitive awareness is essential in regulating second language (L2) listening and has been predominantly assessed by a multidimensional instrument named the Metacognitive Awareness Listening Questionnaire (MALQ). Since previous studies have yielded inconclusive evidence concerning the generalization of MALQ, it is important to examine the…
Descriptors: Metacognition, Second Language Learning, Listening, Test Reliability
Amanda M. Snyder – ProQuest LLC, 2024
The ever-changing advances in technology require digital literacy skills for success in the workplace. To determine the critical digital literacy skills needed in the workplace today, the development of a reliable, valid instrument occurred using the nine steps of scale development by DeVellis and Thorpe (2021). Based on the SkillRise (2020a)…
Descriptors: Digital Literacy, Measures (Individuals), Job Skills, Test Reliability
Paul Alexander Siegel – ProQuest LLC, 2024
While multimodality and multiliteracies has been a concept for 25 years (Kalantzis & Cope, 2023; The New London Group, 1996), research on and application of the concept within text complexity measures has been limited. Attempts to assess multiliteracies and multimodality (Jacobs, 2013; Schmerbeck & Lucht, 2017; Wyatt-Smith & Kimber,…
Descriptors: Multiple Literacies, Learning Modalities, Test Validity, Test Reliability
Michael T. Kalkbrenner – Measurement and Evaluation in Counseling and Development, 2024
The purpose of this instructional piece was to provide a nontechnical synthesis of common internal consistency reliability estimates used in professional counseling and in related fields. The article begins with an overview of coefficients alpha, omega, omega hierarchical, and H, with guidelines for their selection. Next, I provide recommendations…
Descriptors: Reliability, Counseling, Cutting Scores, High Stakes Tests
Todd Grindal; Sarah Nixon Gerard; Anne Partika; Nancy Perez; Gullnar Syed; Morgan Solender; Anna Mark – SRI Education, a Division of SRI International, 2025
Accurate, reliable, and scalable measurement of classroom quality represent a critical tool for ensuring that young children benefit from early learning programs. The Early Childhood Classroom Observation (ECCO) study was designed to better understand how video recordings can support high-quality measurement of pre-kindergarten (pre-K) classrooms,…
Descriptors: Classroom Observation Techniques, Video Technology, Preschool Education, Reliability
Patricia Ayllón-Salas; Mirian Hervás-Torres; José L. Arco-Tirado; Francisco D. Fernández-Martín – Journal of Psychoeducational Assessment, 2025
Despite the evolution of the grit conceptualization over the years, the psychometric validity of scales and construct structure remain unclear. Consequently, this study aims to provide new evidence that broadens the current understanding of the grit's dimensional nature in the Spanish population by examining the psychometric properties of the…
Descriptors: Measures (Individuals), Undergraduate Students, Psychometrics, Factor Structure
Alaa Eldin A. Ayoub; Muneera R. Ghablan; Eid G. Abo Hamza; Ahmed M. Abdulla Alabbasi – European Journal of STEM Education, 2025
This study describes the development of the science, technology, engineering, and mathematics (STEM) Scale, intended to assess parental attitudes toward school programs designed to deliver STEM, and evaluates its psychometric properties. The study group included 400 parents of students (138 males and 262 females) enrolled in STEM programs…
Descriptors: STEM Education, Test Construction, Parent Attitudes, Psychometrics
Anthony S. Bryk; Angel Yee-Lam Li; Stuart Luppescu; Mai Anh Bui – Peabody Journal of Education, 2025
This is the second article in a series of three in this special issue on establishing a boundary object to foster network health and development. The first article laid out the theoretical rationale for an Improvement Network Health and Development Framework. This article details the efforts to develop a set of practical measures tied to this…
Descriptors: Validity, Networks, Measurement Techniques, Reliability
Cemre Yaren Güngörenler; Tülay Tarsuslu – Measurement in Physical Education and Exercise Science, 2025
The aim of this study is to investigate the test-retest reliability of the Closed Kinetic Chain Upper Extremity Stability Test (CKCUEST) and modified-CKCUEST in children aged 7-10 years and to compare the two test versions within the same group. The study was completed with fifty-three children. Average, normalized, and power scores were obtained…
Descriptors: Test Reliability, Physical Activities, Performance Tests, Children
Heena Suthar; Krisha Thiagarajah; Ibraheem Karaye; Zayra Teresa Lopez-Ixta; Trishnee Bhurosy – Journal of American College Health, 2025
Objective: To measure the interrater reliability of assessing the frequency of vegetable intake using mobile photos and descriptions. Design: Repeated measures design. Setting: A Midwestern university. Participants: Undergraduate students (N = 165). Measurable Outcome/Analysis: Number of times each of these vegetable subgroups were consumed daily:…
Descriptors: Interrater Reliability, Incidence, Food, Eating Habits
Sevinc Zeynep Kavruk; Figen Turan – Psychology in the Schools, 2025
This study adapts the "Scales for Identifying Gifted Students (SIGS-2)" into Turkish for use from preschool onward, specifically during the candidate nomination stage. Conducted with 974 parents (675 mothers, 299 fathers) of children aged 5-10, it employs Confirmatory Factor Analysis (CFA) to evaluate the scale's structure and…
Descriptors: Foreign Countries, Rating Scales, Academically Gifted, Psychometrics
Vanessa Gonçalves Coutinho de Oliveira; Letícia Colombo de Oliveira; Bruna Reclusa Martinez; Thiago Melo Malheiros de Souza; Nelson Carvas Junior; Liu Chiao Yi – Measurement in Physical Education and Exercise Science, 2025
The study aimed to analyze, synthesize, and investigate the measurement properties of clinical tests that assess foot posture in children and adolescents. The study included research published in scientific journals that analyzed the measurement properties of clinical tests, focusing on the validity, reliability, responsiveness, or specificity of…
Descriptors: Human Body, Human Posture, Children, Adolescents
Mutia Wati; Rahmah Johar; Marwan Ramli; Mailizar – SAGE Open, 2025
Learning behavior refers to students' preparedness to embrace various learning forms and techniques, encompassing skills, activities, creativity, and motivation. Positive learning behavior improves efficiency, discipline, and academic skills, while negative learning behavior results in a diminished grasp of the essence of learning and cultivates…
Descriptors: Test Validity, Questionnaires, Student Behavior, Test Reliability

Peer reviewed
Direct link
