Publication Date
| In 2026 | 3 |
| Since 2025 | 656 |
| Since 2022 (last 5 years) | 3157 |
| Since 2017 (last 10 years) | 7398 |
| Since 2007 (last 20 years) | 15036 |
Descriptor
| Test Reliability | 15028 |
| Test Validity | 10265 |
| Reliability | 9757 |
| Foreign Countries | 7137 |
| Test Construction | 4821 |
| Validity | 4191 |
| Measures (Individuals) | 3876 |
| Factor Analysis | 3822 |
| Psychometrics | 3520 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1326 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 251 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Fangxing Bai; Ben Kelcey; Amota Ataneka; Yanli Xie; Kyle Cox; Nianbo Dong – Society for Research on Educational Effectiveness, 2024
Purpose: Multisite mediation studies are a cornerstone in mapping out developmental processes because they probe the mechanisms of a treatment while creating key opportunities to learn from and about variation in those mechanisms across sites. Despite the prevalence of multisite studies, a significant gap in the literature is how to plan such…
Descriptors: Randomized Controlled Trials, Mediation Theory, Statistical Analysis, Robustness (Statistics)
John Gero; Julie Milovanovic – Creativity Research Journal, 2024
In this paper, we explore measurements of design creativity through metrics related to the processes used in designing and relate them to the metrics used in psychology for idea creativity, ie, novelty and fluency. Our goal was to test the reliability of psychometric measures of creativity to assess creativity in team design. We studied 19 teams…
Descriptors: Correlation, Creativity, Psychology, Psychometrics
Janis, Ilyana – Field Methods, 2022
Dependability (also known as consistency) is one of four criteria in rigor and trustworthiness in qualitative research. In this article, the process of establishing consistency is discussed through the lenses of constructivism and interpretivism, as the observed social reality is viewed as epistemologically counter-intuitive. Two strategies were…
Descriptors: Reliability, Qualitative Research, Case Studies, Data Collection
Goldfarb, Jake H.; Orpella, Joan; Jackson, Eric S. – Journal of Speech, Language, and Hearing Research, 2023
Purpose: Most neural and physiological research on stuttering focuses on the fluent speech of speakers who stutter due to the difficulty associated with eliciting stuttering reliably in the laboratory. We previously introduced an approach to elicit stuttered speech in the laboratory in adults who stutter. The purpose of this study was to determine…
Descriptors: Adolescents, Children, Stuttering, Laboratory Experiments
Jordan, Altricia – ProQuest LLC, 2023
Data science, as a discipline can be used in any area. However, in order to utilize data science techniques, data scientist must be taught domain knowledge, referred to as a partner discipline, in the area with which the techniques are to be utilized. Using a quantitative analysis of publicly available information and survey methodology, this…
Descriptors: Data Science, Training, Scientists, Reliability
Metsämuuronen, Jari – Practical Assessment, Research & Evaluation, 2023
Traditional estimators of reliability such as coefficients alpha, theta, omega, and rho (maximal reliability) are prone to give radical underestimates of reliability for the tests common when testing educational achievement. These tests are often structured by widely deviating item difficulties. This is a typical pattern where the traditional…
Descriptors: Test Reliability, Achievement Tests, Computation, Test Items
Xiong, Yao; Schunn, Christian D.; Wu, Yong – Journal of Computer Assisted Learning, 2023
Background: For peer assessment, reliability (i.e., consistency in ratings across peers) and validity (i.e., consistency of peer ratings with instructors or experts) are frequently examined in the research literature to address a central concern of instructors and students. Although the average levels are generally promising, both reliability and…
Descriptors: Peer Evaluation, Computer Assisted Testing, Test Reliability, Test Validity
Abbas, Mohsin; van Rosmalen, Peter; Kalz, Marco – IEEE Transactions on Learning Technologies, 2023
For predicting and improving the quality of essays, text analytic metrics (surface, syntactic, morphological, and semantic features) can be used to provide formative feedback to the students in higher education. In this study, the goal was to identify a sufficient number of features that exhibit a fair proxy of the scores given by the human raters…
Descriptors: Feedback (Response), Automation, Essays, Scoring
Aktas, Fatma Nur – Acta Didactica Napocensia, 2023
This phenomenology research aims to examine prospective elementary mathematics teachers' proving and proof evaluation and their thoughts on convincing according to proof type and argument type. The participants were eight prospective teachers. The data collection tools were semi-structured group interviews, interviews video recordings and the…
Descriptors: Persuasive Discourse, Mathematical Logic, Logical Thinking, Visual Aids
Pin, Tamis W.; So, Vincent K. K.; Siu, Cynthia S. H.; Yip, Sheila S. N.; Cheung, Stella See-wing; Kan, Jenny Yim-mui – Journal of Autism and Developmental Disorders, 2021
To examine reliability and validity of the new Social Motor Function Classification System for Children with Autism Spectrum Disorders (SMFCS-ASD). The SMFCS-ASD reliability was examined on 25 children (62.4 months SD 7.8) with ASD among six physical therapists. The validity study involved 1001 children (57.0 months, SD 9.9) with ASD using the…
Descriptors: Autism, Pervasive Developmental Disorders, Children, Classification
Özaydin, Zeynep; Arslan, Çigdem – Journal of Theoretical Educational Science, 2022
The aim of this study is to develop a rubric to assess mathematical reasoning competence. Since the aim is to assess a competency, the frameworks of the PISA exams in the literature, which give an important place to competencies, have been examined. Due to its focus and in-depth analysis of mathematical reasoning, each of the actions expected from…
Descriptors: Foreign Countries, Scoring Rubrics, Mathematical Logic, Competence
Osman Birgin; Elif Seval Peker – Psychology in the Schools, 2025
The aim of this study was to develop an instrument for assessing sixth-grade students' number sense skills in fractions and decimals. This study was conducted on 452 sixth graders (10-11 years old) from the western region of Turkey. The construct validity of the number sense test (NST) was examined via exploratory factor analysis (EFA) and…
Descriptors: Foreign Countries, Grade 6, Test Construction, Mathematics Education
Mehmet Emin Ören; Servet Atik – International Journal of Assessment Tools in Education, 2025
In this study, it was aimed to adapt the DigiFuehr 2.0 Scale developed by Claassen et al. (2023) to Turkish and to conduct validity and reliability studies on three groups of participants consisting of teachers. In the study, exploratory and confirmatory factor analyses were performed in line with translation study, linguistic application, and…
Descriptors: Test Reliability, Test Validity, Test Construction, Translation
Hongwei Yang; Müslim Alanoglu; Songül Karabatak; Kelly D. Bradley – International Journal of Assessment Tools in Education, 2025
The study took a Rasch measurement theory approach to validating the 10-item Digital Literacy Scale (DLS) using the unidimensional rating scale model (RSM). To that end, the study used the data from a sample of online Turkish university students. The study began the Rasch analysis with all 10 items in the scale and, to improve in the local…
Descriptors: Digital Literacy, Measures (Individuals), Test Validity, Foreign Countries
Joseph F. Mirabelli; Eileen M. Johnson; Sara R. Vohra; Jeanne L. Sanders; Karin J. Jensen – International Journal of STEM Education, 2025
Background: Undergraduate engineering students report increased rates of mental health distress. Evidence suggests that these students experience high stress, which can perpetuate mental health challenges. Further, engineering students may engage in help-seeking and self-care activities more rarely than students in other disciplines. We…
Descriptors: Undergraduate Students, Engineering Education, Mental Health, Stress Variables

Peer reviewed
Direct link
