Publication Date
| In 2026 | 3 |
| Since 2025 | 666 |
| Since 2022 (last 5 years) | 3167 |
| Since 2017 (last 10 years) | 7408 |
| Since 2007 (last 20 years) | 15046 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10272 |
| Reliability | 9759 |
| Foreign Countries | 7141 |
| Test Construction | 4823 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3525 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1327 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 252 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Noller, Christine; Berry, David C. – Athletic Training Education Journal, 2020
Context: Health care organizations are integrating employee training and educational programs to designate themselves as high-reliability organizations (HROs). HROs continually strive to evaluate and create an environment in which potential problems are anticipated, detected early, and virtually always responded to early enough to prevent…
Descriptors: Athletics, Allied Health Occupations Education, Reliability, Health Services
Forthmann, Boris; Paek, Sue Hyeon; Dumas, Denis; Barbot, Baptiste; Holling, Heinz – British Journal of Educational Psychology, 2020
Background: The originality of divergent thinking (DT) production is one of the most critical indicators of creative potential. It is commonly scored using the statistical infrequency of responses relative to all responses provided in a given sample. Aims: Response frequency estimates vary in terms of measurement precision. This issue has been…
Descriptors: Creative Thinking, Creativity Tests, Item Response Theory, Scores
Monfort-Pañego, Manuel; Miñana-Signes, Vicente – Measurement in Physical Education and Exercise Science, 2020
The purpose of this study was to develop a questionnaire to assess body posture habits in adolescents' daily activities. To develop and assess the instrument we used the Delphi method, and a test-retest reliability design. The questionnaire consisted of 31 questions with 4-level Likert scale. One hundred and sixty-eight students were studied, 72…
Descriptors: Psychometrics, Content Validity, Questionnaires, Human Body
Collins, Peter J.; Hahn, Ulrike – Journal of Experimental Psychology: Learning, Memory, and Cognition, 2020
We gain much of our knowledge from other people. Because people are fallible--they lie, mislead, and are mistaken--it seems essential to monitor their claims and their reliability as sources of information. An intuitive way to do this is to draw on our expectations about claims and sources: to perform expectation-based updating (Hahn, Merdes,…
Descriptors: Pragmatics, Reliability, Trust (Psychology), Cooperation
Joseph, Gail; Soderberg, Janet S.; Stull, Sara; Cummings, Kevin; McCutchen, Deborah; Han, Rachel J. – Early Education and Development, 2020
Research Findings: This study explores the inter-rater reliability of WaKIDS, Washington State's kindergarten entry assessment (KEA). Specifically, we analyze (1) the extent to which teachers' assessments are in agreement with a master code, (2) how often inaccurate assessment decisions lead to misidentification of school readiness, and (3)…
Descriptors: Interrater Reliability, School Readiness, Kindergarten, Evaluation Problems
Chiu, Loren Z. F.; Daehlin, Torstein E. – Measurement in Physical Education and Exercise Science, 2020
Males (n = 29) and females (n = 34) performed vertical jumps. Jump height was estimated from force platform data using five numerical methods and compared using intraclass correlation ([rho]), and linear and rank regression standard error of estimate ("SEE"). Take-off velocity plus center of mass height at take-off and mechanical work…
Descriptors: Physical Activities, Scientific Concepts, Computation, Motion
Umucu, Emre; Wu, Jia-Rung; Sanchez, Jennifer; Brooks, Jessica M.; Chiu, Chung-Yi; Tu, Wei-Mo; Chan, Fong – Journal of American College Health, 2020
Objective: The current study aims to validate the PERMA-Profiler, a well-known well-being measure, among a sample of student veterans. Participants: A sample of 205 student veterans were recruited from universities across the United States. Method: Cross-sectional research design was used in this study. Measurement structure of the PERMA-Profiler…
Descriptors: Test Validity, Measures (Individuals), Well Being, Veterans
Özdemir, Ali Selman; Durhan, Tebessüm Ayyildiz; Akgül, Beyza Merve – Asian Journal of Education and Training, 2020
The aim of this study is to provide the validity and reliability analysis of the ?Serious Leisure Inventory and Measurement (Short Form)? SLIM and introduce it to the literature. The data obtained from 285 university students and the KMO-Barlett test was performed and the sample size was tested (0.89; 2506.309, p<0,001). A three subdimensions…
Descriptors: Test Validity, Test Reliability, Leisure Time, College Students
Hong, Maxwell; Steedle, Jeffrey T.; Cheng, Ying – Educational and Psychological Measurement, 2020
Insufficient effort responding (IER) affects many forms of assessment in both educational and psychological contexts. Much research has examined different types of IER, IER's impact on the psychometric properties of test scores, and preprocessing procedures used to detect IER. However, there is a gap in the literature in terms of practical advice…
Descriptors: Responses, Psychometrics, Test Validity, Test Reliability
Goldhaber, Dan; Grout, Cyrus; Wolf, Malcom; Martinkova, Patricia – National Center for Analysis of Longitudinal Data in Education Research (CALDER), 2020
There is growing interest in using measures of teacher applicant quality to improve hiring decisions, but the statistical properties of such measures are poorly understood. We present evidence on structured ratings solicited from teacher applicants' references. We find that the reference ratings capture only one underlying dimension of applicant…
Descriptors: Job Applicants, Teacher Selection, Interrater Reliability, Decision Making
Chalmers, R. Philip – Educational and Psychological Measurement, 2018
This article discusses the theoretical and practical contributions of Zumbo, Gadermann, and Zeisser's family of ordinal reliability statistics. Implications, interpretation, recommendations, and practical applications regarding their ordinal measures, particularly ordinal alpha, are discussed. General misconceptions relating to this family of…
Descriptors: Misconceptions, Test Theory, Test Reliability, Statistics
McNeish, Daniel; Dumas, Denis – Journal of Educational Measurement, 2018
Dynamic measurement modeling (DMM) is a recent framework for measuring developing constructs whose manifestation occurs after an assessment is administered (e.g., learning capacity). Empirical studies have suggested that DMM may improve consequential validity of test scores because DMM learning capacity estimates were shown to be much less related…
Descriptors: Measurement Techniques, Test Reliability, Accuracy, Computation
María del Carmen García-Mendoza; Águeda Parra Jiménez; Enrique Bernardino Arranz Freijo; Jeffrey Arnett; Inmaculada Sánchez Queija – International Journal of Behavioral Development, 2024
During emerging adulthood, family relationships remain salient. This study examined, from a gender perspective, continuity/discontinuity and stability/instability in family relationships, in a two-time repeated-measures study with Spanish emerging adult college students. It also analyzed the implications of the quality of parent--child…
Descriptors: Foreign Countries, College Students, Young Adults, Family Relationship
Ken Ardon – Pioneer Institute for Public Policy Research, 2024
This paper reviews overall student performance as well as the performance of student subgroups on the assessment system developed in response to the Massachusetts Education Reform Act of 1993 (MERA), the Massachusetts Comprehensive Assessment System (MCAS). Comparing students in Massachusetts to students in the rest of the United States or against…
Descriptors: Accuracy, Test Reliability, Elementary Secondary Education, Achievement Tests
Luis J. Martín-Antón; Juan A. Valdivieso; Juan-Carlos García-Alonso; Miguel Angel Carbonero-Martín; María-Consuelo Saíz-Manzanares – Social Psychology of Education: An International Journal, 2024
Evaluating teachers' social-emotional competence is key to studying the effectiveness of education systems. This competence tends to be measured through self-reports, which might lead to a distorted vision. As an alternative, situational judgement tests have emerged. The present work seeks to adapt the Test of Regulation in and Understanding of…
Descriptors: Teacher Evaluation, Social Emotional Learning, Teacher Competencies, Spanish

Peer reviewed
Direct link
