Publication Date
| In 2026 | 7 |
| Since 2025 | 690 |
| Since 2022 (last 5 years) | 3191 |
| Since 2017 (last 10 years) | 7432 |
| Since 2007 (last 20 years) | 15070 |
Descriptor
| Test Reliability | 15055 |
| Test Validity | 10290 |
| Reliability | 9763 |
| Foreign Countries | 7150 |
| Test Construction | 4828 |
| Validity | 4192 |
| Measures (Individuals) | 3880 |
| Factor Analysis | 3826 |
| Psychometrics | 3532 |
| Interrater Reliability | 3126 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1329 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 224 |
| Spain | 218 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedTraub, Ross E.; Rowley, Glenn L. – Educational Measurement: Issues and Practice, 1991
The idea of test consistency is illustrated, with reference to two sets of test scores. A mathematical model is used to explain the relative consistency and relative inconsistency of measurements, and a means of indexing reliability is derived using the model. Practical aspects of estimating reliability are considered. (TJH)
Descriptors: Mathematical Models, Test Reliability, True Scores
Peer reviewedRudner, Lawrence M. – Educational Measurement: Issues and Practice, 2001
Identifies and evaluates alternative methods for weighting tests. Presents formulas for composite reliability and validity as a function of component weights and suggests a rational process that identifies and considers trade-offs in determining weights. Discusses drawbacks to implicit weighting and explicit weighting and the difficulty of…
Descriptors: Reliability, Test Construction, Test Items, Validity
Peer reviewedLindell, Michael K. – Applied Psychological Measurement, 2001
Developed an index for assessing interrater agreement with respect to a single target using a multi-item rating scale. The variance of rater mean scale scores is used as the numerator of the agreement index. Studied four variants of a disattenuated agreement index that vary in the random response term used as the denominator. (SLD)
Descriptors: Evaluation Methods, Interrater Reliability, Rating Scales
Peer reviewedVacha-Haase, Tammi; Kogan, Lori R.; Thompson, Bruce – Educational and Psychological Measurement, 2000
Investigated how dissimilar in composition and variability samples inducting reliability coefficients from prior studies were from the cited prior samples from which coefficients were generalized. Results from 20 articles show that citing reliability coefficients from prior studies as the basis for concluding new scores are reliable is only…
Descriptors: Reliability, Sampling, Scores, Test Manuals
Peer reviewedFan, Xitao; Chen, Michael – Educational and Psychological Measurement, 2000
Provides a sample of seven published studies in different disciplines that inappropriately generalized reliability coefficients involving several raters to scores generated by a single rater. Score reliability when only one rater is used for scoring is lower than the score reliability for which two raters are used. (SLD)
Descriptors: Interrater Reliability, Research Reports, Scores, Scoring
Peer reviewedGump, Linda S.; Baker, Richard C.; Roll, Samuel – Adolescence, 2000
Describes the development of the Moral Justification Scale, an objective measure of justice and care orientations. The scale was administered to 100 college students. Results imply that the Moral Justification Scale shows promise as an easily administered, objectively scored measure of Giligan's constructs of care and justice. (Author/MKA)
Descriptors: Measures (Individuals), Moral Development, Reliability, Validity
Peer reviewedRaykov, Tenko – Multivariate Behavioral Research, 1997
The population discrepancy between Cronbach's Coefficient Alpha (L. Cronbach, 1951) and scale reliability with fixed congeneric measure, uncorrelated errors, and sampling of subjects was studied. The difference is expressed in terms of the individual component violations of the assumption of equal tau-equivalence that is necessary and sufficient…
Descriptors: Error of Measurement, Reliability, Sampling, Scaling
Peer reviewedBuhi, Eric R. – Journal of School Health, 2005
A number of school-based programs address sexual violence by focusing on adolescents' attitudes about rape or acceptance of rape myths. However, really problems exist in the literature regarding measurement of rape myth acceptance, including issues of reliability and validity. This paper addresses measurement reliability issues and reviews…
Descriptors: Mythology, Violence, Sexual Harassment, Reliability
Zinbarg, Richard E.; Revelle, William; Yovel, Iftah; Li, Wen – Psychometrika, 2005
We make theoretical comparisons among five coefficients--Cronbach's [alpha], Revelle's [beta], McDonald's [omega][sub h], and two alternative conceptualizations of reliability. Though many end users and psychometricians alike may not distinguish among these five coefficients, we demonstrate formally their nonequivalence. Specifically, whereas…
Descriptors: Psychometrics, Test Reliability, Rating Scales, Scores
Raykov, Tenko; Marcoulides, George A. – Structural Equation Modeling: A Multidisciplinary Journal, 2006
A covariance structure modeling perspective on reliability estimation can be used to construct a formal approach to estimation of reliability in multilevel models. This article presents a didactic discussion of the relation between a structural modeling procedure for scale reliability estimation and the notion of reliability of observed means in…
Descriptors: Structural Equation Models, Reliability, Interdisciplinary Approach
Noreau, Luc; Lepage, Celine; Boissiere, Lucie; Picard, Roger; Fougeyrollas, Patrick; Mathieu, Jean; Desmarais, Gilbert; Nadeau, Line – Developmental Medicine & Child Neurology, 2007
The objectives of this study were: (1) to examine the psychometric properties of the Assessment of Life Habits (LIFE-H) for children; and (2) to draw a profile of the level of participation among children of 5 to 13 years of age with various impairments. The research team adapted the adult version of the LIFE-H in order to render it more…
Descriptors: Genetic Disorders, Head Injuries, Neurological Impairments, Measurement Techniques
Murdock, Linda C.; Cost, Hollie C.; Tieso, Carol – Focus on Autism and Other Developmental Disabilities, 2007
The "Social-Communication Assessment Tool" (S-CAT) was created as a direct observation instrument to quantify specific social and communication deficits of children with autism spectrum disorders (ASD) within educational settings. In this pilot study, the instrument's content validity and interrater reliability were investigated to determine the…
Descriptors: Nonverbal Communication, Autism, Content Validity, Test Validity
Beg, Mohsan R.; Casey, Joseph E.; Saunders, Cory D. – Assessment, 2007
The purpose of this study was to produce a typology of behavior problems in preschool children. Distinct subtypes were identified through the use of cluster analytic techniques on data from the Behavior Assessment System for Children (BASC)--Parent Rating Scales. Analyses were based on archival data collected on a sample of 268 children, aged 2 to…
Descriptors: Behavior Problems, Rating Scales, Classification, Preschool Children
Utley, Juliana – School Science and Mathematics, 2007
The purpose of this study was to develop and establish the validity and reliability of an instrument to measure students' attitudes toward geometry. Participants consisted of 264 undergraduate students from two universities, one in the Midwest and one in the Southwest. The instrument is a 5-point Likert-scaled survey consisting of 32 statements…
Descriptors: Undergraduate Students, Student Attitudes, Attitude Measures, Geometry
Roisman, Glenn I.; Fraley, R. Chris; Belsky, Jay – Developmental Psychology, 2007
This study is the first to examine the latent structure of individual differences reflected in the Adult Attachment Interview (AAI; C. George, N. Kaplan, & M. Main, 1985), a commonly used and well-validated measure designed to assess an adult's current state of mind regarding childhood experiences with caregivers. P. E. Meehl's (1995)…
Descriptors: Caregivers, Attachment Behavior, Individual Differences, Adults

Direct link
