Publication Date
| In 2026 | 3 |
| Since 2025 | 666 |
| Since 2022 (last 5 years) | 3167 |
| Since 2017 (last 10 years) | 7408 |
| Since 2007 (last 20 years) | 15046 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10272 |
| Reliability | 9759 |
| Foreign Countries | 7141 |
| Test Construction | 4823 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3525 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1327 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 252 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedKraemer, Helena Chmura – Psychometrika, 1979
Coefficient Kappa is generally defined in terms of procedures of computation rather than in terms of a population. Here a population definition is proposed. Factors influencing the magnitude of Kappa are identified. Strategies to improve reliability are proposed, including that of combining multiple unreliable diagnoses. (Author/CTM)
Descriptors: Correlation, Models, Reliability, Research Design
Peer reviewedMilligan, Glenn W. – Psychometrika, 1981
A Monte Carlo evaluation of 30 internal criteria for cluster analysis was conducted using four hierarchical clustering techniques. The results indicated that a subset of internal criteria was identified which appear to be valid indices of correct cluster recovery. (Author/JKS)
Descriptors: Cluster Analysis, Criteria, Reliability, Validity
Peer reviewedten Berge, Jos M. F.; And Others – Psychometrika, 1981
Several algorithms for computing the greatest lower bound to reliability or the constrained minimum-trace communality solution in factor analysis have been developed. The convergence properties of these methods are examined. A uniqueness proof for the desired solution is offered. (Author/JKS)
Descriptors: Algorithms, Factor Analysis, Test Reliability
Peer reviewedSmith, Philip L. – Journal of Educational Measurement, 1981
This study explores a strategy for improving the stability of variance component estimates when only small samples are available, using a series of small, less complex generalizability (G) study designs as a surrogate for a single large design. (Author/BW)
Descriptors: Models, Reliability, Research Design, Sampling
Peer reviewedVegelius, Jan – Educational and Psychological Measurement, 1980
One argument against the G index is that, unlike phi, it is not a correlation coefficient; yet, G conforms to the Kendall and E-coefficient definitions. The G index is also equal to the Pearson product moment correlation coefficient obtained from double scoring. (Author/CP)
Descriptors: Correlation, Mathematical Formulas, Test Reliability
Peer reviewedHartmann, Donald P. – Educational and Psychological Measurement, 1976
In at least two situations, use of the Spearman-Brown prophesy formula yields overestimates of the interobserver reliability of composite scores. The appropriate formulas for estimating the interobserver reliability of composite scores, as well as efficient means of estimating the elements in these formulas are presented. (Author)
Descriptors: Correlation, Measurement Techniques, Observation, Reliability
Peer reviewedVacha-Haase, Tammi; Henson, Robin K.; Caruso, John C. – Educational and Psychological Measurement, 2002
Reliability generalization (RG) is a measurement meta-analytic method used to explore the variability in score reliability estimates and to characterize possible sources of the variance. Summarizes some RG considerations, and suggests how confidence intervals might be portrayed graphically. (SLD)
Descriptors: Generalization, Meta Analysis, Reliability, Scores
Peer reviewedHanson, William E.; Curry, Kyle T.; Bandalos, Deborah L. – Educational and Psychological Measurement, 2002
Used reliability generalization to study five versions of the Working Alliance Inventory (A. Horvath, 1981; WAI), analyzing 67 internal consistency estimates, 6 interrater reliability estimates, and 4 study characteristics. In general WAI scale scores appear to be robust. (SLD)
Descriptors: Generalization, Meta Analysis, Reliability, Scores
Peer reviewedSchuster, Christof – Journal of Educational and Behavioral Statistics, 2001
If two raters assign targets to categories, the ratings can be arranged in a two-dimensional contingency table. This article presents a model for the frequencies in such a contingency table for which Cohen's kappa is a parameter. Illustrates the model using data from a study of the psychobiology of depression. (Author/SLD)
Descriptors: Depression (Psychology), Interrater Reliability, Models
Peer reviewedBeretvas, S. Natasha; Pastor, Dena A. – Educational and Psychological Measurement, 2003
Describes how the assumptions underlying the use of multiple regression are not satisfied in reliability generalization studies and introduces mixed effects modeling to overcome many shortcomings of traditional approaches. Provides an example using results from the Beck Depression Inventory. (SLD)
Descriptors: Generalization, Models, Regression (Statistics), Reliability
Peer reviewedLevin, Joseph – Educational and Psychological Measurement, 1993
Average reliability of the coefficient is suggested as a multivariate extension of the classical reliability coefficient. It is shown that this coefficient is identical to the mean redundancy of the observed variables conditional on the true variables. (Author/SLD)
Descriptors: Correlation, Multivariate Analysis, Profiles, Reliability
Peer reviewedTarter, Ralph E.; And Others – Journal of Child and Adolescent Substance Abuse, 1994
Examines psychometric reliability of Drug Use Screening Inventory (DUSI) utilizing adolescents with DSM-III-R diagnosis of Psychoactive Substance Use Disorder. Concludes that split-half, internal, and test-retest reliability is superior. Suggests that DUSI may be useful for identifying and quantifying substance use and related problems. Includes…
Descriptors: Adolescents, Alcohol Abuse, Test Reliability
Peer reviewedten Berge, Jos M. F.; Hofstee, Willem K. B. – Psychometrika, 1999
H. Kaiser (1992) has shown that the sum of coefficients alpha of a set of principal components does not change when the components are transformed by an orthogonal rotation. In this paper, the rotational invariance and the successive alpha-optimality are integrated and generalized in a simultaneous approach. (SLD)
Descriptors: Factor Structure, Orthogonal Rotation, Reliability
Peer reviewedFeldt, Leonard S.; Ankenmann, Robert D. – Applied Psychological Measurement, 1998
Developed a graphical method of determining adequate sample size based on the power of L. Feldt's (1969) test of the difference between two values of Cronbach's alpha coefficient. Discusses assumptions on which this approach is based. (SLD)
Descriptors: Comparative Analysis, Reliability, Sample Size
Peer reviewedLunz, Mary E. – Popular Measurement, 1999
Describes a study of judge leniency and consistency using a Rasch approach and involving 4,683 candidates and 53 judges. (SLD)
Descriptors: Interrater Reliability, Judges, Longitudinal Studies


