Publication Date
| In 2026 | 3 |
| Since 2025 | 666 |
| Since 2022 (last 5 years) | 3167 |
| Since 2017 (last 10 years) | 7408 |
| Since 2007 (last 20 years) | 15046 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10272 |
| Reliability | 9759 |
| Foreign Countries | 7141 |
| Test Construction | 4823 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3525 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1327 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 252 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedBlau, Gary J. – Journal of Vocational Behavior, 1988
Examined the reliability and validity of a career commitment measure using employees (N=266) of newspaper and insurance companies. Results showed career commitment could be reliably measured and was operationally distinct from job involvement and organizational commitment. Discusses findings in terms of meaning of career commitment. (Author/ABL)
Descriptors: Careers, Employees, Test Reliability, Test Validity
Peer reviewedBerry, Kenneth J.; Mielke, Paul W., Jr. – Educational and Psychological Measurement, 1988
Cohen's kappa statistic is frequently used to measure agreement between two observers using categorical polytomies. Cohen's statistic is: shown to be inherently multivariate in nature; expanded to analyze ordinal and interval data; and extended to over two observers. A non-asymptotic test of significance is provided for the generalized statistic.…
Descriptors: Equations (Mathematics), Interrater Reliability, Multivariate Analysis
Peer reviewedvan der Linden, Wim J.; Boekkooi-Timminga, Ellen – Applied Psychological Measurement, 1988
Gulliksen's matched random subtests method is a graphical method to split a test into parallel test halves, allowing maximization of coefficient alpha as a lower bound to the classical test reliability coefficient. This problem is formulated as a zero-one programing problem solvable by algorithms that already exist. (TJH)
Descriptors: Algorithms, Equations (Mathematics), Programing, Test Reliability
Peer reviewedMerriam, Sharan B. – PAACE Journal of Lifelong Learning, 1995
Deals with issues of validity and reliability in qualitative research in education. Discusses philosophical assumptions underlying the concepts of internal validity, reliability, and external validity or generalizability. Presents strategies congruent with a qualitative research perspective for ensuring the rigor and trustworthiness of findings.…
Descriptors: Educational Research, Qualitative Research, Reliability, Validity
Peer reviewedRosenthal, James A. – Social Work Research, 1994
Notes that conventional practice in social work research is to recommend level of reliability close to 0.80 as minimum standard. Contends that needed reliability varies by situation: that in situations in which important decisions about individuals are being made, 0.90 provides better standard; whereas in descriptive survey research with large…
Descriptors: Reliability, Research, Social Work, Statistical Analysis
Peer reviewedWoo-Kyoung, Ahn; And Others – Cognition, 1995
Presents a series of four studies testing the hypothesis that people seek out and prefer information about causal mechanisms rather than information about covariation. Concludes that people attempt to seek out causal mechanisms in developing a causal explanation for a specific event. (DR)
Descriptors: College Students, Information Seeking, Motivation, Reliability
Peer reviewedZimmerman, Donald W. – Journal of Educational Measurement, 1994
An alternative formula is presented for the reliability of a difference score that contains the correlation between true scores instead of the correlation between observed scores. This approach provides more useful information and yields values that are not as anomalous as those usually obtained. (SLD)
Descriptors: Correlation, Equations (Mathematics), Reliability, Research Methodology
Peer reviewedTraub, Ross E.; Rowley, Glenn L. – Educational Measurement: Issues and Practice, 1991
The idea of test consistency is illustrated, with reference to two sets of test scores. A mathematical model is used to explain the relative consistency and relative inconsistency of measurements, and a means of indexing reliability is derived using the model. Practical aspects of estimating reliability are considered. (TJH)
Descriptors: Mathematical Models, Test Reliability, True Scores
Peer reviewedRudner, Lawrence M. – Educational Measurement: Issues and Practice, 2001
Identifies and evaluates alternative methods for weighting tests. Presents formulas for composite reliability and validity as a function of component weights and suggests a rational process that identifies and considers trade-offs in determining weights. Discusses drawbacks to implicit weighting and explicit weighting and the difficulty of…
Descriptors: Reliability, Test Construction, Test Items, Validity
Peer reviewedLindell, Michael K. – Applied Psychological Measurement, 2001
Developed an index for assessing interrater agreement with respect to a single target using a multi-item rating scale. The variance of rater mean scale scores is used as the numerator of the agreement index. Studied four variants of a disattenuated agreement index that vary in the random response term used as the denominator. (SLD)
Descriptors: Evaluation Methods, Interrater Reliability, Rating Scales
Peer reviewedVacha-Haase, Tammi; Kogan, Lori R.; Thompson, Bruce – Educational and Psychological Measurement, 2000
Investigated how dissimilar in composition and variability samples inducting reliability coefficients from prior studies were from the cited prior samples from which coefficients were generalized. Results from 20 articles show that citing reliability coefficients from prior studies as the basis for concluding new scores are reliable is only…
Descriptors: Reliability, Sampling, Scores, Test Manuals
Peer reviewedFan, Xitao; Chen, Michael – Educational and Psychological Measurement, 2000
Provides a sample of seven published studies in different disciplines that inappropriately generalized reliability coefficients involving several raters to scores generated by a single rater. Score reliability when only one rater is used for scoring is lower than the score reliability for which two raters are used. (SLD)
Descriptors: Interrater Reliability, Research Reports, Scores, Scoring
Peer reviewedGump, Linda S.; Baker, Richard C.; Roll, Samuel – Adolescence, 2000
Describes the development of the Moral Justification Scale, an objective measure of justice and care orientations. The scale was administered to 100 college students. Results imply that the Moral Justification Scale shows promise as an easily administered, objectively scored measure of Giligan's constructs of care and justice. (Author/MKA)
Descriptors: Measures (Individuals), Moral Development, Reliability, Validity
Peer reviewedRaykov, Tenko – Multivariate Behavioral Research, 1997
The population discrepancy between Cronbach's Coefficient Alpha (L. Cronbach, 1951) and scale reliability with fixed congeneric measure, uncorrelated errors, and sampling of subjects was studied. The difference is expressed in terms of the individual component violations of the assumption of equal tau-equivalence that is necessary and sufficient…
Descriptors: Error of Measurement, Reliability, Sampling, Scaling
Peer reviewedBuhi, Eric R. – Journal of School Health, 2005
A number of school-based programs address sexual violence by focusing on adolescents' attitudes about rape or acceptance of rape myths. However, really problems exist in the literature regarding measurement of rape myth acceptance, including issues of reliability and validity. This paper addresses measurement reliability issues and reviews…
Descriptors: Mythology, Violence, Sexual Harassment, Reliability


