Publication Date
| In 2026 | 3 |
| Since 2025 | 656 |
| Since 2022 (last 5 years) | 3157 |
| Since 2017 (last 10 years) | 7398 |
| Since 2007 (last 20 years) | 15036 |
Descriptor
| Test Reliability | 15028 |
| Test Validity | 10265 |
| Reliability | 9757 |
| Foreign Countries | 7137 |
| Test Construction | 4821 |
| Validity | 4191 |
| Measures (Individuals) | 3876 |
| Factor Analysis | 3822 |
| Psychometrics | 3520 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1326 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 251 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedBarber, Jacques P.; Foltz, Carol; Weinryb, Robert M. – Journal of Counseling Psychology, 1998
An initial evaluation of the psychometric properties of the Central Relationship Questionnaire (CRQ) is presented. Central relationship patterns refer to people's characteristic ways of relating to others. Results indicate that the CRQ components could be differentiated into meaningful subscales. Validity, reliability, and specific suggestions for…
Descriptors: Cognitive Structures, Interpersonal Relationship, Psychometrics, Reliability
Peer reviewedLi, Heng; Wainer, Howard – Journal of Educational and Behavioral Statistics, 1997
Provides a general mathematical framework is provided that can be specialized to four different reliability coefficients. Consideration of this general framework makes it easier to convey to students the individual character of the formulations of reliability and the extent of their underlying similarity. (SLD)
Descriptors: Mathematical Models, Reliability, Teaching Methods, Test Theory
Arnold, Margery E. – Research in the Schools, 1996
This paper explains how different factors affect classical reliability estimates, such as test-retest, interrater, internal consistency, and equivalent forms coefficients. The limitations of classical test theory are explored, and the advantages of generalizability theory are discussed. Concrete examples are used. (SLD)
Descriptors: Estimation (Mathematics), Generalizability Theory, Reliability, Test Theory
Peer reviewedBandalos, Deborah L.; Enders, Craig K. – Applied Measurement in Education, 1996
Computer simulation indicated that reliability increased with the degree of similarity between underlying and observed distributions when the observed categorical distribution was deliberately constructed to match the shape of the underlying distribution of the trait being measured. Reliability also increased with correlation among variables and…
Descriptors: Computer Simulation, Correlation, Likert Scales, Reliability
Peer reviewedHogan, Thomas P.; Benjamin, Amy; Brezinski, Kristen L. – Educational and Psychological Measurement, 2000
Examined the frequency of use of various types of reliability coefficients for a sample of 696 tests in a directory published by the American Psychological Association. Coefficient alpha was the overwhelming favorite, and several measures treated almost universally in textbooks were rarely or never used. Identifies problems encountered in the…
Descriptors: Reliability, Research Reports, Scores, Test Items
Peer reviewedFord, Norma; Murphy, Gai – British Journal of Educational Technology, 2002
Evaluates a training Web site for professional development of health and safety enforcement officers in the United Kingdom and reviews the use of knowledge elicitation in work-based training in accident investigations. Results showed that the training was realistic and that the imbedded discussion facility had the potential to improve enforcement…
Descriptors: Discussion, Professional Development, Realism, Reliability
Peer reviewedBuboltz, Walter C., Jr.; Thomas, Adrian; Donnell, Alison J. – Journal of Counseling & Development, 2002
Psychological reactance is an important construct for social scientists. The measure most often used to tap psychological reactance is the Therapeutic Reactance Scale (TRS). However, little research to date has examined the psychometric properties of the TRS. Eight hundred and eighty-three individuals completed the TRS and their responses were…
Descriptors: Factor Structure, Psychological Testing, Psychometrics, Test Reliability
Peer reviewedSanderson, Patricia – Educational Research, 2000
Factor analysis of an initial sample of 368 11-16 year-olds and a test with 1,668 confirmed the reliability and validity of a dance attitude instrument. Two subscales, ballet and male dancers, produced valid measurements of attitudes, but dance performance and presentation scales were less reliable. (SK)
Descriptors: Adolescents, Attitudes, Dance, Foreign Countries
Peer reviewedLindell, Michael K.; Brandt, Christina J.; Whitney, David J. – Applied Psychological Measurement, 1999
Proposes a revised index of interrater agreement for multi-item ratings of a single target. This index is an inverse linear function of the ratio of the average obtained variance to the variance of the uniformly distributed random error. Discusses the importance of sample size for the index. (SLD)
Descriptors: Error of Measurement, Interrater Reliability, Sample Size
Peer reviewedMusante, Linda; Treiber, Frank A.; Davis, Harry C.; Thompson, William O.; Waller, Jennifer L. – Assessment, 1999
Findings related to internal consistency, temporal stability, and principal components structures suggest that the Anger Expression Scale (C. Spielberger and others, 1985) and the Pediatric Anger Expression Scale (G. Jacobs and others, 1989), studied with a sample of 415 youth with a mean age of 14.7 years are acceptably reliable. (SLD)
Descriptors: Adolescents, Anger, Factor Structure, Reliability
Peer reviewedKomaroff, Eugene – Applied Psychological Measurement, 1997
Evaluated coefficient alpha under violations of two classical test theory assumptions: essential tau-equivalence and uncorrelated errors through simulation. Discusses the interactive effects of both violations with true and error scores. Provides empirical evidence of the derivation of M. Novick and C. Lewis (1993). (SLD)
Descriptors: Correlation, Reliability, Simulation, Test Theory
Peer reviewedLehtokangas, Raija; Jarvelin, Kalervo – Journal of Documentation, 2001
Investigates the consistency of different newspapers in their choice of words when writing about the same news events based on a study of three Finnish newspapers. Concludes that expression inconsistency is a sign of a retrieval problem and that query expansion based on semantic relationships can significantly improve retrieval performance on free…
Descriptors: Foreign Countries, Information Retrieval, Newspapers, Reliability
Peer reviewedVacha-Haase, Tammi; Kogan, Lori R.; Tani, Crystal R.; Woodall, Renee A. – Educational and Psychological Measurement, 2001
Used reliability generalization to explore the variance of scores on 10 Minnesota Multiphasic Personality Inventory (MMPI) clinical scales drawing on 1,972 articles in the literature on the MMPI. Results highlight the premise that scores, not tests, are reliable or unreliable, and they show that study characteristics do influence scores on the…
Descriptors: Clinical Diagnosis, Diagnostic Tests, Generalization, Reliability
Schuster, Christof; Smith, David A. – Psychometrika, 2005
The rater agreement literature is complicated by the fact that it must accommodate at least two different properties of rating data: the number of raters (two versus more than two) and the rating scale level (nominal versus metric). While kappa statistics are most widely used for nominal scales, intraclass correlation coefficients have been…
Descriptors: Psychometrics, Statistics, Rating Scales, Correlation
Hall, Kendra M.; Markham, Janet C.; Culatta, Barbara – Communication Disorders Quarterly, 2005
In the present study, the authors investigated the initial development of the Early Expository Comprehension Assessment (EECA) by examining its reliability. The EECA consists of a compare/contrast passage, manipulatives to represent the information in the paragraph, and three response tasks ("Retelling, Mapping, and Comparing"). The authors…
Descriptors: Statistical Analysis, Computation, Preschool Children, Test Reliability

Direct link
