Publication Date
| In 2026 | 3 |
| Since 2025 | 666 |
| Since 2022 (last 5 years) | 3167 |
| Since 2017 (last 10 years) | 7408 |
| Since 2007 (last 20 years) | 15046 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10272 |
| Reliability | 9759 |
| Foreign Countries | 7141 |
| Test Construction | 4823 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3525 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1327 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 252 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedVanLeeuwen, Dawn M. – Journal of Agricultural Education, 1997
Generalizability Theory can be used to assess reliability in the presence of multiple sources and different types of error. It provides a flexible alternative to Classical Theory and can handle estimation of interrater reliability with any number of raters. (SK)
Descriptors: Error of Measurement, Generalizability Theory, Interrater Reliability, Measurement Techniques
Peer reviewedTisak, John; Tisak, Marie S. – Applied Psychological Measurement, 1996
Dynamic generalizations of reliability and validity that will incorporate longitudinal or developmental models, using latent curve analysis, are discussed. A latent curve model formulated to depict change is incorporated into the classical definitions of reliability and validity. The approach is illustrated with sociological and psychological…
Descriptors: Definitions, Development, Longitudinal Studies, Models
Peer reviewedCollins, Linda M. – Applied Psychological Measurement, 1996
The clarification provided by Williams and Zimmerman on the reliability of gain scores is translated into recognizable patterns of change that tend to produce reliable or unreliable gain scores. The relevance of the traditional idea of reliability to the measurement of change is also discussed. (SLD)
Descriptors: Achievement Gains, Change, Measurement Techniques, Reliability
Coscarelli, William; Shrock, Sharon – Performance Improvement Quarterly, 2002
Discusses problems in using traditional measures of reliability for criterion-referenced tests (CRTs) and describes two approaches to reliability for CRTs: estimates sensitive to all measures of error; and estimates of consistency in test outcome. Compares the two approaches and proposes recommendations for interpretation and use. (Author/LRW)
Descriptors: Comparative Analysis, Criterion Referenced Tests, Measurement Techniques, Test Reliability
Peer reviewedMcCoach, D. Betsy; Siegle, Del – Educational and Psychological Measurement, 2002
Designed an instrument to measure adolescents' attitudes toward school and teachers, goal-valuation, motivation, and general academic self-perceptions. Tested the developed survey with samples of 146, 200, and 299 high school students. Findings suggest the scale has adequate construct validity, criterion-related validity, and internal consistency…
Descriptors: Adolescents, Attitude Measures, Low Achievement, Reliability
Peer reviewedMcLearen, Alix M. – Journal of Offender Rehabilitation
Compares detection rates of the Referral Decision Scale (RDS) with a short, officer-administered booking questionnaire at a low capacity jail. Although RDS produced a higher number of false positives, it correctly identified more mentally ill inmates than did the booking procedure. Results suggest that combining both instruments may provide the…
Descriptors: Disability Identification, Measures (Individuals), Mental Disorders, Prisoners
Peer reviewedForjaz, Maria Joao; Cano, Pedro Martinez; Cervera-Enguix, Salvador – American Journal of Family Therapy, 2002
Tests the properties of a Spanish version of the Family Adaptability and Cohesion Scales III (FACES III) in a Spanish sample. Confirmatory factor analyses replicated the factor structure of the original American version. Reliability coeffecients were higher for the cohesion than the adaptability scale. Convergent and discriminant validity was…
Descriptors: Factor Analysis, Foreign Countries, Spanish, Test Reliability
Peer reviewedZea, Maria Cecilia; Asner-Self, Kimberly K.; Birman, Dina; Buki, Lydia P. – Cultural Diversity & Ethnic Minority Psychology, 2003
Two studies were conducted to develop and examine internal consistencies and validate the Abbreviated Multidimensional Acculturation Scale. Findings indicated good internal reliabilities for all 3 subscales. Adequate concurrent validity was established with length of residence in the United States. The scale also showed adequate convergent and…
Descriptors: Acculturation, Concurrent Validity, Hispanic Americans, Measures (Individuals)
Peer reviewedYuan, Ke-Hai; Bentler, Peter M. – Psychometrika, 2002
Examined the asymptotic distributions of three reliability coefficient estimates: (1) sample coefficient alpha; (2) reliability estimate of a composite score following factor analysis; and (3) maximal reliability of a linear combination of item scores after factor analysis. Findings show that normal theory based asymptotic distributions for these…
Descriptors: Estimation (Mathematics), Factor Analysis, Reliability, Robustness (Statistics)
Wang, Greg – Educational Technology, 2003
Discusses the money wasted on ineffective training programs and the resulting surge in interest among training professionals in conducting learning evaluation and return on investment (ROI) measurement. Describes Kirkpatrick's four-level evaluation concept; questions regarding measurement validity and reliability; and new developments in learning…
Descriptors: Educational Assessment, Evaluation Methods, Measurement Techniques, Models
Peer reviewedFeldt, Leonard S. – Applied Measurement in Education, 2002
Considers the degree of bias in testlet-based alpha (internal consistency reliability) through hypothetical examples and real test data from four tests of the Iowa Tests of Basic Skills. Presents a simple formula for computing a testlet-based congeneric coefficient. (SLD)
Descriptors: Estimation (Mathematics), Reliability, Statistical Bias, Test Format
Peer reviewedHenson, Robin K.; Thompson, Bruce – Measurement and Evaluation in Counseling and Development, 2002
T. Vacha-Haase (1998) proposed her "reliability generalization" methodology to characterize (a) typical score reliability for a measure across studies, (b) the variability of score reliabilities, and (c) what measurement protocol features predict the variability in score reliabilities across administration. The present article provides…
Descriptors: Error of Measurement, Generalization, Psychometrics, Research Methodology
Praeger, Charles E. – School Business Affairs, 2002
Discusses the advantages of metal building and roofing systems, especially the use of steel. Considers such factors as installation ease and design flexibility, reliability and durability, and cost-effectiveness. (PKP)
Descriptors: Cost Effectiveness, Elementary Secondary Education, Reliability, Roofing
Peer reviewedPfeiffer, Karin A.; Pivarnik, James M.; Womack, Christopher J.; Reeves, Mathew J.; Malina, Robert M. – Medicine & Science in Sports & Exercise, 2002
Investigated the reliability and validity of the Borg and OMNI rating of perceived exertion (RPE) scales in adolescent girls during treadmill exercise. Girls were randomly assigned to one of the RPE scales during various treadmill exercise conditions. Results indicated that the OMNI cycle pictorial scale was reliable and valid for use with…
Descriptors: Adolescents, Exercise Physiology, Females, Perception
Peer reviewedIwata, Brian A.; And Others – Journal of Applied Behavior Analysis, 1990
A measure, the Self-Injury Trauma Scale, is described for classifying and quantifying surface tissue damage. The scale permits differentiation of self-injurious behavior according to topography, location of the injury on the body, type of injury, number of injuries, and estimate of severity. High interrater reliability has been found. (Author/DB)
Descriptors: Developmental Disabilities, Measures (Individuals), Medical Evaluation, Reliability


