Publication Date
| In 2026 | 3 |
| Since 2025 | 666 |
| Since 2022 (last 5 years) | 3167 |
| Since 2017 (last 10 years) | 7408 |
| Since 2007 (last 20 years) | 15046 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10272 |
| Reliability | 9759 |
| Foreign Countries | 7141 |
| Test Construction | 4823 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3525 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1327 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 252 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedHumes, Larry E.; Kirn, Elizabeth U. – Journal of Speech and Hearing Disorders, 1990
The study examined the test-retest reliability in unaided and aided sound-field thresholds and the functional gain values derived from these measurements in 24 hearing-impaired adults. Test-retest standard deviations were slightly larger for functional gain than for unaided thresholds. (Author/DB)
Descriptors: Adults, Auditory Evaluation, Auditory Tests, Hearing Impairments
Peer reviewedHaith, Marshall M.; McCarty, Michael E. – Developmental Psychology, 1990
A total of 45 3-month-olds were observed for stability in forming visual expectations. Findings indicate that infant performance in the Visual Expectation Paradigm is reliable as early as 3 months. Individual differences exist in infants' tendency to form visual expectations. (RH)
Descriptors: Expectation, Individual Differences, Infants, Performance Factors
Peer reviewedJones, Russell A.; And Others – Multivariate Behavioral Research, 1989
The stability of dimensions extracted from a body of free response data was studied using 1,523 expressions of concern and questions raised by 271 elderly persons and analyzed by 2 groups of experimenters. The structures of resulting multidimensional configurations obtained by the 2 groups were identical. (SLD)
Descriptors: Data Analysis, Hypothesis Testing, Multidimensional Scaling, Older Adults
Peer reviewedHorowitz, Leonard M.; And Others – Journal of Consulting and Clinical Psychology, 1989
Developed method for aggregating psychodynamic formulations of independent clinicians. Panels of clinicians observed videotaped interviews of patients and wrote individual formulations which were combined into consensual formulation. Other clinical raters read each consensual formulation and judged whether each problem was apt to be distressing…
Descriptors: Clinical Diagnosis, Interpersonal Relationship, Interrater Reliability, Psychological Evaluation
Peer reviewedTsui, Anne S.; Ohlott, Patricia – Personnel Psychology, 1988
To test model of general managerial effectiveness, superiors (N=271), subordinates (N=605), and peers (N=469) rated 344 managers. Study designed to test three specific hypotheses on criterion type and criterion weights found consensus in effectiveness models of superiors, subordinates, and peers. Consensus among different raters was high on both…
Descriptors: Administrator Effectiveness, Congruence (Psychology), Evaluation Problems, Interrater Reliability
Peer reviewedCronbach, Lee J. – Psychometrika, 1988
A coefficient derived from communalities of test parts represents greatest lower bound to Guttman's "immediate retest reliability." Constrained minimum trace factor analysis allows a consistent estimate of the greatest defensible internal-consistency coefficient. In modest size samples, this analysis capitalizes on chance, suggesting an…
Descriptors: Estimation (Mathematics), Evaluation Methods, Factor Analysis, Psychometrics
Canady, Robert Lynn; Hotchkiss, Phyllis Riley – Phi Delta Kappan, 1989
Identifies counterproductive grading policies and practices, such as varying grading scales; worshipping averages; using zeros indiscriminantly; following the assign, test, grade, and teach pattern; failing to match testing to teaching; ambushing students; grading first efforts; establishing inconsistent criteria; and failing to recognize…
Descriptors: Elementary Secondary Education, Evaluation Criteria, Failure, Grading
Peer reviewedJefferson, T. R.; And Others – Psychometrika, 1989
The problem of scaling ordinal categorical data observed over two or more sets of categories measuring a single characteristic is addressed. Scaling is obtained by solving a constrained entropy model. A Kullback-Leibler statistic is generated that operationalizes a measure for the strength of consistency among the sets of categories. (TJH)
Descriptors: Classification, Entropy, Mathematical Models, Matrices
Peer reviewedCresswell, M. J. – Educational Review, 1988
The author suggests combining grades from component assessments to provide an overall student assessment. He explores the concept of reliability and concludes that the overall assessment will be reliable only if the number of grades used to report component achievements equals or exceeds the number used to report overall achievement. (Author/CH)
Descriptors: Evaluation Problems, Grades (Scholastic), Holistic Evaluation, Reliability
Peer reviewedFabbris, Luigi; Gallo, Francesca – Educational and Psychological Measurement, 1993
New coefficients of agreement are suggested for the measure of intraclass consistency between observations on two variables. The coefficients are derived from a general coefficient for measuring intraclass dependence in a bivariate analysis context. Various coefficients for the univariate agreement analysis are shown to be cases of the suggested…
Descriptors: Correlation, Equations (Mathematics), Interrater Reliability, Judges
Peer reviewedKrus, David J.; Helmstadter, Gerald C. – Educational and Psychological Measurement, 1993
Negative coefficients of reliability, sometimes returned by the standard formula for estimation of the internal-consistency reliability, are neither theoretically nor numerically correct. Alternative strategies for test development in this special case are suggested. (Author)
Descriptors: Estimation (Mathematics), Reliability, Test Construction, Test Use
Peer reviewedAgrawal, Divyakant; El Abbadi, Amr – Information Systems, 1995
Proposes a new lock primitive called ordered sharing that allows increased concurrency in database systems. Reliability and performance issues of the proposed protocol are addressed, a simulation study that demonstrates that ordered sharing results in improved performance in database systems is described; and use in several representative database…
Descriptors: Databases, Mathematical Formulas, Models, Performance
Peer reviewedHagner, David C.; Helm, David T. – Rehabilitation Counseling Bulletin, 1994
Outlines major features of qualitative research methods and rehabilitation research contexts for which these methods are particularly appropriate. Presents representative examples of qualitative rehabilitation research. Presents strategies for handling threats to reliability and validity within qualitative tradition and criteria for assessing…
Descriptors: Higher Education, Postsecondary Education, Qualitative Research, Rehabilitation
Peer reviewedKuder, Frederic – Educational and Psychological Measurement, 1991
Recommendations are made for the appropriate use and identification of traditional Kuder-Richardson formulas for the estimation of reliability. "Alpha" should be used for reliabilities estimated for tests or scales composed of items yielding scores distributed on more than two points. (SLD)
Descriptors: Estimation (Mathematics), Evaluation Methods, Mathematical Formulas, Scores
Peer reviewedCorty, Eric; And Others – Journal of Consulting and Clinical Psychology, 1993
Examined interrater reliability of diagnoses made on basis of structured interview for psychiatric patients with and without psychoactive substance use disorders (PSUDs). Results from 47 pairs of ratings by 9 clinical interviewers revealed that interrater reliability for non-PSUD psychiatric diagnoses was quite high when patient had no diagnosable…
Descriptors: Clinical Diagnosis, Interrater Reliability, Patients, Psychiatric Hospitals


