Publication Date
| In 2026 | 3 |
| Since 2025 | 656 |
| Since 2022 (last 5 years) | 3157 |
| Since 2017 (last 10 years) | 7398 |
| Since 2007 (last 20 years) | 15036 |
Descriptor
| Test Reliability | 15028 |
| Test Validity | 10265 |
| Reliability | 9757 |
| Foreign Countries | 7137 |
| Test Construction | 4821 |
| Validity | 4191 |
| Measures (Individuals) | 3876 |
| Factor Analysis | 3822 |
| Psychometrics | 3520 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1326 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 251 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedKrus, David J.; Blackman, Harold S. – Applied Measurement in Education, 1988
Test homogeneity and internal consistency reliability indices were developed on the basis of theoretical considerations of properties of hierarchical structures of data matrices. This reconceptualization, in terms of ordinal test theory, has potential for explication of the mutual relationship of test reliability and homogeneity. (TJH)
Descriptors: Equations (Mathematics), Statistics, Test Reliability, Test Theory
Peer reviewedBolton, Brian – Measurement and Evaluation in Counseling and Development, 1988
Examined reliability of United States Employment Service Interest Inventory (USES-II) by twice administering USES-II to 100 vocational rehabilitation clients. Retest reliability coefficients for the 12 scales of the USES-II ranged from .73 to .88 with a median of .83 for this population. Findings support use of USES-II in occupational exploration…
Descriptors: Interest Inventories, Psychometrics, Test Reliability, Vocational Rehabilitation
Peer reviewedFeldt, Leonard S. – Applied Measurement in Education, 1990
Sampling theory for the intraclass reliability coefficient, a Spearman-Brown extrapolation of alpha to a single measurement for each examinee, is less recognized and less cited than that of coefficient alpha. Techniques for constructing confidence intervals and testing hypotheses for the intraclass coefficient are presented. (SLD)
Descriptors: Hypothesis Testing, Measurement Techniques, Reliability, Sampling
The Descriptive Use of Absolute Differences between Pairs of Scores with a Common Mean and Variance.
Peer reviewedMcGraw, Kenneth O.; Wong, S. P. – Journal of Educational Statistics, 1994
Similarity between pairs of scores with a common mean and variance can be expressed in terms of the absolute differences between them. The distribution of such absolute differences between pairs of normally distributed scores with a common mean and variance is discussed, with procedures for calculating moments and areas within this distribution.…
Descriptors: Correlation, Reliability, Scores, Statistical Distributions
Peer reviewedAndrews, Christopher – CD-ROM Professional, 1991
Describes the mastering and replication process for CD-ROMs. Costs are discussed, the production cycle of a CD-ROM is explained, packaging is described, testing to increase the reliability of discs is discussed, and a directory of mastering and replication facilities is provided. (LRW)
Descriptors: Costs, Optical Data Disks, Production Techniques, Reliability
Peer reviewedLester, David – Omega: Journal of Death and Dying, 1991
Published Lester Attitude toward Death Scale for first time, together with data on its reliability and validity. Notes that scale is different from other fear of death scales in its use of scaled value approach that permits measure of inconsistency in attitudes. (Author)
Descriptors: Attitude Measures, Death, Test Reliability, Test Validity
Peer reviewedNichols, Paul; Kuehl, Barbara Jean – Applied Measurement in Education, 1999
An approach is presented that can predict internal consistency of cognitively complex assessments on two dimensions, those of adding tasks with similar or different solution strategies and adding test takers with different solution strategies. Data from the 1992 National Assessment of Educational Progress mathematics assessment are used to…
Descriptors: Cognitive Tests, Mathematics Tests, Prediction, Test Reliability
Peer reviewedLi, Mao-Neng Fred; Lautenschlager, Gary J. – Educational and Psychological Measurement, 1999
Describes a Statistical Analysis System (SAS) MACRO for computing various indices of interrater agreement, including a new generalizability coefficient, for categorical data in a single-facet, crossed design. (Author/SLD)
Descriptors: Classification, Generalizability Theory, Interrater Reliability, Qualitative Research
Peer reviewedGreen, Samuel B.; Hershberger, Scott L. – Structural Equation Modeling, 2000
Proposes true score models that can account for correlated errors and their effect on coefficient alpha. These models allow random measurement errors on earlier items to affect directly or indirectly the scores on later items. Conditions under which coefficient alpha may yield spuriously high estimates or reliability are discussed. (SLD)
Descriptors: Correlation, Error of Measurement, Reliability, True Scores
Peer reviewedStephenson, Agnes S.; Elmore, Patricia B.; Evans, John Andrew – Measurement and Evaluation in Counseling and Development, 2000
Standard-setting techniques concerning testing, comparisons of these techniques, and methods for assessing interrater and intrarater reliability are described. These techniques are discussed in relation to counselor education programs. (Author/MKA)
Descriptors: Counselor Training, Higher Education, Reliability, Standards
Peer reviewedSawilowsky, Shlomo S. – Educational and Psychological Measurement, 2000
Reviews issues, raised in part because of "Educational and Psychological Measurement" (EPM) policies, regarding "test reliability," which is psychometric terminology, and "score reliability," score-centric terminology. Discusses datametrics and provides a critique of T. Vacha-Haase's proposed meta-analytic reliability generalization via…
Descriptors: Generalization, Meta Analysis, Psychometrics, Reliability
Peer reviewedThompson, Bruce; Vacha-Haase, Tammi – Educational and Psychological Measurement, 2000
Responds to criticisms of some "Educational and Psychological Measurement" policies and the reliability generalization meta-analytic methods of T. Vacha-Haase. Explores consequences of misunderstanding score reliability. (SLD)
Descriptors: Generalization, Meta Analysis, Psychometrics, Reliability
Peer reviewedSawilowsky, Shlomo S. – Educational and Psychological Measurement, 2000
B. Thompson and T. Vacha-Haase have examined the statement "the reliability of the test" with emphasis on the following three words: (1) the first "the"; (2) "test"; and (3) the second "the." This discussion focuses instead on the word "reliability." (Author)
Descriptors: Generalization, Meta Analysis, Psychometrics, Reliability
Peer reviewedStone, Gregory E. – Popular Measurement, 2000
Discusses the fundamental qualities that must be used to judge the merit of performance standards: validity, reliability, and genuineness. The Rasch-based objective model has demonstrated these qualities, but other models should also be evaluated for these characteristics. (SLD)
Descriptors: Models, Performance Based Assessment, Reliability, Standards
Peer reviewedGoodwin, Laura D.; Goodwin, William L. – School Psychology Quarterly, 1999
Presents frequently encountered measurement misconceptions and various measurement "rules." Origins of the misconceptions and rules are described, along with the reasons why they are problematic. Alternate approaches or considerations are given. Misconceptions discussed pertain to the estimation of internal consistency reliability and item…
Descriptors: Factor Analysis, Measures (Individuals), Psychology, Reliability


