Publication Date
| In 2026 | 6 |
| Since 2025 | 481 |
| Since 2022 (last 5 years) | 1960 |
| Since 2017 (last 10 years) | 4532 |
| Since 2007 (last 20 years) | 7017 |
Descriptor
| Test Reliability | 15055 |
| Test Validity | 10022 |
| Test Construction | 4374 |
| Foreign Countries | 3840 |
| Psychometrics | 2435 |
| Factor Analysis | 2302 |
| Measures (Individuals) | 1787 |
| Evaluation Methods | 1410 |
| Higher Education | 1391 |
| Questionnaires | 1264 |
| Factor Structure | 1249 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 454 |
| Practitioners | 319 |
| Teachers | 128 |
| Administrators | 73 |
| Policymakers | 33 |
| Counselors | 31 |
| Students | 17 |
| Parents | 10 |
| Community | 6 |
| Support Staff | 5 |
Location
| Turkey | 840 |
| Australia | 239 |
| China | 211 |
| Canada | 207 |
| Indonesia | 163 |
| Spain | 131 |
| United States | 123 |
| United Kingdom | 121 |
| Germany | 112 |
| Taiwan | 108 |
| Netherlands | 103 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 2 |
| Meets WWC Standards with or without Reservations | 2 |
| Does not meet standards | 1 |
Test Service Bulletin, 1952
Some aspects of test reliability are discussed. Topics covered are: (1) how high should a reliability coefficient be?; (2) two factors affecting the interpretation of reliability coefficients--range of talent and interval between testings; (3) some common misconceptions--reliability of speed tests, part vs. total reliability, reliability for what…
Descriptors: Bulletins, Correlation, Scores, Statistical Analysis
Kennedy, Beth T. – 1972
Issues related to the evaluation of instructional programs developed under the auspices of the Southwest Educational Development Laboratory are briefly discussed. The Laboratory develops criterion-referenced tests which form an integral part of each instructional program. The importance of examining the reliability and validity of these tests is…
Descriptors: Criterion Referenced Tests, Evaluation Methods, Instructional Programs, Test Reliability
Ellis, E. N. – 1975
Concern over the reading and writing programs in Vancouver, British Columbia Schools culminated in the establishment in June 1974 of a Task Force on English. In response to the request from the Task Force for a survey of the writing ability of Grade 11 students, a committee of English Department Heads assisted in developing an instrument and the…
Descriptors: Essay Tests, Grade 11, Scoring, Secondary Education
Peer reviewedNeill, John A.; Jackson, Douglas N. – Educational and Psychological Measurement, 1976
Illustrates a multivariate approach to item analysis. Previous formulation is extended by investigating techniques simultaneously taking into account scale variance with the goal of reducing the average correlation between scales. Study examines problems in determining optimum values for combinations of item parameters selected for personality…
Descriptors: Correlation, Factor Structure, Item Analysis, Personality Measures
Peer reviewedCarroll, C. Dennis – Educational and Psychological Measurement, 1976
A computer program for item evaluation, reliability estimation, and test scoring is described. The program contains a variable format procedure allowing flexible input of responses. Achievement tests and affective scales may be analyzed. (Author)
Descriptors: Achievement Tests, Affective Measures, Computer Programs, Item Analysis
Peer reviewedBrennan, Robert L. – Educational and Psychological Measurement, 1975
Variance components from split-plot factorial design (SPF) were used to estimate reliability for schools and persons within schools. Reliability for persons within SPF and randomized block design (RB) schools were compared and reliability for SPF and RB design schools were compared. (Author/BJG)
Descriptors: Analysis of Variance, Evaluation Methods, Schools, Statistical Analysis
Peer reviewedMehrabian, Albert; Hines, Melissa – Educational and Psychological Measurement, 1978
Reliability and validity data are reported for a questionnaire measure of individual differences in dominance-submissiveness. The 48-item questionnaire, which was balanced for response bias, had high internal consistency and correlated highly with other available measure of dominance. (Author/JKS)
Descriptors: Higher Education, Individual Characteristics, Personality Measures, Questionnaires
Peer reviewedHuynh, Huynh – Psychometrika, 1978
The use of Cohen's kappa index as a measure of the reliability of multiple classifications is developed. Special cases of the index as well as the effects of test length on the index are also explored. (JKS)
Descriptors: Career Development, Classification, Mastery Tests, Test Length
Peer reviewedKrashen, Stephen D. – Language Learning, 1978
Cites evidence showing that the "natural order" found using the Bilingual Syntax Measure to measure morpheme order is not an artifact of the test. (Author/AM)
Descriptors: Language Acquisition, Language Tests, Morphemes, Second Language Learning
Peer reviewedHuck, Schuyler W. – Educational and Psychological Measurement, 1978
A modification of Hoyt's analysis of variance model for test analysis was proposed by Lu. A difficulty that may be encountered in using Lu's modification is examined, and a solution is proposed. (JKS)
Descriptors: Analysis of Variance, Difficulty Level, Item Analysis, Test Items
Peer reviewedSchulman, Robert S. – Psychometrika, 1978
Ordinal measurement is the rank ordering of individuals in a population. For ordinal measurement, the concept of an individual propensity distribution is his or her true score. Estimation of, as well as other aspects of the distribution, are discussed. (Author/JKS)
Descriptors: Correlation, Measurement, Nonparametric Statistics, Probability
Peer reviewedWilliams, Richard H.; Zimmerman, Donald W. – Educational and Psychological Measurement, 1977
The usual formulas for the reliability of differences between two test scores are based on the assumption that the error scores are uncorrelated. Formulas are presented for the general case where this assumption is unnecessary. (Author/JKS)
Descriptors: Correlation, Error of Measurement, Error Patterns, Scores
Peer reviewedJackson, Paul H.; Agunwamba, Christian C. – Psychometrika, 1977
Finding and interpreting lower bounds for reliability coefficients for tests with nonhomogenous items has been a problem for psychometricians. This paper presents a mathematical formula for finding the greatest lower bound for such a coefficient. (Author/JKS)
Descriptors: Comparative Analysis, Mathematical Models, Measurement, Test Interpretation
Peer reviewedKuncel, Ruth Boutin – Educational and Psychological Measurement, 1977
A method to improve personality inventory validity by presenting positively keyed items from a single trait in descending order of endorsement and encouraging subjects to respond in terms of the resulting Guttman scale is presented. Experimental results show increases in reliability and validity. (Author/JKS)
Descriptors: Adults, Item Analysis, Personality Measures, Postsecondary Education
Peer reviewedParish, Thomas S.; Eads, Gerald M. – Educational and Psychological Measurement, 1977
The Personal Attribute Inventory was compared to the Adjective Check List as a measure of self concept on a sample of college students. The Personal Attribute Inventory showed strong correlation to two subscales of the Adjective Check List, and more stability over time. (JKS)
Descriptors: College Students, Higher Education, Self Concept Measures, Test Reliability


