Publication Date
| In 2026 | 3 |
| Since 2025 | 675 |
| Since 2022 (last 5 years) | 3176 |
| Since 2017 (last 10 years) | 7417 |
| Since 2007 (last 20 years) | 15055 |
Descriptor
| Test Reliability | 15043 |
| Test Validity | 10279 |
| Reliability | 9761 |
| Foreign Countries | 7144 |
| Test Construction | 4825 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3526 |
| Interrater Reliability | 3124 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1328 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 217 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Turner, Carol J.; Record, Albert L. – Measurement and Evaluation in Guidance, 1981
Evaluated the reliability and convergent validity of three rating scales designed to measure self-concept clarity. The ratings of self, peer, and professional judges were compared. Results supported that a basic difference exists between a participant's self-reported constructs and an observer's judgment of that participant's constructs.…
Descriptors: Comparative Analysis, Evaluation Methods, Evaluators, Peer Evaluation
Peer reviewedJacobson, Neil S.; Moore, Danny – Journal of Consulting and Clinical Psychology, 1981
Examined the reliability of spouses as observers of the behaviors that occur in their own marital relationships. Distressed and nondistressed couples collected data in the home. Across the entire checklist, nondistressed couples exhibited significantly greater consensus than did distressed couples, based on both percentage agreement and kappa.…
Descriptors: Behavior Patterns, Check Lists, Congruence (Psychology), Data Collection
Peer reviewedDucroquet, Lucile – System, 1980
Examines objectivity as a criterion for excellency in language tests through a critical presentation of the characteristics of objective tests. This examination leads to a discussion of what the best approach to testing language might be, endorsing a combination of integrative and objective techniques. (Author/MES)
Descriptors: Evaluation, Language Tests, Objective Tests, Second Language Instruction
Peer reviewedCuellar, Israel; And Others – Hispanic Journal of Behavioral Sciences, 1980
Describes an acculturation scale for Mexican Americans which can be administered in English, Spanish, or both languages; is applicable with both normal and psychiatric populations; and differentiates five distinct types of Mexican Americans based on four major factors. Includes reliability and validity data and a copy of the scale. (Author/SB)
Descriptors: Acculturation, Clinical Psychology, English, Measures (Individuals)
Peer reviewedBradley, Fred O.; And Others – Journal of Consulting and Clinical Psychology, 1980
No WISC-R IQ scale is immune to serious scoring errors. Inspection of the standard deviations reveals that the score an examinee receives for a given performance on WISC-R content can easily vary by six to eight IQ points. (Author)
Descriptors: Children, Diagnostic Tests, Elementary Secondary Education, Error of Measurement
Peer reviewedAnderson, Jack D.; And Others – Journal of Speech and Hearing Disorders, 1980
A reliability coefficient of 0.91 was obtained between pre- and posttest administrations. Internal stability was highest for total score, the subtests measuring form tests and function words, and grammatical categories. Low coefficients were obtained for the subtests of morphological construction and syntactic structure. (Author/DLS)
Descriptors: Auditory Perception, Exceptional Child Research, Language Skills, Listening Comprehension
Peer reviewedGanopole, Selina J. – Journal of Educational Measurement, 1980
The fundamental Reading Competencies Test assesses proficiency with respect to well-defined functional reading skills of high school students. Validity and reliability data for the test are presented. (Author/JKS)
Descriptors: Criterion Referenced Tests, Functional Reading, High Schools, Minimum Competency Testing
Peer reviewedKlein, Alice E. – Educational and Psychological Measurement, 1979
The ability of the Stanford Early School Achievement Test (SESAT) to predict scores on the Stanford Achievement Test (SAT) was assessed, using pupils in grades 1 and 2 from a large midwest suburban school district. Observed SESAT-SAT correlations ranged from .257 to .723. Author/CTM)
Descriptors: Achievement Tests, Correlation, Grade 1, Grade 2
Peer reviewedReynolds, William M.; Greco, Victor T. – Educational and Psychological Measurement, 1980
Data from 182 teachers showed the Educational Attitude Survey to be a factorially valid and highly reliable measure. Its 16 items yield three scores: administrative aspects of mainstreaming, educational aspects, and total score. (Author/CP)
Descriptors: Attitude Measures, Elementary Secondary Education, Factor Structure, Mainstreaming
Peer reviewedCudeck, Robert; And Others – Applied Psychological Measurement, 1980
Tailored testing by Cliff's method of implied orders was simulated through the use of responses gathered during conventional administration of the Stanford-Binet Intelligence Scale. Tailoring eliminated approximately half the responses with only modest decreases in score reliability. (Author/BW)
Descriptors: Adaptive Testing, Computer Assisted Testing, Elementary Secondary Education, Intelligence Tests
Peer reviewedBray, James H.; And Others – Applied Psychological Measurement, 1980
The reliability of the four-factor model of the survey of study habits and attitudes (SSHA) was investigated. The reliabilities of the scales were marginal as measured by coefficient alpha. The hierarchical model of the SSHA was not supported by confirmatory factor analysis. (Author/BW)
Descriptors: Factor Analysis, Factor Structure, Higher Education, Learning Processes
Peer reviewedPrediger, Dale J. – Journal of Vocational Behavior, 1980
Holland types characterizing 34 occupational groups are reported for Self-Directed Search (SDS) standard scores. Results are compared with the Holland types obtained raw scores. Results imply that SDS standard scores are more accurate than raw scores in describing the Holland types of occupational groups. (Author)
Descriptors: Career Guidance, Interest Inventories, Job Applicants, Job Search Methods
Crocker, Linda; Benson, Jeri – Measurement and Evaluation in Guidance, 1980
Test papers of seventh-grade examinees were scored using students' initial responses, then rescored using their final responses on a subtest of the Metropolitan Achievement Test. Results indicate that reliability coefficients and item discriminations are not adversely affected by examinee response changes. (Author)
Descriptors: Attitude Change, Counselors, Junior High School Students, Response Style (Tests)
Peer reviewedKarnes, Frances A.; Brown, K. Eliot – Psychology in the Schools, 1981
A study to develop a short form of the Wechsler Intelligence Scale for Children-Revised (WISC-R) for the intellectually gifted showed the Vocabulary and Block Design comprise the best two-subtest short form. The Similarities, Vocabulary, Block Design, and Object Assembly tetrad could be most useful in time and reliability. (Author)
Descriptors: Academically Gifted, Elementary Secondary Education, Intelligence Tests, Screening Tests
Peer reviewedBerk, Ronald A. – Journal of Educational Measurement, 1980
A dozen different approaches that yield 13 reliability indices for criterion-referenced tests were identified and grouped into three categories: threshold loss function, squared-error loss function, and domain score estimation. Indices were evaluated within each category. (Author/RL)
Descriptors: Classification, Criterion Referenced Tests, Cutting Scores, Evaluation Methods


