Publication Date
| In 2026 | 7 |
| Since 2025 | 690 |
| Since 2022 (last 5 years) | 3191 |
| Since 2017 (last 10 years) | 7432 |
| Since 2007 (last 20 years) | 15070 |
Descriptor
| Test Reliability | 15055 |
| Test Validity | 10290 |
| Reliability | 9763 |
| Foreign Countries | 7150 |
| Test Construction | 4828 |
| Validity | 4192 |
| Measures (Individuals) | 3880 |
| Factor Analysis | 3826 |
| Psychometrics | 3532 |
| Interrater Reliability | 3126 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1329 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 224 |
| Spain | 218 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedMartin, Richard M.; And Others – Developmental Psychology, 1977
This study provides an independent assessment of the reliability, validity, and design of the Defining Issues Test. (Author/SB)
Descriptors: Measurement Instruments, Moral Development, Research, Test Reliability
Peer reviewedSafrit, Margaret J.; Wood, Terry M. – Research Quarterly for Exercise and Sport, 1987
An investigation of the reliability and validity of the Health-Related Physical Fitness Test found the instrument to be highly reliable for the 11- through 14-year-olds who served as subjects. (Author/CB)
Descriptors: Physical Fitness, Physical Health, Preadolescents, Secondary Education
Peer reviewedNewman, Jody L.; Fuqua, Dale R. – Counselor Education and Supervision, 1986
Examined the effects of order of stimulus presentation on observer ratings of counseling performance. Results revealed a statistically significant interaction between quality of performance and the order in which the performances were rated. (Author/ABB)
Descriptors: Counselor Evaluation, Counselor Performance, Interrater Reliability, Observation
Peer reviewedCompas, Bruce E.; And Others – Journal of Consulting and Clinical Psychology, 1987
Conducted four studies to develop Adolescent Perceived Events Scale (APES), measure of major and daily stressful events during adolescence. Describes test construction, test-retest reliability, and concurrent validity of APES. Summarizes subsequent research showing APES to be significantly related to behavior problems and psychological…
Descriptors: Adolescents, Stress Variables, Test Construction, Test Reliability
Peer reviewedHorejsi, Charles; And Others – Child Welfare, 1987
Discusses the use of protocols, which are concise, written descriptions of preferred practice principles and procedures to be used by child welfare workers to ameliorate situations where mistakes or omissions may have serious consequences. Explains a procedure for development of protocols and presents an example. (NH)
Descriptors: Child Abuse, Child Welfare, Methods, Reliability
Peer reviewedAnsorge, Charles J.; Scheer, John K. – Research Quarterly for Exercise and Sport, 1988
Analysis of gymnastics judges scores of their own and other countries' gymnasts' performance during the 1984 Olympic Games indicated that the judges were biased in favor of their own country's gymnasts. (Author/CB)
Descriptors: Bias, Competition, Gymnastics, International Relations
Peer reviewedBraun, Henry I. – Journal of Educational Statistics, 1988
A statistical experiment was conducted in an operational setting to determine the contributions of different sources of variability to the unreliability scoring of essays and other free-response questions. Partially balanced incomplete block designs facilitated the unbiased estimation of certain main effects without requiring readers to assess the…
Descriptors: Essay Tests, Grading, Reliability, Scoring
Peer reviewedCalhoun, Angela; And Others – Volta Review, 1988
Twenty normal-hearing, sighted subjects (ages 20-42) viewed soundless videotapes of a speaker reading lists from the two forms of the Utley Lipreading Test and three from Harris' revised Central Institute for the Deaf (CID) Everyday Sentences. Results do not support the interchange of Utley and CID sentences for test-retest comparisons of…
Descriptors: Hearing Impairments, Lipreading, Perception Tests, Test Reliability
Peer reviewedRussell, Elbert W.; Levy, Marie – Journal of Consulting and Clinical Psychology, 1987
Implemented a method of abbreviating the Category Test of the Halstead-Reitan Neuropsychological Test Battery. The revision shortened the scales and reorganized Subtests 5 and 6 into two new scales using separate principles. Demonstrated it to be as accurate as the full test in predicting the presence or absence of brain damage in subjects.…
Descriptors: Neurological Impairments, Predictive Measurement, Psychological Testing, Psychometrics
Peer reviewedWilliford, A. Michael – Journal of College Admissions, 1986
Examined descriptive information about marketing, enrollment management, institutional planning and factors affecting them. A factor analysis of statistically appropriate variables identified factors associated with a state of symbiosis between marketing and institutional planning. (Author/BL)
Descriptors: College Administration, College Planning, Colleges, Higher Education
Folea, Richard V. – American School Board Journal, 1986
Computerized records are not necessarily accurate. Outlines how to do a short evaluation that checks the accuracy of student records. Includes discussion of how to improve student record-keeping. (MD)
Descriptors: Computers, Elementary Secondary Education, Evaluation, Recordkeeping
Peer reviewedVollmerhausen, Susan; And Others – Journal of Clinical Psychology, 1986
Compared Kennedy and Elder's (1982) Wechsler Intelligence Scale for Children (Revised) regression model with Kaufman's (1976) linear equating model. Both the Kennedy and Elder, and the Kaufman abbreviated forms attained a high degree of association, suggesting that both models are equally effective. (Author/BL)
Descriptors: Comparative Analysis, Institutionalized Persons, Models, Special Education
Peer reviewedMcGuire, Dennis P. – Psychometrika, 1986
A small data set is used to show that correlations and standard deviations measured within an explicitly selected group need not be smaller than those within an applicant population. Both validity and reliability estimates within a selected group can exceed those within an applicant population. (Author/LMO)
Descriptors: Correlation, Reliability, Sample Size, Sampling
Peer reviewedWainer, Howard – Journal of Educational Measurement, 1986
An example demonstrates and explains that summary statistics commonly used to measure test quality can be seriously misleading and that summary statistics for the whole test are not sufficient for judging the quality of the test. (Author/LMO)
Descriptors: Correlation, Item Analysis, Statistical Bias, Statistical Studies
Carey, John C.; And Others – Journal of College Student Personnel, 1986
Describes the development of a highly reliable 28-item scale to measure rapport between college roommates. This instrument should be useful in both research and practice. (Author/BL)
Descriptors: College Students, Higher Education, Measures (Individuals), Rapport


