Publication Date
| In 2026 | 3 |
| Since 2025 | 675 |
| Since 2022 (last 5 years) | 3176 |
| Since 2017 (last 10 years) | 7417 |
| Since 2007 (last 20 years) | 15055 |
Descriptor
| Test Reliability | 15043 |
| Test Validity | 10279 |
| Reliability | 9761 |
| Foreign Countries | 7144 |
| Test Construction | 4825 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3526 |
| Interrater Reliability | 3124 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1328 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 217 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedNewstead, Stephen E.; Dennis, Ian – Assessment and Evaluation in Higher Education, 1990
Three studies investigating the existence of sex bias in the grading of undergraduate students, by examining interrater reliability for blind and non-blind grading, are reported. Negative evidence found in the results and the confusing picture presented by previous research indicate little firm evidence of sex bias in grading. (Author/MSE)
Descriptors: Evaluation Methods, Grading, Higher Education, Interrater Reliability
Peer reviewedChapman, David W. – International Review of Education/Internationale Zeitschrift fuer Erziehungswissenschaft/Revue Internationale de Pedagogie, 1991
Reports findings from a study of the confidence expressed by ministry-level decision makers in five developing countries (i.e., Somalia, Botswana, Liberia, Yemen, and Nepal) about the quality of the national-level education data available to them and reasons for the perceived 16-40 percent error rate. (DMM)
Descriptors: Comparative Education, Data Collection, Developing Nations, Educational Research
Peer reviewedSherman, Thomas M. – Journal of General Education, 1991
Reviews commercially available instruments to assess students' study skills/habits/behavior, finding that none completely meets the standards set by the American Psychological Association or the American Educational Research Association. Offers guidance on selecting and applying these instruments in college settings. (DMM)
Descriptors: Comparative Analysis, Higher Education, Standardized Tests, Standards
Peer reviewedMcCroskey, James C. – Communication Quarterly, 1992
Outlines and discusses the nature and assumptions of the Willingness to Communicate scale. Discusses data relating to the reliability and validity of the instrument. Concludes that the scale is of sufficient quality to be recommended for research and screening purposes. (SR)
Descriptors: Communication Apprehension, Educational Research, Higher Education, Interpersonal Communication
Peer reviewedSturmey, Peter; And Others – Journal of Autism and Developmental Disorders, 1992
Analyses of the internal consistency of three autism scales--the Autism Behavior Checklist (ABC), the Real Life Rating Scale (RLRS), and the Childhood Autism Rating Scale (CARS)--were conducted with 34 children with pervasive developmental disabilities. Good internal consistency was found for the CARS. Adequate full-scale consistency was found for…
Descriptors: Autism, Behavior Rating Scales, Children, Screening Tests
Peer reviewedZimmerman, Donald W.; And Others – Educational and Psychological Measurement, 1993
Coefficient alpha was examined through computer simulation as an estimate of test reliability under violation of two assumptions. Coefficient alpha underestimated reliability under violation of the assumption of essential tau-equivalence of subtest scores and overestimated it under violation of the assumption of uncorrelated subtest error scores.…
Descriptors: Computer Simulation, Estimation (Mathematics), Mathematical Models, Robustness (Statistics)
Parry, Scott B. – Training, 1993
When using assessment instruments for personnel purposes, identify the purposes of assessment, specify criteria for excellent performance, determine behavior standards, and select appropriate methodology. Correlation analysis is a useful technique for establishing the validity of an instrument. (SK)
Descriptors: Correlation, Employment Practices, Evaluation Methods, Measures (Individuals)
Peer reviewedSlate, John R.; And Others – Learning Disability Quarterly, 1991
Investigation of the stability of Wechsler Adult Intelligence Scale Revised scores of 25 college students over a 4-year period found that global and subtest scores were highly stable. Subtest scores tended to be higher on the retest, but global scores were not despite four years of educational experiences between test administrations. (Author/DB)
Descriptors: College Students, Higher Education, Intelligence Tests, Learning Disabilities
Peer reviewedRobbins, Rosemary A. – Omega: Journal of Death and Dying, 1991
Tested Bugen's Coping with Death Scale. Individuals who had written wills, planned estates and funerals, and signed organ donor cards scored higher on the Coping with Death Scale. Because Coping with Death scores were more consistently different in those who prepared for death, this scale may help in efforts to predict those who will engage in…
Descriptors: College Students, Coping, Death, Higher Education
Peer reviewedColliver, Jerry A.; And Others – Teaching and Learning in Medicine, 1990
Studies in five senior medical school classes at Southern Illinois University investigated whether using multiple standardized patients to simulate the same case in postclerkship medical student evaluation affects the measure's reliability. Results of three studies show little or no effect on reliability of total, checklist, or written test…
Descriptors: Clinical Experience, Higher Education, Medical Education, Patients
Peer reviewedWard, Sandra B.; And Others – Journal of School Psychology, 1991
Investigated referral question bias on school psychologists' classification decisions across different types of cases. Findings from 175 school psychologists who classified 5 case studies on basis of scores from intelligence, achievement, and behavioral measures revealed lack of congruence among respondents' classification decisions that was more…
Descriptors: Classification, Congruence (Psychology), Educational Diagnosis, Elementary Secondary Education
Peer reviewedWinston, Roger B., Jr.; And Others – Journal of College Student Development, 1994
Created College Classroom Environment Scales (CCES), instrument with six subscales (Cathectic Learning Climate, Professorial Concern, Inimical Ambiance, Academic Rigor, Affiliation, and Structure) to assess social climate of college classrooms. Findings from four studies estimating CCES' reliability and validity suggest it is sufficiently reliable…
Descriptors: Classroom Environment, Higher Education, Psychometrics, Social Environment
Peer reviewedPlante, Elena; Vance, Rebecca – Language, Speech, and Hearing Services in Schools, 1994
Twenty-one language tests that included norms for children ages 4 and 5 were reviewed for information on 10 psychometric criteria. Administration of the 4 tests meeting the most criteria to 20 preschool children with specific language impairments and 20 controls found that only 1 (the Structured Photographic Expressive Language Test) provided…
Descriptors: Language Impairments, Language Tests, Preschool Education, Psychometrics
Peer reviewedKoeske, Gary F.; And Others – Social Work Research, 1994
Developed and validated Job Satisfaction Scale (JSS) in series of studies from 1980 to 1991 involving over 600 helping professionals. Across administrations, alpha reliabilities ranged between 0.83 and 0.91, and reliabilities of intrinsic and organizational satisfaction subscales ranged from 0.85 to 0.90 and 0.78 to 0.90, respectively. (Author/NB)
Descriptors: Employee Attitudes, Evaluation Methods, Human Services, Job Satisfaction
Peer reviewedSelby-Harrington, Maija L.; And Others – Journal of Professional Nursing, 1994
Summarizes the principles of instrument validity and reliability and identifies deviations from these principles in a random sample of 55 research studies published in 1989 in 5 refereed nursing journals targeted toward practicing clinicians. Provides documentation, justification, and suggestions for nursing educators, journal editors, and…
Descriptors: Clinical Experience, Measures (Individuals), Nursing, Research Reports


