Publication Date
| In 2026 | 3 |
| Since 2025 | 675 |
| Since 2022 (last 5 years) | 3176 |
| Since 2017 (last 10 years) | 7417 |
| Since 2007 (last 20 years) | 15055 |
Descriptor
| Test Reliability | 15043 |
| Test Validity | 10279 |
| Reliability | 9761 |
| Foreign Countries | 7144 |
| Test Construction | 4825 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3526 |
| Interrater Reliability | 3124 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1328 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 217 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedWolery, Mark; And Others – Journal of Early Intervention, 1993
A progressive time delay procedure in individual instructional sessions using massed-trial presentation was compared to distributed trials presented during transitions from one activity to another. Results with four preschool children with developmental delays indicated that both procedures were implemented reliably, were comparable in efficiency,…
Descriptors: Developmental Disabilities, Efficiency, Individual Instruction, Instructional Design
Kearney, Christopher A. – Journal of the Association for Persons with Severe Handicaps (JASH), 1994
In this study, interrater reliability of the Motivation Assessment Scale (MAS) was investigated utilizing 84 direct-care staff members familiar with 42 adults with moderate to profound mental retardation. Problematic overt behaviors were evaluated using the MAS. MAS items were found to be moderately but significantly reliable across raters.…
Descriptors: Adults, Attendants, Behavior Problems, Evaluation Methods
Peer reviewedMatson, Johnny L.; Russell, Deirdre – Research in Developmental Disabilities, 1994
The Psychopathology Instrument for Mentally Retarded Adults--Sexuality Scale was developed and used to study 86 adults with mild/moderate mental retardation. Psychometric characteristics were favorable. Differences in the rate of sexually aberrant behavior are addressed as a function of living and work placement, positive history of sexual abuse,…
Descriptors: Adults, Mild Mental Retardation, Moderate Mental Retardation, Psychometrics
Peer reviewedColliver, Jerry R.; And Others – Journal of Academic Medicine, 1991
Case means and case failures in performance-based medical student evaluations were examined to evaluate the consistency of ratings made by two or more standardized patients (SPs) simulating the same case. Results demonstrate a need for caution in interpreting scores obtained from a case checklist completed by multiple SPs. (Author/MSE)
Descriptors: Evaluation Methods, Higher Education, Interrater Reliability, Medical Education
Harrington-Lueker, Donna – Executive Educator, 1991
Performance-based assessment, the umbrella term for various measures to test higher order thinking skills beyond the reach of multiple-choice tests, is the front runner in the race to provide testing alternatives. Problems concerning training, reliability, and cost remain unresolved. A sidebar summarizes testing developments in Arizona,…
Descriptors: Accountability, Costs, Elementary Secondary Education, Performance Based Assessment
Peer reviewedLoBello, Steven G. – Journal of School Psychology, 1991
Data from standardization sample (n=1,700) of Wechsler Preschool and Primary Scale of Intelligence-Revised (WPPSI-R) were used to develop table that gives Full Scale intelligence quotients (IQs) for four-subtest (Comprehension, Arithmetic, Picture Completion, Block Design) abbreviated form of scale. Reports reliability and validity coefficients…
Descriptors: Intelligence Tests, Preschool Children, Preschool Education, Primary Education
Peer reviewedBradley, Clare – Assessment and Evaluation in Higher Education, 1993
Analysis of a study of sex bias in undergraduate student project evaluations revealed evidence of bias that was overlooked by the researchers. Research methodology and interpretation are discussed further. (MSE)
Descriptors: College Students, Higher Education, Interrater Reliability, Research Methodology
Macmann, Gregg M.; Barnett, David W. – American Journal on Mental Retardation, 1993
The reliability of diagnoses of mental retardation severity was examined by comparing psychiatric and psychological case records of 126 dually diagnosed (mental retardation and psychiatric disorder) clients. Overall chance-corrected agreement was 0.47. Results suggested that the reliability of diagnostic decisions may be best evaluated by analysis…
Descriptors: Clinical Diagnosis, Decision Making, Mental Disorders, Mental Retardation
Peer reviewedYeaton, William H.; Wortman, Paul M. – Evaluation Review, 1993
Current practices of reporting a single mean intercoder agreement in meta-analysis leads to systematic bias and overestimates reliability. An alternative is recommended in which average intercoder agreement statistics are calculated within clusters of coded variables. Two studies of intercoder agreement illustrate the model. (SLD)
Descriptors: Coding, Decision Making, Estimation (Mathematics), Interrater Reliability
Peer reviewedCalsyn, Robert J.; And Others – Evaluation Review, 1993
Reliability and validity of self-report data provided by 178 mentally ill homeless persons were generally favorable. Self-reports of service use also generally agreed with treatment staff estimates, providing further validity evidence. Researchers and administrators can be relatively confident in using such data. (SLD)
Descriptors: Adults, Data Collection, Estimation (Mathematics), Homeless People
Peer reviewedLawson, Loralie – Journal of Vocational Behavior, 1993
To measure Theory of Work Adjustment personality and adjustment style dimensions, content-based scales were analyzed for homogeneity and successively reanalyzed for reliability improvement. Three sound scales were developed: inflexibility, activeness, and reactiveness. (SK)
Descriptors: Behavior Theories, Construct Validity, Measures (Individuals), Personality Traits
Peer reviewedSmith, Douglas C.; Nelson, Sandra J. – Journal of Studies in Technical Careers, 1992
This critical review of the Personal Report of Communication Apprehension considers construct validity, cross-situational consistency, replicability, and applicability to non-English cultures. (SK)
Descriptors: Communication Apprehension, Communication Skills, Construct Validity, Higher Education
Peer reviewedPatton, Wendy; Burnett, Paul C. – Adolescence, 1993
Investigated reliability and construct validity of Children's Depression Scale. Data from Australian sample of 202 adolescents revealed that four factors met stipulated criteria and accounted for 54% of variance. Revised subscales, three with five items and one with four items, had strong construct and face validity and high reliability.…
Descriptors: Adolescents, Construct Validity, Depression (Psychology), Foreign Countries
Peer reviewedFrank, Gail C. – Journal of School Health, 1991
Discusses the validity and reliability of using food records and recalls when measuring dietary intake and eating behaviors of children, noting the advantages of collecting dietary data in a school environment. The article explains the appropriate use of food records and describes how to improve the method. (SM)
Descriptors: Child Health, Data Collection, Dietetics, Eating Habits
Peer reviewedLee, Lucienne A.; Heppner, P. Paul – Journal of Counseling and Development, 1991
Describes development of Harassment Sensitivity Inventory (HSI), an 18-item inventory developed to assess sensitivity to the negative effects of male to female sexual and nonsexual harassment in a work setting. Discusses initial psychometric data collected with sample of managers and supervisors (n=133). Claims HSI holds promise, although further…
Descriptors: Administrators, Measures (Individuals), Sexual Harassment, Test Construction


