Publication Date
| In 2026 | 3 |
| Since 2025 | 675 |
| Since 2022 (last 5 years) | 3176 |
| Since 2017 (last 10 years) | 7417 |
| Since 2007 (last 20 years) | 15055 |
Descriptor
| Test Reliability | 15043 |
| Test Validity | 10279 |
| Reliability | 9761 |
| Foreign Countries | 7144 |
| Test Construction | 4825 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3526 |
| Interrater Reliability | 3124 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1328 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 217 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedFitzpatrick, Anne R.; Ercikan, Kadriye; Yen, Wendy M.; Ferrara, Steven – Applied Measurement in Education, 1998
The consistency between raters over three years of a high-stakes performance assessment was examined in two studies involving a total of approximately 3,000 students in grades three, five, and eight. Results show that raters in different years differ in severity, with raters in mathematics most consistent, and those in language arts least…
Descriptors: Elementary Education, Elementary School Students, High Stakes Tests, Interrater Reliability
Peer reviewedChristensen, Rhonda; Knezek, Gerald – Journal of Technology and Teacher Education, 2000
Examines the internal consistency reliabilities for 14 previously-published computer attitude scales based on responses from preservice teachers, practicing K-12 teachers, and teacher educators. Describes the Teachers' Attitude toward Computers Questionnaire (TAC) that included 32 subscales. (LRW)
Descriptors: Computer Attitudes, Elementary School Teachers, Preservice Teachers, Questionnaires
Peer reviewedMantzicopoulos, Panayota – Early Childhood Research Quarterly, 1999
Examined differences in performance as well as reliability and validity indices for 256 Head Start children screened with Brigance K&1 screen. Found high overall test consistency, but considerable variability across subscales. Classification analyses established that the Brigance was not completely accurate in predicting early school…
Descriptors: Academic Achievement, Age Differences, Kindergarten, Kindergarten Children
Peer reviewedCharman, Tony; Pervova, Irina – Journal of Outcome Measurement, 2001
Studied the internal structure of a self-report measure of depressed mood in school children, the Child Depression Inventory (M. Kovacs and A. Beck, 1977) with 92 Russian and 139 English children (mean age: 12 years, 10 months). Internal reliability and consistency results and factor analysis support the use of the scale with non-Western samples…
Descriptors: Depression (Psychology), Elementary Education, Elementary School Students, Factor Analysis
Peer reviewedLoo, Robert – Measurement and Evaluation in Counseling and Development, 2001
Examines the psychometric properties of scores on the Work Preference Inventory (WPI), particularly its factor structure and scale reliabilities. Analyses of data from 200 undergraduates showed qualified support for the inventory and its use for developmental purposes. States that the WPI may be a useful tool in stimulating students to examine…
Descriptors: Factor Structure, Higher Education, Psychometrics, Scaling
Peer reviewedGentile, J. Ronald – Teaching of Psychology, 2000
Describes a classroom activity, listing step-by-step directions, that demonstrates the unreliability of essay scoring. Explains that after the exercise the class discussion should address the problematic factors in scoring essays. Lists recommendations for improving reliability and validity of essay scoring. (CMK)
Descriptors: Class Activities, Discussion (Teaching Technique), Educational Strategies, Essays
Peer reviewedWilliams, Janet L. – RSR: Reference Services Review, 2000
Discusses the basic concepts of testing and item development and the application of alternative assessments to information literacy content for library instruction. Topics include reliability; validity; statistical analysis; selected response, including checklists, rank order, or simple match; constructed response; essays; and complex assessments.…
Descriptors: Essays, Evaluation Methods, Information Literacy, Library Instruction
Peer reviewedMelby, Janet M.; And Others – Journal of Marriage and the Family, 1995
Multiple observer ratings of 424 families were obtained across 2 observational task situations using the Iowa Family Interaction Rating Scales. Observer ratings and family member reports were assessed simultaneously through structural equation modeling. Findings support the reliability of the global assessments of warm/supportive marital…
Descriptors: Affective Behavior, Higher Education, Interpersonal Relationship, Interrater Reliability
Peer reviewedEllis, David; And Others – Journal of the American Society for Information Science, 1996
Describes an investigation of the relationship between the levels of interlinker consistency obtained among a group of full-text databases in which internodal links were inserted, and the effectiveness of searches carried out in those databases. Topics include interindexer consistency and retrieval system evaluation. (Author/LRW)
Descriptors: Correlation, Evaluation Methods, Full Text Databases, Hypermedia
Peer reviewedRamsey, Paul G.; And Others – Academic Medicine, 1996
A study of 187 internists, evaluated by peers they recommended, found the highest rating was for integrity, and lowest was for psychosocial aspects of patient care. Peer raters' response rate and analysis of the ratings suggest this rating process is acceptable to physicians and that it is feasible to obtain reliable, multidimensional peer…
Descriptors: Evaluation Methods, Hospitals, Internal Medicine, Job Performance
Peer reviewedSwanson, H. Lee – Journal of Special Education, 1996
Evaluation of two meta-analyses of sociometric research on children with learning disabilities notes differences in their findings. Differences are attributed to effects of gender, ethnicity, and type of measurement on effect size; inadequate reporting of coding reliability; failure to include similar articles for analysis; and poor…
Descriptors: Elementary Secondary Education, Interpersonal Competence, Learning Disabilities, Meta Analysis
Peer reviewedReynolds, William M.; Kobak, Kenneth A. – Psychological Assessment, 1995
A self-report, paper-and-pencil version of the Hamilton Depression Rating Scale, the Hamilton Depression Inventory, was developed and tested with 140 depressed adults, 99 adults with anxiety disorders, and 118 nonreferred adults. Overall, data support the reliability and validity of the new measure. (SLD)
Descriptors: Anxiety, Clinical Diagnosis, Depression (Psychology), Diagnostic Tests
Peer reviewedStock, William A.; And Others – Evaluation and the Health Professions, 1996
Guidelines are offered that make it more likely that high-quality information will be extracted and coded from primary research reports in meta-analyses. It is also noted that the methodology of meta-analysis results in pressure to change the type of information that appears in primary research reports. (SLD)
Descriptors: Coding, Data Analysis, Data Collection, Information Needs
Peer reviewedKloseck, Marita; And Others – Therapeutic Recreation Journal, 1996
Outlines the conceptual background and development of the Leisure Competence Measure, a behavior-anchored rating scale which documents current levels of leisure functioning and change over time. Validation with a geriatric rehabilitation population supports it as an instrument from which researchers can draw dependable inferences regarding leisure…
Descriptors: Behavior Rating Scales, Leisure Time, Older Adults, Rehabilitation
Peer reviewedGilmer, Mary Jo; And Others – Journal of School Health, 1996
This paper reports the psychometric properties of each of the subscales of the Youth Health Survey and results of a pilot study with 205 sixth-, seventh-, and eighth-grade students. The eight subscales assess health habits and attitudes of the adolescent as well as peer and family influences. Data for internal consistencies, test-retest…
Descriptors: Adolescents, Cardiovascular System, Health Activities, Psychometrics


