Publication Date
| In 2026 | 3 |
| Since 2025 | 666 |
| Since 2022 (last 5 years) | 3167 |
| Since 2017 (last 10 years) | 7408 |
| Since 2007 (last 20 years) | 15046 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10272 |
| Reliability | 9759 |
| Foreign Countries | 7141 |
| Test Construction | 4823 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3525 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1327 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 252 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedAllison, S. C.; And Others – International Journal of Rehabilitation Research, 1996
This attempt to determine the reliability of the Modified Ashworth Scale for assessing the severity of muscle spasticity for ankle plantarflexors in 30 patients with traumatic brain injury concluded that the reliability was minimally adequate to support the scale's continued use. Interrater reliability was less than that previously reported for…
Descriptors: Evaluation Methods, Head Injuries, Measures (Individuals), Motor Reactions
Peer reviewedBaker, Eva L. – Journal of Educational Research, 1996
Paper introduces a collection of papers on educational assessment, noting controversies surrounding both old and new assessment methods, discussing the persistent set of technical issues that must be addressed by the research community in measurement, and examining work by the National Center for Research on Evaluation, Standards, and Student…
Descriptors: Alternative Assessment, Educational Assessment, Elementary Secondary Education, Evaluation Methods
Peer reviewedVandenplas-Holper, Christiane – Scientia Paedagogica Experimentalis, 1995
College psychology students rated 18 books on children's fears according to their potential impact on children's socioemotional growth. During a training program, they coded the books based on story grammar and specific categories of children's fears. Students then rerated the stories. Interrater reliability increased significantly from initial to…
Descriptors: Child Development, Child Psychology, Children, Childrens Literature
Peer reviewedJacobson, Joseph L.; Jacobson, Sandra W. – Developmental Psychology, 1996
Examined methodological issues related to the detection and evaluation of behavioral toxicity in infants and children, focusing on the selection of appropriate variables and strategies to control for confounding, sampling strategies and the problem of "overcontrol" for confounding; and the evaluation of dose-response relations and…
Descriptors: Children, Developmental Psychology, Error of Measurement, Fetal Alcohol Syndrome
Stirling, Keith – Proceedings of the ASIS Annual Meeting, 2000
Describes a session on information retrieval systems that planned to discuss relevance measures with Web-based information retrieval; retrieval system performance and evaluation; probabilistic independence of index terms; vector-based models; metalanguages and digital objects; how users assess the reliability, timeliness and bias of information;…
Descriptors: Bias, Electronic Libraries, Evaluation Methods, Information Retrieval
Peer reviewedHarkin, Joe; Davis, Pauline; Turner, Gill – Westminster Studies in Education, 1999
Describes the process of producing a valid and reliable Communication Styles Questionnaire (CSQ) that teachers of 16-19 year old students in English post-compulsory education may use to monitor and evaluate how they tend to interact with students. Considers the development, validity, and reliability of the CSQ. (CMK)
Descriptors: Communication Skills, Dutch, Foreign Countries, Professional Development
Peer reviewedCline, Nancy M. – EDUCAUSE Review, 2000
Discusses issues facing research libraries relating to information technology and access to information. Considers the proliferation of digital information; the reliability of digital information; traditional library roles in the support of scholarship and research; future issues; and current projects relating to digital information. (LRW)
Descriptors: Academic Libraries, Access to Information, Electronic Libraries, Futures (of Society)
Fernhall, Bo; Pitetti, Kenneth H.; Vukovich, Matthew D.; Stubbs, Nancy; Hensen, Terri; Winnick, Joseph P.; Short, Francis X. – American Journal on Mental Retardation, 1998
The validity of the 600-yard walk/run, the 20-meter shuttle run, and a modified 16-meter shuttle run was determined to measure aerobic capacity (VO2peak) in 34 children with mental retardation (ages 10-17). All field tests were found to be very reliable, and VO2peak was significantly related to them all. (Author/CR)
Descriptors: Aerobics, Child Health, Elementary Secondary Education, Mental Retardation
Peer reviewedArnold, Karl-Heinz – Zeitschrift fur Padagogik, 2001
Demonstrates that a high degree of fairness may be achieved in international comparative research on school achievement, using the Third International Mathematics and Science Study (TIMSS) as an example and employing the methods of advanced pedagogical-psychological diagnosis. Includes references. (CMK)
Descriptors: Comparative Analysis, Educational Quality, Educational Research, Elementary Secondary Education
Peer reviewedBrennan, Robert L. – Educational Measurement: Issues and Practice, 2001
Discusses some problems, pitfalls, and paradoxes that challenge measurement theory and practice, especially for K-12 achievement testing. Considers a number of technical issues, especially some related to reliability. Also discusses a number of practical or political issues related to validation and accountability. (SLD)
Descriptors: Accountability, Achievement Tests, Educational Testing, Educational Theories
Peer reviewedGibbs, William; Graves, Pat R.; Bernas, Ronan S. – Journal of Research on Technology in Education, 2001
Describes a study that used a Web-based survey and a modified Delphi technique to identify criteria important to multimedia instructional courseware evaluation, validate them with a panel of instructional technology experts, and examine the effect of conducting panel discussions online. Shows information accuracy and reliability as the most…
Descriptors: Computer Software Evaluation, Courseware, Delphi Technique, Discussion
Peer reviewedSarafino, Edward P.; Ewing, Maureen – Journal of American College Health, 1999
Describes the development of the Hassles Assessment Scale for Students in College, which measured student stress. Development involved item generation, psychometric evaluation, and revision. Separate student samples participated in each phase. Results found very high levels of internal consistency for the frequency, unpleasantness, and dwelling…
Descriptors: College Students, Coping, Higher Education, Stress Management
VanSciver, James H. – High School Magazine, 1999
Teachers deserve objective evaluations that will help them to improve. Delaware uses a four-point rubric ranging from "unsatisfactory" to "needs improvement,""effective," and "exemplary." Teachers' placement depends on frequency of demonstrated behaviors identified in each rubric. For lower ratings, the…
Descriptors: Evaluation Criteria, Feedback, Interrater Reliability, Program Descriptions
Peer reviewedRiddle, Kathryn P.; Aponte, Joseph F. – Child Abuse & Neglect: The International Journal, 1999
Data collected from 95 college students found the Comprehensive Childhood Maltreatment Inventory to have excellent test-retest reliability and 2 of the 4 subscales (Psychological Maltreatment and Neglect and Sexual Abuse) to possess adequate internal consistency. Reasons for low internal consistency for the Physical Maltreatment and Physical…
Descriptors: Child Abuse, Child Neglect, College Students, Evaluation Methods
Peer reviewedEpstein, Michael H.; Hertzog, Melody A.; Reid, Robert – Behavioral Disorders, 2001
Data are reported on the long-term (6-month) test-retest reliability of the Behavioral and Emotional Rating Scale (BERS), which is a strength-based assessment instrument. Participants included 95 typical elementary students and 26 children at risk of having or having emotional or behavioral disorders. Moderate to high test-retest correlations were…
Descriptors: Behavior Disorders, Behavior Rating Scales, Clinical Diagnosis, Disability Identification


