Publication Date
| In 2026 | 12 |
| Since 2025 | 958 |
| Since 2022 (last 5 years) | 4567 |
| Since 2017 (last 10 years) | 10500 |
| Since 2007 (last 20 years) | 21963 |
Descriptor
| Test Validity | 21786 |
| Validity | 13791 |
| Test Reliability | 10864 |
| Foreign Countries | 9887 |
| Test Construction | 6897 |
| Factor Analysis | 5761 |
| Measures (Individuals) | 5633 |
| Predictive Validity | 5022 |
| Psychometrics | 4820 |
| Reliability | 4635 |
| Correlation | 4376 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 1169 |
| Practitioners | 629 |
| Teachers | 336 |
| Administrators | 165 |
| Policymakers | 110 |
| Counselors | 63 |
| Students | 63 |
| Parents | 15 |
| Community | 12 |
| Media Staff | 10 |
| Support Staff | 8 |
| More ▼ | |
Location
| Turkey | 1397 |
| Australia | 705 |
| Canada | 626 |
| China | 528 |
| United States | 439 |
| Indonesia | 389 |
| United Kingdom | 363 |
| Germany | 340 |
| California | 338 |
| Netherlands | 336 |
| Spain | 311 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 7 |
| Meets WWC Standards with or without Reservations | 12 |
| Does not meet standards | 10 |
Parker, Richard; And Others – Diagnostique, 1996
Describes development and testing of maze-like semantic maps for assessing reading comprehension of content-area information. Results involving 144 students (38 with learning disabilities) showed that maps could be produced and scored with high reliability. Criterion-related validity based on a standardized test was weak, but moderate-to-strong…
Descriptors: Content Area Reading, Junior High Schools, Learning Disabilities, Map Skills
Peer reviewedBarthelemy, C.; And Others – Journal of Autism and Developmental Disorders, 1997
A French study of 136 children (ages 20-139 months) with developmental disabilities investigated the reliability and validity of the Revised Behavior Summarized Evaluation Scale (BSE-R) in evaluating autistic behavior in children with developmental delays. The BSE-R was found to be useful for progressive recording of the evolution of patients…
Descriptors: Autism, Children, Developmental Delays, Disability Identification
Peer reviewedSmith, P. Hull; And Others – Merrill-Palmer Quarterly, 1997
Examined predictive validity of measures of infant habituation and later aspects of temperament. Found babies who habituated sooner (fewer trials to criterion) at five months of age and had fewer peak fixations during habituation were rated by mothers as more active, intense, and negative in mood, and less persistent and adaptable. Age differences…
Descriptors: Age Differences, Habituation, Infant Behavior, Infants
Peer reviewedErford, Bradley T. – Educational and Psychological Measurement, 1997
Reliability and construct and criterion-related validity of scores on the Disruptive Behavior Rating Scale--Teacher Version were studied with 151 teachers of 1,766 elementary school students and 30 teachers of 131 students. Factor analysis confirmed a four-factor structure. Results support the measure's internal consistency and convergent…
Descriptors: Behavior Problems, Elementary Education, Elementary School Students, Elementary School Teachers
Peer reviewedMcPherson, K. M.; Pentland, B. – International Journal of Rehabilitation Research, 1997
This study of 54 individuals with head injuries compares a commonly used measure of physical disability, the Barthel Index, with three measures designed to assess intellectual functioning, communication, behavior, and mobility. The results indicate support for using scales other than the Barthel Index when describing disability following traumatic…
Descriptors: Adults, Communication Skills, Evaluation Methods, Foreign Countries
Peer reviewedEngdahl, Brian E.; And Others – Psychological Assessment, 1996
Four posttraumatic stress disorder (PTSD) scales were compared in a community sample of 330 former prisoners of war and World War II combat veterans. The Mississippi Scale for Combat-Related PTSD, the Minnesota Multiphasic Personality Inventory-2, and the Impact of Event Scale demonstrated moderate relationships with PTSD. (SLD)
Descriptors: Emotional Problems, Evaluation Methods, Personality Assessment, Personality Measures
Peer reviewedKooiman, C. G.; Ouwehand, A. W.; ter Kuile, M. M. – Child Abuse & Neglect: The International Journal, 2002
Criterion validity of the Sexual and Physical Abuse Questionnaire was investigated in 134 psychiatric patients using the Structured Trauma Interview. The measures of agreement and the predictive measures of the questionnaire were satisfactory, particularly with sexual abuse. Positive answering increased odds for sexual abuse by a factor of…
Descriptors: Child Abuse, Elementary Secondary Education, Evaluation Criteria, Evaluation Methods
Peer reviewedChapelle, Carol A.; Jamieson, Joan; Hegelheimer, Volker – Language Testing, 2003
Presents the design and validation of an English-as-a-Second-Language (ESL) test for a commercial publisher. (Author/VWL)
Descriptors: English (Second Language), Language Tests, Second Language Instruction, Second Language Learning
Peer reviewedDenner, Peter R.; Salzman, Stephanie A.; Bangert, Arthur W. – Journal of Personnel Evaluation in Education, 2001
Examined the validity and generalizability of the use of Teacher Work Samples to assess the ability of preservice teachers and inservice teachers to meet national and state teaching standards and to make an impact on the learning of their students. Results of the study, which involved 132 work samples, show initial support for teacher work sample…
Descriptors: Academic Achievement, Elementary Secondary Education, Generalization, Preservice Teachers
Peer reviewedConroy, Maureen A.; Stichter, Janine Peck – Journal of Special Education, 2003
This article critically analyzes research investigating the use of antecedent events in the functional assessment process, with an emphasis on interventions conducted in natural settings. An analysis of 17 articles found that research on antecedent-based interventions lacks a consistent conceptual framework. Suggestions for future research are…
Descriptors: Behavior Disorders, Elementary Secondary Education, Environmental Influences, Evaluation Methods
Peer reviewedHaertel, Edward H. – Educational Measurement: Issues and Practice, 2002
Outlines a framework for considering the validity of standards-based score interpretations and then considers the potential roles of different stakeholder groups and other participants in that process. Suggests study of a new standard-setting method, the "briefing book," which would describe alternative cut scores. (SLD)
Descriptors: Accountability, Cutting Scores, Elementary Secondary Education, High Stakes Tests
Peer reviewedCastro, Marcelo; Mendez, Julia L.; Fantuzzo, John – School Psychology Quarterly, 2002
Investigates the psychometric properties of a Spanish and English version of the Penn Interactive Peer Play Scale (PIPPS) when employed with Spanish- and English-speaking teachers and students. The independent emergence of comparable Spanish and English PIPPS factor structures provides initial support for use of this measure in research with…
Descriptors: Black Students, Factor Structure, Hispanic American Students, Peer Relationship
Peer reviewedWhittaker, Andrea; Young, Viki M. – Teacher Education Quarterly, 2002
Examines what teachers learn about the role of assessments in instruction through designing and using their own curriculum- embedded assessment, noting how the development of teachers' understanding of assessment for instructional purposes supports and conflicts with use of assessment data for reporting and accountability purposes. The paper…
Descriptors: Accountability, Elementary Secondary Education, Faculty Development, High Stakes Tests
Peer reviewedOliver, Bonamy; Dale, Philip S.; Saudino, Kimberly J.; Petrill, Stephen A.; Pike, Alison; Plomin, Robert – Early Child Development and Care, 2002
Validated a parent-based assessment of cognitive abilities of 3-year-olds, the Parent Report of Children's Abilities for 3s (PARCA3), against a standard tester-administered measure, the McCarthy Scales of Children's Abilities, and a vocabulary checklist. Found that PARCA3 parent report and parent-administered components significantly related to…
Descriptors: Cognitive Ability, Cognitive Development, Cognitive Measurement, Measures (Individuals)
Peer reviewedSouth, Mikle; Williams, Brenda J.; McMahon, William M.; Owley, Thomas; Filipek, Pauline A.; Shernoff, E.; Corsello, Christina; Lainhart, Janet E.; Landa, Rebecca; Ozonoff, Sally – Journal of Autism and Developmental Disorders, 2002
A study examined the validity of the Gilliam Autism Rating Scale (GARS) with 119 children (ages 3-10) with strict DSM-IV (Diagnostic Statistical Manual of Mental Disorders-IV) diagnoses of autism. The GARS consistently underestimated the likelihood that children would be classified as having autism. Limitations of ratings scales and of the GARS…
Descriptors: Autism, Behavior Rating Scales, Classification, Clinical Diagnosis


