Publication Date
| In 2026 | 3 |
| Since 2025 | 666 |
| Since 2022 (last 5 years) | 3167 |
| Since 2017 (last 10 years) | 7408 |
| Since 2007 (last 20 years) | 15046 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10272 |
| Reliability | 9759 |
| Foreign Countries | 7141 |
| Test Construction | 4823 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3525 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1327 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 252 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Cronin-Jones, Linda L. – Canadian Journal of Environmental Education, 2005
This case study describes the development and field-testing of a research-based scoring rubric for analyzing elementary students' schoolyard habitat drawings. To justify schoolyard learning experiences in U.S. schools, teachers, program evaluators, and others need valid, reliable, and objective assessment tools for determining if, and how, these…
Descriptors: Playgrounds, Ecology, Freehand Drawing, Measures (Individuals)
Reath, Anne – Language Assessment Quarterly, 2004
In 1993, the language section of the Swedish Migration Board initiated the production of documents they called "language analyses" to aid in the processing of asylum seekers. Today, 11 years later, 2 privately owned companies in Stockholm produce these documents. These companies have produced language analyses not only for the Swedish…
Descriptors: Language Tests, Police, Foreign Countries, Immigration
Miller, Michael L.; Fee, Virginia E.; Netterville, Amanda K. – Research in Developmental Disabilities: A Multidisciplinary Journal, 2004
The reliability of Attention-Deficit/Hyperactivity Disorder (ADHD) rating scales in children with mental retardation was assessed. Parents, teachers, and teaching assistants completed ADHD rating scales on 48 children aged 5-12 diagnosed with mental retardation. Measures included the Child Behavior Checklist (CBCL), Conners Rating Scales, the…
Descriptors: Psychometrics, Behavior Rating Scales, Measures (Individuals), Attention Deficit Disorders
Eckert, Eileen; Bell, Alexandra – Adult Basic Education: An Interdisciplinary Journal for Adult Literacy Educational Planning, 2004
Current accountability policy that bases assessment on validity and reliability criteria in the positivist research tradition is counterproductive to serving adult learners and their communities. In this article, we outline a framework for accountability that allows for the emergence and demonstration of the full range of program outcomes and…
Descriptors: Accountability, Validity, Reliability, Adult Learning
Stone, Wendy L.; Coonrod, Elaine E.; Turner, Lauren M.; Pozdol, Stacie L. – Journal of Autism and Developmental Disorders, 2004
The STAT is an interactive screening measure for autism that assesses behaviors in the areas of play, communication, and imitation skills. In Study 1, signal detection procedures were employed to identify a cutoff score for the STAT using developmentally matched groups of 2-year-old children with autism and with nonspectrum disorders. The…
Descriptors: Psychometrics, Autism, Screening Tests, Measures (Individuals)
MacDonald, Rebecca; Anderson, Jennifer; Dube, William V.; Geckeler, Amy; Green, Gina; Holcomb, William; Mansfield, Renee; Sanchez, June – Research in Developmental Disabilities: A Multidisciplinary Journal, 2006
This paper describes a highly structured assessment protocol with objective behavioral measures for joint attention responding and initiation. The assessment was given to 26 children diagnosed with autism spectrum disorders and 21 typically developing children, aged two to four years. Interobserver agreement was high for all behavioral measures.…
Descriptors: Pervasive Developmental Disorders, Autism, Attention, Measurement Techniques
Riffert, Franz – Interchange: A Quarterly Review of Education, 2005
First an overview is given about the actual national and international situation concerning standardized testing. Then two major reasons are presented why accountability systems based on standardized testing have become so widespread: (a) the missing validity and reliability of teachers' assessment of students' achievement, and (b) the important…
Descriptors: Standardized Tests, Accountability, Reliability, Validity
Costa, Paul T.; McCrae, Robert R. – Psychological Bulletin, 2006
This article presents comments on the original article "Patterns of Mean-Level Change in Personality Traits Across the Life Course: A Meta-Analysis of Longitudinal Studies," by B. W. Roberts, K. W. Walton, and W. Viechtbauer. Although Roberts et al depicted the present authors as proponents of the immutability of traits, in fact we have…
Descriptors: Longitudinal Studies, Age Differences, Personality Traits, Meta Analysis
Preacher, Kristopher J.; Rucker, Derek D.; MacCallum, Robert C.; Nicewander, W. Alan – Psychological Methods, 2005
Analysis of continuous variables sometimes proceeds by selecting individuals on the basis of extreme scores of a sample distribution and submitting only those extreme scores to further analysis. This sampling method is known as the extreme groups approach (EGA). EGA is often used to achieve greater statistical power in subsequent hypothesis tests.…
Descriptors: Sampling, Statistical Analysis, Reliability, Measures (Individuals)
van der Kloot, Willem A.; Spaans, Alexander M. J.; Heiser, Willem J. – Psychological Methods, 2005
Hierarchical agglomerative cluster analysis (HACA) may yield different solutions under permutations of the input order of the data. This instability is caused by ties, either in the initial proximity matrix or arising during agglomeration. The authors recommend to repeat the analysis on a large number of random permutations of the rows and columns…
Descriptors: Multivariate Analysis, Reliability, Goodness of Fit, Data Analysis
Kuo, Elena S.; Stoep, Ann Vander; Stewart, David G. – Assessment, 2005
The Mood and Feelings Questionnaire (MFQ) is examined for its utility in screening youth in juvenile justice settings for depression. In a cross-sectional study conducted at King County Juvenile Detention Center, a representative sample of 228 detained adolescents complete structured assessments, including the MFQ and the Massachusetts Youth…
Descriptors: Measures (Individuals), Adolescents, Reliability, Juvenile Justice
Lock, Timothy G.; Levis, Donald J.; Rourke, Patricia A. – Journal of Child Sexual Abuse, 2005
This paper provides the results of two studies designed to evaluate a newly constructed self-report instrument, the Sexual Abuse Questionnaire (SAQ). The SAQ was designed as a brief screening device to aid in the identification of a childhood sexual abuse history. A "unique" feature of the SAQ is the inclusion of a number of non-face…
Descriptors: Sexual Abuse, Child Abuse, Questionnaires, Adults
Pelegrina, Santiago; Garcia-Linares, M. Cruz; Casanova, Pedro F. – Journal of Adolescence, 2003
This study examined family factors reported by parents and their children in relation to children's academic competence. Adolescents and their parents (N=323) reported about the same family characteristics: parental acceptance and involvement in the children's education. Measures related to children's academic competence were: academic competence…
Descriptors: Family Characteristics, Academic Achievement, Child Rearing, Interrater Reliability
Perez, Christina – Journal of College Admission, 2002
Spurred in part by University of California (UC) President Richard Atkinson's February 2001 proposal to drop the SAT I for UC applicants, more attention is being paid to other tests such as the SAT II and ACT. Proponents of these alternative exams argue that the SAT I is primarily an aptitude test measuring some vague concept of "inherent…
Descriptors: College Entrance Examinations, Test Reliability, Academic Achievement, Prediction
Urban, Klaus K. – International Education Journal, 2005
The Test for Creative Thinking-Drawing Production (TCT-DP), its design, concept and evaluation scheme as well as experiences and results of application are described. The test was designed to mirror a more holistic concept of creativity than the mere quantitatively oriented, traditional divergent thinking tests. The specific design using figural…
Descriptors: Creativity, Ability Grouping, Creative Thinking, Tests

Peer reviewed
Direct link
