Publication Date
| In 2026 | 10 |
| Since 2025 | 642 |
| Since 2022 (last 5 years) | 2579 |
| Since 2017 (last 10 years) | 5614 |
| Since 2007 (last 20 years) | 9210 |
Descriptor
| Test Validity | 21786 |
| Test Reliability | 10022 |
| Test Construction | 5897 |
| Foreign Countries | 4963 |
| Psychometrics | 2969 |
| Factor Analysis | 2942 |
| Measures (Individuals) | 2382 |
| Higher Education | 2250 |
| Evaluation Methods | 2085 |
| College Students | 1813 |
| Correlation | 1724 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 728 |
| Practitioners | 429 |
| Teachers | 142 |
| Administrators | 96 |
| Policymakers | 57 |
| Counselors | 36 |
| Students | 20 |
| Parents | 13 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 2 |
| More ▼ | |
Location
| Turkey | 808 |
| Australia | 347 |
| Canada | 324 |
| China | 300 |
| United States | 188 |
| Indonesia | 172 |
| Spain | 170 |
| Netherlands | 160 |
| United Kingdom | 160 |
| California | 156 |
| Germany | 154 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 3 |
| Does not meet standards | 1 |
Howell, Scott L. – New Directions for Teaching and Learning, 2004
Although instructional methods are moving in ever greater number to a multimedia base, testing is not. What principles should be considered in correcting this misalignment?
Descriptors: Multimedia Instruction, Teaching Methods, Test Validity, Test Reliability
Simner, Marvin L.; Goffin, Richard D. – International Journal of Testing, 2003
Among the various tests employed in personnel selection, handwriting analysis, or graphology, has enjoyed long-standing international popularity despite being highly contentious. This report contains not only an evaluation of the current published scientific reviews on the use of graphology in personnel selection, but also an evaluation of several…
Descriptors: Predictive Validity, Handwriting, Personnel Selection, Position Papers
Hewitt, Margaret A.; Homan, Susan P. – Reading Research and Instruction, 2004
Test validity issues considered by test developers and school districts rarely include individual item readability levels. In this study, items from a major standardized test were examined for individual item readability level and item difficulty. The Homan-Hewitt Readability Formula was applied to items across three grade levels. Results of…
Descriptors: Test Validity, Test Items, Standardized Tests, Readability Formulas
Franklin, Anna – Journal of Experimental Child Psychology, 2006
Kowalski and Zimiles (2006) and O'Hanlon and Roberson (2006) address an age-old question: Why do children find it difficult to learn color terms? Here these articles are reflected on, providing a focused examination of the issues central to this question. First, the criteria by which children are said to find color naming difficult are considered.…
Descriptors: Children, Color, Test Validity, Test Reliability
Naglieri, Jack A.; De Lauder, Brianna Y.; Goldstein, Sam; Schwebech, Adam – School Psychology Quarterly, 2006
The relationships between Wechsler Intelligence Scale for Children-Third Edition (WISC-III) and the Cognitive Assessment System (CAS) with the Woodcock-Johnson Tests of Achievement (WJ-III) were examined for a sample of 119 children (87 males and 32 females) ages 6 to 16. The sample was comprised of children who were referred to a specialty clinic…
Descriptors: Measures (Individuals), Intelligence Tests, Comparative Analysis, Correlation
Haladyna, Thomas M.; Downing, Steven M. – Educational Measurement: Issues and Practice, 2004
There are many threats to validity in high-stakes achievement testing. One major threat is construct-irrelevant variance (CIV). This article defines CIV in the context of the contemporary, unitary view of validity and presents logical arguments, hypotheses, and documentation for a variety of CIV sources that commonly threaten interpretations of…
Descriptors: Student Evaluation, Evaluation Methods, High Stakes Tests, Construct Validity
Walters, Glenn D. – Assessment, 2005
Postrelease recidivism data were collected on 137 male inmates released 1 to 72 months after completing the Psychological Inventory of Criminal Thinking Styles (PICTS) in follow-ups lasting 6 to 55 months. When a dichotomous measure of recidivism (0, 1+ arrests) was employed, Entitlement (En) was the only PICTS thinking-style scale to achieve…
Descriptors: Measures (Individuals), Criminals, Psychometrics, Test Validity
Elhai, Jon D.; Naifeh, James A.; Zucker, Irene S.; Gold, Steven N.; Deitsch, Sarah E.; Frueh, B. Christopher – Assessment, 2004
The Infrequency-Posttraumatic Stress Disorder scale (Fptsd), recently created for the Minnesota Multiphasic Personality Inventory-2 (MMPI-2), has demonstrated incremental validity over other MMPI-2 scales in malingered posttraumatic stress disorder (PTSD) detection. Fptsd was developed with combat-exposed PTSD patients, potentially limiting its…
Descriptors: Measures (Individuals), Personality Measures, Test Validity, Sexual Abuse
Rydberg, Agneta; Ericson, Birgit; Lindstedt, Eva – Journal of Visual Impairment and Blindness, 2004
When assessing the visual function of young children, it is important to use a variety of tests. It is essential to have a structured observation method when it is not possible to use ordinary acuity tests. A structured observation method can be created by using a checklist. An ideal checklist should be handy and reliable and include a minimum of…
Descriptors: Observation, Check Lists, Young Children, Vision
Thrane, Lisa E.; Whitbeck, Les B.; Hoyt, Danny R.; Shelley, Mack C. – American Indian and Alaska Native Mental Health Research The Journal of the National Center, 2004
This study examined the measurement of depressive symptoms among American Indian adolescents as assessed by the Center for Epidemiologic Studies Depression Scale (CES-D), Youth Self Report (YSR), and the Tri-Ethnic Center's for Prevention Research Depression Scale (TEDS). This analysis demonstrated that the TEDS had good internal consistency,…
Descriptors: Measures (Individuals), Adolescents, Predictive Validity, American Indians
Gallagher, H. Alix – Peabody Journal of Education, 2004
In this study, I examined the validity of a performance-based, subject-specific teacher evaluation system by analyzing the relationship between teacher evaluation scores and student achievement. From a policy perspective, establishing validity was important because it is embedded in a knowledge-and skills-based pay system, which attached high…
Descriptors: Test Validity, Pedagogical Content Knowledge, Academic Achievement, Teacher Evaluation
Cole, Jason C.; Rabin, Adele S.; Smith, Tom L.; Kaufman, Alan S. – Psychological Assessment, 2004
The current study presents a Rasch-derived short form of the Center for Epidemiologic Studies-Depression scale (CES-D) for use as a depression screening tool in the general population. In contrast to short forms developed with reliance on classical measurement techniques, those developed using techniques based on item response theory produce a…
Descriptors: Measurement Techniques, Test Validity, Depression (Psychology), Item Response Theory
Schreck, Kimberly A.; Mulick, James A.; Rojahn, Johannes – Journal of Child and Family Studies, 2003
We describe the development, preliminary psychometric properties, and cross-validation of the Behavioral Evaluation of Disorders of Sleep (BEDS: Schreck 1997/1998). Parental reports of problem sleep behavior in elementary school aged children 5 years to 12 years were collected for two samples. With the first sample, an exploratory factor analysis…
Descriptors: Behavior Problems, Sleep, Factor Analysis, Psychometrics
Haertel, Edward H.; Lorie, William A. – Measurement: Interdisciplinary Research and Perspectives, 2004
Standards-based score reports interpret test performance with reference to cut scores defining categories like "below basic," "proficient," or "master." This article first develops a conceptual framework for validity arguments supporting such interpretations, then presents three applications. Two of these serve to introduce new standard-setting…
Descriptors: Scores, Test Interpretation, Test Validity, Standard Setting (Scoring)
Woodbury-Smith, M. R.; Robinson, J.; Wheelwright, S.; Baron-Cohen, S. – Journal of Autism and Developmental Disorders, 2005
The Autism Spectrum Quotient (AQ) has been developed to measure the degree to which an adult with normal intelligence has autistic traits. In this paper it is evaluated for its potential as a screening questionnaire in clinical practice on one hundred consecutive referrals to a diagnostic clinic for adults suspected of having Asperger Syndrome or…
Descriptors: Adults, Asperger Syndrome, Psychometrics, Test Validity

Peer reviewed
Direct link
