Publication Date
| In 2026 | 3 |
| Since 2025 | 675 |
| Since 2022 (last 5 years) | 3176 |
| Since 2017 (last 10 years) | 7417 |
| Since 2007 (last 20 years) | 15055 |
Descriptor
| Test Reliability | 15043 |
| Test Validity | 10279 |
| Reliability | 9761 |
| Foreign Countries | 7144 |
| Test Construction | 4825 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3526 |
| Interrater Reliability | 3124 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1328 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 217 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Larzelere, Robert E.; Andersen, Jamie J.; Ringle, Jay L.; Jorgensen, Dan D. – Death Studies, 2004
This study documents the initial reliability and validity of the Child Suicide Risk Assessment (CSRA) for children under the age of 13. The revised CSRA retained 18 of 20 original items based on item-specific psychometric data from 140 pre-adolescents in out-of-home treatment programs. The CSRA demonstrated adequate internal consistency (alpha =…
Descriptors: Psychometrics, Depression (Psychology), Suicide, Predictor Variables
Meier, Scott T. – American Journal of Evaluation, 2004
Despite evidence that the choice of dependent measures can significantly influence design sensitivity, many evaluators default to traditional measures that may be insensitive to intervention effects. This paper describes an innovative set of test development guidelines designed to select items and create aggregate scales that are better able to…
Descriptors: Psychometrics, Item Analysis, Test Construction, Measures (Individuals)
Lopez, Michael N.; Charter, Richard A.; Mostafavi, Beeta; Nibut, Lorraine P.; Smith, Whitney E. – Assessment, 2005
Criterion-referenced (Livingston) and norm-referenced (Gilmer-Feldt) techniques were used to measure the internal consistency reliability of Folsteins Mini-Mental State Examination (MMSE) on a large sample (N = 418) of elderly medical patients. Two administration and scoring variants of the MMSE Attention and Calculation section (Serial 7s only…
Descriptors: Psychometrics, Test Reliability, Test Validity, Older Adults
Mazor, Kathleen M.; Schwartz, Carolyn E.; Rogers, H. Jane – Assessment, 2004
A new measure of concerns about dying was investigated in this psychometric study. The Concerns About Dying instrument (CAD) was administered to medical students, nursing students, hospice nurses, and life sciences graduate students ( N = 207) on two occasions; on one occasion they also completed three related measures. Analyses included…
Descriptors: Psychometrics, Patients, Nursing Students, Nurses
Uys, Kitty; Alant, Erna – Perspectives in Education, 2004
The purpose of this article is to describe the processes followed to develop an authentic, reliable and valid play-based assessment of communication-related behaviours in young children with severe disabilities. The Daily Multiple Measurement Instrument (DMMI) was developed to be used based on an intervention package of play activities. The…
Descriptors: Measurement, Disabilities, Test Validity, Young Children
Cooke, David J.; Hart, Stephen D.; Michie, Christine – Psychological Assessment, 2004
Cross-national differences in the prevalence of psychopathy have been reported. This study examined whether rater effects could account for these differences. Psychopathy was assessed with the Psychopathy Checklist-Revised (PCL-R; R. D. Hare, 1991). Videotapes of 6 Scottish prisoners and 6 Canadian prisoners were rated by 10 Scottish and 10…
Descriptors: Foreign Countries, Check Lists, Interrater Reliability, Factor Analysis
Downs, Danielle Symons; Hausenblas, Heather A.; Nigg, Claudio R. – Measurement in Physical Education and Exercise Science, 2004
The research purposes were to examine the factorial and convergent validity, internal consistency, and test-retest reliability of the Exercise Dependence Scale (EDS). Two separate studies, containing a total of 1,263 college students, were undertaken to accomplish these purposes. Participants completed the EDS and measures of exercise behavior and…
Descriptors: Evidence, Test Validity, Measures (Individuals), Factor Analysis
Village, Andrew – Journal of Beliefs & Values, 2005
Biblical literalism was assessed among 404 adult Anglicans from a variety of church traditions using a summated rating scale based on 10 items referring to events in the Bible. The literalism scale showed high internal reliability ([alpha] = 0.92) and scores were highest (i.e. most literal) in Evangelical churches, intermediate in Broad churches…
Descriptors: Biblical Literature, Catholics, Churches, Rating Scales
Goreczny, Anthony J.; Miller, Bree; Dunmire, Brenda; Tolge, Geoffrey J. – Research in Developmental Disabilities: A Multidisciplinary Journal, 2005
The purpose of the independent monitoring for quality (IM4Q) program is to bridge communication between individuals with mental retardation and the service providers on whom they rely. The IM4Q program uses an interview (essential data elements survey) to gather information about the lives of individuals with mental retardation. Collaboration…
Descriptors: Interrater Reliability, Mental Retardation, Program Effectiveness, Community Surveys
DeMars, Christine E. – Journal of Educational Measurement, 2006
Four item response theory (IRT) models were compared using data from tests where multiple items were grouped into testlets focused on a common stimulus. In the bi-factor model each item was treated as a function of a primary trait plus a nuisance trait due to the testlet; in the testlet-effects model the slopes in the direction of the testlet…
Descriptors: Item Response Theory, Reliability, Item Analysis, Factor Analysis
Epstein, Monica K.; Poythress, Norman G.; Brandon, Karen O. – Assessment, 2006
The reliability and validity of the Self-Report Psychopathy Scale (SRPS) was examined in a noninstitutionalized offender sample of mixed gender and race. Adequate alpha coefficients were obtained for the total sample and across gender and race. The SRPS was compared to measures of trait anxiety and passive avoidance errors. SRPS total, primary,…
Descriptors: Self Evaluation (Individuals), Race, Sex, Psychopathology
de Bildt, Annelies; Sytema, Sjoerd; Ketelaars, Cees; Kraijer, Dirk; Mulder, Erik; Volkmar, Fred; Minderaa, Ruud – Journal of Autism and Developmental Disorders, 2004
The interrelationship between the Autism Diagnostic Interview-Revised (ADI-R), Autism Diagnostic Observation Schedule-Generic (ADOS-G) and clinical classification was studied in 184 children and adolescents with Mental Retardation (MR). The agreement between the ADI-R and ADOS-G was fair, with a substantial difference between younger and older…
Descriptors: Autism, Classification, Children, Adolescents
Weekes, Brendan S.; Castles, Anne E.; Davies, Robert A. – Reading and Writing: An Interdisciplinary Journal, 2006
Three experiments investigated the effects of rime consistency on reading and spelling among developing readers ranging in age from 7 to 11 years. Experiment 1 found that children read words with inconsistent feedforward mappings between orthography and phonology (O [right arrow] P) less accurately than consistent words. OP consistency interacted…
Descriptors: Reliability, Reading, Spelling, Children
Hockey, A.; Geffen, G. – Intelligence, 2004
To determine whether the visuospatial n-back working memory task is a reliable and valid measure of cognitive processes believed to underlie intelligence, this study compared the reaction times and accuracy of performance of 70 participants, with performance on the Multidimensional Aptitude Battery (MAB). Testing was conducted over two sessions…
Descriptors: Short Term Memory, Spatial Ability, Validity, Test Reliability
Watkins, J.; Espie, C. A.; Curtice, L.; Mantala, K.; Corp, A.; Foley, J. – Journal of Intellectual Disability Research, 2006
Background: Epilepsy is common in people with intellectual disability, yet clinicians and researchers seldom obtain information directly from the client. The development and preliminary validation of a novel measure for use with people with mild to moderate intellectual disabilities is described. Methods: Focus group methods (6 groups; 24…
Descriptors: Epilepsy, Mild Mental Retardation, Moderate Mental Retardation, Measures (Individuals)

Peer reviewed
Direct link
