Publication Date
| In 2026 | 2 |
| Since 2025 | 469 |
| Since 2022 (last 5 years) | 1948 |
| Since 2017 (last 10 years) | 4520 |
| Since 2007 (last 20 years) | 7005 |
Descriptor
| Test Reliability | 15043 |
| Test Validity | 10011 |
| Test Construction | 4371 |
| Foreign Countries | 3834 |
| Psychometrics | 2429 |
| Factor Analysis | 2301 |
| Measures (Individuals) | 1785 |
| Evaluation Methods | 1410 |
| Higher Education | 1391 |
| Questionnaires | 1261 |
| Factor Structure | 1248 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 454 |
| Practitioners | 319 |
| Teachers | 128 |
| Administrators | 73 |
| Policymakers | 33 |
| Counselors | 31 |
| Students | 17 |
| Parents | 10 |
| Community | 6 |
| Support Staff | 5 |
Location
| Turkey | 839 |
| Australia | 239 |
| China | 211 |
| Canada | 207 |
| Indonesia | 163 |
| Spain | 130 |
| United States | 123 |
| United Kingdom | 121 |
| Germany | 112 |
| Taiwan | 108 |
| Netherlands | 102 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 2 |
| Meets WWC Standards with or without Reservations | 2 |
| Does not meet standards | 1 |
Putnam, Frank W.; And Others – Child Abuse and Neglect: The International Journal, 1993
Evaluation of the Child Dissociative Checklist found it to be a reliable and valid observer report measure of dissociation in children, including sexually abused girls and children with dissociative disorder and with multiple personality disorder. The checklist, which is appended, is intended as a clinical screening instrument and research measure…
Descriptors: Check Lists, Children, Emotional Disturbances, Psychological Evaluation
Peer reviewedGellman, Estelle S. – Action in Teacher Education, 1993
Portfolio assessment can be a valuable tool in assessing professional proficiency in teachers if appropriate attention is given to issues of reliability and validity. The Teaching Assessment Project at Stanford University has explored portfolios as an alternative to traditional methods of teacher evaluation. (IAH)
Descriptors: Elementary Secondary Education, Portfolios (Background Materials), Teacher Competencies, Teacher Competency Testing
Peer reviewedArmstrong, Ronald D.; And Others – Journal of Educational Statistics, 1994
A network-flow model is formulated for constructing parallel tests based on classical test theory while using test reliability as the criterion. Practitioners can specify a test-difficulty distribution for values of item difficulties as well as test-composition requirements. An empirical study illustrates the reliability of generated tests. (SLD)
Descriptors: Algorithms, Computer Assisted Testing, Difficulty Level, Item Banks
Peer reviewedZimmerman, Donald W.; And Others – Applied Psychological Measurement, 1993
Some of the methods originally used to find relationships between reliability and power associated with a single measurement are extended to difference scores. Results, based on explicit power calculations, show that augmenting the reliability of measurement by reducing error score variance can make significance tests of difference more powerful.…
Descriptors: Equations (Mathematics), Error of Measurement, Individual Differences, Mathematical Models
Peer reviewedHumphreys, Lloyd G.; And Others – Applied Psychological Measurement, 1993
Two articles discuss the controversy about the relationship between reliability and the power of significance tests in response to the discussion of Donald W. Zimmerman, Richard H. Williams, and Bruno D. Zumbo. Lloyd G. Humphreys emphasizes the differences between what statisticians can do and constraints on researchers. Zimmerman, Williams, and…
Descriptors: Error of Measurement, Individual Differences, Power (Statistics), Research Methodology
Peer reviewedRoznowski, Mary; Smith, Marna L. – Intelligence, 1993
Measurement and psychometric quality of the Sternberg task (S. Sternberg, 1966, 1969), a memory search task, was investigated with 78 undergraduates. Individual performance was fairly homogeneous across responses, fairly unstable over time, and fairly stable across stimulus content. Implications for individual differences research are discussed.…
Descriptors: Cognitive Tests, Evaluation Methods, Higher Education, Individual Differences
Peer reviewedMatson, Johnny L.; Smiroldo, Brandi B. – Research in Developmental Disabilities, 1997
A study tested the validity of the Diagnostic Assessment for the Severely Handicapped-II (DASH-II) for determining the presence of mania (bipolar disorder) in 22 individuals with severe mental retardation. Results found the mania subscale to be internally consistent and able to be used to classify manic and control subjects accurately. (Author/CR)
Descriptors: Adults, Clinical Diagnosis, Disability Identification, Evaluation Methods
Peer reviewedDozois, David J. A.; Ahnberg, Jamie L.; Dobson, Keith S. – Psychological Assessment, 1998
Provides psychometric information on the second edition of the Beck Depression Inventory (BDI-II) (A. Beck, R. Steer, and G. Brown, 1996) for internal consistency, factorial validity, and gender differences. Results indicate that the BDI-II is a stronger instrument than its predecessor in terms of factor structure. (SLD)
Descriptors: Depression (Psychology), Factor Analysis, Factor Structure, Psychometrics
Peer reviewedScarsellone, Jana M. – Journal of Speech, Language, and Hearing Research, 1998
Hearing in Noise Test (HINT) list equivalency was examined using 24 listeners (ages 60 to 70) with sensorineural hearing impairments. Four speech conditions were tested, including a quiet condition and three noise conditions. Results found that for the three noise conditions, all lists were within 2dB of the means, indicating list equivalency.…
Descriptors: Auditory Evaluation, Auditory Perception, Communication Research, Generalization
Moss, Pamela A.; Schutz, Aaron – Phi Delta Kappan, 1999
Considers four key decision points in the National Board for Professional Teaching Standards' assessment-development process: development of content standards; development of tasks guiding candidates in providing evidence about their teaching; development of scoring rubrics and benchmarks; and determination of the performance standard that…
Descriptors: Elementary Secondary Education, Performance Based Assessment, Portfolio Assessment, Scoring Rubrics
Riccio, Cynthia A.; Boan, Candace H.; Staniszewski, Deborah; Hynd, George W. – Diagnostique, 1997
A study involving 120 school-aged children that investigated the concurrent validity of measures of written language found that the Wechsler Individual Achievement Test Written Expression subtest correlates moderately with the Written Expression subtest of the Peabody Individual Achievement Test-Revised and the Spontaneous Writing Quotient of the…
Descriptors: Elementary Secondary Education, Learning Disabilities, Test Reliability, Test Validity
Peer reviewedKapci, Emine G. – Early Child Development and Care, 1999
This study examined the validity and reliability of the Pre-school Behaviour Checklist (PBCL) for Turkish nursery school children. Data were obtained from 902 children, 24 to 82 months old, attending state or private nursery schools. Findings suggested that the PBCL has psychometric properties comparable to the British sample and could be used…
Descriptors: Behavior Problems, Check Lists, Child Behavior, Foreign Countries
Peer reviewedKaminski, Ruth A.; Good, Roland H., III – School Psychology Review, 1996
Examines the reliability, validity, and sensitivity of experimental measures developed to assess three areas of early literacy: phonological awareness, vocabulary development, and fluency in letter naming. Results indicate which measures display adequate psychometric properties for kindergartners not yet reading. Experimental measures were less…
Descriptors: Emergent Literacy, Grade 1, Kindergarten Children, Language Fluency
Owen, T. Ross – Journal of Educational Opportunity, 1997
A study investigated the validity and reliability of a new instrument for assessing the wellness lifestyles of Upward Bound students. Subjects were 42 students from five high schools using the program. The study examined 14 variables, including total scores, 10 subscales, and three demographic variables (age, race, gender), and concluded that the…
Descriptors: College Students, High School Students, High Schools, Measurement Techniques
Peer reviewedMaes, B.; Fryns, J. P.; Ghesquiere, P.; Borghgraef, M. – Mental Retardation, 2000
A study investigated the effectiveness of a phenotypic checklist for identifying 110 males with fragile X syndrome and 79 controls, matched for age, level of cognitive development, and social adaptation. Results indicated that those boys who are likely to be diagnosed as having fragile X syndrome can be identified. (Contains references.)…
Descriptors: Adults, Check Lists, Children, Clinical Diagnosis


