Publication Date
| In 2026 | 2 |
| Since 2025 | 462 |
| Since 2022 (last 5 years) | 1941 |
| Since 2017 (last 10 years) | 4513 |
| Since 2007 (last 20 years) | 6998 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10004 |
| Test Construction | 4369 |
| Foreign Countries | 3831 |
| Psychometrics | 2428 |
| Factor Analysis | 2301 |
| Measures (Individuals) | 1785 |
| Evaluation Methods | 1410 |
| Higher Education | 1391 |
| Questionnaires | 1261 |
| Factor Structure | 1248 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 454 |
| Practitioners | 319 |
| Teachers | 128 |
| Administrators | 73 |
| Policymakers | 33 |
| Counselors | 31 |
| Students | 17 |
| Parents | 10 |
| Community | 6 |
| Support Staff | 5 |
Location
| Turkey | 838 |
| Australia | 239 |
| China | 211 |
| Canada | 207 |
| Indonesia | 162 |
| Spain | 129 |
| United States | 123 |
| United Kingdom | 121 |
| Germany | 112 |
| Taiwan | 108 |
| Netherlands | 102 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 2 |
| Meets WWC Standards with or without Reservations | 2 |
| Does not meet standards | 1 |
Peer reviewedBradley, John M.; And Others – Journal of Reading Behavior, 1978
The present study was designed to determine if maze tests constructed over the same passages by different teachers were comparable. In addition, maze test parallel form reliability was investigated. (HOD)
Descriptors: Educational Research, Reading Comprehension, Reading Tests, Test Reliability
Peer reviewedHenggeler, Scott W.; Tavormina, Joseph B. – Hispanic Journal of Behavioral Sciences, 1979
The one-year stabilities of several well-standardized intellectual, educational, and personality tests were evaluated for 15 children of Mexican American migrant workers. Most of the stability coefficients observed for these tests were statistically significant and similar to those reported for their normative samples. However, the stability…
Descriptors: Mexican Americans, Migrant Children, Psychological Testing, Test Reliability
Peer reviewedJackson, Paul H. – Psychometrika, 1979
Use of the same term "split-half" for division of an n-item test into two subtests containing equal (Cronbach), and possibly unequal (Guttman), numbers of items sometimes leads to a misunderstanding about the relation between Guttman's maximum split-half bound and Cronbach's coefficient alpha. This distinction is clarified. (Author/JKS)
Descriptors: Item Analysis, Mathematical Formulas, Technical Reports, Test Reliability
Peer reviewedReynolds, Cecil R. – Psychology in the Schools, 1979
Two doctoral level school psychologists independently scored 50 McCarthy drawing booklets. Children producing the drawings ranged from 5-11. Interscorer reliability for Draw-A-Design was .93 and for Draw-A-Child was .96. No significant differences occurred in the mean score for either test across scores. (Author)
Descriptors: Children, Elementary Education, Scoring, Test Reliability
Peer reviewedProsnick, Kevin P.; Evans, William J.; Farris, Jaelyn R. – Measurement and Evaluation in Counseling and Development, 2003
This research reports the development and psychometric properties of scores from the 10-item Short Index of Self-Directedness (SISD), drawn from the Temperament and Character Inventory (TCI; C. R. Cloninger, 1987/1992a) and the TCI-125 (C. R. Cloninger, 1992b). Factor structure, construct validity, internal consistency, and test-retest reliability…
Descriptors: Factor Structure, Measures (Individuals), Personality, Psychometrics
Peer reviewedSnow, Mark; Thurber, Steven; Hodgson, Joele M. – Adolescence, 2002
Item content of the Michigan Alcoholism Screening Test (MAST) was modified to make it more appropriate for young persons. The resulting test was found to have lower internal consistency than the adult MAST, but the elimination of five items with comparatively poor psychometric properties yielded an acceptable alpha coefficient. (Contains 10…
Descriptors: Adolescents, Alcoholism, Drinking, Psychometrics
Peer reviewedDoble, Susan E. – Occupational Therapy Journal of Research, 1991
Process Skills Assessment is an observational assessment designed to evaluate process skills as demonstrated during the performance of a self-selected task. Retest reliability results indicated that assessment can be used to measure changes in clients' process skills after occupational therapy if the same task is used upon readministration.…
Descriptors: Interrater Reliability, Occupational Therapy, Skill Analysis, Test Reliability
Peer reviewedShields, Cleveland G.; And Others – Journal of Marital and Family Therapy, 1992
Developed Family Emotional Involvement and Criticism Scale (FEICS), self-report scale assessing perceived criticism and intensity of emotional involvement. Adult respondents (n=83) completed FEICS. Cronbach's alpha was 0.82 for Perceived Criticism subscale and 0.74 for Emotional Involvement subscale. Findings suggest that FEICS is reliable…
Descriptors: Family Relationship, Test Construction, Test Reliability, Test Validity
Peer reviewedFodness, Ruth Wochnick; And Others – Journal of School Psychology, 1991
Examined test-retest reliability for Test of Language Development-2: Primary (TOLD-2 P) and Intermediate (TOLD-2 I). Findings from 60 children revealed that, with few exceptions, both tests had satisfactory reliability over 2-week interval. Less satisfactory reliability was found for TOLD-2 P Semantics Composite (ages 4, 6 ,and 8); Phonology…
Descriptors: Age Differences, Language Acquisition, Test Reliability, Young Children
Peer reviewedLivingston, Ronald B.; Gray, Robert M.; Haak, Ruth A. – Assessment, 1999
Examined the internal consistency of three tests from the Halstead-Reitan Neuropsychological Battery (R. Reitan and D. Wolfson, 1992) with a sample of 334 children, 9 to 14 years of age. Gives reliability coefficients for the Seashore Rhythm Test, two forms of the Speech Sounds Perception Test, and the Aphasia Screening Test. (SLD)
Descriptors: Children, Early Adolescents, Neuropsychology, Test Reliability
Peer reviewedQuilter, Shawn M.; Band, Jennie P.; Miller, Gary M. – Journal of Mental Health Counseling, 1999
Investigates some of the psychometric characteristics of the results from visual-analogue scales used to measure mental imagery. Reports that the scores from visual-analogue scales are positively related to scores from longer pencil-and-paper measures of mental imagery. Implications and limitations for the use of visual-analogue scales to measure…
Descriptors: Counseling Techniques, Instrumentation, Psychometrics, Test Reliability
Hoachlander, E. Gareth – Techniques: Making Education and Career Connections, 1998
Discusses state testing, various types of tests, and whether the increased attention to assessment is contributing to improved student learning. Describes uses of standardized multiple-choice, open-ended constructed response, essay, performance event, and portfolio methods. (JOW)
Descriptors: Academic Achievement, Student Evaluation, Test Format, Test Reliability
Peer reviewedLawrence, John W.; Heinberg, Leslie J.; Roca, Robert; Munster, Andrew; Spence, Robert; Fauerbach, James A. – Psychological Assessment, 1998
The Satisfaction with Appearance Scale (SWAP) was administered to 165 burn victims. SWAP showed a high level of internal consistency (Cronbach's alpha, r(a)=0.87); an 84-subject retest measured reliability (r(tt)=0.59). SWAP is both a reliable and valid measure of body image for a burn-injured population. (Author/MAK)
Descriptors: Body Image, Test Construction, Test Reliability, Test Validity
Peer reviewedBurton, Richard F. – Assessment & Evaluation in Higher Education, 2001
Item-discrimination indices are numbers calculated from test data that are used in assessing the effectiveness of individual test questions. This article asserts that the indices are so unreliable as to suggest that countless good questions may have been discarded over the years. It considers how the indices, and hence overall test reliability,…
Descriptors: Guessing (Tests), Item Analysis, Test Reliability, Testing Problems
Berge, Jos M. F. Ten; Socan, Gregor – Psychometrika, 2004
To assess the reliability of congeneric tests, specifically designed reliability measures have been proposed. This paper emphasizes that such measures rely on a unidimensionality hypothesis, which can neither be confirmed nor rejected when there are only three test parts, and will invariably be rejected when there are more than three test parts.…
Descriptors: Test Reliability, Sampling, Psychometrics, Test Bias

Direct link
