Publication Date
| In 2026 | 2 |
| Since 2025 | 462 |
| Since 2022 (last 5 years) | 1941 |
| Since 2017 (last 10 years) | 4513 |
| Since 2007 (last 20 years) | 6998 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10004 |
| Test Construction | 4369 |
| Foreign Countries | 3831 |
| Psychometrics | 2428 |
| Factor Analysis | 2301 |
| Measures (Individuals) | 1785 |
| Evaluation Methods | 1410 |
| Higher Education | 1391 |
| Questionnaires | 1261 |
| Factor Structure | 1248 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 454 |
| Practitioners | 319 |
| Teachers | 128 |
| Administrators | 73 |
| Policymakers | 33 |
| Counselors | 31 |
| Students | 17 |
| Parents | 10 |
| Community | 6 |
| Support Staff | 5 |
Location
| Turkey | 838 |
| Australia | 239 |
| China | 211 |
| Canada | 207 |
| Indonesia | 162 |
| Spain | 129 |
| United States | 123 |
| United Kingdom | 121 |
| Germany | 112 |
| Taiwan | 108 |
| Netherlands | 102 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 2 |
| Meets WWC Standards with or without Reservations | 2 |
| Does not meet standards | 1 |
Peer reviewedCronbach, Lee J. – Psychometrika, 1988
A coefficient derived from communalities of test parts represents greatest lower bound to Guttman's "immediate retest reliability." Constrained minimum trace factor analysis allows a consistent estimate of the greatest defensible internal-consistency coefficient. In modest size samples, this analysis capitalizes on chance, suggesting an…
Descriptors: Estimation (Mathematics), Evaluation Methods, Factor Analysis, Psychometrics
Canady, Robert Lynn; Hotchkiss, Phyllis Riley – Phi Delta Kappan, 1989
Identifies counterproductive grading policies and practices, such as varying grading scales; worshipping averages; using zeros indiscriminantly; following the assign, test, grade, and teach pattern; failing to match testing to teaching; ambushing students; grading first efforts; establishing inconsistent criteria; and failing to recognize…
Descriptors: Elementary Secondary Education, Evaluation Criteria, Failure, Grading
Peer reviewedKuder, Frederic – Educational and Psychological Measurement, 1991
Recommendations are made for the appropriate use and identification of traditional Kuder-Richardson formulas for the estimation of reliability. "Alpha" should be used for reliabilities estimated for tests or scales composed of items yielding scores distributed on more than two points. (SLD)
Descriptors: Estimation (Mathematics), Evaluation Methods, Mathematical Formulas, Scores
Peer reviewedRogers, James R.; DeShon, Richard P. – Suicide and Life-Threatening Behavior, 1992
Presents psychometric investigation of the eight-factor clinical model of the Suicide Opinion Questionnaire (SOQ) as representing the most appropriate interpretive model for the SOQ. Notes that factor-analytic and internal consistency reliability results failed to support hypothesized eight-factor model. Discusses alternative factor scheme and…
Descriptors: Factor Structure, Models, Opinions, Suicide
Peer reviewedDoble, Susan E.; Fisk, John D.; Lewis, Norma; Rockwood, Kenneth – Occupational Therapy Journal of Research, 1999
The findings of a study of 55 elderly adults support the test-retest reliability of the Assessment of Motor and Process Skills, illustrate the utility of alternative methods for examining the reliability of individual subjects' measures, and indicate that not all test-retest differences represent measurement error. (Author/JOW)
Descriptors: Error of Measurement, Older Adults, Psychomotor Skills, Test Reliability
Peer reviewedKutlesic, Vesna; Williamson, Donald A.; Gleaves, David H.; Barbin, Jane M.; Murphy-Eberenz, Kathleen P. – Psychological Assessment, 1998
Describes psychometric development of the fourth revision of the Interview for Diagnosis of Eating Disorders (IDED-IV). IDED-IV internal consistency and item-total correlations were assessed. IDED-IV yields sufficiently reliable and valid data for determining diagnoses in research studies and clinics specializing in the treatment of eating…
Descriptors: Diagnostic Tests, Eating Disorders, Psychometrics, Test Reliability
Peer reviewedCanivez, Gary L.; Watkins, Marley W. – Psychological Assessment, 1998
The long-term stability of the Wechsler Intelligence Scale for Children-Third Edition (WISC-III) (D. Wechsler, 1991) was studied with 667 children twice evaluated for special education consideration. Test-retest reliability coefficients are reported, providing the highest estimates of WISC-III stability yet reported. (SLD)
Descriptors: Children, Intelligence Tests, Longitudinal Studies, Special Education
Peer reviewedKlein, Sheryl; Magill-Evans, Joyce – Canadian Journal of Occupational Therapy, 1998
In a sample of 24 children with motor/language delays, the Pictorial Scale of Perceived Competence and Social Acceptance for Young Children (PS) and All about Me (AAM) had moderate to good reliability in measuring self-perceptions of competence. PS subscales other than cognitive competence and competence factor had lower reliability. (SK)
Descriptors: Childhood Attitudes, Competence, Self Concept, Test Reliability
Peer reviewedFuller, Gerald B.; Vance, Booney – Psychology in the Schools, 1995
Extends the reliability research of the Qualitative Scoring System for the Modified Version of the Bender-Gestalt Test. The test was administered to 48 children, and the 48 test protocols were scored independently by two psychologists using the Qualitative Scoring System. Results indicated that the scoring system is highly reliable. (JPS)
Descriptors: Interrater Reliability, Preschool Education, Primary Education, Psychologists
Peer reviewedPritchard, David A.; Livingston, Ronald B.; Reynolds, Cecil R.; Moses, James A., Jr. – School Psychology Quarterly, 2000
Presents a normative typology for classifying the Wechsler Intelligence Scale for Children-Third Edition (WISC-III) factor index profiles according to profile shape. Current analyses indicate that overall profile level accounted for a majority of the variance in WISC-III index scores, but a considerable proportion of the variance was because of…
Descriptors: Children, Classification, Profiles, Psychological Testing
Peer reviewedBird, Hector R.; Canino, Glorisa J.; Davies, Mark; Ramirez, Rafael; Chavez, Ligia; Duarte, Cristiane; Shen, Sa – Journal of the American Academy of Child and Adolescent Psychiatry, 2005
Objective: This article provides the results of the psychometric testing of the Brief Impairment Scale (BIS). The BIS is a 23-item instrument that evaluates three domains of functioning: interpersonal relations, school/work functioning, and self-care/self-fulfilment. It capitalizes on the strengths of existing global measures while addressing some…
Descriptors: Measures (Individuals), Psychiatry, Psychometrics, Validity
Omichinski, Donna Riccio; Van Tubbergen, Marie; Warschausky, Seth – Journal of the American Academy of Special Education Professionals, 2008
A component of a school assessment plan includes traditional IQ testing, often referred to as psychological or psycho-educational testing. Psycho-educational testing can yield information about how a student compares to others in her grade or age group, individual strengths and needs, and recommendations to improve instruction. The intended…
Descriptors: Intelligence Quotient, Intelligence Tests, Psychological Testing, Educational Testing
Pell, Godfrey; Homer, Matthew S.; Roberts, Trudie E. – International Journal of Research & Method in Education, 2008
Increasingly, academic institutions are being required to improve the validity of the assessment process; unfortunately, often this is at the expense of reliability. In medical schools (such as Leeds), standardized tests of clinical skills, such as "Objective Structured Clinical Examinations" (OSCEs) are widely used to assess clinical…
Descriptors: Medical Education, Standardized Tests, Clinical Experience, Criterion Referenced Tests
VanDerHeyden, Amanda M.; Burns, Matthew K. – Assessment for Effective Intervention, 2008
This article investigates the utility of various estimates of mathematics proficiency. The participants were 432 students in Grades 2 through 5. The delayed alternate form reliability of multiskill probes, retention probes, slopes of student growth, and trials to criterion were computed. The fluency probes were found to be both sufficiently…
Descriptors: Grades (Scholastic), Scores, Grade 5, Grade 2
Hall, John D.; Howerton, D. Lynn; Jones, Craig H. – Research in the Schools, 2008
The No Child Left Behind Act and the accountability movement in public education caused many states to develop criterion-referenced academic achievement tests. Scores from these tests are often used to make high stakes decisions. Even so, these tests typically do not receive independent psychometric scrutiny. We evaluated the 2005 Arkansas…
Descriptors: Criterion Referenced Tests, Achievement Tests, High Stakes Tests, Public Education

Direct link
