Publication Date
| In 2026 | 7 |
| Since 2025 | 690 |
| Since 2022 (last 5 years) | 3191 |
| Since 2017 (last 10 years) | 7432 |
| Since 2007 (last 20 years) | 15070 |
Descriptor
| Test Reliability | 15055 |
| Test Validity | 10290 |
| Reliability | 9763 |
| Foreign Countries | 7150 |
| Test Construction | 4828 |
| Validity | 4192 |
| Measures (Individuals) | 3880 |
| Factor Analysis | 3826 |
| Psychometrics | 3532 |
| Interrater Reliability | 3126 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1329 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 224 |
| Spain | 218 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedMcColl, Mary Ann; Friedland, Judith – Occupational Therapy Journal of Research, 1989
Discusses the development and psychometric evaluation of the Social Support Inventory for Stroke Survivors, a multidimensional instrument for measuring social support and its influence on the rehabilitation of stroke patients. Examines the test-retest reliability and internal consistency of the instrument and suggests modifications and clinical…
Descriptors: Measures (Individuals), Multidimensional Scaling, Occupational Therapy, Physical Disabilities
Peer reviewedRubin, Rebecca B.; And Others – Communication Education, 1995
Examines the role of standardized, performance-based assessment measures in the high school context. Reports validity and reliability information on the "Communication Competency Assessment Instrument--High School Edition," which was used to assess student speaking performance and to gauge the level of improvement as a result of instruction. (SR)
Descriptors: High Schools, Instructional Effectiveness, Performance Based Assessment, Speech Communication
Peer reviewedGardner, Donald G.; And Others – Journal of Educational Computing Research, 1993
This empirical study of undergraduates compared the psychometric properties, i.e., reliability and validity, of four computer attitude measures and their subscales. Results are analyzed that indicate all measures tested were essentially equal in terms of reliability and validity, and attempts to empirically derive improved scales were…
Descriptors: Attitude Measures, Comparative Analysis, Computer Attitudes, Higher Education
Peer reviewedYen, Wendy M.; Candell, Gregory L. – Applied Measurement in Education, 1991
Empirical reliabilities of scores based on item-pattern scoring, using 3-parameter item-response theory and number-correct scoring, were compared within each of 5 score metrics for at least 900 elementary school students for 5 content areas. Average increases in reliability were produced by item-pattern scoring. (SLD)
Descriptors: Elementary Education, Elementary School Students, Grade Equivalent Scores, Item Response Theory
Peer reviewedLunz, Mary E.; And Others – Applied Measurement in Education, 1990
An extension of the Rasch model is used to obtain objective measurements for examinations graded by judges. The model calibrates elements of each facet of the examination on a common log-linear scale. Real examination data illustrate the way correcting for judge severity improves fairness of examinee measures. (SLD)
Descriptors: Certification, Difficulty Level, Interrater Reliability, Judges
Peer reviewedEvans, Brian – Canadian Journal of Program Evaluation/La Revue canadienne d'evaluation de programme, 1995
The distinction between two models of reliability is clarified. Reliability may be conceived of and estimated from a true score model or from the perspective of sampling precision. Basic models are developed and illustrated for each approach using data from the author's work on measuring organizational climate. (SLD)
Descriptors: Data Analysis, Error of Measurement, Evaluators, Models
Peer reviewedBirchler, Gary R.; Fals-Stewart, William – Assessment, 1994
The Response to Conflict Scale, a 24-item measure of maladaptive responses to marital conflict, was evaluated psychometrically with 420 couples. The inventory showed high internal consistency, test-retest reliability, construct and discriminant validity, and classification efficiency. Clinical utility is discussed. (SLD)
Descriptors: Classification, Conflict, Construct Validity, Marital Instability
Peer reviewedDowling-Guyer, Seana; And Others – Assessment, 1994
Reliability and validity of the Risk Behavior Assessment, a questionnaire evaluating drug use and sexual human immunovirus risk behavior through self-reports, were studied with 218 drug users who also provided urine samples. Overall, self-reports of drug use and sexual behavior were reliable. (SLD)
Descriptors: Acquired Immune Deficiency Syndrome, Adults, Behavior Patterns, Drug Use
Peer reviewedLunz, Mary E.; And Others – Educational and Psychological Measurement, 1994
In a study involving eight judges, analysis with the FACETS model provides evidence that judges grade differently, whether or not scores correlate well. This outcome suggests that adjustments for differences among judges should be made before student measures are estimated to produce reproducible decisions. (SLD)
Descriptors: Correlation, Decision Making, Evaluation Methods, Evaluators
Peer reviewedEisenstadt, Toni Hembree; And Others – Child & Family Behavior Therapy, 1994
This study investigated interparental agreement of the Eyberg Child Behavior Inventory for 44 clinic-referred families. Mothers rated their children's disruptive behavior as more frequent and more problematic than did fathers. However, strong evidence for cross-informant reliability was obtained. For maternal vs. paternal reports, classification…
Descriptors: Behavior Problems, Behavior Rating Scales, Child Behavior, Fathers
Peer reviewedHuerta-Macias, Ana – TESOL Journal, 1995
Discusses the use of alternative assessment procedures in English-as-a-Second-Language classrooms, focusing on three issues: (1) definitions of alternative assessment; (2) issues related to validity, reliability, and objectivity that are often raised as objections to alternative assessment; and (3) the power of alternative assessment to provide…
Descriptors: Alternative Assessment, Definitions, English (Second Language), Evaluation Methods
Peer reviewedSmith, Gregory T.; McCarthy, Denis M. – Psychological Assessment, 1995
Instrument refinement refers to any set of procedures designed to improve an instrument's representation of a construct. Five objectives of instrument refinement are discussed, and instrument refinement practices are reviewed in a discussion of its role in the process of developing theory and sharpening construct definition. (SLD)
Descriptors: Clinical Diagnosis, Construct Validity, Definitions, Measures (Individuals)
Peer reviewedWehmeyer, Michael L.; Kelchner, Kathy – Career Development for Exceptional Individuals, 1995
This study assessed the validity and reliability of a modified, self-report version of the Autonomous Functioning Checklist for use with adults with mental retardation. Adolescents and adults (n=409) with mental retardation were interviewed. Results generally supported the instrument's validity and reliability and previous studies' findings that…
Descriptors: Adolescents, Adults, Check Lists, Mental Retardation
Peer reviewedEndler, Norman S.; Parker, James D. A. – Psychological Assessment, 1994
Four studies on the psychometric properties of the Coping Inventory for Stressful Situations (CISS), involving 682 adults and 1,592 college students, investigated factor structure and construct and content validities. Overall, results suggest that the CISS is a valid and reliable measure of basic coping styles. (SLD)
Descriptors: Adults, College Students, Construct Validity, Content Validity
Peer reviewedParatore, Jeanne R. – Topics in Language Disorders, 1995
This article provides a framework for portfolio assessment in which common benchmarks and rubrics provide explicit and shared criteria for judging both the collection of work in the portfolio and individual performance samples. Also addressed are efforts to achieve validity and reliability in teacher, student, and parent judgments while…
Descriptors: Elementary Secondary Education, Evaluation Criteria, Individualized Programs, Literacy


