Publication Date
| In 2026 | 7 |
| Since 2025 | 690 |
| Since 2022 (last 5 years) | 3191 |
| Since 2017 (last 10 years) | 7432 |
| Since 2007 (last 20 years) | 15070 |
Descriptor
| Test Reliability | 15055 |
| Test Validity | 10290 |
| Reliability | 9763 |
| Foreign Countries | 7150 |
| Test Construction | 4828 |
| Validity | 4192 |
| Measures (Individuals) | 3880 |
| Factor Analysis | 3826 |
| Psychometrics | 3532 |
| Interrater Reliability | 3126 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1329 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 224 |
| Spain | 218 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedCoenders, Germa; Saris, Willem E.; Batista-Foguet, Joan M.; Andreenkova, Anna – Structural Equation Modeling, 1999
Illustrates that sampling variance can be very large when a three-wave quasi simplex model is used to obtain reliability estimates. Also shows that, for the reliability parameter to be identified, the model assumes a Markov process. These problems are evaluated with both real and Monte Carlo data. (SLD)
Descriptors: Estimation (Mathematics), Markov Processes, Monte Carlo Methods, Reliability
Peer reviewedSnowden, Lonnie R.; Hines, Alice M. – Journal of Black Psychology, 1999
Investigated an acculturation scale designed for use in the African-American population. Responses from more than 900 African Americans generally indicate an African-American orientation within the sample, although there are notable variations on all 10 scale items. Discusses evidence for scale reliability and validity. (SLD)
Descriptors: Acculturation, Adults, Blacks, Psychological Characteristics
Peer reviewedLee, Guemin; Frisbie, David A. – Applied Measurement in Education, 1999
Studied the appropriateness and implications of using a generalizability theory approach to estimating the reliability of scores from tests composed of testlets. Analyses of data from two national standardization samples suggest that manipulating the number of passages is a more productive way to obtain efficient measurement than manipulating the…
Descriptors: Generalizability Theory, Models, National Surveys, Reliability
Peer reviewedTheron, C. C. – Educational and Psychological Measurement, 1999
Studies the effects of a joint correction for criterion unreliability and Case 1 selection on the parameters of the decision function in personnel selection. Illustrates the joint correction equations. (SLD)
Descriptors: Criteria, Decision Making, Equations (Mathematics), Personnel Selection
Peer reviewedJarjoura, David; Hartman-Stein, Paula; Speight, Joan; Reuter, Jeanette – Educational and Psychological Measurement, 1999
Examined the reliability and construct validity in an older adult population (n=149 older adults and their informants) of scores on the Behavioral Competence Inventory (BCI) (P. Hartman-Stein). Results indicate that scores on the BCI's seven scales show adequate internal consistencies and represent seven overlapping but distinct constructs in this…
Descriptors: Behavior Patterns, Competence, Construct Validity, Older Adults
Peer reviewedSanders, Piet F.; Verschoor, Alfred J. – Applied Psychological Measurement, 1998
Presents minimization and maximization models for parallel test construction under constraints. The minimization model constructs weakly and strongly parallel tests of minimum length, while the maximization model constructs weakly and strongly parallel tests with maximum test reliability. (Author/SLD)
Descriptors: Algorithms, Models, Reliability, Test Construction
Peer reviewedDoble, Susan E.; Fisk, John D.; Lewis, Norma; Rockwood, Kenneth – Occupational Therapy Journal of Research, 1999
The findings of a study of 55 elderly adults support the test-retest reliability of the Assessment of Motor and Process Skills, illustrate the utility of alternative methods for examining the reliability of individual subjects' measures, and indicate that not all test-retest differences represent measurement error. (Author/JOW)
Descriptors: Error of Measurement, Older Adults, Psychomotor Skills, Test Reliability
Peer reviewedVoorhees, Ellen M. – Information Processing & Management, 2000
Discusses the test collections developed in the TREC (Text REtrieval Conference) workshops for information retrieval research and describes a study by NIST (National Institute of Standards and Technology) that verified their reliability by investigating the effect changes in the relevance assessments have on the evaluation of retrieval results.…
Descriptors: Correlation, Information Retrieval, Relevance (Information Retrieval), Reliability
Peer reviewedCaruso, John C. – Educational and Psychological Measurement, 2000
Performed a reliability generalization using 244 studies that used NEO personality scales. Reliability estimates were given in only 15% of these studies, and 44% made no mention of reliability at all. Results suggest that many researchers have an inadequate understanding of concepts of reliability. Results also suggest that NEO personality scales…
Descriptors: Estimation (Mathematics), Generalization, Personality Assessment, Personality Measures
Peer reviewedArmstrong, Ronald D.; Jones, Douglas H.; Wang, Zhaobo – Journal of Educational and Behavioral Statistics, 1998
Generating a test from an item bank using a criterion based on classical test theory parameters poses considerable problems. A mathematical model is formulated that maximizes the reliability coefficient alpha, subject to logical constraints on the choice of items. Theorems ensuring appropriate application of the Lagragian relation techniques are…
Descriptors: Item Banks, Mathematical Models, Reliability, Test Construction
Peer reviewedKutlesic, Vesna; Williamson, Donald A.; Gleaves, David H.; Barbin, Jane M.; Murphy-Eberenz, Kathleen P. – Psychological Assessment, 1998
Describes psychometric development of the fourth revision of the Interview for Diagnosis of Eating Disorders (IDED-IV). IDED-IV internal consistency and item-total correlations were assessed. IDED-IV yields sufficiently reliable and valid data for determining diagnoses in research studies and clinics specializing in the treatment of eating…
Descriptors: Diagnostic Tests, Eating Disorders, Psychometrics, Test Reliability
Peer reviewedDownie, Michele Sebastian; Robbins, Steven B. – Counseling Psychologist, 1998
Highlights the use of semistructured interviews to explore essential positive and negative qualities of significant relationships. This approach allows for identifying who (or what) comprises respondents' significant social networks, and for conducting a qualitative analysis of those positive and negative qualities. Clinical individuals had…
Descriptors: Evaluation, Friendship, Interpersonal Relationship, Interviews
Peer reviewedCanivez, Gary L.; Watkins, Marley W. – Psychological Assessment, 1998
The long-term stability of the Wechsler Intelligence Scale for Children-Third Edition (WISC-III) (D. Wechsler, 1991) was studied with 667 children twice evaluated for special education consideration. Test-retest reliability coefficients are reported, providing the highest estimates of WISC-III stability yet reported. (SLD)
Descriptors: Children, Intelligence Tests, Longitudinal Studies, Special Education
Peer reviewedKier, Frederick J.; Melancon, Janet G.; Thompson, Bruce – Educational and Psychological Measurement, 1998
The reliability and construct validity of scores on the Personal Preferences Self-Description Questionnaire (PPSDQ) (B. Thompson) were studied. Classical psychometric analysis of data from 641 participants (item and alpha analyses and both oblique and orthogonal exploratory factor analyses) indicate that the PPSDQ has reasonable properties. (SLD)
Descriptors: College Students, Construct Validity, Higher Education, Psychometrics
Peer reviewedWard, Tony; Fon, Christina; Hudson, Stephen M.; McCormack, Julie – Journal of Interpersonal Violence, 1998
Develops a descriptive model utilizing grounded theory to classify sex offenders' cognitions concerning their offending behavior. The model consists of four categories that were tested to determine their content validity and reliability. Results suggest that the model has provisional validity and adequate interrater reliability. Discusses the…
Descriptors: Child Abuse, Classification, Cognitive Processes, Reliability


