Publication Date
| In 2026 | 7 |
| Since 2025 | 690 |
| Since 2022 (last 5 years) | 3191 |
| Since 2017 (last 10 years) | 7432 |
| Since 2007 (last 20 years) | 15070 |
Descriptor
| Test Reliability | 15055 |
| Test Validity | 10290 |
| Reliability | 9763 |
| Foreign Countries | 7150 |
| Test Construction | 4828 |
| Validity | 4192 |
| Measures (Individuals) | 3880 |
| Factor Analysis | 3826 |
| Psychometrics | 3532 |
| Interrater Reliability | 3126 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1329 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 224 |
| Spain | 218 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
King, Harry A. – Research Quarterly, 1978
Some statistical considerations in applying survey sampling methods to small populations are explored. (DS)
Descriptors: Error of Measurement, Program Development, Reliability, Sampling
Peer reviewedSchell, Leo – Reading World, 1979
Suggests that there are major issues involved in using criterion-referenced reading tests that are unclear or unresolved, including their place in the system of teaching reading, their validity, their reliability, and how cut-off scores are set and evaluated. (TJ)
Descriptors: Criterion Referenced Tests, Elementary Education, Reading Tests, Test Reliability
Peer reviewedHoffman, Kaaren I.; Lundberg, George D. – Educational and Psychological Measurement, 1976
Conventional paper-and-pencil testing was compared to computer-assisted testing on a pharmacy school pathology test. Individual items were speeded in the computer-assisted mode. In addition to responses, the number and pattern of changes in responses were analyzed. True-false, multiple-choice and matching items were used. (JKS)
Descriptors: Analysis of Variance, Computer Assisted Instruction, Measurement Techniques, Medical Education
Peer reviewedBooth, Richard F.; Norton, Richard S. – Educational and Psychological Measurement, 1976
Usefulness of the Comrey Validity Check scale score as an index of random or indiscriminate responding to the Comrey Personality Scales was evaluated on a large sample of Navy paramedical personnel. Tables are presented and implications are discussed. (Author/JKS)
Descriptors: Personality Assessment, Predictor Variables, Response Style (Tests), Test Reliability
Peer reviewedBlankstein, Kirk R. – Journal of Clinical Psychology, 1976
To determine the relationship between Spielberger's measure of trait anxiety and social-interpersonal vs. physical danger trait anxiety, Ss were administered the trait scale of the State-Trait Anxiety Inventory (STAI) and Lykken's Activity Preference Questionnaire (APQ). (Editor)
Descriptors: Anxiety, Measurement Instruments, Personality Assessment, Psychological Studies
Peer reviewedShavelson, Richard; Dempsey, Nancy – Review of Educational Research, 1976
Evidence is gathered on the generalizability of measures of teacher behavior to help resolve the issue of whether the absence of clear, replicable relationships between teacher behavior and student outcomes was due to measurement problems or problems in conceptualization. Generalizability theory seemed particularly well suited but problems in…
Descriptors: Academic Achievement, Literature Reviews, Measurement, Observation
Peer reviewedSingleton, Royce, Jr.; Christiansen, John B. – Sociology and Social Research, 1977
The FEM Scale, a Likert-type measure of attitudes toward feminism, was validated via data from a heterogeneous sample which indicated the FEM Scale is highly reliable, contains a single factor accounting for 38 percent total variance, and correlates with measures of anti-black prejudice, dogmatism, and indentification with the Women's Movement.…
Descriptors: Attitudes, Evaluation, Females, Measurement Instruments
Peer reviewedCampbell, John B.; Chun, Ki-Taek – Applied Psychological Measurement, 1977
A multiple regression approach is used to assess the feasibility of reciprocal prediction between the Sixteen Personality Factor Questionnaire scales and the California Psychological Inventory scales (i.e., the prediction of each 16PF scale from the CPI scales and of each CPI scale from the 16PF scales). (RC)
Descriptors: Correlation, Multiple Regression Analysis, Personality Measures, Prediction
Peer reviewedMoreland, John R.; Liss-Levinson, Nechama – Applied Psychological Measurement, 1977
One reason the fear of success (FOS) literature is confusing and contradictory is because the FOS construct is not being reliably measured across studies. Current scoring guidelines are insufficient for ensuring the reliable measurement of this construct. (RC)
Descriptors: Fear, Fear of Success, Females, Imagery
Peer reviewedCarey, Tracy; Reid, Graham; Ruggiero, Laurie; Horner, James; Dubow, Eric – Assessment, 1997
The internal consistency and validity of the Diabetes Regimen Responsibility Scale (DRRS) (L. Ruggiero and others, 1991) were examined in a sample of 49 youths. The DRRS demonstrated adequate internal consistency, and most subscales correlated significantly with diabetes knowledge (health education issue). Only two reports correlated with…
Descriptors: Diabetes, Health Education, Knowledge Level, Physical Health
Peer reviewedParsons, Elizabeth; Betz, Nancy E. – Journal of Career Assessment, 1998
One group of 113 college students took the Skills Confidence Inventory twice in three weeks; 218 took it once. Test-retest reliability and content validity were supported by the results. Confirmatory factor analyses suggested the inventory's fit with Holland's six-factor structure. (SK)
Descriptors: College Students, Content Validity, Expectation, Higher Education
Peer reviewedChang, Hua-Hua – Psychometrika, 1996
H. H. Chang and W. F. Stout (1993) presented a derivation of the asymptotic posterior normality of the latent trait given examinee responses under nonrestrictive nonparametric assumptions for dichotomous item response (IRT) theory models. This paper presents an extension of their results to polytomous IRT models and defines a global information…
Descriptors: Classification, Equations (Mathematics), Item Response Theory, Mathematical Models
Peer reviewedPerry, Cheryl L.; Komro, Kelli A.; Jones, Resa M.; Munson, Karen; Williams, Carolyn L.; Jason, Leonard – Journal of Child and Adolescent Substance Abuse, 2002
The objective of this study was to create an Adolescent Wisdom Scale, based on Jason et al.'s Functional Value Scale. The scale was found to have high internal consistency and three subscales which were significantly associated with less involvement with alcohol use, cigarette use, and violent behaviors. (Contains 27 references and 6 tables.) (GCP)
Descriptors: Adolescents, Behavior Problems, Measures (Individuals), Psychometrics
Peer reviewedTeesson, Kathryn; Packman, Ann; Onslow, Mark – Journal of Speech, Language, and Hearing Research, 2003
This study examined intrajudge and interjudge agreement for the Lidcombe Behavioral Data Language (LBDL), a behaviorally based stuttering taxonomy. Ten experienced speech language pathologists and 10 undergraduates applied the LBDL to stuttered speech on two occasions. Intrajudge agreement was high for both groups, but only the experienced judges…
Descriptors: Adults, Classification, Reliability, Speech Evaluation
Peer reviewedFeldt, Leonard S. – Applied Measurement in Education, 2002
Considers the situation in which content or administrative considerations limit the way in which a test can be partitioned to estimate the internal consistency reliability of the total test score. Demonstrates that a single-valued estimate of the total score reliability is possible only if an assumption is made about the comparative size of the…
Descriptors: Error of Measurement, Reliability, Scores, Test Construction


