Publication Date
| In 2026 | 3 |
| Since 2025 | 675 |
| Since 2022 (last 5 years) | 3176 |
| Since 2017 (last 10 years) | 7417 |
| Since 2007 (last 20 years) | 15055 |
Descriptor
| Test Reliability | 15043 |
| Test Validity | 10279 |
| Reliability | 9761 |
| Foreign Countries | 7144 |
| Test Construction | 4825 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3526 |
| Interrater Reliability | 3124 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1328 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 217 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedBachman, Lyle F. – Annual Review of Applied Linguistics, 1989
Applied linguistics and psychometrics have influenced language testing, providing additional tools for investigating factors affecting language test performance and assuring measurement reliability. An examination is presented of language testing, including the theoretical issues involved, the methodological advances, language test development,…
Descriptors: Applied Linguistics, Evaluation Methods, Language Proficiency, Language Tests
Peer reviewedWatson, Jane M. – Journal of Educational Psychology, 1988
The Achievement Anxiety Test's dimensionality was assessed using data from 378 university students. Analyses suggest the viability of a unidimensional construct, whose ability to provide extreme subject groups showing differences on other characteristics of academic achievement was assessed. Such a scale has potential for separating…
Descriptors: Academic Achievement, College Students, Factor Analysis, Higher Education
Peer reviewedSwanson, Jane L.; Hansen, Jo-Ida C. – Journal of Vocational Behavior, 1988
Investigated long-term stability of vocational interests in 409 college freshmen tested with Strong-Campbell Interest Inventory as freshmen in 1974, four years later (N=204), and in 1986. Results revealed remarkable degree of interest stability over all three time intervals and individual differences in stability over time. (Author/NB)
Descriptors: Adults, College Freshmen, Followup Studies, Higher Education
Peer reviewedEdgeman, Rick L. – Innovative Higher Education, 1988
Recent establishment of degree programs in quality and reliability at several leading American academic institutions provides evidence that quality is making ivory tower in-roads. One critical difference in the orientations of U.S. and Japanese firms is in the shouldering of responsibility for product/service quality and reliability. (MLW)
Descriptors: Business Administration, Competition, Economics, Engineering
Peer reviewedGeruschat, D. R.; De l'Aune, W. – Journal of Visual Impairment and Blindness, 1989
The study assessed the efficacy of a method of quantifying observations of blind clients made by orientation and mobility instructors. Client problems were observed for street crossings, bumps, stumbles, orientation, and drop-offs. (DB)
Descriptors: Blindness, Evaluation Methods, Naturalistic Observation, Rehabilitation
Peer reviewedPease, Damaris; And Others – Educational and Psychological Measurement, 1989
Test-retest reliability of the Q-Sort Inventory of Parenting Behaviors was obtained with a sample of 30 mothers of 3-year-olds. The within-subject correlation, with 2 weeks between test administrations, for sorting the test statements was 0.72. Reliable differences among the respondents were responsible for 35% of the variance. (SLD)
Descriptors: Adults, Behavior Patterns, Measures (Individuals), Mothers
Peer reviewedWilliams, Steven L.; And Others – Remedial and Special Education (RASE), 1989
General/special education secondary teachers (N=183) and nonhandicapped and handicapped students (N=437) were surveyed on perceptions of the importance of social skills in relating to others, relating to adults, and relating to oneself. Discussed are the survey instrument's reliability, importance ratings for skills, and levels of agreement…
Descriptors: Adolescents, Disabilities, Interpersonal Competence, Secondary Education
Peer reviewedHaladyna, Thomas M.; Downing, Steven M. – Applied Measurement in Education, 1989
A taxonomy of 43 rules for writing multiple-choice test items is presented, based on a consensus of 46 textbooks. These guidelines are presented as complete and authoritative, with solid consensus apparent for 33 of the rules. Four rules lack consensus, and 5 rules were cited fewer than 10 times. (SLD)
Descriptors: Classification, Interrater Reliability, Multiple Choice Tests, Objective Tests
Peer reviewedGuglielmino, Lucy M.; And Others – Adult Education Quarterly, 1989
Responding to Field's criticism of the Self-Directed Learning Readiness Scale (SDLRS), Guglielmino cites evidence supporting the scale's validity and reliability, Long notes gaps in Field's literature review, and McCune critiques Field's approach to factor analysis in testing the scale. (SK)
Descriptors: Adult Learning, Evaluation Methods, Evaluation Problems, Factor Analysis
Peer reviewedMelancon, Janet G.; Thompson, Bruce – Psychology in the Schools, 1989
Investigated measurement characteristics of both forms of Finding Embedded Figures Test (FEFT). College students (N=302) completed both forms of FEFT or one form of FEFT and Group Embedded Figures Test. Results suggest that FEFT forms provide reasonable reliable and valid data. (Author/NB)
Descriptors: College Students, Field Dependence Independence, Higher Education, Multiple Choice Tests
Peer reviewedMunger, Gail F.; And Others – Research in Developmental Disabilities, 1989
A study of the reliability of teacher's interpretations of graphed performance data on students with moderate to profound mental retardation revealed that teacher judgments are consistent and accurate for continuous improvement in performance, but less consistent for variable performance. (MSE)
Descriptors: Data Interpretation, Decision Making, Graphs, Mental Retardation
Peer reviewedHolden, Ronald R.; And Others – Journal of Consulting and Clinical Psychology, 1988
Used 112 adult patients from three psychiatric facilities to examine psychometric properties of Basic Personality Inventory scales. Found no differences across facilities; sex differences on some scales. Scales appeared to be internally consistent. Using clinical staff ratings as criteria, scales were found to possess both convergent and…
Descriptors: Adults, Foreign Countries, Mental Disorders, Personality Measures
Peer reviewedHill, Clara E.; And Others – Journal of Counseling Psychology, 1988
Outlined method for studying rater bias in counseling and psychotherapy. Used method to study three potential sources of rater bias concerning characteristics of rater, client, and therapist. Examined ratings on Collaborative Study Psychotherapy Rating Scale for 826 sessions of psychotherapy in Treatment of Depression Collaborative Research…
Descriptors: Bias, Client Characteristics (Human Services), Congruence (Psychology), Counselor Characteristics
Menchetti, Bruce M.; Rusch, Frank R. – American Journal on Mental Retardation, 1988
Test-retest reliability, internal consistency, and validity of the Vocational Assessment and Curriculum Guide with mentally retarded and nonretarded subjects having different employment characteristics was investigated. Empirical validation results suggested that domain scores differentiated between retarded subjects with only sheltered work…
Descriptors: Behavior Rating Scales, Curriculum, Employment Experience, Mental Retardation
Peer reviewedSegall, Daniel O. – Psychometrika, 1994
An asymptotic expression for the reliability of a linearly equated test is developed using normal theory. Reliability is expressed as the product of test reliability before equating and an adjustment term that is a function of the sample sizes used to estimate the linear equating transformation. The approach is illustrated. (SLD)
Descriptors: Equated Scores, Error of Measurement, Estimation (Mathematics), Sample Size


