Publication Date
| In 2026 | 3 |
| Since 2025 | 675 |
| Since 2022 (last 5 years) | 3176 |
| Since 2017 (last 10 years) | 7417 |
| Since 2007 (last 20 years) | 15055 |
Descriptor
| Test Reliability | 15043 |
| Test Validity | 10279 |
| Reliability | 9761 |
| Foreign Countries | 7144 |
| Test Construction | 4825 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3526 |
| Interrater Reliability | 3124 |
| Correlation | 3040 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1328 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 253 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 217 |
| California | 215 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedHubbard, J. I.; Seddon, G. M. – British Educational Research Journal, 1989
Reports on a case study designed to determine whether marking standards and reliability of assessments change significantly from one occasion to another. Finds a sex bias occurs with both men and women judges, leading to significant changes in the measures for girls, but not for boys. (KO)
Descriptors: Case Studies, Educational Research, Females, Grading
Peer reviewedSawyer, Richard; And Others – College and University, 1989
A study attempted to verify earlier research results suggesting that college-bound students' self-reporting of coursework and grades is relatively accurate. It also examined whether different subgroups of students (categorized by sex, racial-ethnic group, ability level, educational level, and date tested) varied significantly in their reporting…
Descriptors: Academic Records, College Admission, College Bound Students, College Entrance Examinations
Peer reviewedCizek, Gregory J. – Educational Measurement: Issues and Practice, 1988
Sources of current misuse of standardized tests in assessing the quality of home-based educational programs are identified. Development of new instruments and cooperation of concerned groups are suggested as a means of increasing educational alternatives, excellence, and accountability. (Author/TJH)
Descriptors: Educational Legislation, Educational Quality, Elementary Secondary Education, Home Schooling
Peer reviewedShaw, Brian F.; Dobson, Keith S. – Journal of Consulting and Clinical Psychology, 1988
Reviews several scales used to evaluate competency of psychotherapists. Discusses concerns about interrater reliability and predictive validity of scales. Considers competency a state-like variable, with therapists demonstrating higher competence when they skillfully treat patients across range of difficulty levels. Contends that development of…
Descriptors: Competence, Counselor Evaluation, Counselor Qualifications, Evaluation Criteria
Peer reviewedRoss, Thomas J.; Spencer, Farida – Career Development Quarterly, 1988
Examined utility of My Vocational Situation (MVS) in identifying career decision difficulties within population of adult psychiatric patients (N=300). Found that MVS could successfully identify those patients in need of vocational training and counseling. (Author/NB)
Descriptors: Adults, Career Choice, Career Counseling, Client Characteristics (Human Services)
Peer reviewedFuchs, Douglas; Fuchs, Lynn S. – Exceptional Children, 1989
Presented is a quantitative synthesis of examiner familiarity effects on Caucasian and minority students' test performance. Fourteen controlled studies were coded in terms of methodological quality and race-ethnicity. Caucasian students performed similarly in both familiar and unfamiliar examiner conditions, while Black and Hispanic children…
Descriptors: Blacks, Comparative Analysis, Elementary Secondary Education, Examiners
Peer reviewedBuck, Gary – ELT Journal, 1989
Examination of the reliability and validity of paper-and-pencil pronunciation tests of English as a second language in Osaka (Japan) showed very low reliability. Correlations with more direct measures of pronunciation showed very low validity of written pronunciation tests. Sample tests are appended. (Author/CB)
Descriptors: English (Second Language), Foreign Countries, Higher Education, Language Tests
Peer reviewedFeletti, Grahame; Ryan, Greg – Assessment & Evaluation in Higher Education, 1994
The Triple Jump, a procedure for assessing students' problem-based learning, is applied to assessment of inquiry-based learning in a graduate course. Results suggest the need for more research into interrater reliability and other characteristics of the exercise. Some simple strategies for making the instrument cost effective are offered. (MSE)
Descriptors: Evaluation Methods, Graduate Study, Higher Education, Independent Study
Peer reviewedFishkin, Anne S.; And Others – Roeper Review, 1996
This study investigated patterns of Wechsler Intelligence Scale for Children (WISC) Third Edition subtest scores for 42 gifted children in grades 4-8. Variability from subtest means was highest on Similarities, Comprehension, Coding, and Symbol Search subtests. Significant weaknesses were found on the Block Design subtest, seen as a peak subtest…
Descriptors: Ability Identification, Cluster Analysis, Elementary Secondary Education, Gifted
McAlpine, Lynn; Weston, Cynthia – Performance Improvement Quarterly, 1994
Describes a list of attributes of instructional materials based on a review of instructional design literature that was validated and found reliable through a series of studies. Four categories of attributes are presented: (1) instructional design; (2) language, including semantic and syntactic structures; (3) presentation, including layout; and…
Descriptors: Content Analysis, Instructional Design, Instructional Material Evaluation, Instructional Materials
Peer reviewedSwiezy, Naomi B.; And Others – Research in Developmental Disabilities, 1995
The validity of the schizophrenia subscale of the Psychopathology Instrument for Mentally Retarded Adults (PIMRA) was evaluated with 65 adults having mild to moderate mental retardation as well as either schizophrenia, depression, or no psychopathology. Univariate and multivariate analyses were conducted, as were interrater reliability analyses.…
Descriptors: Adults, Clinical Diagnosis, Depression (Psychology), Disability Identification
Peer reviewedReid, Denise T.; Renwick, Rebecca M. – International Journal of Rehabilitation Research, 1994
A new questionnaire instrument, the Life Satisfaction Index for Adolescents (LSIA), has been developed for adolescents with Duchenne muscular dystrophy (DMD). This article reviews the conceptual basis of the LSIA, its development, and its reliability and validity (established with 15 male adolescents with DMD). (DB)
Descriptors: Adolescents, Attitudes, Life Satisfaction, Males
Peer reviewedVillar, Irene de Aquino; And Others – Educational and Psychological Measurement, 1995
Exploratory and confirmatory factor analyses provide evidence for the construct validity of the five subscales of a parallel Portuguese language version of the Dimensions of Self-Concept scale Form H. Subjects were 159 female and 236 male Brazilian college students. (SLD)
Descriptors: College Students, Construct Validity, Females, Foreign Countries
Peer reviewedRecords, Nancy L.; Tomblin, J. Bruce – Journal of Speech and Hearing Research, 1994
The diagnostic decision-making standards used by practicing clinicians to determine language impairment were investigated. Results showed significant interrater agreement among the 27 clinicians' decisions and moderate intrarater reliability within clinician's decisions. Most of the clinicians' diagnostic decision-making standards could be modeled…
Descriptors: Clinical Diagnosis, Decision Making, Disability Identification, Evaluation Methods
Peer reviewedEl-Khawas, Elaine – Higher Education Management, 1995
A discussion of external evaluation of a college or university program or unit looks at two different approaches used in the United States for accreditation purposes--standards of good practice and competency-based learning and review. Methods of maintaining consistency and eliminating bias are examined briefly. (MSE)
Descriptors: Bias, College Administration, Educational Quality, Evaluation Criteria


