Publication Date
| In 2026 | 3 |
| Since 2025 | 666 |
| Since 2022 (last 5 years) | 3167 |
| Since 2017 (last 10 years) | 7408 |
| Since 2007 (last 20 years) | 15046 |
Descriptor
| Test Reliability | 15036 |
| Test Validity | 10272 |
| Reliability | 9759 |
| Foreign Countries | 7141 |
| Test Construction | 4823 |
| Validity | 4191 |
| Measures (Individuals) | 3877 |
| Factor Analysis | 3825 |
| Psychometrics | 3525 |
| Interrater Reliability | 3124 |
| Correlation | 3039 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 709 |
| Practitioners | 451 |
| Teachers | 208 |
| Administrators | 122 |
| Policymakers | 66 |
| Counselors | 42 |
| Students | 38 |
| Parents | 11 |
| Community | 7 |
| Support Staff | 6 |
| Media Staff | 5 |
| More ▼ | |
Location
| Turkey | 1327 |
| Australia | 436 |
| Canada | 379 |
| China | 368 |
| United States | 271 |
| United Kingdom | 256 |
| Indonesia | 252 |
| Taiwan | 234 |
| Netherlands | 223 |
| Spain | 216 |
| California | 214 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 8 |
| Meets WWC Standards with or without Reservations | 9 |
| Does not meet standards | 6 |
Peer reviewedPutnam, Steven H.; And Others – Psychological Assessment, 1992
Difficulties inherent in differentiating practice effects from meaningful change in neuropsychological retest data are illustrated in this case study of a personal injury case. Although the patient demonstrated substantial gains on the Wechsler Adult Intelligence Scale Revised, most of the tests given on successive days did demonstrate acceptable…
Descriptors: Case Studies, Change, Court Litigation, Intelligence Tests
Peer reviewedRieber, Lloyd – College Teaching, 1993
The use of paraprofessional editors to evaluate student writing, particularly in large college classes, allows teachers to give students more writing practice, provides more individual assistance for students, and helps teachers gain insight into student needs. Adoption of uniform criteria for evaluation also provides consistency and objectivity.…
Descriptors: Classroom Techniques, College Instruction, Editors, Evaluation Criteria
Peer reviewedThomas, Volker; Olson, David H. – Journal of Marital and Family Therapy, 1993
Examined validity, reliability, and curvilinearity of Clinical Rating Scale and tested scale's ability to discriminate between clinical families and nonclinical families on family cohesion, adaptability, and communication. Data from two groups of problem families and two control groups supported curvilinear hypothesis that problem families are…
Descriptors: Adjustment (to Environment), Family Counseling, Family Problems, Family Relationship
Peer reviewedAnglin, M. Douglas; And Others – Evaluation Review, 1993
Reliability and validity of self-reported behavior within a deviant population are examined using data from 2 interviews with 323 narcotics addicts conducted 10 years apart (1974-75 and 1985-86). Results complement existing reliability and validity studies of alcohol use, and suggest that quality information can be obtained from heroin users. (SLD)
Descriptors: Comparative Testing, Drinking, Drug Addiction, Evaluation Methods
Peer reviewedMarsh, Herbert W.; Roche, Lawrence A. – Australian Journal of Education, 1992
In five studies, the applicability of two North American instruments for student evaluation of teacher performance was assessed at the University of Western Sydney (Macarthur, Australia). Results of a multitrait-multimethod analysis indicate the instruments are applicable at this institution and across diverse educational settings. (Author/MSE)
Descriptors: Cultural Context, Foreign Countries, Higher Education, Organizational Climate
Peer reviewedFletcher, Jack M.; And Others – Journal of Learning Disabilities, 1991
For successful classification of children with attention deficit-hyperactivity disorder, major issues include (1) the need for explicit studies of identification criteria; (2) the need for systematic sampling strategies; (3) development of hypothetical classifications; and (4) systematic assessment of reliability and validity of hypothetical…
Descriptors: Attention Deficit Disorders, Classification, Elementary Secondary Education, Handicap Identification
Rolfe, John – Simulation/Games for Learning, 1991
Discusses the need to evaluate the effectiveness of games and simulations that are used for training and educational purposes. Evaluation criteria are described, including validity, reliability, and utility; methods of measuring training device effectiveness are explained; and problems encountered with evaluations are discussed. (30 references)…
Descriptors: Educational Games, Evaluation Criteria, Evaluation Needs, Evaluation Problems
Peer reviewedRothman, A. I.; And Others – Academic Medicine, 1991
A 1990 study of domain-referenced scores from a multiple-station clinical examination for foreign medical graduates investigated identification of essential checklist items, setting of minimum passing scores, consistency of candidate classification, and perceived appropriateness of the number of candidates classified as competent. Results and…
Descriptors: Foreign Medical Graduates, Higher Education, Medical Education, Medical Evaluation
Peer reviewedHenry, Rachael M. – Educational and Psychological Measurement, 1991
Logical difficulties with existing measures of construct implications are examined, and a new instrument that partially overcomes them--the Logical Relations Grid--is described. Empirical data from a study of 28 children and 47 parents in Australia are given in support of instrument reliability and validity. (SLD)
Descriptors: Cognitive Processes, Construct Validity, Elementary School Students, Foreign Countries
Cizek, Gregory J. – Phi Delta Kappan, 1991
This rejoinder to Grant Wiggins on performance assessment suggests that true educational reform will undoubtedly be evidenced by something more substantial than pocket folders bulging with student work. Labeling performance tests "authentic" does not ensure their validity, reliability, or incorruptibility. Such tests are neither replacements nor…
Descriptors: Elementary Secondary Education, Multiple Choice Tests, Performance Based Assessment, Pilot Projects
Peer reviewedCohen, Robert; And Others – Academic Medicine, 1991
The performance of foreign medical school graduates on multistation standardized patient-based tests was used to determine the validity and generalizability of global ratings of their clinical competence made by expert examiners. Results suggest that these ratings can be used as an effective form of assessment in this context. (Author/MSE)
Descriptors: Foreign Medical Graduates, Higher Education, Holistic Approach, Medical Education
Peer reviewedLittlefield, John H.; And Others – Academic Medicine, 1991
Interrater reliability in numerical ratings of clerkship performance (n=1,482 students) in five surgery programs was studied. Raters were classified as accurate or moderately or significantly stringent or lenient. Results indicate that increasing the proportion of accurate raters would substantially improve the precision of class rankings. (MSE)
Descriptors: Academic Achievement, Clinical Experience, Evaluation Criteria, Higher Education
Peer reviewedFeil, Edward G.; Becker, Wesley C. – Behavioral Disorders, 1993
The Walker/Severson Systematic Screening for Behavior Disorders measure was revised for use with preschool children. The revision consists of three hierarchical stages of increasingly time-consuming methodologies: (1) teacher rankings, (2) teacher ratings, and (3) direct behavioral observations. Testing with 121 children demonstrated significant…
Descriptors: Behavior Disorders, Behavior Rating Scales, Preschool Children, Preschool Education
Putnam, Frank W.; And Others – Child Abuse and Neglect: The International Journal, 1993
Evaluation of the Child Dissociative Checklist found it to be a reliable and valid observer report measure of dissociation in children, including sexually abused girls and children with dissociative disorder and with multiple personality disorder. The checklist, which is appended, is intended as a clinical screening instrument and research measure…
Descriptors: Check Lists, Children, Emotional Disturbances, Psychological Evaluation
Peer reviewedZhi-Cheng, Dong; Collis, Betty – Journal of Educational Technology Systems, 1994
Discusses the portability of a Canadian-made educational simulation software package, "The Electronics Workbench," to China that was part of a larger study conducted at the University of Twente (The Netherlands). Evaluation results of the software use in China are presented, including functionality for electronics education, ease of use,…
Descriptors: Computer Assisted Instruction, Computer Simulation, Courseware, Efficiency


