Publication Date
| In 2026 | 1 |
| Since 2025 | 899 |
| Since 2022 (last 5 years) | 4508 |
| Since 2017 (last 10 years) | 10441 |
| Since 2007 (last 20 years) | 21904 |
Descriptor
| Test Validity | 21743 |
| Validity | 13779 |
| Test Reliability | 10839 |
| Foreign Countries | 9859 |
| Test Construction | 6878 |
| Factor Analysis | 5756 |
| Measures (Individuals) | 5619 |
| Predictive Validity | 5019 |
| Psychometrics | 4806 |
| Reliability | 4634 |
| Correlation | 4373 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 1169 |
| Practitioners | 629 |
| Teachers | 336 |
| Administrators | 165 |
| Policymakers | 110 |
| Counselors | 63 |
| Students | 63 |
| Parents | 15 |
| Community | 12 |
| Media Staff | 10 |
| Support Staff | 8 |
| More ▼ | |
Location
| Turkey | 1393 |
| Australia | 704 |
| Canada | 626 |
| China | 527 |
| United States | 439 |
| Indonesia | 387 |
| United Kingdom | 363 |
| Germany | 338 |
| California | 337 |
| Netherlands | 334 |
| Spain | 309 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 7 |
| Meets WWC Standards with or without Reservations | 12 |
| Does not meet standards | 10 |
Peer reviewedChang, Lei – Applied Psychological Measurement, 1994
Reliability and validity of 4-point and 6-point scales were assessed using a new model-based approach to fit empirical data from 165 graduate students completing an attitude measure. Results suggest that the issue of four- versus six-point scales may depend on the empirical setting. (SLD)
Descriptors: Attitude Measures, Goodness of Fit, Graduate Students, Graduate Study
Peer reviewedGrant, Carolyn D.; Nash, Michael R. – Psychological Assessment, 1995
In a counterbalanced, within subjects, repeated measures design, 130 undergraduates were administered the Computer-Assisted Hypnosis Scale (CAHS) and the Stanford Hypnotic Susceptibility Scale and were hypnotized. The CAHS was shown to be a psychometrically sound instrument for measuring hypnotic ability. (SLD)
Descriptors: Ability, Clinical Diagnosis, Computer Assisted Testing, Diagnostic Tests
Peer reviewedBeidel, Deborah C.; And Others – Psychological Assessment, 1995
A new instrument, the Social Phobia and Anxiety Inventory for Children (SPAI-C), was developed. Results from 6 studies with nearly 600 children indicate that the SPAI-C is a reliable and valid measure for childhood social anxiety and fear. It may be useful for improving clinical assessment and documenting treatment outcomes. (SLD)
Descriptors: Anxiety, Children, Clinical Diagnosis, Diagnostic Tests
Peer reviewedWrobel, Nancy Howells; Lachar, David – Psychological Assessment, 1995
Minnesota Multiphasic Personality Inventories (MMPIs) were administered to an urban mixed-race sample of 218 adolescent psychiatric patients. Wiggins scale elevations for African Americans and whites were compared, and the validity of the scales was assessed through comparison with parent observations. Implications for the MMPI-Adolescent content…
Descriptors: Adolescents, Blacks, Measurement Techniques, Multiracial Persons
Peer reviewedBalk, David E. – Death Studies, 1995
Discusses an ethical dilemma that emerged in a study with bereaved college students. The instruments used to gather data clearly elicited grief-related distress, and more bereaved students in control groups left the study than did participants in social support groups. Three alternatives to a traditional control-group design are discussed for…
Descriptors: Bereavement, Case Studies, Control Groups, Death
Peer reviewedKamphaus, Randy W.; And Others – Journal of Learning Disabilities, 1991
This study investigated diagnosis of learning problems in 177 boys (ages 6-13) with behavior problems. The standard score discrepancy method and standard score plus low achievement method were more likely to identify children with above-average intelligence quotients as learning disabled, whereas a regression approach identified learning…
Descriptors: Behavior Problems, Educational Diagnosis, Elementary Education, Handicap Identification
Peer reviewedBates, John E. – Monographs of the Society for Research in Child Development, 1991
Reactions to the national study of children's problems and competence by Achenbach et al. are offered, with focus on the development of a new survey instrument, the ACQ Behavior Check List. A reply from the researchers is included. (12 references) (LB)
Descriptors: Adolescents, Behavior Problems, Check Lists, Children
Peer reviewedCunningham, George K. – Contemporary Education, 1991
Educational reform typically includes modifications in testing. The article discusses misconceptions about testing and reform, noting reform involves changing test structure or increasing amounts of testing. To provide accountability within educational reform, curriculum must be unified nationally, statewide, or locally because students cannot be…
Descriptors: Academic Achievement, Accountability, Achievement Tests, Change Strategies
Peer reviewedKunnan, Antony John – Language Testing, 1992
Three analysis procedures were used to study the dependability and validity of ESLPE, a criterion-referenced English-as-a-Second-Language placement test developed at the University of California at Los Angeles in 1989. Findings led to the suggestion that some students might have been differently placed if subtest scores were used for placement.(38…
Descriptors: Cluster Analysis, Comparative Analysis, Criterion Referenced Tests, English (Second Language)
Peer reviewedForsyth, Robert A.; And Others – Applied Measurement in Education, 1992
Two criteria defined in previous research that can be used to evaluate the validity of normative data provided for customized tests are discussed. Results of an exploratory investigation of the validity of such data for about 2,500 fifth graders in a 1989 study are reported. (SLD)
Descriptors: Adaptive Testing, Elementary School Students, Evaluation Criteria, Evaluation Methods
Alessi, Stephen M.; Johnson, Lynn A. – Simulation/Games for Learning, 1992
Discussion of the use of simulations for licensure testing highlights the Dental Interactive Simulations Corporation (DISC) project that uses interactive video patient simulations for dental education and licensure. Topics addressed include reliability, validity, test administration issues, effects of fidelity on reliability and validity, and…
Descriptors: Computer Assisted Instruction, Computer Simulation, Dental Students, Dentistry
Peer reviewedTirre, William C.; Pena, Carmen M. – Journal of Educational Psychology, 1992
Two experiments with approximately 377 newly enlisted Air Force personnel and 182 college students investigated the validity of a reading span test combining a knowledge verification task with a word memorization task. Results support the hypothesis that word recall reflects the amount of working memory functional in reading. (SLD)
Descriptors: College Students, Comparative Testing, Higher Education, Knowledge Level
Worthen, Blaine R. – Phi Delta Kappan, 1993
Describes how alternative assessment differs from more traditional forms and outlines the forces causing the recent fascination with alternative assessment (demands for accountability, negative consequences of high-stakes testing, and increasing criticisms of standardized tests). Identifies some major issues involving alternative assessment,…
Descriptors: Accountability, Alternative Assessment, Competency Based Education, Elementary Secondary Education
Peer reviewedFedoruk, Genevieve M.; Norman, Charles A. – Exceptional Children, 1991
The study evaluated how 21 first grade teachers differed in preferences, requirements, and expectations of students. Teachers ranked 86 student descriptors on a continuum of contributing to either student success or failure. Teachers were found to vary considerably in descriptor rankings, suggesting that teacher variations may be a factor in the…
Descriptors: Grade 1, Individual Differences, Kindergarten, Predictive Measurement
Peer reviewedAlbanese, Mark A. – Academic Medicine, 1991
A study compared student and trained observer ratings of 15 high-rated and 15 low-rated lecturers in a multi-instructor medical course to identify distinguishing delivery characteristics. Student ratings were stable over three years; trained observers discriminated between students' highest- and lowest-rated lecturers. Voice presentation was the…
Descriptors: Faculty Evaluation, Higher Education, Interrater Reliability, Medical Education


