Publication Date
| In 2026 | 0 |
| Since 2025 | 49 |
| Since 2022 (last 5 years) | 211 |
| Since 2017 (last 10 years) | 492 |
| Since 2007 (last 20 years) | 984 |
Descriptor
| Test Validity | 3908 |
| Test Reliability | 1517 |
| Testing | 1090 |
| Test Construction | 1014 |
| Testing Problems | 1008 |
| Computer Assisted Testing | 616 |
| Elementary Secondary Education | 553 |
| Foreign Countries | 494 |
| Higher Education | 490 |
| Standardized Tests | 488 |
| Test Interpretation | 433 |
| More ▼ | |
Source
Author
| Ebel, Robert L. | 16 |
| Hambleton, Ronald K. | 13 |
| Green, Donald Ross | 10 |
| Popham, W. James | 10 |
| Linn, Robert L. | 9 |
| Haney, Walt | 8 |
| Koretz, Daniel | 8 |
| Sireci, Stephen G. | 8 |
| Thompson, Bruce | 8 |
| Tindal, Gerald | 8 |
| Hilliard, Asa G., III | 7 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 137 |
| Researchers | 134 |
| Teachers | 51 |
| Administrators | 34 |
| Policymakers | 18 |
| Counselors | 11 |
| Students | 8 |
| Parents | 5 |
| Support Staff | 4 |
| Community | 2 |
Location
| Canada | 57 |
| Australia | 40 |
| California | 40 |
| China | 34 |
| United Kingdom (England) | 31 |
| United Kingdom | 29 |
| New York | 28 |
| United States | 26 |
| Florida | 22 |
| Germany | 21 |
| Turkey | 20 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Peer reviewedKovac, Ron J. – Educational Research Quarterly, 1989
The construct validity of three spatial ability tests was assessed by determining the correlation between the test results and student class grades in various subjects for 58 eighth graders enrolled at the Burris Laboratory School, Ball State University (Muncie, Indiana). Results do not strongly support any of the tests evaluated. (SLD)
Descriptors: Academic Achievement, Comparative Testing, Construct Validity, Correlation
Peer reviewedRadocy, Rudolf E. – Music Educators Journal, 1989
Identifies the underlying concepts of student evaluation. Offers suggestions for evaluating musical achievement. Maintains that all evaluations are subjective, and suggests techniques for minimizing subjectivity. Considers various test formats, and discusses objectives for both classroom and performance achievement. (RW)
Descriptors: Academic Achievement, Elementary Secondary Education, Evaluation Criteria, Evaluation Problems
Peer reviewedBenson, Jeri; Rentsch, Joan – Educational and Psychological Measurement, 1988
Confirmatory factor analysis techniques assessed several structural models that have been reported regarding the construct validity of the Piers-Harris Children's Self-Concept Scale. Responses of 885 Black, White, and Hispanic students in grades three-six suggest that the scale's construct validity is a function of content and manner of phrasing.…
Descriptors: Black Students, Child Development, Construct Validity, Elementary Education
Peer reviewedChannell, Ron W.; Peek, Michelle S. – Language, Speech, and Hearing Services in Schools, 1989
Thirty-six children, aged four-five, completed four vocabulary measures: Peabody Picture Vocabulary Test-Revised, Picture Vocabulary subtest of the Test of Oral Language Development, Expressive One-Word Picture Vocabulary Test, and Receptive One-Word Picture Vocabulary Test. Only moderate correlations were found among these tests, implying that a…
Descriptors: Correlation, Expressive Language, Handicap Identification, Learning Disabilities
Peer reviewedBurston, Jack – Australian Review of Applied Linguistics, 1995
Investigates the empirical validity of the Monash-Melbourne computer adaptive test for French (French CAT). The article focuses on the accuracy of the French CAT as a tool for streaming incoming university students into three levels of a first-year (post-high school) French course. The test is demonstrated to be a good predictor of short-term…
Descriptors: College Students, Comparative Analysis, Computer Assisted Testing, Correlation
Peer reviewedStuart, I. – Journal of Visual Impairment & Blindness, 1995
Tests of a neuropsychological model for spatial orientation in the absence of vision were developed and administered to 31 children with congenital blindness. Results support the neuropsychological model and indicate that some congenitally blind subjects had focal brain damage, sufficient to impair their capacity to be accurately oriented in…
Descriptors: Blindness, Brain Hemisphere Functions, Children, Clinical Diagnosis
Peer reviewedBraswell, James S. – Mathematics Teacher, 1992
Described are important changes that will be introduced in the mathematics sections of the new Scholastic Aptitude Test (SAT). The three main changes are (1) permission to use calculators; (2) inclusion of open-ended questions; and (3) content revisions consistent with the National Council of Teachers of Mathematics "Curriculum and Evaluation…
Descriptors: Calculators, Mathematics Achievement, Mathematics Education, Mathematics Skills
Peer reviewedAlexander, Patricia A.; Parsons, James L. – Contemporary Education, 1991
Misconceptions about educational testing and school assessment are ingrained in U.S. society and in the knowledge structure of educational professionals. Tests and assessments are highly ethnocentric, ignoring knowledge valued by many cultures. The article examines misconceptions and makes recommendations for an informed discourse on using testing…
Descriptors: Academic Achievement, Achievement Tests, Change Strategies, Cultural Influences
Peer reviewedTombokan-Runtukahu, Juliana; Nitko, Anthony – Research in Developmental Disabilities, 1992
This study delineated procedures for cross-cultural adaptation and operationalization of adaptive behavior in individuals with mental retardation, culturally adapted the Vineland Adaptive Behavior Scale, and investigated the validity of the resulting instrument. The study concluded that the domain of adaptive behavior can be successfully applied…
Descriptors: Adaptive Behavior (of Disabled), Behavior Rating Scales, Cross Cultural Studies, Cultural Influences
Peer reviewedLindblad, Torsten – System, 1992
Looks at the large-scale experiments on the testing of oral proficiency in English, French, and German that have been carried out over the last five years in the Swedish gymnasium. Various kinds of tasks and different grading criteria have been used, and the practical problems of scheduling and of teacher training have been discussed. (nine…
Descriptors: English (Second Language), Foreign Countries, French, German
Peer reviewedIsaacson, Stephen L. – Learning Disabilities Research and Practice, 1992
This review of the Test of Early Written Language concludes that the test succeeds in identifying students who are below their peers in writing and in measuring long-term gains in written language achievement; but its format makes it difficult to document specific strengths and weaknesses and its reliability; and validity have not been…
Descriptors: Early Childhood Education, Evaluation Methods, Student Evaluation, Test Reliability
Peer reviewedSmith-Sebasto, N. J. – Journal of Environmental Education, 1992
A study reveals the need for extensive refinement of the Revised Perceived Environmental Control Measure purported in the past to be a reliable and valid instrument to measure the relationship between the psychological construct, "locus of control," and environmental action or environmentally responsible behavior. (MCO)
Descriptors: Behavior, Behavioral Science Research, Concurrent Validity, Construct Validity
Peer reviewedGroenveld, M.; Jan, J. E. – Journal of Visual Impairment and Blindness, 1992
Analysis of scores of 118 visually impaired children on the Wechsler Intelligence Scale for Children (Revised) and the Wechsler Preschool and Primary Scales of Intelligence (Revised) found a consistent response pattern suggesting that the verbal as well as the performance tests provide useful assessment information. (Author/DB)
Descriptors: Blindness, Cognitive Development, Evaluation Methods, Intelligence
Peer reviewedFidler, James R. – Evaluation and the Health Professions, 1993
Criterion-related validities of 2 laboratory practitioner certification examinations for medical technologists (MTs) and medical laboratory technicians (MLTs) were assessed for 81 MT and 70 MLT examinees. Validity coefficients are presented for both measures. Overall, summative ratings yielded stronger validity coefficients than ratings based on…
Descriptors: Achievement Rating, Certification, Comparative Testing, Credentials
Stake, Robert – Phi Delta Kappan, 1999
Measuring achievement is not the same as measuring the quality of teaching and learning conditions; the validity of the former is not related to the latter. As measures of school improvement, achievement test scores have not been validated. No accumulation of evidence shows assessment to be an indicator of good schooling. (MLH)
Descriptors: Accountability, Achievement Tests, Educational Environment, Educational Quality


