Publication Date
| In 2026 | 0 |
| Since 2025 | 49 |
| Since 2022 (last 5 years) | 211 |
| Since 2017 (last 10 years) | 492 |
| Since 2007 (last 20 years) | 984 |
Descriptor
| Test Validity | 3908 |
| Test Reliability | 1517 |
| Testing | 1090 |
| Test Construction | 1014 |
| Testing Problems | 1008 |
| Computer Assisted Testing | 616 |
| Elementary Secondary Education | 553 |
| Foreign Countries | 494 |
| Higher Education | 490 |
| Standardized Tests | 488 |
| Test Interpretation | 433 |
| More ▼ | |
Source
Author
| Ebel, Robert L. | 16 |
| Hambleton, Ronald K. | 13 |
| Green, Donald Ross | 10 |
| Popham, W. James | 10 |
| Linn, Robert L. | 9 |
| Haney, Walt | 8 |
| Koretz, Daniel | 8 |
| Sireci, Stephen G. | 8 |
| Thompson, Bruce | 8 |
| Tindal, Gerald | 8 |
| Hilliard, Asa G., III | 7 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 137 |
| Researchers | 134 |
| Teachers | 51 |
| Administrators | 34 |
| Policymakers | 18 |
| Counselors | 11 |
| Students | 8 |
| Parents | 5 |
| Support Staff | 4 |
| Community | 2 |
Location
| Canada | 57 |
| Australia | 40 |
| California | 40 |
| China | 34 |
| United Kingdom (England) | 31 |
| United Kingdom | 29 |
| New York | 28 |
| United States | 26 |
| Florida | 22 |
| Germany | 21 |
| Turkey | 20 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Rippey, Robert M.; And Others – Evaluation Quarterly, 1978
Retrospective pretesting is a method for inferring learning by using the students' recollections of their preinstruction knowledge in lieu of actual pretesting. The most significant problem encountered in such a procedure is establishing the validity of the inferred precourse level of information; attempts to deal with that issue are presented.…
Descriptors: Achievement Tests, Cognitive Tests, Higher Education, Knowledge Level
Peer reviewedCohen, Arie; Farley, Frank H. – Educational and Psychological Measurement, 1977
Cross-cultural validity studies for psychological instruments may result in overestimation of structure invariance due to some items being scored on more than one scale. This problem, called the common-item effect, is investigated with some data from the literature. (JKS)
Descriptors: Cross Cultural Studies, Factor Analysis, Item Sampling, Multidimensional Scaling
Peer reviewedStulman, David A.; Dawis, Rene V. – Journal of Vocational Behavior, 1976
Two Minnesota Importance Questionnaire (MIQ) scales, Creativity and Independence were validated by experiment. Subjects (N=68) were exposed to four task conditions representing joint combinations of high or low levels of Creativity and Independence. The behavioral results were consistent with the subjects' MIQ score levels on the two scales,…
Descriptors: Behavior Patterns, College Students, Creativity, Predictive Validity
Halpin, Gerald; And Others – Measurement and Evaluation in Guidance, 1978
Super's Work Values Inventory is utilized in making interindividual and intraindividual comparative interpretations of work values. Internal consistency reliability coefficients for 15 scales and reliabilities of differences between scores on scales were of such a weak magnitude that caution in making interindividual and intrindividual comparisons…
Descriptors: Comparative Analysis, High School Students, Research Projects, Test Reliability
Peer reviewedHilliard, Asa G., III – Negro Educational Review, 1977
Notes that there is no pedagogical or psychological research or evaluation to date to justify the use of norm-referenced standardized tests as precision tools. At best, they are experimental instruments, yet they are used as if they are already proven to be valid. (Author/AM)
Descriptors: Blacks, Educational Practices, Intelligence Tests, Norm Referenced Tests
Peer reviewedKato, Hiroki – System, 1977
The difficulties of assessing and grading fluency in Japanese for advanced students on oral examinations is discussed. Factors of accuracy of grammar and vocabulary, pronunciation, and ease of expression are discussed. (CHK)
Descriptors: Japanese, Language Fluency, Language Tests, Second Language Learning
Peer reviewedWallbrown, Jane D.; And Others – Journal of Clinical Psychology, 1977
The intent of this study was to determine whether the Minnesota Percepto-Diagnostic Test (Fuller, 1969; Fuller & Laird, 1963) is more effective than the Bender-Gestalt (Bender, 1937) with respect to identifying achievement-related errors in visual-motor perception. (Author/RK)
Descriptors: Evaluation Criteria, Hypothesis Testing, Measurement Instruments, Psychological Studies
Peer reviewedEbel, Robert L. – Personnel Psychology, 1977
More precise conception of the functions and limitations of prediction, validation, construct validity, and criterion referenced tests will not solve the problems which must be faced in using tests. It will not repel many of the current attacks on testing. But it should enable test specialists to think more clearly about some of those problems.…
Descriptors: Concept Formation, Criterion Referenced Tests, Employment Qualifications, Predictive Validity
Peer reviewedWitkin, Belle Ruth; And Others – Language, Speech, and Hearing Services in Schools, 1977
Descriptors: Auditory Perception, Group Testing, Language Acquisition, Learning Disabilities
Tonn, Sue; van Kleeck, Anne – Journal of Childhood Communication Disorders, 1986
In order to determine effects of different sequential placement of the expressive language sample during evaluation of young children referred for speech or languge handicap, 27 normal 3-year-olds were evaluated. Length, complexity, or spontaneity were not affected even when the sample was elicited immediately after formal tests requiring little…
Descriptors: Delayed Speech, Language Handicaps, Language Tests, Preschool Children
Ewoldt, Carolyn – Perspectives for Teachers of the Hearing Impaired, 1987
Standardized reading tests are likely to provide an inaccurate assessment of reading comprehension for deaf students due to the lack of test coaching and test taking skills; item irrelevancy; and the difficulty of test directions. Testing alternatives include parent and teacher observation of students and qualitative evaluations of reading skills…
Descriptors: Deafness, Elementary Secondary Education, Reading Comprehension, Reading Tests
Peer reviewedWestbrook, Bert W.; And Others – Measurement and Evaluation in Counseling and Development, 1987
Investigated the reliability and validity of the Revised Research Edition of the Career Planning Questionnaire, an instrument designed to measure six theoretically different aspects of career maturity. Provided a more definitive criterion measure than those used previously, and addressed the issue of alternate forms reliability for two separate…
Descriptors: Career Planning, High Schools, Psychological Testing, Questionnaires
Peer reviewedBarrett, Edwin T., Jr.; Gleser, Goldine C. – Journal of Consulting and Clinical Psychology, 1987
Evaluation of patients on the Cognitive Status Examination (CSE) significantly discriminated the brain-damaged from the psychiatric and medical groups. Examined the relationship of scores to age and education, as well as the effect of demographics on group discrimination. Presents a distribution of cutting scores for screening purposes. (Author/KS)
Descriptors: Cognitive Tests, Emotional Disturbances, Neurological Impairments, Patients
Peer reviewedMealey, Donna L. – Journal of Reading, 1988
Reviews the Learning and Study Strategies Inventory (LASSI), a test designed to measure college students' learning and study strategies through self report. Advises that the test is still in experimental form, with a questionable norm group, and thus must be used with caution, especially in interpreting percentile ranks. (SKC)
Descriptors: Educational Testing, Higher Education, Learning Strategies, Standardized Tests
Peer reviewedCaldwell, JoAnne – Journal of Reading, 1987
Concludes that the test has basic problems in construction, interpretation, validity, and reliability. (FL)
Descriptors: Cognitive Style, Individual Testing, Reading Instruction, Reading Tests


