Publication Date
| In 2026 | 0 |
| Since 2025 | 49 |
| Since 2022 (last 5 years) | 211 |
| Since 2017 (last 10 years) | 492 |
| Since 2007 (last 20 years) | 984 |
Descriptor
| Test Validity | 3908 |
| Test Reliability | 1517 |
| Testing | 1090 |
| Test Construction | 1014 |
| Testing Problems | 1008 |
| Computer Assisted Testing | 616 |
| Elementary Secondary Education | 553 |
| Foreign Countries | 494 |
| Higher Education | 490 |
| Standardized Tests | 488 |
| Test Interpretation | 433 |
| More ▼ | |
Source
Author
| Ebel, Robert L. | 16 |
| Hambleton, Ronald K. | 13 |
| Green, Donald Ross | 10 |
| Popham, W. James | 10 |
| Linn, Robert L. | 9 |
| Haney, Walt | 8 |
| Koretz, Daniel | 8 |
| Sireci, Stephen G. | 8 |
| Thompson, Bruce | 8 |
| Tindal, Gerald | 8 |
| Hilliard, Asa G., III | 7 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 137 |
| Researchers | 134 |
| Teachers | 51 |
| Administrators | 34 |
| Policymakers | 18 |
| Counselors | 11 |
| Students | 8 |
| Parents | 5 |
| Support Staff | 4 |
| Community | 2 |
Location
| Canada | 57 |
| Australia | 40 |
| California | 40 |
| China | 34 |
| United Kingdom (England) | 31 |
| United Kingdom | 29 |
| New York | 28 |
| United States | 26 |
| Florida | 22 |
| Germany | 21 |
| Turkey | 20 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Peer reviewedHonaker, L. Michael; And Others – Computers in Human Behavior, 1988
Describes study that compared the administration of the Microtest computer version of the Minnesota Multiphasic Personality Inventory (MMPI) with administration procedures for the traditional paper and pencil version. Highlights include an anxiety inventory; attitude scales; rank order and reliability; correlation analyses; power analyses; and…
Descriptors: Attitude Measures, Comparative Analysis, Computer Assisted Testing, Correlation
Peer reviewedThurlow, Martha L.; And Others – Remedial and Special Education, 1995
This article addresses issues concerned with providing testing accommodations for students with disabilities. It discusses policy and legal considerations, existing standards, research on current practice, and research on technical concerns. It recommends a research program on testing accommodations for students with disabilities. (Author/DB)
Descriptors: Disabilities, Educational Policy, Educational Practices, Elementary Secondary Education
Peer reviewedAustin, J. Sue – Educational and Psychological Measurement, 1992
The efficacy of 5 scales of the Minnesota Multiphasic Personality Inventory-2 (MMPI-2) in detecting fake good, fake bad, and honest profiles was investigated for 110 undergraduate students instructed to fake good, fake bad, or respond honestly. An analysis of variance suggests that these validity scales are useful. (SLD)
Descriptors: Analysis of Variance, Diagnostic Tests, Higher Education, Lying
Peer reviewedForsyth, Robert A. – Educational Measurement: Issues and Practice, 1991
The scales of the National Assessment of Educational Progress (NAEP), as constructed, do not yield meaningful criterion-referenced interpretations. Poorly defined NAEP goals and the present knowledge base do not allow the measurement of what examinees can and cannot do. Inappropriate interpretations of NAEP data are discussed, with specific…
Descriptors: Achievement Tests, Criterion Referenced Tests, Educational Assessment, Item Response Theory
Bilsky-Torna, Zelda – English Teachers' Journal (Israel), 1993
Ten groups (2-5 members each) of tenth-grade Israeli English-as-a-Second-Language students took first a group quiz and then individual quizzes on the same material. Comparison of the results showed that, especially for weaker students, group work and group grades offered some advantages over individual work and assessment. (CNP)
Descriptors: Achievement Tests, Comparative Analysis, English (Second Language), Foreign Countries
Squires, David; Trevisan, Michael S.; Canney, George F. – Studies in Educational Evaluation, 2006
The Idaho Comprehensive Literacy Assessment (ICLA) is a faculty-developed, state-wide, high-stakes assessment of pre-service teachers' knowledge and application of research based literacy practices. The literacy faculty control all aspects of the test, including construction, refinement, administration, scoring and reporting. The test development…
Descriptors: Test Construction, Comparative Testing, Investigations, Test Reliability
Linn, Robert L.; and others – Educ Psychol Meas, 1969
Descriptors: Measurement Techniques, Programing, Test Construction, Test Validity
Schwartz, Mark S.; Ewert, Josephine C. – J Clin Psychol, 1969
Descriptors: Diagnostic Tests, Projective Measures, Psychological Testing, Schizophrenia
Wober, Mallory – Percept Mot Skills, 1969
Descriptors: Ability, Cross Cultural Training, Psychological Testing, Research
BARRITT, LOREN S. – 1967
THE RELEVANCE OF INTELLIGENCE TESTS FOR EDUCATIONAL USES IS CHALLENGED ON TWO GROUNDS--(1) TESTS WHICH MERELY PREDICT THE LIKELIHOOD OF FUTURE SUCCESS DO NOT PROVIDE USEFUL INFORMATION FOR THOSE WHO WISH TO PRESCRIBE TREATMENTS TO ENHANCE PERFORMANCE, AND (2) INTELLIGENCE IS NOT DEFINED AND HENCE THE INTERPRETATION OF SCORES IS MISLEADING. IT IS…
Descriptors: Intelligence, Intelligence Tests, Measurement Objectives, Test Validity
Peer reviewedLukens, John – Journal of School Psychology, 1988
Administered the Stanford-Binet, Fourth Edition, to 31 mentally retarded adolescents who had previously been tested with the Stanford-Binet, L-M, with a mean interval between testings of 17.3 months. Found an intertest correlation of .86 and a median intelligence quotient change of three points in either direction. Compatability of scores supports…
Descriptors: Adolescents, Comparative Testing, Intelligence Tests, Mental Retardation
Peer reviewedAnastasi, Anne – Journal of Counseling & Development, 1985
Describes the role of information on score reliabilities, significance of score differences, intercorrelations of scores, and differential validity of score patterns on the interpretation of results from multiscore batteries. (Author)
Descriptors: Psychological Testing, Scoring, Test Interpretation, Test Reliability
Peer reviewedGutterman, Jo Ellin; And Others – Journal of Visual Impairment and Blindness, 1985
The Perkins-Binet Test of Intelligence for the Blind, Form U; the Wechsler Intelligence Scale for Children-Revised (WISC-R), Verbal Scale; and the Wide Range Achievement Test (WRAT) were administered to 52 low-vision children in the third, fifth, seventh, and ninth grades. Results indicated that the mean ten scores on the two tests of intelligence…
Descriptors: Elementary Education, Intelligence Tests, Partial Vision, Test Validity
Peer reviewedLieberman, R. Jane; Michael, Ann – Journal of Speech and Hearing Disorders, 1986
Three tests of grammatical ability (Carrow Elicited Language Inventory, Test of Language Development, and Clinical Evaluation of Language Functions) were evaluated for content-oriented test construction. Content domains were found deficient when judged against an external standard as well as when examined according to their own content…
Descriptors: Grammar, Language Handicaps, Language Tests, Test Validity
Peer reviewedBolter, John F.; And Others – Journal of Consulting and Clinical Psychology, 1984
Contends that the Speech Sounds Perception Test form (Adult and Midrange versions) is structured such that correct responses can be determined rationally. If a patient identifies and responds according to that structure, the validity of the test is compromised. Posttest interview is suggested as a simple solution. (Author/JAC)
Descriptors: Response Style (Tests), Test Format, Test Validity, Testing Problems

Direct link
