Publication Date
| In 2026 | 0 |
| Since 2025 | 49 |
| Since 2022 (last 5 years) | 211 |
| Since 2017 (last 10 years) | 492 |
| Since 2007 (last 20 years) | 984 |
Descriptor
| Test Validity | 3908 |
| Test Reliability | 1517 |
| Testing | 1090 |
| Test Construction | 1014 |
| Testing Problems | 1008 |
| Computer Assisted Testing | 616 |
| Elementary Secondary Education | 553 |
| Foreign Countries | 494 |
| Higher Education | 490 |
| Standardized Tests | 488 |
| Test Interpretation | 433 |
| More ▼ | |
Source
Author
| Ebel, Robert L. | 16 |
| Hambleton, Ronald K. | 13 |
| Green, Donald Ross | 10 |
| Popham, W. James | 10 |
| Linn, Robert L. | 9 |
| Haney, Walt | 8 |
| Koretz, Daniel | 8 |
| Sireci, Stephen G. | 8 |
| Thompson, Bruce | 8 |
| Tindal, Gerald | 8 |
| Hilliard, Asa G., III | 7 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 137 |
| Researchers | 134 |
| Teachers | 51 |
| Administrators | 34 |
| Policymakers | 18 |
| Counselors | 11 |
| Students | 8 |
| Parents | 5 |
| Support Staff | 4 |
| Community | 2 |
Location
| Canada | 57 |
| Australia | 40 |
| California | 40 |
| China | 34 |
| United Kingdom (England) | 31 |
| United Kingdom | 29 |
| New York | 28 |
| United States | 26 |
| Florida | 22 |
| Germany | 21 |
| Turkey | 20 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Peer reviewedShapiro, Alvin H. – Reading Improvement, 1983
Provides a description and a rationale for the Shapiro Dyslexia Test, designed to measure poor and nonreaders' alphabet knowledge, letter-sound abilities, letter sound/sequence memory, blending/syllabication prowess, and position-in-space skills. (FL)
Descriptors: Dyslexia, Elementary Education, Reading Diagnosis, Reading Instruction
Peer reviewedParish, Thomas S.; Rankin, Charles I. – Educational and Psychological Measurement, 1982
The Nonsexist Personal Attribute Inventory for Children (NPAIC) was administered along with the Piers-Harris scale to children in fifth through eighth grade. A correlation of .49 was found between the two scales. The NPAIC was found to be a reliable, valid self-concept scale for females and males. (Author/GK)
Descriptors: Elementary Secondary Education, Self Concept Measures, Sex Bias, Test Reliability
Peer reviewedWeldhen, Margaret – Assessment and Evaluation in Higher Education, 1981
Despite the amount of work done in evaluation in the humanities, there has not been enough clarification of the kind of knowledge or experience that the humanities can yield. More of this clarification must be done before developing even more new evaluation techniques. (MSE)
Descriptors: Aesthetic Education, Creativity, Higher Education, Humanities Instruction
Peer reviewedWeinrott, Mark R.; And Others – Journal of Educational Psychology, 1981
A secondary analysis was conducted to test the validity of five behaviors from five classroom observation systems: approves/praises; asks questions; criticizes/disapproves; gives directions; and presents facts or judgments. Scores for each were intercorrelated and arranged in a multitrait-multimethod matrix. Evidence was found of construct…
Descriptors: Classroom Observation Techniques, Factor Analysis, Junior High Schools, Teacher Behavior
Peer reviewedDodrill, Carl B. – Journal of Consulting and Clinical Psychology, 1981
Evaluated the ability of the Wonderlic Personnel Test to replicate the Wechsler Adult Intelligence Scale (WAIS) with (N=120) normal persons divided into principal and cross-validation groups. Correlations between the Wonderlic IQs and the WAIS Full Scale IQs were .93 for the main group and .91 for the cross-validation group. (Author)
Descriptors: Adults, Age Differences, Comparative Analysis, Intelligence Quotient
Peer reviewedFerrari, Michael – Psychology in the Schools, 1980
The Peabody correlated significantly with the McCarthy General Cognitive Index, Verbal Scale, Perceptual Scale, and Memory Scale. A significant difference between the means of the two tests was found, with the Peabody yielding lower scores. (Author)
Descriptors: Autism, Children, Cognitive Measurement, Comparative Testing
Peer reviewedFrary, Robert B. – Applied Psychological Measurement, 1980
Six scoring methods for assigning weights to right or wrong responses according to various instructions given to test takers are analyzed with respect to expected change scores and the effect of various levels of information and misinformation. Three of the methods provide feedback to the test taker. (Author/CTM)
Descriptors: Guessing (Tests), Knowledge Level, Multiple Choice Tests, Scores
Peer reviewedSmith, Cyrus F.; Western, Richard D. – Reading World, 1980
Reports that tenth-grade students far exceeded chance mean scores on the Stanford Test of Academic Skills when given passage-out components (the publisher's questions without the accompanying reading passages); concludes that the results lend support to concerns regarding the valid measurement of reading comprehension. (GT)
Descriptors: Reading Comprehension, Reading Research, Reading Tests, Secondary Education
Peer reviewedRimm, Sylvia; Davis, Gary A. – Journal of Creative Behavior, 1976
Findings here indicate that an inventory can be constructed for a wide age range of elementary school children which can predict creativity. GIFT is easy to administer and score, and correlates with a composite creativity criterion, art teacher nominations, and scores on a Uses test. (Author/RK)
Descriptors: Correlation, Creativity, Educational Testing, Measurement Instruments
Peer reviewedMathewson, Peter D. – Journal of Consulting and Clinical Psychology, 1977
Navy enlisted personnel (N=60) were administered the Recall scale of the Kahn Intelligence Test (Experimental Form; KIT) and the Digit Span subtest of the Wechsler Adult Intelligence Scale (WAIS). Scores for the KIT tasks indicate a significant transfer of data to long-term memory. (Author)
Descriptors: Comparative Analysis, Intelligence Tests, Psychological Testing, Research Projects
Peer reviewedMacmann, Gregg M.; Barnett, David W. – School Psychology Quarterly, 1997
Used computer simulation to examine the reliability of interpretations for Kaufman's "intelligent testing" approach to the Wechsler Intelligence Scale for Children (3rd ed.) (WISC-III). Findings indicate that factor index-score differences and other measures could not be interpreted with confidence. Argues that limitations of IQ testing…
Descriptors: Elementary Secondary Education, Evaluation Problems, Intelligence, Intelligence Quotient
Peer reviewedLevinson, Edward M.; Zeman, Heather L.; Ohler, Denise L. – Career Development Quarterly, 2002
Assesses the reliability and validity of the Web-based version of the Career Key. Participants completed the Web-based version of the Career Key and the Self-Directed Search-Form R and completed a second Career Key administration 2 weeks later. Test-retest reliability ranged between .75 and .84. With the exception of the conventional scale, all…
Descriptors: Career Counseling, Computer Assisted Testing, Concurrent Validity, Test Reliability
Peer reviewedFrederiksen, John R.; Collins, Allan – Educational Researcher, 1989
Proposes a systemically valid testing system that induces curricular and instructional changes in education systems to foster the development of the cognitive traits that tests are designed to measure. Analyzes test characteristics and outlines the principles of a systemically valid testing system. (FMW)
Descriptors: Cognitive Tests, Educational Change, Outcomes of Education, Systems Approach
Peer reviewedSturmey, Peter – Journal of Autism and Developmental Disorders, 1994
This paper reviews the psychometric properties, treatment utility, and conceptual basis of instruments used to identify the functions of aberrant behaviors in people with developmental disabilities. Instruments include the Motivational Assessment Scale, Motivation Analysis Rating Scale, Functional Analysis Interview Form, and Functional Analysis…
Descriptors: Behavior Problems, Developmental Disabilities, Evaluation Methods, Motivation
Peer reviewedMcKendy, Thomas – Research in the Teaching of English, 1992
Offers a brief history of an attempt to establish predictive test validity with holistic scoring of writing tests. Asserts that holistically scored writing samples may be practical for some local purposes, but they must be used judiciously. Suggests monitoring the validity of local tests with statistical packages and using teacher ratings as…
Descriptors: Higher Education, Holistic Approach, Holistic Evaluation, Test Validity


