Publication Date
| In 2026 | 0 |
| Since 2025 | 49 |
| Since 2022 (last 5 years) | 211 |
| Since 2017 (last 10 years) | 492 |
| Since 2007 (last 20 years) | 984 |
Descriptor
| Test Validity | 3908 |
| Test Reliability | 1517 |
| Testing | 1090 |
| Test Construction | 1014 |
| Testing Problems | 1008 |
| Computer Assisted Testing | 616 |
| Elementary Secondary Education | 553 |
| Foreign Countries | 494 |
| Higher Education | 490 |
| Standardized Tests | 488 |
| Test Interpretation | 433 |
| More ▼ | |
Source
Author
| Ebel, Robert L. | 16 |
| Hambleton, Ronald K. | 13 |
| Green, Donald Ross | 10 |
| Popham, W. James | 10 |
| Linn, Robert L. | 9 |
| Haney, Walt | 8 |
| Koretz, Daniel | 8 |
| Sireci, Stephen G. | 8 |
| Thompson, Bruce | 8 |
| Tindal, Gerald | 8 |
| Hilliard, Asa G., III | 7 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 137 |
| Researchers | 134 |
| Teachers | 51 |
| Administrators | 34 |
| Policymakers | 18 |
| Counselors | 11 |
| Students | 8 |
| Parents | 5 |
| Support Staff | 4 |
| Community | 2 |
Location
| Canada | 57 |
| Australia | 40 |
| California | 40 |
| China | 34 |
| United Kingdom (England) | 31 |
| United Kingdom | 29 |
| New York | 28 |
| United States | 26 |
| Florida | 22 |
| Germany | 21 |
| Turkey | 20 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Peer reviewedMilner, Joel S. – Early Child Development and Care, 1989
Describes the Child Abuse Potential Inventory and associated psychometric data. Discusses uses and misuses of the inventory, general test limitations, and screening instruments available to professionals concerned with child maltreatment. (RJC)
Descriptors: Child Abuse, Evaluation Methods, Measurement Techniques, Test Interpretation
Peer reviewedSeymour, Richard T. – Journal of Vocational Behavior, 1988
Argues that occupational tests can exclude racial minorities and that many industrial psychologists have overlooked evidence that many tests are biased and that some claims for validity generalization are based on faulty science. Outlines what plaintiff's counsel looks for in deciding to try a testing case, and provides primer on how to challenge…
Descriptors: Court Litigation, Employment Practices, Generalization, Minority Groups
Peer reviewedBrandt, Ron – Educational Leadership, 1992
Assessment reforms might fail because high stakes will be attached to them too soon, judgments will be unreliable, and lawsuits will occur. Educators need to consider technical issues including score reliability, task validity, portfolio sampling, and generalizability problems. U.S. schools would improve considerably if national Advanced Placement…
Descriptors: Advanced Placement, Elementary Secondary Education, Generalization, Performance Based Assessment
Peer reviewedLeki, Ilona – WPA: Writing Program Administration, 1991
Discusses one attempt to find innovative answers for questions regarding advanced English-as-a-Second-Language writing-placement testing. (MG)
Descriptors: English (Second Language), Higher Education, Test Validity, Testing Problems
Peer reviewedJanikowski, Timothy P.; And Others – Rehabilitation Counseling Bulletin, 1991
Examined validity of Microcomputer Evaluation Screening and Assessment (MESA) aptitude scores relative to General Aptitude Test Battery (GATB) using multitrait-multimethod correlational analyses. Findings from 54 rehabilitation clients and 29 displaced workers revealed no evidence to support the construct validity of the MESA. (Author/NB)
Descriptors: Aptitude Tests, Computer Assisted Testing, Construct Validity, Dislocated Workers
Peer reviewedHendel, Darwin D. – Educational and Psychological Measurement, 1991
Correlations among measures of college outcomes were examined using 118 college students. Instruments studied were the College Outcome Measures Program; the Educational Testing Service Academic Profile; the Defining Issues Test; and the author-developed Sophomore Assessment Project Questionnaire. Commonly used measures tap similar outcome…
Descriptors: Academic Achievement, College Students, Comparative Testing, Higher Education
Peer reviewedRobinson, Nancy M.; And Others – Intelligence, 1990
The validity of the fourth edition of the Stanford-Binet (S-B IV) test was studied with 30 linguistically precocious children at ages 20, 24, and 30 months. Validity at 24 months was questionable. Problems in using the test with very young children are discussed. (SLD)
Descriptors: Age Differences, Child Development, Cognitive Processes, Intelligence Tests
Peer reviewedFarr, Roger; Greene, Beth – Educational Horizons, 1993
A review of public demand for accountability uncovers three types of educational assessment problems: demand for valid reading measures, need for a broader range of assessments, and value of assessments for various audiences. Integration of the various types of assessments is recommended. (SK)
Descriptors: Accountability, Educational Assessment, Political Influences, Reading Tests
Peer reviewedWatkins, Marley W. – School Psychology Quarterly, 2000
Reviews the results of four studies included in this issue of "School Psychology Quarterly" which found all four cognitive profile reports lacking reliability, validity, or diagnostic utility. Argues that ipsative methods are inferior to normative methods in cognitive assessment. Recommends that psychologists eschew the application of…
Descriptors: Clinical Diagnosis, Cognitive Measurement, Intelligence Tests, Profiles
Olson, Allan – American School Board Journal, 2000
The Northwest Evaluation Association, serving over 300 U.S. school districts, is developing an Internet-enabled assessment system that adapts questions to each student's performance. Shorter, adaptive tests help students avoid frustrations or boredom caused by too-difficult or -easy questions. Scores are as valid as traditional test scores. (MLH)
Descriptors: Accountability, Achievement Tests, Computer Assisted Testing, Elementary Secondary Education
Williams, John E.; Weed, Nathan C. – Assessment, 2004
There are eight commercially available computer-based test interpretations (CBTIs) for the Minnesota Multiphasic Personality Inventory-2 (MMPI-2), of which few have been empirically evaluated. Prospective users of these programs have little scientific data to guide choice of a program. This study compared ratings of these eight CBTIs. Test users…
Descriptors: Rating Scales, Measurement Techniques, Test Interpretation, Personality Measures
Peer reviewedTrevlas, Efthimios; Grammatikopoulous, Vasilios; Tsigilis, Nikolaos; Zachopoulou, Evridiki – Early Childhood Education Journal, 2003
Examined the underlying structure and factorial validity of the Children's Playfulness Scale in evaluating preschool children's behavior. Found that factor loadings, factor variances/covariances, and error variances/covariances are invariant across calibration and validation groups, indicating the good cross-generalizability of the scale. (JPB)
Descriptors: Behavior Development, Child Behavior, Personality Traits, Play
Tomlinson, Brian – ELT Journal, 2005
This article advocates making the provision of opportunities for learning the main objective of language testing. It recognizes the need for tests to be fair, valid, and reliable, but asserts the priority of what it calls "learning validity", in order to prevent time being wasted on language courses on tests, and the preparation for them. The…
Descriptors: Test Validity, Testing, Language Tests, Second Language Learning
Lowe, Patricia A.; Reynolds, Cecil R. – Educational and Psychological Measurement, 2006
The psychometric properties of the Adult Manifest Anxiety Scale-Elderly Version (AMAS-E) scores were evaluated in two studies. In Study 1, the temporal stability and construct validity of the AMAS-E test scores were examined in a group of 226 older adults, aged 60 years and older. Results indicated adequate to excellent temporal stability (2-week…
Descriptors: Test Validity, Psychological Patterns, Psychological Testing, Psychometrics
Dyehouse, Melissa A.; Bennett, Deborah E. – Assessment for Effective Intervention, 2006
This study investigated the validity of a statewide alternate assessment program, IASEP (Indiana Assessment System of Educational Proficiencies) by examining supporting profile patterns on the 100 core IASEP items of individuals with significant disabilities. Participants were 5,192 students ranging in age from 7-21 years with special education…
Descriptors: Alternative Assessment, Test Validity, Computer Assisted Testing, Special Needs Students

Direct link
