Publication Date
| In 2026 | 0 |
| Since 2025 | 49 |
| Since 2022 (last 5 years) | 211 |
| Since 2017 (last 10 years) | 492 |
| Since 2007 (last 20 years) | 984 |
Descriptor
| Test Validity | 3908 |
| Test Reliability | 1517 |
| Testing | 1090 |
| Test Construction | 1014 |
| Testing Problems | 1008 |
| Computer Assisted Testing | 616 |
| Elementary Secondary Education | 553 |
| Foreign Countries | 494 |
| Higher Education | 490 |
| Standardized Tests | 488 |
| Test Interpretation | 433 |
| More ▼ | |
Source
Author
| Ebel, Robert L. | 16 |
| Hambleton, Ronald K. | 13 |
| Green, Donald Ross | 10 |
| Popham, W. James | 10 |
| Linn, Robert L. | 9 |
| Haney, Walt | 8 |
| Koretz, Daniel | 8 |
| Sireci, Stephen G. | 8 |
| Thompson, Bruce | 8 |
| Tindal, Gerald | 8 |
| Hilliard, Asa G., III | 7 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 137 |
| Researchers | 134 |
| Teachers | 51 |
| Administrators | 34 |
| Policymakers | 18 |
| Counselors | 11 |
| Students | 8 |
| Parents | 5 |
| Support Staff | 4 |
| Community | 2 |
Location
| Canada | 57 |
| Australia | 40 |
| California | 40 |
| China | 34 |
| United Kingdom (England) | 31 |
| United Kingdom | 29 |
| New York | 28 |
| United States | 26 |
| Florida | 22 |
| Germany | 21 |
| Turkey | 20 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Peer reviewedBenson, Bernard W.; Young, Linda L. – Journal of Research in Science Teaching, 1971
Describes validation procedures used with an individually administered test involving a branching program format to assess students level of cognitive achievement in biology. Reports results of comparisons of classes of teachers with BSCS or non-BSCS backgrounds. (AL)
Descriptors: Biology, Cognitive Measurement, Individual Testing, Secondary School Science
Ebel, Robert L. – Sch Rev, 1970
Many college teachers are opposed to true-false tests because they feel that such tests do not reflect the extent and depth of a student's knowledge and understanding of his subject. However, true-false tests, properly constructed, can be extremely valuable instruments in measuring academic achievement. (CK)
Descriptors: Academic Achievement, Creative Thinking, Critical Thinking, Higher Education
Peer reviewedMcCarthy, Dorothea; And Others – Journal of Consulting and Clinical Psychology, 1970
The WAIS M index did not correlate significantly with any of the other measures for either sex and thus should not be interpreted as a personality indicator of sexual inversion or homosexuality. It represents solely sex differences in certain aspects of intellectual performance. (Author)
Descriptors: Comparative Testing, Intellectual Development, Performance Factors, Personality Studies
Peer reviewedMillimet, C. Raymond – Psychological Reports, 1970
Descriptors: Anxiety, Individual Characteristics, Personality Assessment, Personality Measures
Knight, Octavia B. – Training Sch Bull, 1970
Descriptors: Exceptional Child Research, Measurement Instruments, Measurement Techniques, Mental Retardation
Peer reviewedRussell, Stephen F.; Ludenia, Krista – Journal of Clinical Psychology, 1983
Examined the psychometric properties of the Multidimensional Health Locus of Control (MHLC) Scales with alcoholic patients (N=100). The three MHLC Scales appear to measure largely independent dimensions of locus of control beliefs specifically related to health. Factor analyses suggest that the instrument is factorily valid. (Author/JAC)
Descriptors: Alcoholism, Beliefs, Factor Analysis, Locus of Control
Peer reviewedEbel, Robert L. – Educational Measurement: Issues and Practice, 1983
One major reason for the problems of test validation is an overemphasis on the need for empirical validity data, and a failure to recognize the primary importance of explicit verbal definitions of what the test is intended to measure and rational arguments in support of the means chosen for obtaining the measurement. (Author/LC)
Descriptors: Occupational Tests, Performance Tests, Standardized Tests, Statistical Data
Peer reviewedSinger, Marc G.; And Others – Journal of Clinical Psychology, 1983
Investigated the concurrent validity of the Wechsler Intelligence Scale for Children (Revised) subtests and three IQs compared to Wide Range Achievement Test standard scores, in 28 learning-disabled children. Analysis showed no significant correlations between WISC-R and WRAT scores, indicating a lack of concurrent validity on the WAIS-R. (WAS)
Descriptors: Adolescents, Children, Comparative Testing, Elementary Secondary Education
Zwarts, Michel A. – Evaluation in Education: International Progress, 1982
The increasing relation between domain-referenced measurement and its implications in instruction are discussed with the argument that construction methods available do not guarantee content validity which needs more attention to be effective. Two domain-referenced testing procedures illustrate construction and validity problems. (CM)
Descriptors: Achievement Tests, Criterion Referenced Tests, Educational Testing, Evaluation Methods
Peer reviewedPratt, Jamie H.; And Others – Educational and Psychological Measurement, 1981
Attempts are made to improve the previously used procedure for multidimensional scaling in the measure of conceptual level. Monte Carlo simulation runs and a laboratory experiment highlight potential incongruities related to the degree of error in the input data required by that process. Methodological recommendations are briefly evaluated.…
Descriptors: Cognitive Measurement, Cognitive Style, Higher Education, Hypothesis Testing
Peer reviewedBersoff, Donald N. – American Psychologist, 1981
Examines the legal issues concerning (1) cultural bias in educational tests; (2) the validity of employment tests; and (3) the disclosure of test materials. Discusses the legal interpretations of psychometric concepts that have arisen from recent litigation. (Author/GC)
Descriptors: Court Litigation, Cultural Differences, Disclosure, Educational Testing
Peer reviewedShimberg, Benjamin – American Psychologist, 1981
The purpose of licensing and certification tests is to protect the public. This purpose determines test content, difficulty of the items, test format, use of test scores in decision making, and ways of setting standards. Currently being explored are methods to ensure continued competence, and legal considerations related to antidiscrimination…
Descriptors: Certification, Court Litigation, Legal Problems, Occupational Tests
Peer reviewedBellack, Alan S.; And Others – Journal of Consulting and Clinical Psychology, 1979
Undergradautes responded to role-played scenarios involving heterosocial interactions. Later they were surreptitiously observed while they interacted with an opposite-sex student, an experimental confederate. Videotapes were rated for component responses. Role-play behavior was moderately correlated with behavior in naturalistic situations for…
Descriptors: Behavior Patterns, Interaction, Interpersonal Competence, Responses
Peer reviewedFowler, Patrick C. – Journal of Clinical Psychology, 1982
Presented for 64 subjects a replication of the Family Environment Scale's maximum likelihood factor structure for which the two-factor, Varimax-rotated solution was found to be stable when the correlations among the subscales were corrected for the effects of social desirability response bias. (Author)
Descriptors: Family Relationship, Family Structure, Interpersonal Relationship, Research Methodology
Peer reviewedHoward, George S.; And Others – Journal of Educational Measurement, 1979
Evaluations of experimental interventions which employ self-report measures are subject to contamination known as response-shift bias. Response-shift effects may be attenuated by substituting retrospective pretest ratings for the traditional self-report pretest ratings. This study indicated that the retrospective rating more accurately reflected…
Descriptors: Higher Education, Rating Scales, Response Style (Tests), Self Evaluation


