Publication Date
| In 2026 | 0 |
| Since 2025 | 49 |
| Since 2022 (last 5 years) | 211 |
| Since 2017 (last 10 years) | 492 |
| Since 2007 (last 20 years) | 984 |
Descriptor
| Test Validity | 3908 |
| Test Reliability | 1517 |
| Testing | 1090 |
| Test Construction | 1014 |
| Testing Problems | 1008 |
| Computer Assisted Testing | 616 |
| Elementary Secondary Education | 553 |
| Foreign Countries | 494 |
| Higher Education | 490 |
| Standardized Tests | 488 |
| Test Interpretation | 433 |
| More ▼ | |
Source
Author
| Ebel, Robert L. | 16 |
| Hambleton, Ronald K. | 13 |
| Green, Donald Ross | 10 |
| Popham, W. James | 10 |
| Linn, Robert L. | 9 |
| Haney, Walt | 8 |
| Koretz, Daniel | 8 |
| Sireci, Stephen G. | 8 |
| Thompson, Bruce | 8 |
| Tindal, Gerald | 8 |
| Hilliard, Asa G., III | 7 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 137 |
| Researchers | 134 |
| Teachers | 51 |
| Administrators | 34 |
| Policymakers | 18 |
| Counselors | 11 |
| Students | 8 |
| Parents | 5 |
| Support Staff | 4 |
| Community | 2 |
Location
| Canada | 57 |
| Australia | 40 |
| California | 40 |
| China | 34 |
| United Kingdom (England) | 31 |
| United Kingdom | 29 |
| New York | 28 |
| United States | 26 |
| Florida | 22 |
| Germany | 21 |
| Turkey | 20 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Diamond, Esther E. – 1981
As test standards and research literature in general indicate, definitions of test bias and item bias vary considerably, as do the results of existing methods of identifying biased items. The situation is further complicated by issues of content, context, construct, and criterion. In achievement tests, for example, content validity may impose…
Descriptors: Achievement Tests, Aptitude Tests, Psychometrics, Test Bias
McGue, Matthew; And Others – 1979
The validity of the Woodcock-Johnson Psycho-Educational Battery was examined using test results of 50 learning disabled fourth graders. The appropriateness of the developmental strategy and the evidence for the external validity of the cluster measures contained in the battery were considered. Results indicated that the factor and scholastic…
Descriptors: Elementary Education, Exceptional Child Research, Learning Disabilities, Standardized Tests
Livingston, Samuel A. – 1975
A measure of the usefulness of a pass/fail testing decision procedure is the ratio of the utility of the given procedure to the utility of a procedure based on knowledge of scores on a criterion measure. It is computed from scores for a representative sample of persons tested. Utility functions may be specified by the test user or set by…
Descriptors: Cutting Scores, Decision Making, Mathematical Models, Measurement Techniques
Bloomer, Corinne – Teacher, 1975
Article discussed the disadvantages of student testing as a means of evaluating student progress in the classroom and suggested the use of a new model of assessment. Three steps intended for classroom diagnosis of students were described. (RK)
Descriptors: Academic Achievement, Educational Testing, Models, Standardized Tests
Peer reviewedModjeski, Richard B.; Michael, William B. – Educational and Psychological Measurement, 1978
The General Education Performance Index (GEPI) is a comparatively short test covering the same content as the General Educational Development Test (GED), which takes ten hours to administer. Correlations of the subtests of the GEPI with the GED ranged from .28 to .57. (JKS)
Descriptors: Correlation, Equivalency Tests, Military Personnel, Statistical Data
Peer reviewedDeffenbacher, Jerry L.; Deitz, Sheila R. – Psychology in the Schools, 1978
Test performance and reported anxiety levels of high and low test-anxious subjects taking either a regular exam or an exam containing brief, written relaxation instructions were compared. High test-anxious subjects performed more poorly and reported greater worry and emotionality. Results provide greater external validity for Test Anxiety Scale.…
Descriptors: Anxiety, College Students, Higher Education, Research Projects
Peer reviewedHull, Marc; Halloran, William – Educational and Psychological Measurement, 1976
Results show that the mean number of Occupational Aptitude Patterns (OAP's) generated for a sample of mentally retarded and boarderline intelligence students is significantly greater for the Nonreading Aptitude Test Battery (NATB) than for the General Aptitude Test Battery (GATB). (DEP)
Descriptors: Comparative Testing, Intelligence Tests, Low Ability Students, Mental Retardation
The Invalidity of Partitioned-U Tests in Canonical Correlation and Multivariate Analysis of Variance
Peer reviewedHarris, Richard J. – Multivariate Behavioral Research, 1976
The partitioned-U procedure is outlined, a fundamental logical flaw in this procedure's avoidance of any direct test of the significance of the first discriminant function or largest coefficient of canonical correlation is pointed out, and two alternatives to the partitioned-U procedure are discussed. (Author/DEP)
Descriptors: Analysis of Variance, Correlation, Hypothesis Testing, Multivariate Analysis
Peer reviewedDudley, Harold K.; And Others – Journal of Youth and Adolescence, 1976
Indicates that IQ ranking is the most significnat factor affecting Draw A Person test performance by male subjects. IQ rankings were not found to significantly influence drawings by females. (Author/DEP)
Descriptors: Adolescents, Background, Institutionalized Persons, Intelligence
Peer reviewedMcGovern, Francis J.; Nevid, Jeffrey S. – Journal of Consulting and Clinical Psychology, 1986
Psychological inventories were administered to incarcerated offenders, either without prior cuing or following exposure to experimental cues that identified psychological health and growth with either positive or negative self-disclosure. Results showed that self-disclosure of deviant and symptomatic responses could be enhanced by associating…
Descriptors: Correctional Institutions, Personality Measures, Prisoners, Prompting
Peer reviewedHays, Ron D.; Huba, George J. – Journal of Consulting and Clinical Psychology, 1988
Considered techniques to assess self-reported drug use. Evaluated the effects of different response options on the distribution, reliability, and validity of scores on drug-use items. Suggests that more quantitative measures are not necessarily more reliable or valid than less quantitative measures of drug use. (Author/KS)
Descriptors: Drug Use, Item Analysis, Psychological Testing, Psychometrics
Peer reviewedNelson, Linda D. – Journal of Consulting and Clinical Psychology, 1987
Administered Minnesota Multiphasic Personality Inventory (MMPI) to clinically depressed and nondepressed inpatients, and compared scores from its Depression scale with scores from the Beck Depression Inventory. Demonstrated a positive linear relationship between the two measures and their ability to discriminate between depressed and nondepressed…
Descriptors: Concurrent Validity, Depression (Psychology), Item Analysis, Patients
Peer reviewedBritton, Warner H.; Eaves, Ronald C. – American Journal of Mental Deficiency, 1986
The relationship between the Vineland Adaptive Behavior Scales-Classroom Edition and its predecessor, the Vineland Social Maturity Scale was examined with 54 educable and trainable mentally retarded children. The concurrent validity of the two scales was moderate. The mean scores from the newer instrument were significantly lower. (Author/CL)
Descriptors: Elementary Secondary Education, Mild Mental Retardation, Moderate Mental Retardation, Test Validity
Peer reviewedDavis, Steven E.; Kramer, Jack J. – Psychology in the Schools, 1985
Compared scores on the Peabody Picture Vocabulary Test-Revised (PPVT-R) and Wechsler Intelligence Scale for Children-Revised (WISC-R) for 40 nonexceptional second graders. Subjects tended to score lower on PPVT-R than on WISC-R; scores from the tests were moderately correlated; and order of administration did not appear to alter scores. (NRB)
Descriptors: Grade 2, Intelligence Tests, Primary Education, Test Validity
Peer reviewedVilleme, Mel; Hall, Bruce – Action in Teacher Education, 1985
Questionnaires returned by 285 graduates were used to survey perceptions of the Florida Teacher Certification Examination in terms of adequacy to measure competence, difficulty level, need for more rigor, and usefulness in upgrading standards. The results suggest that education graduates do not necessarily dismiss the value of a certification…
Descriptors: Graduate Surveys, Minimum Competency Testing, Teacher Attitudes, Teacher Certification


