Publication Date
| In 2026 | 0 |
| Since 2025 | 49 |
| Since 2022 (last 5 years) | 211 |
| Since 2017 (last 10 years) | 492 |
| Since 2007 (last 20 years) | 984 |
Descriptor
| Test Validity | 3908 |
| Test Reliability | 1517 |
| Testing | 1090 |
| Test Construction | 1014 |
| Testing Problems | 1008 |
| Computer Assisted Testing | 616 |
| Elementary Secondary Education | 553 |
| Foreign Countries | 494 |
| Higher Education | 490 |
| Standardized Tests | 488 |
| Test Interpretation | 433 |
| More ▼ | |
Source
Author
| Ebel, Robert L. | 16 |
| Hambleton, Ronald K. | 13 |
| Green, Donald Ross | 10 |
| Popham, W. James | 10 |
| Linn, Robert L. | 9 |
| Haney, Walt | 8 |
| Koretz, Daniel | 8 |
| Sireci, Stephen G. | 8 |
| Thompson, Bruce | 8 |
| Tindal, Gerald | 8 |
| Hilliard, Asa G., III | 7 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 137 |
| Researchers | 134 |
| Teachers | 51 |
| Administrators | 34 |
| Policymakers | 18 |
| Counselors | 11 |
| Students | 8 |
| Parents | 5 |
| Support Staff | 4 |
| Community | 2 |
Location
| Canada | 57 |
| Australia | 40 |
| California | 40 |
| China | 34 |
| United Kingdom (England) | 31 |
| United Kingdom | 29 |
| New York | 28 |
| United States | 26 |
| Florida | 22 |
| Germany | 21 |
| Turkey | 20 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Hampton, Nan Zhang – 1999
This study explores the applicability of the Career Decision-Making Self-Efficacy Scale (CDMSE) to Chinese college students. The reliability and validity of the scale are examined using two independent samples (128 students from a polytechnic institute and 157 students from a teacher's college in China). The results indicate that the CDMSE is…
Descriptors: Academic Achievement, Age Differences, Career Choice, Career Development
Peer reviewedMerino, Barbara J.; Spencer, Mary – NABE: The Journal for the National Association for Bilingual Education, 1983
Compares five commonly used English-Spanish language dominance instruments according to area of language measured, domain assessed, developmental comparability, and language variety or dialect. Examines the proficiency information provided and the validity, reliability, and norming of the instruments. Concludes that the tests are not comparable…
Descriptors: Bilingual Education, Bilingual Students, Comparative Analysis, Comparative Testing
McGuigan, Corrine A. – B. C. Journal of Special Education, 1982
Philosophical, technical, and practical considerations in selecting and evaluating educational tests for exceptional children are discussed. The following major technical considerations are addressed: validity, reliability, sensitivity, appropriateness, objectivity, and feasibility. (SEW)
Descriptors: Diagnostic Tests, Disabilities, Disability Identification, Educational Diagnosis
Peer reviewedPlake, Barbara S.; Hoover, H. D. – Journal of Educational Measurement, 1979
An experiment investigated the extent to which the results of out-of-level testing may be biased because the child given an out of level test may have had a significantly different curriculum than the children given in-level tests. Item analysis data suggested this was unlikely. (CTM)
Descriptors: Achievement Tests, Elementary Education, Elementary School Curriculum, Grade Equivalent Scores
Peer reviewedGalbato, Linda; Markus, Mimi – Journal of Applied Research in the Community College, 1995
Describes a study comparing English course-level placement based on writing samples and standardized placement scores, as well as two different methods of grading writing samples. Results support the use of writing samples to supplement standardized placement scores as well as the use of a group holistic method of grading writing samples. Includes…
Descriptors: Basic Writing, Community Colleges, English Instruction, Holistic Evaluation
Peer reviewedRogers, James R.; Hanlon, Peter J. – Measurement and Evaluation in Counseling and Development, 1996
Investigated the psychometric integrity of the College Student Reasons for Living Inventory of suicidal behaviors in a sample of 511 undergraduate students from a large Midwestern university. Results tentatively support the scale's continued use and development. (KW)
Descriptors: Adjustment (to Environment), Behavior Disorders, College Students, Coping
Peer reviewedRichards, Brian; Chambers, Francine – Language Learning Journal, 1996
Examined open-ended production tasks in higher level testing in French-as-a-second-language courses in the United Kingdom. The article studied the effect of teachers' linguistic background, training, experience, and ability on their marking of students' oral language tests; the reliability of marking criteria; and the validity of assessing…
Descriptors: British National Curriculum, Foreign Countries, French, Language Fluency
Peer reviewedHood, Stafford; Parker, Laurence J. – Journal of Negro Education, 1989
Critics charge that the use of standardized examinations for initial teacher certification fails to measure teacher competency accurately and results in high failure rates for minorities. Compares minority bias review panels in Illinois and Pennsylvania, and finds greater minority involvement and attention to minority culture in Illinois' test…
Descriptors: Comparative Analysis, Cultural Differences, Elementary Secondary Education, Ethnic Bias
Peer reviewedCampbell, Chari A.; Ashmore, Robert J. – Measurement and Evaluation in Counseling and Development, 1995
Critiques the 1990 revision of the Slosson Intelligence Test. The SIT-R is an untimed, individually administered screening instrument that assesses the mental ability of children and adults. Many of the problems with the original version have been addressed in the revised version, but with varying success. (LKS)
Descriptors: Adults, Children, Cognitive Ability, Intelligence Tests
Peer reviewedCrossman, Leslie L.; And Others – Measurement and Evaluation in Counseling and Development, 1994
Investigates the relationship of Minnesota Multiphasic Personality Inventory-2 Scales L, K, and Mf with Verbal, Performance, and Full Scale IQs from the Wechsler Adult Intelligence Scale-Revised; achievement scores in reading, spelling, and arithmetic; and total years of education as self-reported by the study research participants. (RJM)
Descriptors: Achievement, Diagnostic Tests, Educational Attainment, Intelligence
Peer reviewedLong, Kathleen A.; And Others – Measurement and Evaluation in Counseling and Development, 1994
Examined differences in Minnesota Multiphasic Personality Inventory-2 (MMPI-2) scores between persons of differing educational levels and family income in the MMPI-2 normative sample to determine if MMPI-2 scores are differentially accurate in predicting relevant extra-test characteristics of persons of differing socioeconomic levels. MMPI-2…
Descriptors: Diagnostic Tests, Measures (Individuals), Personality Assessment, Personality Measures
Peer reviewedOrnstein, Allan C. – NASSP Bulletin, 1993
Examines the differences between norm-referenced tests (standardized assessments of intelligence, aptitude, achievement, and personality) and criterion-referenced tests. Until school districts improve their potential to develop meaningful criterion-referenced tests, norm-referenced tests will be the major yardstick for measuring student…
Descriptors: Achievement Tests, Aptitude Tests, Criterion Referenced Tests, Elementary Secondary Education
Peer reviewedCavanaugh, Sally Hixon – Evaluation and the Health Professions, 1991
A lawsuit involving the National Board for Respiratory Therapy illustrates that certification examinations are vulnerable to complaints of discrimination and employers' misuse of test results. The board's five-step process--position-viability study, personnel survey, job analysis, item writing/test development, and criterion-related validity…
Descriptors: Certification, Court Litigation, Culture Fair Tests, Legal Problems
Peer reviewedTrevisan, Michael S.; And Others – Educational and Psychological Measurement, 1991
The reliability and validity of multiple-choice tests were computed as a function of the number of options per item and student ability for 435 parochial high school juniors, who were administered the Washington Pre-College Test Battery. Results suggest the efficacy of the three-option item. (SLD)
Descriptors: Ability, Comparative Testing, Distractors (Tests), Grade Point Average
Noller, Patricia; Shugm, David – Psychological Test Bulletin, 1988
The reliability and validity of the Self-Esteem Inventory developed by S. C. Coopersmith (1975) were evaluated via item-total correlation, discriminant analysis, factor analysis, and analysis of variance of data for 352 Australian adults. The instrument had high internal consistency and discriminated well between subjects with high and low…
Descriptors: Adults, Age Differences, Analysis of Variance, Comparative Testing


