Publication Date
| In 2026 | 0 |
| Since 2025 | 27 |
| Since 2022 (last 5 years) | 113 |
| Since 2017 (last 10 years) | 280 |
| Since 2007 (last 20 years) | 517 |
Descriptor
| Testing Problems | 4850 |
| Elementary Secondary Education | 1262 |
| Test Validity | 1008 |
| Test Construction | 801 |
| Standardized Tests | 790 |
| Higher Education | 658 |
| Test Reliability | 607 |
| Student Evaluation | 583 |
| Testing | 564 |
| Test Bias | 562 |
| Achievement Tests | 555 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 248 |
| Researchers | 220 |
| Teachers | 81 |
| Administrators | 35 |
| Policymakers | 34 |
| Parents | 15 |
| Counselors | 13 |
| Students | 5 |
| Community | 3 |
| Support Staff | 2 |
Location
| Canada | 52 |
| Australia | 45 |
| California | 44 |
| United Kingdom | 37 |
| United States | 36 |
| United Kingdom (England) | 31 |
| China | 29 |
| Netherlands | 26 |
| Florida | 25 |
| New York | 25 |
| United Kingdom (Great Britain) | 24 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards with or without Reservations | 1 |
Peer reviewedLevine, Michael V.; Rubin, Donald B. – Journal of Educational Statistics, 1979
A student may be so unlike other students that his/her aptitude test score fails to be a completely appropriate measure. We consider the problem of using the student's pattern of multiple-choice aptitude test answers to decide whether his/her score is an appropriate ability measure. (Author/CTM)
Descriptors: Answer Sheets, College Entrance Examinations, Guessing (Tests), Latent Trait Theory
Peer reviewedAaron, Robert L.; And Others – Reading Improvement, 1980
Reports that the Estes Attitude Scale suffers from the influence of cultural biases that make it inadequate to measure attitude about reading among low-income, Black students in grades three through seven in the rural southern United States. (FL)
Descriptors: Black Youth, Cultural Influences, Elementary Secondary Education, Measurement Techniques
Peer reviewedJaeger, Richard M. – Educational Evaluation and Policy Analysis, 1979
A liberal interpretation of Section 151 of Public Law 93-380, to implement effective local evaluation of Title I programs, is supported. Weaknesses are cited: (1) nationwide impact data, (2) unsound aggregation of Title I achievement gains, and (3) lack of consideration of alternative evaluation methods. (MH)
Descriptors: Achievement Tests, Compensatory Education, Elementary Education, Evaluation Methods
Peer reviewedGresham, Frank M.; Witt, Joseph C. – School Psychology Quarterly, 1997
Maintains that intelligence tests contribute little to the planning, implementation, and evaluation of instructional interventions for children. Suggests that intelligence tests are not useful in making differential diagnostic and classification determinations for children with mild learning problems and that such testing is not a cost-beneficial…
Descriptors: Aptitude Treatment Interaction, Diagnostic Tests, Elementary Secondary Education, Evaluation Problems
Bartek, Mary M. – Understanding Our Gifted, 2003
Using a sci-fi matchmaking scenario to illustrate the fallibility of technology, this article discusses the practice of reducing a student to a series of test scores for gifted identification. The limits of testing are addressed, and student performance and behavior are urged as additional categories for identifying aptitude and achievement.…
Descriptors: Ability Identification, Academic Achievement, Classroom Observation Techniques, Data Collection
Peer reviewedDorton, Ian – Economics, 1989
Examines the organization of the extended project that is part of the General Certificate of Secondary Education (GCSE) A Level Business Studies examination. Provides a timetable for implementing the project. Includes student evaluations of the project. (LS)
Descriptors: Achievement Tests, Business Education, Economics, Economics Education
Peer reviewedChannell, Ron W.; Peek, Michelle S. – Language, Speech, and Hearing Services in Schools, 1989
Thirty-six children, aged four-five, completed four vocabulary measures: Peabody Picture Vocabulary Test-Revised, Picture Vocabulary subtest of the Test of Oral Language Development, Expressive One-Word Picture Vocabulary Test, and Receptive One-Word Picture Vocabulary Test. Only moderate correlations were found among these tests, implying that a…
Descriptors: Correlation, Expressive Language, Handicap Identification, Learning Disabilities
Peer reviewedBresnock, Anne E.; And Others – Journal of Economic Education, 1989
Investigates the effects on multiple choice test performance of altering the order and placement of questions and responses. Shows that changing the response pattern appears to alter significantly the apparent degree of difficulty. Response patterns become more dissimilar under certain types of response alterations. (LS)
Descriptors: Cheating, Economics Education, Educational Research, Grading
Peer reviewedSheehan, Robert; Sites, Jane – Topics in Early Childhood Special Education, 1989
The passage of Public Law 99-457 has both quantitative and qualitative implications for assessment. Educators working with infants and young children must become more familiar with assessment strategies and limitations including psychometrically sound assessment of complex and controversial family variables, and health and environmental risk…
Descriptors: Disabilities, Educational Legislation, Evaluation Methods, Family Involvement
Peer reviewedDunlap, William P.; And Others – Applied Psychological Measurement, 1989
The reliability of derived measures from 4 cognitive paradigms was studied using 19 Navy enlisted men (aged between 18 and 24 years). The paradigms were: graphemic and phonemic analysis; semantic memory retrieval; lexical decision making; and letter classification. Results indicate that derived scores may have low reliability. (SLD)
Descriptors: Adults, Armed Forces, Cognitive Measurement, Cognitive Processes
Peer reviewedTombokan-Runtukahu, Juliana; Nitko, Anthony – Research in Developmental Disabilities, 1992
This study delineated procedures for cross-cultural adaptation and operationalization of adaptive behavior in individuals with mental retardation, culturally adapted the Vineland Adaptive Behavior Scale, and investigated the validity of the resulting instrument. The study concluded that the domain of adaptive behavior can be successfully applied…
Descriptors: Adaptive Behavior (of Disabled), Behavior Rating Scales, Cross Cultural Studies, Cultural Influences
Peer reviewedIsaacson, Stephen L. – Learning Disabilities Research and Practice, 1992
This review of the Test of Early Written Language concludes that the test succeeds in identifying students who are below their peers in writing and in measuring long-term gains in written language achievement; but its format makes it difficult to document specific strengths and weaknesses and its reliability; and validity have not been…
Descriptors: Early Childhood Education, Evaluation Methods, Student Evaluation, Test Reliability
Peer reviewedYaple, Newell; And Others – Journal of Dental Education, 1992
The process used in Ohio to reform the state dental licensing examination and incorporate a nonpatient (simulated) clinical procedure is described and the results summarized. Findings focus on the degree to which results of the new testing procedures differentiate dental students by class rank. (MSE)
Descriptors: Academic Achievement, Clinical Experience, Dental Students, Dentistry
Peer reviewedGroenveld, M.; Jan, J. E. – Journal of Visual Impairment and Blindness, 1992
Analysis of scores of 118 visually impaired children on the Wechsler Intelligence Scale for Children (Revised) and the Wechsler Preschool and Primary Scales of Intelligence (Revised) found a consistent response pattern suggesting that the verbal as well as the performance tests provide useful assessment information. (Author/DB)
Descriptors: Blindness, Cognitive Development, Evaluation Methods, Intelligence
Peer reviewedChan, Jason C. – Educational and Psychological Measurement, 1991
A study involving 102 high school students (49 males and 53 females) from Taiwan revealed that the order of response scale labels had a primacy effect on subjects' choices of the alternatives in Likert-type attitude scales. Practical implications of the response-order effects for measurement are discussed. (SLD)
Descriptors: Correlation, Factor Analysis, Factor Structure, Foreign Countries


