Publication Date
| In 2026 | 0 |
| Since 2025 | 27 |
| Since 2022 (last 5 years) | 113 |
| Since 2017 (last 10 years) | 280 |
| Since 2007 (last 20 years) | 517 |
Descriptor
| Testing Problems | 4850 |
| Elementary Secondary Education | 1262 |
| Test Validity | 1008 |
| Test Construction | 801 |
| Standardized Tests | 790 |
| Higher Education | 658 |
| Test Reliability | 607 |
| Student Evaluation | 583 |
| Testing | 564 |
| Test Bias | 562 |
| Achievement Tests | 555 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 248 |
| Researchers | 220 |
| Teachers | 81 |
| Administrators | 35 |
| Policymakers | 34 |
| Parents | 15 |
| Counselors | 13 |
| Students | 5 |
| Community | 3 |
| Support Staff | 2 |
Location
| Canada | 52 |
| Australia | 45 |
| California | 44 |
| United Kingdom | 37 |
| United States | 36 |
| United Kingdom (England) | 31 |
| China | 29 |
| Netherlands | 26 |
| Florida | 25 |
| New York | 25 |
| United Kingdom (Great Britain) | 24 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards with or without Reservations | 1 |
Peer reviewedJenkins, Joseph R.; Pany, Darlene – Exceptional Children, 1978
The extent and direction of curriculum bias in standardized reading achievement tests were examined. (Author)
Descriptors: Achievement Tests, Elementary Education, Exceptional Child Research, Handicapped Children
Peer reviewedEbel, Robert L. – Educational and Psychological Measurement, 1978
A multiple true-false item is one where a testee has to identify statements as true or false within a cluster (of two or more) of such statements. Clusters are then scored as items. This study showed such a procedure to yield less reliable results than traditional true-false items. (JKS)
Descriptors: Guessing (Tests), Higher Education, Item Analysis, Multiple Choice Tests
Peer reviewedBorich, Gary D.; And Others – Journal of Educational Psychology, 1978
Using five readily available classroom observation systems, three 50-minute videotapes of classroom interaction were rated for each of twelve social studies teachers. Comparisons yielded 23 categories that measured similar behaviors across systems, approximately half of which satisfied all validity tests. Implications for process-product studies…
Descriptors: Behavior Rating Scales, Classroom Observation Techniques, Junior High Schools, Secondary School Teachers
Peer reviewedNapier, John D. – Educational and Psychological Measurement, 1977
Two types of test scores were analyzed to examine whether sixty teachers were unable to use Kohlberg's measurement system for determining stages of moral thought because they were stage scoring invalidly on the basis of content. This proved to be the case. (Author/JKS)
Descriptors: Bias, Elementary School Teachers, Graduate Students, Moral Development
Peer reviewedGoodstadt, Michael S.; Magid, Simmie – Educational and Psychological Measurement, 1977
The ability of respondents to properly execute Thurstone scaling procedures as opposed to Likert scaling procedures was investigated in a sample of high school students. Results indicate difficulty among respondents using Thurstone-type instructions to resist giving Likert-type responses. (JKS)
Descriptors: Error Patterns, High School Students, Likert Scales, Questionnaires
Rivers, L. Wendell – Journal of Non-White Concerns in Personnel and Guidance, 1978
Presents a study of ways in which the PPVT may be modified to increase its sensitivity to culturally specific factors. The need for separate tests for Blacks and whites is not supported by data; rather more caution indicated in selecting to whom standardized tests should be administered. (Author)
Descriptors: Auditory Discrimination, Black Students, Elementary Education, Elementary School Students
Smith, Paul – ADE Bulletin, 1978
Examines the way standardized test questions are developed by the Advanced Placement Development Committee; shows that the tests continue to reflect features of New Criticism and have not been changed to reflect recent developments in the discipline of English. (GW)
Descriptors: Advanced Placement, English Instruction, Essay Tests, Higher Education
Reister, Barry W.; And Others – Journal of College Student Personnel, 1977
Investigated relative effectiveness of rational behavior therapy and systematic desensitization in the treatment of state (test) anxiety and trait anxiety. There were no significant differences between the rational behavior and systematic desensitization groups in regard to test anxiety reduction, but the behavior group did have significantly…
Descriptors: Anxiety, Behavior Change, College Students, Comparative Analysis
Peer reviewedRatusnik, David L.; Koenigsknecht, Roy A. – Language, Speech, and Hearing Services in Schools, 1977
Descriptors: Black Youth, Disadvantaged Youth, Early Childhood Education, Examiners
Peer reviewedNewcomer, Phyllis L. – Journal of Special Education, 1977
The author takes the position that the use of a diagnostic-remedial model to provide special education services to the "mildly handicapped" is often inappropriate. (Author)
Descriptors: Conceptual Schemes, Consultants, Diagnostic Teaching, Elementary Secondary Education
Peer reviewedFrasier, Mary M. – Journal for the Education of the Gifted, 1987
In spite of the use of nominations, rating scales, checklists, standard measuring instruments, culture-specific models, quota system models, and identification and instructional models, the number of Black students identified as gifted remains very small. New practices for identifying this subgroup must emphasize the diversity within the Black…
Descriptors: Black Students, Culture Fair Tests, Elementary Secondary Education, Evaluation Criteria
Peer reviewedDoolittle, Allen E.; Cleary, T. Anne – Journal of Educational Measurement, 1987
Eight randomly equivalent samples of high school seniors were each given a unique form of the ACT Assessment Mathematics Usage Test (ACTM). Signed measures of differential item performance (DIP) were obtained for each item in the eight ACTM forms. DIP estimates were analyzed and a significant item category effect was found. (Author/LMO)
Descriptors: Analysis of Variance, College Entrance Examinations, Discriminant Analysis, High School Seniors
Peer reviewedTye-Murray, Nancy; Tyler, Richard S. – Journal of Speech and Hearing Disorders, 1988
Continuous discourse tracking, when used as a test of the effectiveness of aural rehabilitation strategies, has numerous uncontrolled variables related to the sender, the receiver, the text materials, and repeated presentations. Tracking is inappropriate for across-subject designs, and acceptable for within-subject test designs only when stringent…
Descriptors: Auditory Perception, Auditory Training, Discourse Analysis, Evaluation Problems
Peer reviewedAbraham, Suzanne; Stoker, Richard – Language, Speech, and Hearing Services in Schools, 1988
A survey of 182 educational programs for hearing-impaired children and youth identified those test instruments most widely used to assess language at infant, preschool, primary, and secondary levels. Also analyzed were communication modes and manual systems used in testing, difficulties encountered in assessing hearing-impaired children, and…
Descriptors: Early Childhood Education, Elementary Secondary Education, Hearing Impairments, Language Tests
Peer reviewedValencia, Sheila; Pearson, P. David – Reading Teacher, 1987
Argues that the tests used to measure reading achievement do not reflect recent advances in the understanding of the reading process, and that effective instruction best can be fostered by resolving the discrepancy between what is known and what is measured. (FL)
Descriptors: Elementary Education, Reading Achievement, Reading Comprehension, Reading Instruction


