Publication Date
| In 2026 | 0 |
| Since 2025 | 27 |
| Since 2022 (last 5 years) | 113 |
| Since 2017 (last 10 years) | 280 |
| Since 2007 (last 20 years) | 517 |
Descriptor
| Testing Problems | 4850 |
| Elementary Secondary Education | 1262 |
| Test Validity | 1008 |
| Test Construction | 801 |
| Standardized Tests | 790 |
| Higher Education | 658 |
| Test Reliability | 607 |
| Student Evaluation | 583 |
| Testing | 564 |
| Test Bias | 562 |
| Achievement Tests | 555 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 248 |
| Researchers | 220 |
| Teachers | 81 |
| Administrators | 35 |
| Policymakers | 34 |
| Parents | 15 |
| Counselors | 13 |
| Students | 5 |
| Community | 3 |
| Support Staff | 2 |
Location
| Canada | 52 |
| Australia | 45 |
| California | 44 |
| United Kingdom | 37 |
| United States | 36 |
| United Kingdom (England) | 31 |
| China | 29 |
| Netherlands | 26 |
| Florida | 25 |
| New York | 25 |
| United Kingdom (Great Britain) | 24 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards with or without Reservations | 1 |
Peer reviewedWhitney, Douglas R. – Adult Learning, 1991
The General Educational Development Tests can be used to place adult basic education students for study, to measure progress achieved, to enable students to qualify for postsecondary education, and to evaluate program effectiveness. (SK)
Descriptors: Adult Basic Education, Basic Skills, High School Equivalency Programs, Program Evaluation
Peer reviewedJohnson, Bruce R. – American Mathematical Monthly, 1991
Described is an approach that substantially reduces the annotated shortcomings of standard multiple-choice tests presented to lower-division college mathematics and statistics classes. Examples are included from each discipline. (JJK)
Descriptors: College Mathematics, Distractors (Tests), Higher Education, Mathematics Education
Peer reviewedMessick, Samuel – Educational Researcher, 1994
Authentic and direct assessment of performance and products are examined in light of contrasting functions and purposes with implications for validation, especially those of specialized validity criteria for performance assessment. The roles of positive and negative consequences of validation are underscored, along with the need for evidence of…
Descriptors: Construct Validity, Criteria, Educational Assessment, Evaluation Methods
Peer reviewedGrabe, Mark – Contemporary Educational Psychology, 1994
Two multiple-examination system, in which the best test score or the last test score counted toward the final grade were compared with a conventional testing and grading system for 271 undergraduates. The type of examination system generated no significant effect on a cumulative final, but multiple examinations appeared to result in poorer unit…
Descriptors: Academic Achievement, Analysis of Variance, Grades (Scholastic), Higher Education
Peer reviewedMurphy, Sandra; Bergamini, Jan; Rooney, Paul – Educational Assessment, 1997
The impact of the New Standards Project English language arts Field Trial Portfolio on curriculum and classroom portfolio assessment practice was studied in the 10th-grade classrooms of two experienced teachers. Case studies conducted over the school year illustrate ways in which teachers are mediators of standards and reveal problems teachers…
Descriptors: Case Studies, Educational Practices, Field Tests, Grade 10
Peer reviewedWyatt-Smith, Clair – English in Australia, 1998
Discusses the formulation of the Australian Literacy Benchmarks for Year 3 and Year 5. Suggests they (1) lay claim to designating a minimum standard; (2) represent "fuzzy" standards; and (3) are a composite based on a number of underlying criteria. Claims teachers' knowledge of student literacy achievement is a richer source of valid…
Descriptors: Academic Achievement, Benchmarking, Diversity (Student), Elementary Education
Coleman, Arthur L. – American School Board Journal, 2000
While recognizing high-stakes testing's value, both the "GI Forum" decision and the Office of Civil Rights guide raise questions that boards and educators should ask about the administration and consequences of their own testing programs. Methods for systematically collecting, analyzing, disseminating, and acting on test results are needed. (MLH)
Descriptors: Court Litigation, Elementary Secondary Education, High Stakes Tests, Measurement
Peer reviewedRodriguez, M. Victoria – Equity & Excellence in Education, 1998
Discusses problems encountered when identifying, assessing, and deciding the language of instruction for culturally and linguistically diverse preschool children with disabilities and suggests some directions for practice in the context of the Education of the Handicapped Act Amendments of 1986. (SLD)
Descriptors: Cultural Differences, Diversity (Student), Federal Legislation, Language Minorities
Peer reviewedTraynelis-Yurek, Elaine; Strong, Mary W. – Journal of Reading Education, 2000
Examines the results of instruction in administering the Informal Reading Inventory (IRI) in three teacher training programs. Focuses on the examination of the scoring of the IRI in simulation exercises by preservice teachers after instruction in the administration and scoring of the IRI. Concludes that the preservice teachers did not accurately…
Descriptors: Elementary Education, Evaluation Research, Informal Reading Inventories, Miscue Analysis
Rotberg, Iris C. – School Administrator, 1996
Because educators have unrealistic expectations about tests, they use them inappropriately and draw inaccurate conclusions from results. This article debunks five myths about test-score comparisons: valid measurement of school quality; declining international competitiveness; "fixing" schools with more tests; development of new, improved…
Descriptors: Comparative Education, Competition, Elementary Secondary Education, Expenditure per Student
Peer reviewedCarroll, John B. – Intelligence, 1995
It is argued that the statements and accusations made by Stephen Jay Gould about the use of factor analysis are incorrect and unjustified and that tests properly designed for the purpose can adequately measure a "general" or "g" factor of intelligence, particularly in view of the developments in testing since "The…
Descriptors: Factor Analysis, Intelligence Tests, Measurement Techniques, Nature Nurture Controversy
Domenech, Daniel A. – School Administrator, 2000
The question of validity, or how high-stakes tests are being used and interpreted, threatens to undermine the entire standards movement. Joint standards developed by three professional associations say decisions affecting students' life chances should not be based on test scores alone. Objectivity and teaching to tests are real concerns. (MLH)
Descriptors: Academic Standards, Data Interpretation, Elementary Secondary Education, High Stakes Tests
Stoskopf, Alan – Phi Delta Kappan, 2001
Inquiry-based teaching and assessment approaches are superior to standardized tests for measuring students' progress. Historical thinking skills employed in Leopold von Ranke's 19th-century seminars have been refined to consider point of view, credibility of evidence, historical context, causality, and multiple perspectives--benchmarks of…
Descriptors: Discovery Learning, Elementary Secondary Education, History Instruction, Inquiry
Gallagher, Chris – Phi Delta Kappan, 2000
Prospects for reforming assessment are dim. Persistence of the "education crisis" is chiefly attributable to the testing industry's profit margins and its distrust of teachers--a ploy to preserve the educational power structure. Schools should be accountable to local communities, not corporate entities, as a Nebraska initiative shows.…
Descriptors: Accountability, Elementary Secondary Education, Politics of Education, Power Structure
Peer reviewedSmith, Tina T.; Lee, Evan; McDade, Hiram L. – Communication Disorders Quarterly, 2001
This study investigated the dialectal sensitivity of the T-unit as a nonbiased alternative for assessing the oral grammatical skills of school-age, nonstandard English speakers. Analysis of language samples from 28 9-year-old children (half African-American) revealed no significant differences between groups, suggesting that the T-unit may be a…
Descriptors: Black Dialects, Black Students, Culture Fair Tests, Elementary Education


