Publication Date
| In 2026 | 0 |
| Since 2025 | 27 |
| Since 2022 (last 5 years) | 113 |
| Since 2017 (last 10 years) | 280 |
| Since 2007 (last 20 years) | 517 |
Descriptor
| Testing Problems | 4850 |
| Elementary Secondary Education | 1262 |
| Test Validity | 1008 |
| Test Construction | 801 |
| Standardized Tests | 790 |
| Higher Education | 658 |
| Test Reliability | 607 |
| Student Evaluation | 583 |
| Testing | 564 |
| Test Bias | 562 |
| Achievement Tests | 555 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 248 |
| Researchers | 220 |
| Teachers | 81 |
| Administrators | 35 |
| Policymakers | 34 |
| Parents | 15 |
| Counselors | 13 |
| Students | 5 |
| Community | 3 |
| Support Staff | 2 |
Location
| Canada | 52 |
| Australia | 45 |
| California | 44 |
| United Kingdom | 37 |
| United States | 36 |
| United Kingdom (England) | 31 |
| China | 29 |
| Netherlands | 26 |
| Florida | 25 |
| New York | 25 |
| United Kingdom (Great Britain) | 24 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards with or without Reservations | 1 |
Peer reviewedBuchan, Anne S. – Educational Research, 1993
Procedures used to ensure compatibility of British teachers' judgments in school-based assessments were examined, revealing conflicts between examining groups and teachers that might have been diminished had procedures been more open to public scrutiny. (SK)
Descriptors: Foreign Countries, National Competency Tests, Student Evaluation, Teacher Role
Glazer, Susan Mandel – Teaching Pre K-8, 1993
Discusses steps that teachers and concerned parents can take to ensure that their students and children are not overburdened by standardized tests. Examines the significance of standardized tests and considers some alternatives to standardized tests, such as more effective parent-teacher conferences. (MDM)
Descriptors: Educational Change, Elementary Education, Parent Teacher Conferences, Standardized Tests
Peer reviewedWagner, Edwin E.; And Others – Educational and Psychological Measurement, 1990
Maximized correlation as an internal reliability estimate for tests with few items was investigated. An actual sampling distribution of maximum correlation--"r" max--was empirically derived from 100 samples of 50 cases each from Rorschach test data and compared with those of alpha and an odd/even split, using 2,020 Rorschach protocols.…
Descriptors: Comparative Analysis, Correlation, Estimation (Mathematics), Sample Size
Peer reviewedFeuerstein, Abe – Educational Forum, 2001
Hyperrationalization in educational policy creates problems in the imposition of standards by centralized authorities and pursuit of efficiency through standardized tests, based on the assumption that education can be routinized and mechanized. A humanistic alternative is to make students the focus of educational policy and practice. (Contains 33…
Descriptors: Academic Standards, Accountability, Educational Policy, Humanistic Education
Peer reviewedMorrison, Hugh G.; Wylie, E. Caroline – Evaluation and Research in Education, 1999
Makes a case that the architects of national testing posited a measuring scale in which consecutive levels were separated by 2 years of learning under the influence of the "thought disorder" described by J. Michell, who claimed that psychological measurement may be little more than numerical coding. (SLD)
Descriptors: Academic Achievement, Measurement Techniques, National Competency Tests, Test Construction
Peer reviewedMcBee, Robin Haskell – Educational Forum, 2002
Identifies drawbacks of testing, especially high-stakes, standardized tests, and acknowledges pressures on teachers to acquiesce. Encourages teachers to focus on deep learning; use of higher-level questions, tasks, and projects; a wide range of materials; transfer of learning to other contexts; and ways to diffuse tension and build enthusiasm for…
Descriptors: Educational Practices, Elementary Secondary Education, Evaluation Methods, Standardized Tests
Peer reviewedO'Rourke, Norm; Cappeliez, Philippe – Measurement and Evaluation in Counseling and Development, 2001
The Marital Aggrandizement Scale (MAS) was developed as a couples measure of biased responding. Results of the current study suggest that responses to the MAS are gender invariant. Differences emerge, however, for psychological well being and self deception. These results may explain differences in marital satisfaction between older men and women.…
Descriptors: Bias, Marital Satisfaction, Older Adults, Sex Differences
Roeber, Edward D. – 1997
Some of the reasons efforts are being made to reform education are discussed, and how these reforms are likely to affect U.S. schools is explored. Data collected annually by the Council of Chief State School Officers (CCSSO) indicate that almost all states have some form of assessment that is administered to all students at one or more grade…
Descriptors: Course Content, Curriculum Development, Educational Assessment, Educational Change
Spray, Judith A.; Miller, Timothy R. – 1992
A popular method of analyzing test items for differential item functioning (DIF) is to compute a statistic that conditions samples of examinees from different populations on an estimate of ability. This conditioning or matching by ability is intended to produce an appropriate statistic that is sensitive to true differences in item functioning,…
Descriptors: Blacks, College Entrance Examinations, Comparative Testing, Computer Simulation
PDF pending restorationNew Jersey State Office of Legislative Services, Trenton. Assembly Education Committee. – 1993
The Assembly Education Committee of the New Jersey Office of Legislative Services held a hearing pursuant to Assembly Resolution 113, a proposal directing the Committee to investigate the skills testing program developed and administered to New Jersey children by the State Department of Education. The Committee was interested in the eighth-grade…
Descriptors: Accountability, Achievement Tests, Basic Skills, Cost Effectiveness
Luijten, Anton J. M., Ed. – 1991
This collection of 18 papers (selected from a total of 57 presented at a conference of the International Association for Educational Assessment) represents efforts by examining bodies and institutes to: improve the examination system and testing techniques; develop reliable instruments; and establish standards for public examinations. The papers…
Descriptors: Educational Assessment, Educational Change, Educational Policy, Elementary Secondary Education
New York State United Teachers. – 1991
New York State United Teachers (NYSUT) established a Task Force on Student Assessment in the spring of 1990, which was designed to: review available information on testing principles and practices; make recommendations for reform of testing; and communicate its findings to NYSUT members. Ten recommendations were made, centering on the necessity…
Descriptors: Academic Achievement, Educational Assessment, Educational Change, Elementary Secondary Education
Carlson, Jerry S. – 1983
This study assessed the usefulness of the dynamic testing approach to optimize testing procedures by reducing or eliminating bias, conceived of as error in measurement attributable to factors entering into performance which were not the target of the assessment. The study examined: (1) whether the dynamic assessment approach yields information…
Descriptors: Anglo Americans, Blacks, Cognitive Ability, Cognitive Measurement
Mississippi State Univ., Mississippi State, Bureau of Educational Research. – 1982
Survey results have suggested that, while teachers like to have test information available, most do not have great skill or consistency in interpreting test score data. Teachers who consider a certain type of test very valuable or useful are less likely to question the accuracy of the scores than are teachers who consider a test to be of little…
Descriptors: Academic Standards, Attitude Change, Decision Making, Educational Research
Levine, Michael V.; Drasgow, Fritz – 1984
Some examinees' test-taking behavior may be so idiosyncratic that their scores are not comparable to the scores of more typical examinees. Appropriateness indices, which provide quantitative measures of response-pattern atypicality, can be viewed as statistics for testing a null hypothesis of normal test-taking behavior against an alternative…
Descriptors: Cheating, College Entrance Examinations, Computer Simulation, Estimation (Mathematics)


