Publication Date
| In 2026 | 0 |
| Since 2025 | 16 |
| Since 2022 (last 5 years) | 93 |
| Since 2017 (last 10 years) | 257 |
| Since 2007 (last 20 years) | 464 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 395 |
| Teachers | 190 |
| Administrators | 102 |
| Researchers | 99 |
| Policymakers | 57 |
| Students | 48 |
| Parents | 43 |
| Counselors | 19 |
| Community | 14 |
| Support Staff | 3 |
Location
| Canada | 83 |
| Australia | 65 |
| United States | 46 |
| California | 35 |
| United Kingdom (England) | 29 |
| New York | 28 |
| Texas | 27 |
| Netherlands | 26 |
| United Kingdom | 26 |
| Kentucky | 23 |
| Ohio | 22 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Peer reviewedHaertel, Edward H. – Educational Measurement: Issues and Practice, 1999
Discusses issues of validity in high-stakes testing, beginning with some purposes of a testing program and proceeding to some underlying assumptions about testing. Suggests four possible studies to address assumptions often ignored by asking various groups of people about testing. (SLD)
Descriptors: Elementary Secondary Education, High Stakes Tests, Research Needs, Surveys
Peer reviewedPopham, W. James – Educational Measurement: Issues and Practice, 1999
Discusses the direction large-scale educational testing is heading, pointing out pitfalls in current and future use of such tests. The large-scale assessment community seems to be unconcerned about the central mission of education, the instruction of children. (SLD)
Descriptors: Educational Testing, Futures (of Society), Role of Education, Standardized Tests
Gose, Ben; Selingo, Jeffrey – Chronicle of Higher Education, 2001
Explores how social, legal, and demographic forces threaten to dethrone the most widely used college entrance exam. New criticism focuses on the use of what is essentially an IQ test to measure students' ability to learn. (EV)
Descriptors: College Admission, College Entrance Examinations, High Stakes Tests, Higher Education
Goodman, Dean P.; Hambleton, Ronald K. – Applied Measurement in Education, 2004
A critical, but often neglected, component of any large-scale assessment program is the reporting of test results. In the past decade, a body of evidence has been compiled that raises concerns over the ways in which these results are reported to and understood by their intended audiences. In this study, current approaches for reporting…
Descriptors: Test Results, Student Evaluation, Scores, Testing Programs
Peer reviewedDorn, Sherman – Education Policy Analysis Archives, 2003
A historical perspective on high-stakes testing suggests that tests required for high school graduation will have mixed results for the putative value of high school diplomas. Graduation requirements are not likely to settle the general cultural confusion in the United States about the purpose of secondary education or a high school diploma. (SLD)
Descriptors: Educational History, Graduation Requirements, High School Graduates, High Schools
Bachman, Lyle F. – Language Assessment Quarterly, 2005
The fields of language testing and educational and psychological measurement have not, as yet, developed a set of principles and procedures for linking test scores and score-based inferences to test use and the consequences of test use. Although Messick (1989) discusses test use and consequences, his framework provides virtually no guidance on how…
Descriptors: Test Use, Testing, Language Tests, Validity
Thurlow, Martha L.; Laitusis, Cara Cahalan; Dillon, Deborah R.; Cook, Linda L.; Moen, Ross E.; Abedi, Jamal; O'Brien, David G. – National Accessible Reading Assessment Projects, 2009
Within the context of standards-based educational systems, states are using large scale reading assessments to help ensure that all children have the opportunity to learn essential knowledge and skills. The challenge for developers of accessible reading assessments is to develop assessments that measure only those student characteristics that are…
Descriptors: Reading Achievement, Measures (Individuals), Student Characteristics, Disabilities
Reeve, Charlie L.; Charles, Jennifer E. – Intelligence, 2008
The current study examines the views of experts in the science of mental abilities about the primacy and uniqueness of "g" and the social implications of ability testing, and compares their responses to the views of a group of non-expert psychologists. Results indicate expert consensus that "g" is an important, non-trivial determinant (or at least…
Descriptors: Race, Psychologists, Testing, Predictive Validity
Livingston, Samuel A.; Lewis, Charles – 1993
This paper presents a method for estimating the accuracy and consistency of classifications based on test scores. The scores can be produced by any scoring method, including the formation of a weighted composite. The estimates use data from a single form. The reliability of the score is used to estimate its effective test length in terms of…
Descriptors: Classification, Error of Measurement, Estimation (Mathematics), Reliability
Scheuneman, Janice Dowd; Slaughter, Carole – 1991
A number of explanations have been offered for the differences in test performance among various population subgroups. This paper begins with a discussion of these explanations including the psychometric explanation that group differences are due to bias in the test. An overview of bias research argues that results to date are inconclusive. A…
Descriptors: Ethnic Groups, Group Membership, Item Bias, Minority Groups
Peck, Curtiss S. – 1995
The relevance of assessing attention or concentration skills for personnel selection is discussed, and how a person's interpersonal characteristics are influenced by and influence attentional skills is explored. Scales in the Theory Attentional and Interpersonal Style (TAIS) inventory developed by Robert Nideffer are described. The interaction of…
Descriptors: Attention, Evaluation Methods, Interpersonal Relationship, Personnel Selection
Daniel, Larry G. – 1997
Statistical significance tests (SSTs) have been the object of much controversy among social scientists. Proponents have hailed SSTs as an objective means for minimizing the likelihood that chance factors have contributed to research results. Critics have both questioned the logic underlying SSTs and bemoaned the widespread misapplication and…
Descriptors: Editing, Educational Assessment, Policy, Research Problems
Smith, Dean R. – 1993
Four studies examined the validity of the "Living Word Vocabulary" (LWV), a corpus of approximately 44,000 alphabetized words with multiple meanings tested at different grade levels. Regressions were performed between the grade level p-values (percentages of students responding correctly to a vocabulary test item) reported by LWV and…
Descriptors: Elementary Secondary Education, Readability, Reading Research, Regression (Statistics)
Ackerman, Terry – 1994
The purpose of this paper is to demonstrate how graphical analyses can enhance the interpretation and understanding of multidimensional item-response theory (IRT) analyses. Conceptually many of the unidimensional IRT concepts such as item characteristic curves, information, etc., can be extended to multiple dimensions. However, as the…
Descriptors: Ability, Achievement Tests, Educational Assessment, Item Response Theory
Educational Testing Service, Princeton, NJ. Test Collection. – 1983
This annotated bibliography (1933-1982) lists currently available instruments that might be used with Spanish-speaking individuals. The bibliographic information, obtained from the holdings of the Educational Testing Service Test Collection, is not limited to any specific type of test. Thus, measures of achievement, aptitude, and attitude, etc.,…
Descriptors: Achievement Tests, Annotated Bibliographies, Aptitude Tests, Attitude Measures

Direct link
