Publication Date
| In 2026 | 0 |
| Since 2025 | 8 |
| Since 2022 (last 5 years) | 38 |
| Since 2017 (last 10 years) | 102 |
| Since 2007 (last 20 years) | 910 |
Descriptor
| Educational Testing | 4166 |
| Elementary Secondary Education | 899 |
| Student Evaluation | 882 |
| Academic Achievement | 755 |
| Educational Assessment | 664 |
| Evaluation Methods | 610 |
| Achievement Tests | 581 |
| Test Construction | 540 |
| Higher Education | 533 |
| Standardized Tests | 499 |
| Testing Problems | 468 |
| More ▼ | |
Source
Author
| Thurlow, Martha | 22 |
| Popham, W. James | 17 |
| Baker, Eva L. | 14 |
| Shipman, Virginia C. | 13 |
| Sinharay, Sandip | 13 |
| Ebel, Robert L. | 12 |
| Haney, Walt | 11 |
| Herman, Joan L. | 10 |
| Mislevy, Robert J. | 10 |
| Hartley, Nancy K. | 8 |
| Koretz, Daniel | 8 |
| More ▼ | |
Publication Type
Education Level
Audience
| Practitioners | 291 |
| Teachers | 138 |
| Researchers | 79 |
| Administrators | 78 |
| Policymakers | 67 |
| Students | 20 |
| Parents | 19 |
| Counselors | 9 |
| Community | 6 |
| Media Staff | 1 |
| Support Staff | 1 |
| More ▼ | |
Location
| California | 102 |
| Canada | 82 |
| Florida | 54 |
| Australia | 52 |
| United Kingdom | 51 |
| United Kingdom (England) | 50 |
| United States | 49 |
| New York | 47 |
| Texas | 42 |
| United Kingdom (Great Britain) | 28 |
| New Jersey | 27 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 1 |
| Meets WWC Standards with or without Reservations | 2 |
| Does not meet standards | 1 |
Bulat, Jennae; Dubeck, Margaret; Green, Paula; Harden, Karon; Henny, Catherine; Mattos, Mónika; Pflepsen, Alison; Robledo, Ana; Sitabkhan, Yasmin – RTI International, 2017
Over the past decade, RTI International has pursued the goal of quality, inclusive, differentiated early grade literacy instruction in nearly 30 early grade reading or early grade literacy programs in low- and middle-income (LMI) countries. Across our diverse portfolio, we have supported Ministries of Education (Ministries) in diverse contexts in…
Descriptors: Emergent Literacy, Inclusion, Equal Education, Reading Instruction
Gergen, Kenneth J.; Dixon-Román, Ezekiel J. – Teachers College Record, 2014
In the present offering we challenge the presumption that the educational testing of students provides objective information about such students. This presumption largely rests on an empiricist account of science. In light of mounting criticism, however, empiricist foundationalism has given way to a social epistemology. From this standpoint,…
Descriptors: Epistemology, Educational Testing, Test Validity, Evaluation Utilization
Aragon, Stephanie; Rowland, Julie; Wixom, Micah Ann – Education Commission of the States, 2015
With new state assessments kicking into full swing across the country, schools are seeing more and more parents wanting to opt out their children. Determining whether states allow assessment opt-outs can be complex and is constantly evolving. In some states the answer is clear: State policies either allow or prohibit state assessment opt-outs, or…
Descriptors: Educational Policy, State Policy, Educational Testing, Parents
White, John – London Review of Education, 2013
It is time to replace the examination regime at 16 and 18 by something more appropriate. The coalition government has been solidifying its place by its Baccalaureate reforms at both ages, but this is a move in quite the wrong direction. Whatever the wider purposes that the examination system may serve, its core aim is to find out how well students…
Descriptors: Student Evaluation, Evaluation Methods, Educational Testing, Testing Programs
Quaigrain, Kennedy; Arhin, Ato Kwamina – Cogent Education, 2017
Item analysis is essential in improving items which will be used again in later tests; it can also be used to eliminate misleading items in a test. The study focused on item and test quality and explored the relationship between difficulty index (p-value) and discrimination index (DI) with distractor efficiency (DE). The study was conducted among…
Descriptors: Item Analysis, Teacher Developed Materials, Test Reliability, Educational Assessment
Cramer, Angelique O. J. – Measurement: Interdisciplinary Research and Perspectives, 2012
What is validity? A simple question but apparently one with many answers, as Paul Newton highlights in his review of the history of validity. The current definition of validity, as entertained in the 1999 "Standards for Educational and Psychological Testing" is indeed a consensus, one between the classical notion of attributes, and measures…
Descriptors: Validity, Educational Testing, Depression (Psychology), Psychology
Thompson, Greg; Mockler, Nicole – Journal of Educational Administration and History, 2016
Historically, school leaders have occupied a somewhat ambiguous position within networks of power. On the one hand, they appear to be celebrated as what Ball (2003) has termed the "new hero of educational reform"; on the other, they are often "held to account" through those same performative processes and technologies. These…
Descriptors: Foreign Countries, Principals, Administrator Role, Ethics
Ramineni, Chaitanya; Williamson, David M. – Assessing Writing, 2013
In this paper, we provide an overview of psychometric procedures and guidelines Educational Testing Service (ETS) uses to evaluate automated essay scoring for operational use. We briefly describe the e-rater system, the procedures and criteria used to evaluate e-rater, implications for a range of potential uses of e-rater, and directions for…
Descriptors: Educational Testing, Guidelines, Scoring, Psychometrics
Koretz, Daniel – Measurement: Interdisciplinary Research and Perspectives, 2015
Accountability has become a primary function of large-scale testing in the United States. The pressure on educators to raise scores is vastly greater than it was several decades ago. Research has shown that high-stakes testing can generate behavioral responses that inflate scores, often severely. I argue that because of these responses, using…
Descriptors: Accountability, Educational Testing, Test Construction, Test Validity
Newton, Paul E. – Measurement: Interdisciplinary Research and Perspectives, 2012
This focus article provided the author with an opportunity to unpack the consensus definition of validity and to explore its implications in the light of recent debates. He proposed an elaboration of the consensus definition, which was intended to express the spirit of the "Standards for Educational and Psychological Testing" with increased…
Descriptors: Validity, Educational Testing, Psychological Testing, Definitions
Murphy, Kevin R. – Measurement: Interdisciplinary Research and Perspectives, 2012
As Paul Newton so ably demonstrates, the concept of validity is both important and problematic. Over the last several decades, a consensus definition of validity has emerged; the current edition of "Standards for Educational and Psychological Testing" notes, "Validity refers to the degree to which evidence and theory support the interpretations of…
Descriptors: Evidence, Validity, Educational Testing, Psychological Testing
Fan, Jinsong; Jin, Yan – Language Testing in Asia, 2013
English language testing has been developing with great momentum in China in the past two decades. However, little research is existent as to how these English tests are developed, administered, and used. This study reported a survey of English language testing practice in the Chinese context through empirically examining the testing practice of…
Descriptors: Foreign Countries, Language Tests, Second Language Learning, English (Second Language)
Lane, Suzanne – Measurement: Interdisciplinary Research and Perspectives, 2013
In Shepard's (1997) discussion on the importance of test use and consequences in a validity argument for educational assessments, she reflected on Cronbach and Meehl's (1955) perspective on the role of test developers in providing consequential evidence. In the following year, a special issue in "Educational Measurement: Issues and Practice"…
Descriptors: Educational Testing, Test Use, Test Results, Test Validity
Educational Testing Service, 2011
Choosing whether to test via computer is the most difficult and consequential decision the designers of a testing program can make. The decision is difficult because of the wide range of choices available. Designers can choose where and how often the test is made available, how the test items look and function, how those items are combined into…
Descriptors: Test Items, Testing Programs, Testing, Computer Assisted Testing
Bond, Lloyd – Measurement: Interdisciplinary Research and Perspectives, 2014
Lloyd Bond comments here on the Focus article in this issue of "Measurement: Interdisciplinary Research and Perspectives". The Focus article is entitled: "How Task Features Impact Evidence from Assessments Embedded in Simulations and Games" (Russell G. Almond, Yoon Jeon Kim, Gertrudes Velasquez, and Valerie J. Shute). Bond…
Descriptors: Educational Assessment, Task Analysis, Models, Design

Peer reviewed
Direct link
