Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 4 |
Since 2006 (last 20 years) | 36 |
Descriptor
Educational Testing | 72 |
Validity | 72 |
Reliability | 25 |
Student Evaluation | 21 |
Evaluation Methods | 20 |
Test Use | 17 |
Educational Assessment | 15 |
Academic Achievement | 14 |
Elementary Secondary Education | 14 |
Scores | 14 |
Test Construction | 14 |
More ▼ |
Source
Author
Haberman, Shelby J. | 4 |
Baker, Eva L. | 3 |
Newton, Paul E. | 3 |
Sinharay, Sandip | 3 |
Mislevy, Robert J. | 2 |
Sireci, Stephen G. | 2 |
Allen, Rich | 1 |
Almond, Russell G. | 1 |
Arnold, Nancy | 1 |
Attali, Yigal | 1 |
Berk, Ronald A. | 1 |
More ▼ |
Publication Type
Education Level
Elementary Secondary Education | 11 |
Higher Education | 5 |
Secondary Education | 4 |
High Schools | 3 |
Postsecondary Education | 3 |
Elementary Education | 2 |
Adult Education | 1 |
Grade 3 | 1 |
Grade 4 | 1 |
Grade 8 | 1 |
Intermediate Grades | 1 |
More ▼ |
Audience
Practitioners | 3 |
Teachers | 3 |
Administrators | 1 |
Researchers | 1 |
Location
United Kingdom | 4 |
United States | 3 |
New York | 2 |
United Kingdom (England) | 2 |
United Kingdom (Wales) | 2 |
Florida | 1 |
Georgia | 1 |
Louisiana | 1 |
Minnesota | 1 |
New York (New York) | 1 |
Tennessee | 1 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 3 |
Every Student Succeeds Act… | 1 |
Individuals with Disabilities… | 1 |
Race to the Top | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Newton, Paul E.; Shaw, Stuart D. – Assessment in Education: Principles, Policy & Practice, 2016
The ability to convey shared meaning with minimal ambiguity is highly desirable for technical terms within disciplines and professions. Unfortunately, there is no widespread professional consensus over the meaning of the word "validity" as it pertains to educational and psychological testing. After illustrating the nature and extent of…
Descriptors: Test Validity, Validity, Ambiguity (Semantics), Psychological Testing
Reardon, Sean F.; Kalogrides, Demetra; Ho, Andrew D. – Journal of Educational and Behavioral Statistics, 2021
Linking score scales across different tests is considered speculative and fraught, even at the aggregate level. We introduce and illustrate validation methods for aggregate linkages, using the challenge of linking U.S. school district average test scores across states as a motivating example. We show that aggregate linkages can be validated both…
Descriptors: Equated Scores, Validity, Methods, School Districts
Camara, Wayne J.; Mattern, Krista; Croft, Michelle; Vispoel, Sara; Nichols, Paul – Educational Measurement: Issues and Practice, 2019
In 2018, 26 states administered a college admissions test to all public school juniors. Nearly half of those states proposed to use those scores as their academic achievement indicators for federal accountability under the Every Student Succeeds Act (ESSA); many others are planning to use those scores for other accountability purposes.…
Descriptors: College Entrance Examinations, Accountability, Scores, Academic Achievement
Newton, Paul E. – Measurement: Interdisciplinary Research and Perspectives, 2012
This focus article provided the author with an opportunity to unpack the consensus definition of validity and to explore its implications in the light of recent debates. He proposed an elaboration of the consensus definition, which was intended to express the spirit of the "Standards for Educational and Psychological Testing" with increased…
Descriptors: Validity, Educational Testing, Psychological Testing, Definitions
Newton, Paul E. – Measurement: Interdisciplinary Research and Perspectives, 2012
The 1999 "Standards for Educational and Psychological Testing" defines validity as the degree to which evidence and theory support the interpretations of test scores entailed by proposed uses of tests. Although quite explicit, there are ways in which this definition lacks precision, consistency, and clarity. The history of validity has taught us…
Descriptors: Evidence, Validity, Educational Testing, Risk
Sinharay, Sandip; Haberman, Shelby J.; Wainer, Howard – Educational and Psychological Measurement, 2011
There are several techniques that increase the precision of subscores by borrowing information from other parts of the test. These techniques have been criticized on validity grounds in several of the recent publications. In this note, the authors question the argument used in these publications and suggest both inherent limits to the validity…
Descriptors: Scores, Methods, Validity, Reliability
Informing in the Information Age: How to Communicate Measurement Concepts to Education Policy Makers
Sireci, Stephen G.; Forte, Ellen – Educational Measurement: Issues and Practice, 2012
Current educational policies rely on educational assessments. However, the technical aspects of assessments are often unknown to policy makers, which is dangerous because sound assessment policy requires knowledge of the strengths and limitations of educational tests. In this article, we discuss the importance of informing policy makers of…
Descriptors: Educational Assessment, Psychometrics, Educational Policy, Educational Testing
Cramer, Angelique O. J. – Measurement: Interdisciplinary Research and Perspectives, 2012
What is validity? A simple question but apparently one with many answers, as Paul Newton highlights in his review of the history of validity. The current definition of validity, as entertained in the 1999 "Standards for Educational and Psychological Testing" is indeed a consensus, one between the classical notion of attributes, and measures…
Descriptors: Validity, Educational Testing, Depression (Psychology), Psychology
Murphy, Kevin R. – Measurement: Interdisciplinary Research and Perspectives, 2012
As Paul Newton so ably demonstrates, the concept of validity is both important and problematic. Over the last several decades, a consensus definition of validity has emerged; the current edition of "Standards for Educational and Psychological Testing" notes, "Validity refers to the degree to which evidence and theory support the interpretations of…
Descriptors: Evidence, Validity, Educational Testing, Psychological Testing
Berk, Ronald A. – Journal of Faculty Development, 2016
Recently, student outcomes have bubbled to the top of debates about how to evaluate teaching in community and liberal arts colleges, universities, and professional schools, but even more international attention has been riveted on how outcomes are being used to evaluate teachers and administrators K-12 (Harris, 2012; Rowen & Raudenbush, 2016;…
Descriptors: Value Added Models, Academic Achievement, Outcomes of Education, Teacher Evaluation
Mislevy, Robert J. – Measurement: Interdisciplinary Research and Perspectives, 2012
Paul E. Newton's "Clarifying the Consensus Definition of Validity" addresses the single most important, yet stubbornly protean, value in educational and psychological assessment. "Standards for Educational and Psychological Testing" (American Educational Research Association, American Psychological Association, & National Council on Measurement in…
Descriptors: Evidence, Validity, Educational Testing, Psychological Evaluation
American Educational Research Association (AERA), 2014
Developed jointly by the American Educational Research Association, American Psychological Association, and the National Council on Measurement in Education, "Standards for Educational and Psychological Testing" (Revised 2014) addresses professional and technical issues of test development and use in education, psychology, and…
Descriptors: Standards, Educational Testing, Psychological Testing, Test Construction
Qi, Sen; Mitchell, Ross E. – Journal of Deaf Studies and Deaf Education, 2012
The first large-scale, nationwide academic achievement testing program using Stanford Achievement Test (Stanford) for deaf and hard-of-hearing children in the United States started in 1969. Over the past three decades, the Stanford has served as a benchmark in the field of deaf education for assessing student academic achievement. However, the…
Descriptors: Testing Programs, Educational Testing, Deafness, Academic Achievement
Condon, William – Assessing Writing, 2013
Automated Essay Scoring (AES) has garnered a great deal of attention from the rhetoric and composition/writing studies community since the Educational Testing Service began using e-rater[R] and the "Criterion"[R] Online Writing Evaluation Service as products in scoring writing tests, and most of the responses have been negative. While the…
Descriptors: Measurement, Psychometrics, Evaluation Methods, Educational Testing
Sinharay, Sandip; Puhan, Gautam; Haberman, Shelby J. – Multivariate Behavioral Research, 2010
Diagnostic scores are of increasing interest in educational testing due to their potential remedial and instructional benefit. Naturally, the number of educational tests that report diagnostic scores is on the rise, as are the number of research publications on such scores. This article provides a critical evaluation of diagnostic score reporting…
Descriptors: Educational Testing, Scores, Reports, Psychometrics