Publication Date
In 2025 | 0 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 4 |
Since 2006 (last 20 years) | 24 |
Descriptor
Educational Testing | 126 |
Test Construction | 126 |
Test Validity | 102 |
Test Reliability | 56 |
Test Interpretation | 39 |
Testing Problems | 35 |
Achievement Tests | 32 |
Elementary Secondary Education | 30 |
Student Evaluation | 24 |
Standardized Tests | 21 |
Test Bias | 21 |
More ▼ |
Source
Author
Ebel, Robert L. | 3 |
Haney, Walt | 2 |
Milton, Ohmer | 2 |
Mislevy, Robert J. | 2 |
ANDRADE, MANUEL | 1 |
Ahmed, Ayesha | 1 |
Allen, R. R. | 1 |
Almond, Russell G. | 1 |
Alonzo, Julie | 1 |
BROYLES, DAVID | 1 |
Baker, Eva L. | 1 |
More ▼ |
Publication Type
Education Level
Elementary Secondary Education | 15 |
Elementary Education | 2 |
Grade 2 | 2 |
Higher Education | 2 |
Grade 3 | 1 |
High Schools | 1 |
Audience
Practitioners | 10 |
Teachers | 5 |
Researchers | 3 |
Administrators | 2 |
Students | 2 |
Community | 1 |
Counselors | 1 |
Support Staff | 1 |
Location
United Kingdom | 4 |
United States | 3 |
Arizona (Phoenix) | 1 |
Australia | 1 |
California | 1 |
California (Stanford) | 1 |
Canada | 1 |
Colorado (Denver) | 1 |
Finland | 1 |
Illinois | 1 |
Michigan | 1 |
More ▼ |
Laws, Policies, & Programs
Individuals with Disabilities… | 2 |
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Timothy Donald Folger – ProQuest LLC, 2024
This dissertation aims to bridge the gap between validity theory and the practice of validation. The dissertation employs a three-article approach. Following the introduction in Chapter I, three independent manuscripts representing three empirical studies are presented (i.e., Chapters II - IV). Each chapter is a stand-alone publishable manuscript,…
Descriptors: Educational Testing, Psychological Testing, Test Validity, Delphi Technique
W. James Popham – Pearson, 2024
"Classroom Assessment" shows pre- and in-service teachers how to use classroom testing accurately and formatively to dramatically increase their teaching effectiveness and promote student learning. In addition to clear and concise guidelines on how to develop and use quality classroom assessments, the author also focuses on the teaching…
Descriptors: Student Evaluation, Testing, Teacher Effectiveness, Test Construction
Beck, Klaus – Frontline Learning Research, 2020
Many test developers try to ensure the content validity of their tests by having external experts review the items, e.g. in terms of relevance, difficulty, or clarity. Although this approach is widely accepted, a closer look reveals several pitfalls need to be avoided if experts' advice is to be truly helpful. The purpose of this paper is to…
Descriptors: Content Validity, Psychological Testing, Educational Testing, Student Evaluation
Mislevy, Robert J.; Oliveri, Maria Elena – Educational Measurement: Issues and Practice, 2019
In this digital ITEMS module, Dr. Robert [Bob] Mislevy and Dr. Maria Elena Oliveri introduce and illustrate a sociocognitive perspective on educational measurement, which focuses on a variety of design and implementation considerations for creating fair and valid assessments for learners from diverse populations with diverse sociocultural…
Descriptors: Educational Testing, Reliability, Test Validity, Test Reliability
Koretz, Daniel – Measurement: Interdisciplinary Research and Perspectives, 2015
Accountability has become a primary function of large-scale testing in the United States. The pressure on educators to raise scores is vastly greater than it was several decades ago. Research has shown that high-stakes testing can generate behavioral responses that inflate scores, often severely. I argue that because of these responses, using…
Descriptors: Accountability, Educational Testing, Test Construction, Test Validity
Shepard, Lorrie A. – Measurement: Interdisciplinary Research and Perspectives, 2013
In his article, Haertel (this issue) asks a fundamental question about how use of a test is expected to cause improvements in the educational system and in learning. He also considers how test validity should be investigated and argues for a more expansive view of validity that does not stop with scoring or generalization (the more technical and…
Descriptors: Educational Testing, Test Validity, Test Results, Test Construction
Briggs, Derek C. – Measurement: Interdisciplinary Research and Perspectives, 2013
In his focus article "How Is Testing Supposed to Improve Schooling?" Ed Haertel distinguishes between seven uses of educational tests as a function of the intended action and what or who will be influenced by the intended action. He then applies Mike Kane's interpretive argument approach (Kane, 2006) as a basis for speculating about the validity…
Descriptors: Educational Testing, Accountability, Educational Improvement, Teacher Evaluation
American Educational Research Association (AERA), 2014
Developed jointly by the American Educational Research Association, American Psychological Association, and the National Council on Measurement in Education, "Standards for Educational and Psychological Testing" (Revised 2014) addresses professional and technical issues of test development and use in education, psychology, and…
Descriptors: Standards, Educational Testing, Psychological Testing, Test Construction
Ahmed, Ayesha; Pollitt, Alastair – Assessment in Education: Principles, Policy & Practice, 2011
At the heart of most assessments lies a set of questions, and those who write them must achieve "two" things. Not only must they ensure that each question elicits the kind of performance that shows how "good" pupils are at the subject, but they must also ensure that each mark scheme gives more marks to those who are…
Descriptors: Academic Achievement, Classification, Educational Quality, Quality Assurance
Wiliam, Dylan – Review of Research in Education, 2010
The idea that validity should be considered a property of inferences, rather than of assessments, has developed slowly over the past century. In early writings about the validity of educational assessments, validity was defined as a property of an assessment. The most common definition was that an assessment was valid to the extent that it…
Descriptors: Educational Assessment, Validity, Inferences, Construct Validity
Lissitz, Robert W.; Samuelsen, Karen – Educational Researcher, 2007
This article raises a number of questions about the current unified theory of test validity that has construct validity at its center. The authors suggest a different way of conceptualizing the problem of establishing validity by considering whether the focus of the investigation of a test is internal to the test itself or focuses on constructs…
Descriptors: Vocabulary, Evaluation Research, Construct Validity, Test Validity
McCrimmon, Adam W.; Climie, Emma A. – Canadian Journal of School Psychology, 2011
This article reviews the "Wechsler Individual Achievement Test-Third Edition" (WIAT-III), a newly updated individual measure of academic achievement for students in Pre-Kindergarten through Grade 12 (age 4 years, 0 months to 19 years, 11 months). Suitable for use in educational, clinical, and research settings, the stated purposes of the WIAT-III…
Descriptors: Elementary Secondary Education, Educational Testing, Academic Achievement, Test Reviews
Walker, Michael E. – Measurement: Interdisciplinary Research and Perspectives, 2010
"Linking" is a term given to a general class of procedures by which one represents scores X on one test or measure in terms of scores Y on another test or measure. A recent taxonomy by Holland and Dorans (2006; Holland, 2007) organizes the various types of links into three broad categories: prediction, scale aligning, and equating. In…
Descriptors: Foreign Countries, Test Construction, Test Validity, Measurement Techniques
von Davier, Alina A. – Measurement: Interdisciplinary Research and Perspectives, 2010
The article "Thinking About Linking" by Newton (2010) presents a novel philosophical perspective on the way that educational assessments should be linked. Newton starts by describing the linking framework as it was characterized in various publications and identifies a cross-cultural dimension in the definitions and uses of test…
Descriptors: Foreign Countries, Educational Assessment, Student Evaluation, Evaluation Criteria
Nichols, Paul D.; Meyers, Jason L.; Burling, Kelly S. – Educational Measurement: Issues and Practice, 2009
Assessments labeled as formative have been offered as a means to improve student achievement. But labels can be a powerful way to miscommunicate. For an assessment use to be appropriately labeled "formative," both empirical evidence and reasoned arguments must be offered to support the claim that improvements in student achievement can be linked…
Descriptors: Academic Achievement, Tutoring, Student Evaluation, Evaluation Methods