Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 9 |
Descriptor
Source
Author
Abedi, Jamal | 1 |
Aiona, Shelli | 1 |
Amo, Courtney | 1 |
Arzi, Hanna J. | 1 |
Baker, Eva L. | 1 |
Benson, Jeri | 1 |
Braden, Jeffery P. | 1 |
Cizek, Gregory J. | 1 |
Cousins, J. Bradley | 1 |
Dean, Linda M. | 1 |
Dietel, Ronald | 1 |
More ▼ |
Publication Type
Reports - Descriptive | 18 |
Journal Articles | 13 |
Speeches/Meeting Papers | 3 |
Opinion Papers | 1 |
Education Level
Elementary Secondary Education | 5 |
Early Childhood Education | 1 |
Grade 12 | 1 |
Grade 4 | 1 |
Grade 8 | 1 |
Audience
Administrators | 1 |
Practitioners | 1 |
Teachers | 1 |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 2 |
Goals 2000 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Koretz, Daniel – American Educator, 2018
In "The Testing Charade: Pretending to Make Schools Better", the author's new book from which this article is drawn, the failures of test-based accountability are documented and some of the most egregious misuses and outright abuses of testing are described, along with some of the most serious negative effects. Neither good intentions…
Descriptors: Accountability, Testing, Testing Problems, Test Validity
Cizek, Gregory J. – Assessment in Education: Principles, Policy & Practice, 2016
Advances in validity theory and alacrity in validation practice have suffered because the term "validity" has been used to refer to two incompatible concerns: (1) the degree of support for specified interpretations of test scores (i.e. intended score meaning) and (2) the degree of support for specified applications (i.e. intended test…
Descriptors: Scores, Definitions, Evaluation Utilization, Data Interpretation
Gargani, John; Donaldson, Stewart I. – New Directions for Evaluation, 2011
This chapter describes a concrete process that stakeholders can use to make predictions about the future performance of programs in local contexts. Within the field of evaluation, the discussion of validity as it relates to outcome evaluation seems to be focused largely on questions of internal validity (Did it work?) with less emphasis on…
Descriptors: Validity, Prediction, Program Evaluation, Evaluation Utilization
Rogers, W. Todd – Canadian Journal of Education, 2014
Principals and teachers do not use large-scale assessment results because the lack of distinct and reliable subtests prevents identifying strengths and weaknesses of students and instruction, the results arrive too late to be used, and principals and teachers need assistance to use the results to improve instruction so as to improve student…
Descriptors: Foreign Countries, Group Testing, Multidimensional Scaling, Evaluation Utilization
Lane, Suzanne; Zumbo, Bruno D.; Abedi, Jamal; Benson, Jeri; Dossey, John; Elliott, Stephen N.; Kane, Michael; Linn, Robert; Paredes-Ziker, Cindy; Rodriguez, Michael; Schraw, Gregg; Slattery, Jean; Thomas, Veronica; Willhoft, Joe – Applied Measurement in Education, 2009
Given the changing landscape of educational accountability at the local, state, and national levels, and the changes in the uses of the National Assessment of Educational Progress (NAEP), including the evolving uses of NAEP as a policy tool to interpret state assessment and accountability systems, an explicit statement of the current and potential…
Descriptors: National Competency Tests, Academic Achievement, Accountability, Test Validity
Braden, Jeffery P.; Shaw, Steven R. – Assessment for Effective Intervention, 2009
The intervention validity of cognitive assessment batteries is considered within an historical context to identify what the evidence supports (knowns), what cannot be known (unknowables), and what is not yet known (unknowns). Two ways cognitive batteries could inform intervention are identified: a disordinal (i.e., aptitude-treatment interaction)…
Descriptors: Intervention, Validity, Cognitive Tests, Cognitive Measurement
Noell, Jay; Ginsburg, Alan – Applied Measurement in Education, 2009
The report, "Evaluation of the National Assessment of Educational Progress", provides a number of recommendations for addressing validity concerns about NAEP. This article identifies actions that could be taken by the Congress, the National Center for Education Statistics, and the National Assessment Governing Board--which share responsibility for…
Descriptors: National Competency Tests, Federal Government, Public Agencies, Test Validity
Herman, Joan L.; Osmundson, Ellen; Dietel, Ronald – Assessment and Accountability Comprehensive Center, 2010
This report describes the purposes of benchmark assessments and provides recommendations for selecting and using benchmark assessments--addressing validity, alignment, reliability, fairness and bias and accessibility, instructional sensitivity, utility, and reporting issues. We also present recommendations on building capacity to support schools'…
Descriptors: Multiple Choice Tests, Test Items, Benchmarking, Educational Assessment
Amo, Courtney; Cousins, J. Bradley – New Directions for Evaluation, 2007
The study of the consequences of evaluation, or more specifically of evaluation use or utilization, represents a significant portion of the body of research on evaluation. Much has been written on the evolution of the multidimensional concept of evaluation use, most recently the examination of consequences of evaluation that are not a function of…
Descriptors: Evaluation Methods, Research Methodology, Cognitive Processes, Evaluation Research

Moss, Pamela A. – Educational Measurement: Issues and Practice, 1998
Provides an argument for incorporating consideration of consequences into validity theory that is grounded in the reflexive nature of social knowledge. It also calls for the consideration of evidence of validity based on the actual discourse surrounding the practices and products of testing. (SLD)
Descriptors: Evaluation Methods, Evaluation Utilization, Program Evaluation, Test Construction
Dean, Linda M. – 1997
This paper proposes a model to establish which criteria are considered by stakeholders as valid for evaluating a program. The model is developed with the aim of increasing the credibility and use of evaluations. Stakeholders are involved in the identification of potential evaluation criteria, and ratings of validity and priority are used as the…
Descriptors: Criteria, Data Analysis, Data Collection, Evaluation Methods

Stufflebeam, Daniel L. – International Journal of Educational Research, 1987
This article reviews experiences in the United States over the past 15 years in developing standards to guide and assess the work of professional evaluators. It introduces the Joint Committee on Standards for Educational Evaluation's "Program Evaluation Standards" and overviews the Committee's project to develop "Educational…
Descriptors: Cost Effectiveness, Educational Assessment, Evaluation Criteria, Evaluation Utilization
de Oliveira, Terezinha Rodrigues; Elliot, Ligia Gomes – 1983
A reorganized version of standards to be utilized for those who are involved with educational evaluation in Brazil was the result of a critical review of evaluation standards which were originally elaborated by the Joint Committee on Standards for Educational Evaluation. The conceptual framework for the critical review comprised logical…
Descriptors: Educational Improvement, Evaluation Criteria, Evaluation Methods, Evaluation Needs
White, Richard T.; Arzi, Hanna J. – Research in Science Education, 2005
Aiming to encourage longitudinal studies in science education, we clarify conceptual and methodological aspects of longitudinal research. We use the studies that other articles in this issue describe to illustrate these aspects. The illustrations range from attempts to promote long-term change through experimental teaching to investigations that…
Descriptors: Longitudinal Studies, Test Validity, Science Education, Research Methodology
Raupp, Magdala – 1982
The process of developing a testing/evaluation/instruction management subsystem that will be uniquely suited to the specific situation of a given district requires a decision to use a strategy for instructional change in which data from testing and evaluation would play a major role. Considerations that might go into such a decision involve some…
Descriptors: Educational Change, Educational Strategies, Elementary Secondary Education, Evaluation Utilization
Previous Page | Next Page ยป
Pages: 1 | 2