Publication Date
In 2025 | 1 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 10 |
Since 2006 (last 20 years) | 30 |
Descriptor
Evaluation Utilization | 61 |
Test Validity | 31 |
Validity | 26 |
Evaluation Methods | 23 |
Test Use | 12 |
Educational Assessment | 11 |
Test Reliability | 11 |
Elementary Secondary Education | 10 |
Evaluation Problems | 10 |
Foreign Countries | 10 |
Program Evaluation | 10 |
More ▼ |
Source
Author
Abrami, Philip C. | 2 |
Cousins, J. Bradley | 2 |
Kirkhart, Karen E. | 2 |
Marsh, Herbert W. | 2 |
Moss, Pamela A. | 2 |
Wilson, Robert J. | 2 |
Abedi, Jamal | 1 |
Aiona, Shelli | 1 |
Amo, Courtney | 1 |
Arzi, Hanna J. | 1 |
Astor, Ron Avi | 1 |
More ▼ |
Publication Type
Education Level
Elementary Secondary Education | 8 |
Higher Education | 6 |
Postsecondary Education | 5 |
Early Childhood Education | 2 |
Elementary Education | 1 |
Grade 1 | 1 |
Grade 12 | 1 |
Grade 4 | 1 |
Grade 8 | 1 |
Kindergarten | 1 |
Preschool Education | 1 |
More ▼ |
Audience
Administrators | 1 |
Practitioners | 1 |
Teachers | 1 |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 2 |
Elementary and Secondary… | 1 |
Elementary and Secondary… | 1 |
Goals 2000 | 1 |
Assessments and Surveys
Students Evaluation of… | 2 |
Graduate Record Examinations | 1 |
Program for International… | 1 |
What Works Clearinghouse Rating
Chitra Sabapathy – Shanlax International Journal of Education, 2024
Background: Mid-semester evaluations are gaining traction as a means to gather evaluation data for formative purposes. However, it is not clear if course coordinators who conduct these evaluations are adequately equipped with evaluative knowledge and skills to guide them through their evaluative processes. Objectives: This study is a…
Descriptors: Evaluation Methods, Instructor Coordinators, Tutors, College Students
Katherine Ryker; Rachel Teasdale; Kelsey Bitting – Journal of Geoscience Education, 2025
Third-party observations using validated observation protocols (OPs) provide a reliable way of recording teacher and student behaviors across different classrooms and institutions, which can then be used to identify what pedagogical strategies geoscience faculty use and how they are tied to learning outcomes of importance to the field. We examined…
Descriptors: Classroom Observation Techniques, Earth Science, STEM Education, Higher Education
Daniella Winter; Yoram Braw – Journal of Attention Disorders, 2022
Background: The current study aimed to validate the utility of previously established validity indicators derived from MOXO-d-CPT's continuous performance test. Method: Healthy simulators feigned impairment after searching online for relevant information, an ecologically valid coaching condition (n = 39). They were compared to ADHD patients (n =…
Descriptors: Foreign Countries, Undergraduate Students, Attention Deficit Hyperactivity Disorder, Computer Simulation
Sessoms, John; Henson, Robert A. – Measurement: Interdisciplinary Research and Perspectives, 2018
Diagnostic classification models (DCMs) classify examinees based on the skills they have mastered given their test performance. This classification enables targeted feedback that can inform remedial instruction. Unfortunately, applications of DCMs have been criticized (e.g., no validity support). Generally, these evaluations have been brief and…
Descriptors: Literature Reviews, Classification, Models, Criticism
Koretz, Daniel – American Educator, 2018
In "The Testing Charade: Pretending to Make Schools Better", the author's new book from which this article is drawn, the failures of test-based accountability are documented and some of the most egregious misuses and outright abuses of testing are described, along with some of the most serious negative effects. Neither good intentions…
Descriptors: Accountability, Testing, Testing Problems, Test Validity
Sireci, Stephen G. – Assessment in Education: Principles, Policy & Practice, 2016
A misconception exists that validity may refer only to the "interpretation" of test scores and not to the "uses" of those scores. The development and evolution of validity theory illustrate test score interpretation was a primary focus in the earliest days of modern testing, and that validating interpretations derived from test…
Descriptors: Test Validity, Misconceptions, Evaluation Utilization, Data Interpretation
Levin, Henry M.; Belfield, Clive – Journal of Research on Educational Effectiveness, 2015
Cost-effectiveness analysis is rarely used in education. When it is used, it often fails to meet methodological standards, especially with regard to cost measurement. Although there are occasional criticisms of these failings, we believe that it is useful to provide a listing of the more common concerns and how they might be addressed. Based upon…
Descriptors: Cost Effectiveness, Comparative Analysis, Validity, Educational Policy
Cizek, Gregory J. – Assessment in Education: Principles, Policy & Practice, 2016
Advances in validity theory and alacrity in validation practice have suffered because the term "validity" has been used to refer to two incompatible concerns: (1) the degree of support for specified interpretations of test scores (i.e. intended score meaning) and (2) the degree of support for specified applications (i.e. intended test…
Descriptors: Scores, Definitions, Evaluation Utilization, Data Interpretation
Moss, Pamela A. – Assessment in Education: Principles, Policy & Practice, 2016
The conventional focus of validity in educational measurement has been on intended interpretations and uses of test scores. Empirical studies of test use by teachers, administrators and policy-makers show that actual interpretations and uses of test scores in context are invariably shaped by local users' questions, which frequently require…
Descriptors: Test Validity, Evaluation Utilization, Educational Assessment, Scores
Gargani, John; Donaldson, Stewart I. – New Directions for Evaluation, 2011
This chapter describes a concrete process that stakeholders can use to make predictions about the future performance of programs in local contexts. Within the field of evaluation, the discussion of validity as it relates to outcome evaluation seems to be focused largely on questions of internal validity (Did it work?) with less emphasis on…
Descriptors: Validity, Prediction, Program Evaluation, Evaluation Utilization
Klieger, David M.; Belur, Vinetha; Kotloff, Lauren J. – ETS Research Report Series, 2017
This survey study investigated how graduate school admissions committees perceive and use the "GRE"® General Test and "GRE"® Subject Tests after the launch of the "GRE"® revised General Test in August 2011. These perceptions and uses impact the validity of the tests. Prior research about the perceptions and uses of…
Descriptors: Research Reports, College Entrance Examinations, Evaluation Utilization, Graduate Study
Gorur, Radhika – European Educational Research Journal, 2016
PISA is an extremely influential large-scale assessment, and its "policy lessons" are being incorporated in a range of nations all over the world. In this paper I argue that not only is PISA influencing policies and practices, but also that "seeing like PISA" is becoming a widespread phenomenon. Globally, education…
Descriptors: International Assessment, Evaluation Utilization, Test Reliability, Test Validity
Palmer, Stuart – Assessment & Evaluation in Higher Education, 2012
Student evaluation of teaching (SET) is now commonplace in many universities internationally. While much effort has been devoted to examining the statistical validity of SET instruments, there has been limited examination of the methodological and consequential validity (together referred to as "utility") of the ways in which SET data…
Descriptors: Student Evaluation of Teacher Performance, Validity, Evaluation Utilization, Data
Gergen, Kenneth J.; Dixon-Román, Ezekiel J. – Teachers College Record, 2014
In the present offering we challenge the presumption that the educational testing of students provides objective information about such students. This presumption largely rests on an empiricist account of science. In light of mounting criticism, however, empiricist foundationalism has given way to a social epistemology. From this standpoint,…
Descriptors: Epistemology, Educational Testing, Test Validity, Evaluation Utilization
Bourgeois, Isabelle; Cousins, J. Bradley – American Journal of Evaluation, 2013
Organizational evaluation capacity building has been a topic of increasing interest in recent years. However, the actual dimensions of evaluation capacity have not been clearly articulated through empirical research. This study sought to address this gap by identifying the key dimensions of evaluation capacity in Canadian federal government…
Descriptors: Foreign Countries, Institutional Evaluation, Capacity Building, Public Agencies