Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 11 |
Descriptor
Source
Author
Publication Type
Reports - Evaluative | 35 |
Journal Articles | 25 |
Speeches/Meeting Papers | 4 |
Collected Works - General | 2 |
Collected Works - Serials | 2 |
Opinion Papers | 2 |
Collected Works - Proceedings | 1 |
Information Analyses | 1 |
Education Level
Higher Education | 3 |
Elementary Secondary Education | 2 |
Postsecondary Education | 2 |
Audience
Community | 1 |
Policymakers | 1 |
Practitioners | 1 |
Researchers | 1 |
Laws, Policies, & Programs
Education Consolidation… | 1 |
Kentucky Education Reform Act… | 1 |
Assessments and Surveys
National Assessment of… | 1 |
Program for International… | 1 |
What Works Clearinghouse Rating
Sireci, Stephen G. – Assessment in Education: Principles, Policy & Practice, 2016
A misconception exists that validity may refer only to the "interpretation" of test scores and not to the "uses" of those scores. The development and evolution of validity theory illustrate test score interpretation was a primary focus in the earliest days of modern testing, and that validating interpretations derived from test…
Descriptors: Test Validity, Misconceptions, Evaluation Utilization, Data Interpretation
Moss, Pamela A. – Assessment in Education: Principles, Policy & Practice, 2016
The conventional focus of validity in educational measurement has been on intended interpretations and uses of test scores. Empirical studies of test use by teachers, administrators and policy-makers show that actual interpretations and uses of test scores in context are invariably shaped by local users' questions, which frequently require…
Descriptors: Test Validity, Evaluation Utilization, Educational Assessment, Scores
Gorur, Radhika – European Educational Research Journal, 2016
PISA is an extremely influential large-scale assessment, and its "policy lessons" are being incorporated in a range of nations all over the world. In this paper I argue that not only is PISA influencing policies and practices, but also that "seeing like PISA" is becoming a widespread phenomenon. Globally, education…
Descriptors: International Assessment, Evaluation Utilization, Test Reliability, Test Validity
Palmer, Stuart – Assessment & Evaluation in Higher Education, 2012
Student evaluation of teaching (SET) is now commonplace in many universities internationally. While much effort has been devoted to examining the statistical validity of SET instruments, there has been limited examination of the methodological and consequential validity (together referred to as "utility") of the ways in which SET data…
Descriptors: Student Evaluation of Teacher Performance, Validity, Evaluation Utilization, Data
Gergen, Kenneth J.; Dixon-Román, Ezekiel J. – Teachers College Record, 2014
In the present offering we challenge the presumption that the educational testing of students provides objective information about such students. This presumption largely rests on an empiricist account of science. In light of mounting criticism, however, empiricist foundationalism has given way to a social epistemology. From this standpoint,…
Descriptors: Epistemology, Educational Testing, Test Validity, Evaluation Utilization
Feuer, Michael J. – Educational Testing Service, 2011
Few arguments about education are as effective at galvanizing public attention and motivating political action as those that compare the performance of students with their counterparts in other countries and that connect academic achievement to economic performance. Because data from international large-scale assessments (ILSA) have a powerful…
Descriptors: International Assessment, Test Interpretation, Testing Problems, Comparative Testing
Penfield, Randall D. – Educational Researcher, 2010
A growing body of research showing that grade retention serves as an educationally low-quality placement has raised increasing concerns about whether the use of standardized tests in making decisions concerning grade retention conforms to current standards for appropriate and nondiscriminatory test use. This article examines the extent to which…
Descriptors: Test Use, Grade Repetition, Standardized Tests, Learning Readiness
Astor, Ron Avi; Guerra, Nancy; Van Acker, Richard – Educational Researcher, 2010
The authors of this article consider how education researchers can improve school violence and school safety research by (a) examining gaps in theoretical, conceptual, and basic research on the phenomena of school violence; (b) reviewing key issues in the design and evaluation of evidence-based practices to prevent school violence; and (c)…
Descriptors: Violence, School Safety, Educational Research, Research Methodology
Beran, Tanya N.; Rokosh, Jennifer L. – Alberta Journal of Educational Research, 2009
This study investigates instructors' perceptions about strengths and weaknesses of a student ratings instrument employed in their university. The sample consisted of 357 instructors in a major Canadian university where each term students are required to complete an evaluation at the end of every course. Qualitative analyses of their written…
Descriptors: Student Evaluation of Teacher Performance, Test Validity, Foreign Countries, Teacher Attitudes
Goldhaber, Dan – Center for American Progress, 2010
The formula is simple: Highly effective teachers equal student academic success. Yet, the physics of American education is anything but. Thus, the question facing education reformers is how can teacher effectiveness be accurately measured in order to improve the teacher workforce? Given the demand for objective, quantitative measures of teacher…
Descriptors: Academic Achievement, Teacher Effectiveness, Models, Merit Pay
Nichols, Paul D.; Meyers, Jason L.; Burling, Kelly S. – Educational Measurement: Issues and Practice, 2009
Assessments labeled as formative have been offered as a means to improve student achievement. But labels can be a powerful way to miscommunicate. For an assessment use to be appropriately labeled "formative," both empirical evidence and reasoned arguments must be offered to support the claim that improvements in student achievement can be linked…
Descriptors: Academic Achievement, Tutoring, Student Evaluation, Evaluation Methods

Sinha, Atanu R.; Buchanan, Bruce S. – Psychometrika, 1995
This paper presents an analysis of the stability of principal components. Stability is measured by the expectation of the absolute inner product of the sample principal component with the corresponding population component. The usefulness and predictive validity of the model were supported through simulation. (SLD)
Descriptors: Evaluation Utilization, Factor Analysis, Models, Predictive Validity

Ory, John C.; Ryan, Katherine – New Directions for Institutional Research, 2001
Examines student ratings of teacher effectiveness within a new framework that emphasizes six distinct aspects of validity: content, substantive, structural, generalizability, external, and consequential. Concludes that greater attention should be directed toward consequential validity, particularly how ratings are used on today's campuses and what…
Descriptors: Evaluation Research, Evaluation Utilization, Student Evaluation of Teacher Performance, Test Validity
Linn, Robert L. – 1987
When the National Assessment of Educational Progress (NAEP) was designed 20 years ago, comparisons among individual states or localities were not deemed desirable. Today, this lack of information to allow comparison is judged to be a serious weakness of the NAEP, and ways to allow comparisons are actively sought. The focus of this paper is to…
Descriptors: Academic Achievement, Comparative Analysis, Content Validity, Educational Assessment

Kirkhart, Karen E. – Evaluation Practice, 1995
The construct "multicultural validity" is proposed as the vehicle for organizing concerns about pluralism and diversity in evaluation and as a way to reflect on the cultural boundness of evaluation. It should be conceptualized as a central dimension of validity and a viable focus of concern in evaluation theory. (SLD)
Descriptors: Cultural Awareness, Cultural Pluralism, Evaluation Methods, Evaluation Utilization