Publication Date
In 2025 | 2 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 6 |
Since 2006 (last 20 years) | 19 |
Descriptor
Evaluation Methods | 83 |
Research Problems | 83 |
Validity | 64 |
Research Methodology | 35 |
Program Evaluation | 24 |
Reliability | 20 |
Research Design | 19 |
Test Validity | 17 |
Educational Research | 16 |
Evaluation Problems | 15 |
Models | 13 |
More ▼ |
Source
Author
Publication Type
Education Level
Elementary Secondary Education | 9 |
High Schools | 1 |
Higher Education | 1 |
Postsecondary Education | 1 |
Audience
Researchers | 5 |
Policymakers | 2 |
Practitioners | 2 |
Parents | 1 |
Teachers | 1 |
Location
United Kingdom | 2 |
Australia | 1 |
China | 1 |
District of Columbia | 1 |
Europe | 1 |
Indiana | 1 |
New York | 1 |
Ohio | 1 |
South Carolina | 1 |
Laws, Policies, & Programs
Elementary and Secondary… | 3 |
Assessments and Surveys
National Assessment of… | 3 |
Comprehensive Tests of Basic… | 1 |
Illinois Test of… | 1 |
Teaching and Learning… | 1 |
What Works Clearinghouse Rating
Yinying Wang; Joonkil Ahn – Educational Management Administration & Leadership, 2025
School leadership research literature has a large number of widely used constructs. Could fewer constructs bring more clarity? This study evaluates construct content validity, defined as the extent to which a measure's items reflect a theoretical content domain, in school leadership literature. To do so, we reviewed 29 articles that used Teaching…
Descriptors: Network Analysis, Construct Validity, Content Validity, Instructional Leadership
Alireza Akbari; Mohammadtaghi Shahnazari – Journal of Applied Research in Higher Education, 2025
Purpose: The primary objective of this research paper was to examine the objectivity of the preselected items evaluation (PIE) method, a prevalent translation scoring method deployed by international institutions such as UAntwerpen, UGent and the University of Granada. Design/methodology/approach: This research critically analyzed the scientific…
Descriptors: Evaluation Methods, Translation, Difficulty Level, Validity
Weidlich, Joshua; Gaševic, Dragan; Drachsler, Hendrik – Journal of Learning Analytics, 2022
As a research field geared toward understanding and improving learning, Learning Analytics (LA) must be able to provide empirical support for causal claims. However, as a highly applied field, tightly controlled randomized experiments are not always feasible nor desirable. Instead, researchers often rely on observational data, based on which they…
Descriptors: Causal Models, Inferences, Learning Analytics, Comparative Analysis
Wing, Coady; Bello-Gomez, Ricardo A. – American Journal of Evaluation, 2018
Treatment effect estimates from a "regression discontinuity design" (RDD) have high internal validity. However, the arguments that support the design apply to a subpopulation that is narrower and usually different from the population of substantive interest in evaluation research. The disconnect between RDD population and the…
Descriptors: Regression (Statistics), Research Design, Validity, Evaluation Methods
Møller, Jørgen – Sociological Methods & Research, 2016
The use of controlled comparisons pervades comparative historical analysis. Heated debates have surrounded the methodological purchase of such comparisons. However, the quality and validity of the conceptual building blocks on which the comparisons are based have largely been ignored. This article discusses a particular problem pertaining to these…
Descriptors: Comparative Analysis, History, Evaluation Methods, Validity
Zimmer, Ron; Engberg, John – Journal of School Choice, 2016
School choice programs continue to be controversial, spurring a number of researchers into evaluating them. When possible, researchers evaluate the effect of attending a school of choice using randomized designs to eliminate possible selection bias. Randomized designs are often thought of as the gold standard for research, but many circumstances…
Descriptors: Inferences, School Choice, Educational Vouchers, Charter Schools
Nancy Koh; Vikash Reddy; Madhabi Chatterji – Quality Assurance in Education: An International Perspective, 2014
Purpose: This AERI-NEPC eBrief, the fourth in a series titled "Understanding validity issues around the world", looks closely at issues surrounding the validity of test-based actions in educational accountability and school improvement contexts. The specific discussions here focus testing issues in the US. However, the general principles…
Descriptors: Public Education, Test Validity, Research Problems, Accountability
Dockray, Samantha; Grant, Nina; Stone, Arthur A.; Kahneman, Daniel; Wardle, Jane; Steptoe, Andrew – Social Indicators Research, 2010
Measurement of affective states in everyday life is of fundamental importance in many types of quality of life, health, and psychological research. Ecological momentary assessment (EMA) is the recognized method of choice, but the respondent burden can be high. The day reconstruction method (DRM) was developed by Kahneman and colleagues ("Science,"…
Descriptors: Employed Women, Quality of Life, Evaluation Methods, Psychological Patterns

Taylor, Erwin K.; Griess, Thomas – Personnel Psychology, 1976
In most selection validation research, only the upper and lower tails of the criterion distribution are used, often yielding misleading or incorrect results. Provides formulas and tables which enable the researcher to account more accurately for the distribution of criterion within the middle range of population. (Author/RW)
Descriptors: Evaluation Methods, Measurement Techniques, Predictive Validity, Reliability
Stuck, Ivan – 1995
By focusing on "appropriateness" and "adequacy" of inference and action, unified validity may be misused in rejecting valid test outcomes. The notion of levels of validity is challenged, the necessity of assumption is argued, and experience is proposed as the basis of validity. "Consequential validity" is interpreted as an optional predictive…
Descriptors: Evaluation Methods, Measurement Techniques, Measures (Individuals), Predictive Validity
Baker, Bruce – Education and the Public Interest Center, 2009
The new "Weighted Student Formula Yearbook 2009" from the Reason Foundation provides a simple framework for touting the successes of states and urban school districts that grant greater fiscal autonomy to schools. The report defines the Weighted Student Formula (WSF) reform extremely broadly, presenting a variety of reforms under the WSF umbrella.…
Descriptors: Evidence, Urban Schools, Research Reports, Change Strategies
Carnoy, Martin – Education and the Public Interest Center, 2009
The third-year evaluation of the federally funded Washington, D.C. voucher program shows that low-income students offered vouchers in the first two years of the program had modestly higher reading scores after three years but showed no significant difference in mathematics. Students were randomly assigned to treatment and control groups, and the…
Descriptors: Control Groups, Private Schools, Program Effectiveness, Scoring
Hagermoser Sanetti, Lisa M.; Kratochwill, Thomas R. – School Psychology Review, 2009
Treatment integrity (also referred to as "treatment fidelity," "intervention integrity," and "procedural reliability") is an important methodological concerning both research and practice because treatment integrity data are essential to making valid conclusions regarding treatment outcomes. Despite its relationship to validity, treatment…
Descriptors: Intervention, Research Methodology, Models, Validity
McLeod, Bryce D.; Southam-Gerow, Michael A.; Weisz, John R. – School Psychology Review, 2009
This special series focused on treatment integrity in the child mental health and education field is timely. The articles do a laudable job of reviewing (a) the current status of treatment integrity research and measurement, (b) existing conceptual models of treatment integrity, and (c) the limitations of prior research. Overall, this thoughtful…
Descriptors: Evaluation Research, Children, Intervention, Research Methodology
Miron, Gary; Applegate, Brooks – Education and the Public Interest Center, 2009
The Center for Research on Education Outcomes (CREDO) at Stanford University conducted a large-scale analysis of the impact of charter schools on student performance. The center's data covered 65-70% of the nation's charter schools. Although results varied by state, 17% of the charter school students have significantly higher math results than …
Descriptors: Evidence, Traditional Schools, Charter Schools, Program Effectiveness