ERIC - Search Results

Publication Date

In 2025	2
Since 2024	2
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	6
Since 2006 (last 20 years)	19

Descriptor

Evaluation Methods	83
Research Problems	83
Validity	64
Research Methodology	35
Program Evaluation	24
Reliability	20
Research Design	19
Test Validity	17
Educational Research	16
Evaluation Problems	15
Models	13
Educational Policy	12
Elementary Secondary Education	12
Evaluation Criteria	12
Measurement Techniques	12
Program Effectiveness	12
Research Needs	12
Academic Achievement	11
Data Analysis	11
Statistical Bias	11
Educational Assessment	10
Achievement Gains	8
Research Reports	8
Sampling	8
Comparative Analysis	7
More ▼

Publication Type

Journal Articles	41
Reports - Research	27
Speeches/Meeting Papers	19
Opinion Papers	18
Reports - Evaluative	18
Information Analyses	14
Reports - Descriptive	7
Guides - Non-Classroom	2
Reports - General	2
Guides - General	1
Reference Materials -…	1
More ▼

Education Level

Elementary Secondary Education	9
High Schools	1
Higher Education	1
Postsecondary Education	1

Audience

Researchers	5
Policymakers	2
Practitioners	2
Parents	1
Teachers	1

Location

United Kingdom	2
Australia	1
China	1
District of Columbia	1
Europe	1
Indiana	1
New York	1
Ohio	1
South Carolina	1

Laws, Policies, & Programs

Elementary and Secondary…

Assessments and Surveys

National Assessment of…	3
Comprehensive Tests of Basic…	1
Illinois Test of…	1
Teaching and Learning…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 83 results Save | Export

The More the Merrier? A Network Analysis of Construct Content Validity in School Leadership Literature

Peer reviewed

Direct link

Yinying Wang; Joonkil Ahn – Educational Management Administration & Leadership, 2025

School leadership research literature has a large number of widely used constructs. Could fewer constructs bring more clarity? This study evaluates construct content validity, defined as the extent to which a measure's items reflect a theoretical content domain, in school leadership literature. To do so, we reviewed 29 articles that used Teaching…

Descriptors: Network Analysis, Construct Validity, Content Validity, Instructional Leadership

Challenging the Illusion of Objectivity: An In-Depth Analysis of the Preselected Items Evaluation (PIE) Method in Translation Evaluation

Peer reviewed

Direct link

Alireza Akbari; Mohammadtaghi Shahnazari – Journal of Applied Research in Higher Education, 2025

Purpose: The primary objective of this research paper was to examine the objectivity of the preselected items evaluation (PIE) method, a prevalent translation scoring method deployed by international institutions such as UAntwerpen, UGent and the University of Granada. Design/methodology/approach: This research critically analyzed the scientific…

Descriptors: Evaluation Methods, Translation, Difficulty Level, Validity

Causal Inference and Bias in Learning Analytics: A Primer on Pitfalls Using Directed Acyclic Graphs

Peer reviewed
PDF on ERIC

Download full text

Weidlich, Joshua; Gaševic, Dragan; Drachsler, Hendrik – Journal of Learning Analytics, 2022

As a research field geared toward understanding and improving learning, Learning Analytics (LA) must be able to provide empirical support for causal claims. However, as a highly applied field, tightly controlled randomized experiments are not always feasible nor desirable. Instead, researchers often rely on observational data, based on which they…

Descriptors: Causal Models, Inferences, Learning Analytics, Comparative Analysis

Regression Discontinuity and Beyond: Options for Studying External Validity in an Internally Valid Design

Peer reviewed

Direct link

Wing, Coady; Bello-Gomez, Ricardo A. – American Journal of Evaluation, 2018

Treatment effect estimates from a "regression discontinuity design" (RDD) have high internal validity. However, the arguments that support the design apply to a subpopulation that is narrower and usually different from the population of substantive interest in evaluation research. The disconnect between RDD population and the…

Descriptors: Regression (Statistics), Research Design, Validity, Evaluation Methods

Composite and Loose Concepts, Historical Analogies, and the Logic of Control in Comparative Historical Analysis

Peer reviewed

Direct link

Møller, Jørgen – Sociological Methods & Research, 2016

The use of controlled comparisons pervades comparative historical analysis. Heated debates have surrounded the methodological purchase of such comparisons. However, the quality and validity of the conceptual building blocks on which the comparisons are based have largely been ignored. This article discusses a particular problem pertaining to these…

Descriptors: Comparative Analysis, History, Evaluation Methods, Validity

Can Broad Inferences Be Drawn from Lottery Analyses of School Choice Programs? An Exploration of Appropriate Sensitivity Analyses

Peer reviewed

Direct link

Zimmer, Ron; Engberg, John – Journal of School Choice, 2016

School choice programs continue to be controversial, spurring a number of researchers into evaluating them. When possible, researchers evaluate the effect of attending a school of choice using randomized designs to eliminate possible selection bias. Randomized designs are often thought of as the gold standard for research, but many circumstances…

Descriptors: Inferences, School Choice, Educational Vouchers, Charter Schools

Understanding Validity Issues Surrounding Test-Based Accountability Measures in the US

Peer reviewed

Direct link

Nancy Koh; Vikash Reddy; Madhabi Chatterji – Quality Assurance in Education: An International Perspective, 2014

Purpose: This AERI-NEPC eBrief, the fourth in a series titled "Understanding validity issues around the world", looks closely at issues surrounding the validity of test-based actions in educational accountability and school improvement contexts. The specific discussions here focus testing issues in the US. However, the general principles…

Descriptors: Public Education, Test Validity, Research Problems, Accountability

A Comparison of Affect Ratings Obtained with Ecological Momentary Assessment and the Day Reconstruction Method

Peer reviewed

Direct link

Dockray, Samantha; Grant, Nina; Stone, Arthur A.; Kahneman, Daniel; Wardle, Jane; Steptoe, Andrew – Social Indicators Research, 2010

Measurement of affective states in everyday life is of fundamental importance in many types of quality of life, health, and psychological research. Ecological momentary assessment (EMA) is the recognized method of choice, but the respondent burden can be high. The day reconstruction method (DRM) was developed by Kahneman and colleagues ("Science,"…

Descriptors: Employed Women, Quality of Life, Evaluation Methods, Psychological Patterns

The Missing Middle in Validation Research

Peer reviewed

Taylor, Erwin K.; Griess, Thomas – Personnel Psychology, 1976

In most selection validation research, only the upper and lower tails of the criterion distribution are used, often yielding misleading or incorrect results. Provides formulas and tables which enable the researcher to account more accurately for the distribution of criterion within the middle range of population. (Author/RW)

Descriptors: Evaluation Methods, Measurement Techniques, Predictive Validity, Reliability

Heresies of the New Unified Notion of Test Validity.

Download full text

Stuck, Ivan – 1995

By focusing on "appropriateness" and "adequacy" of inference and action, unified validity may be misused in rejecting valid test outcomes. The notion of levels of validity is challenged, the necessity of assumption is argued, and experience is proposed as the basis of validity. "Consequential validity" is interpreted as an optional predictive…

Descriptors: Evaluation Methods, Measurement Techniques, Measures (Individuals), Predictive Validity

Review of "Weighted Student Formula Yearbook 2009"

Download full text

Baker, Bruce – Education and the Public Interest Center, 2009

The new "Weighted Student Formula Yearbook 2009" from the Reason Foundation provides a simple framework for touting the successes of states and urban school districts that grant greater fiscal autonomy to schools. The report defines the Weighted Student Formula (WSF) reform extremely broadly, presenting a variety of reforms under the WSF umbrella.…

Descriptors: Evidence, Urban Schools, Research Reports, Change Strategies

Review of "Evaluation of the DC Opportunity Scholarship Program: Impacts after Three Years"

Download full text

Carnoy, Martin – Education and the Public Interest Center, 2009

The third-year evaluation of the federally funded Washington, D.C. voucher program shows that low-income students offered vouchers in the first two years of the program had modestly higher reading scores after three years but showed no significant difference in mathematics. Students were randomly assigned to treatment and control groups, and the…

Descriptors: Control Groups, Private Schools, Program Effectiveness, Scoring

Toward Developing a Science of Treatment Integrity: Introduction to the Special Series

Peer reviewed

Direct link

Hagermoser Sanetti, Lisa M.; Kratochwill, Thomas R. – School Psychology Review, 2009

Treatment integrity (also referred to as "treatment fidelity," "intervention integrity," and "procedural reliability") is an important methodological concerning both research and practice because treatment integrity data are essential to making valid conclusions regarding treatment outcomes. Despite its relationship to validity, treatment…

Descriptors: Intervention, Research Methodology, Models, Validity

Conceptual and Methodological Issues in Treatment Integrity Measurement

Peer reviewed

Direct link

McLeod, Bryce D.; Southam-Gerow, Michael A.; Weisz, John R. – School Psychology Review, 2009

This special series focused on treatment integrity in the child mental health and education field is timely. The articles do a laudable job of reviewing (a) the current status of treatment integrity research and measurement, (b) existing conceptual models of treatment integrity, and (c) the limitations of prior research. Overall, this thoughtful…

Descriptors: Evaluation Research, Children, Intervention, Research Methodology

Review of "Multiple Choice: Charter School Performance in 16 States"

Download full text

Miron, Gary; Applegate, Brooks – Education and the Public Interest Center, 2009

The Center for Research on Education Outcomes (CREDO) at Stanford University conducted a large-scale analysis of the impact of charter schools on student performance. The center's data covered 65-70% of the nation's charter schools. Although results varied by state, 17% of the charter school students have significantly higher math results than …

Descriptors: Evidence, Traditional Schools, Charter Schools, Program Effectiveness

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

Education and the Public…	5
Educational Evaluation and…	5
School Psychology Review	4
Educational Researcher	2
Evaluation Review	2
American Journal of Distance…	1
American Journal of Evaluation	1
Australian Journal of Reading	1
Behavioral Disorders	1
Canadian Journal of…	1
Children & Society	1
Early Childhood Research…	1
Educational Management…	1
Educational Measurement:…	1
Educational Psychology Review	1
Family Relations	1
Journal of Applied Behavior…	1
Journal of Applied Research…	1
Journal of Black Studies	1
Journal of Consulting and…	1
Journal of Counseling…	1
Journal of Learning Analytics	1
Journal of Marriage and the…	1
Journal of School Choice	1
Journal of Social Work…	1
More ▼

Gresham, Frank M.	2
Airasian, Peter W.	1
Alireza Akbari	1
Altschuld, J. W.	1
Anderson, Edward R.	1
Applegate, Brooks	1
Asante, Molefi K.	1
Baker, Bruce	1
Barnes, Robert E.	1
Bello-Gomez, Ricardo A.	1
Boekaerts, Monique	1
Bryant, Fred B.	1
Cambourne, Brian	1
Campbell, Heather E.	1
Carnoy, Martin	1
Carroll, Kathleen M.	1
Cascallar, Eduardo	1
Chamberlain, Howard	1
Cizek, Gregory J.	1
Cline, Hugh F.	1
Coleman, Marilyn	1
Conroy, Maureen A.	1
Cordray, D. S.	1
Costigan, Tracy	1
More ▼