ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	15

Descriptor

Evaluation Methods	21
Probability	21
Statistical Inference	14
Bayesian Statistics	8
Hypothesis Testing	8
Inferences	8
Measurement Techniques	7
Models	7
Evaluation Problems	6
Experiments	6
Misconceptions	6
Replication (Evaluation)	6
Validity	6
Predictive Measurement	5
Research Methodology	5
Computation	4
Effect Size	4
Scores	4
Statistical Significance	4
Causal Models	3
Research Design	3
Statistical Analysis	3
Statistical Distributions	3
Academic Achievement	2
Cognitive Development	2
More ▼

Source

Psychological Methods	6
Asia Pacific Education Review	1
Cognition	1
Educational Psychologist	1
Educational and Psychological…	1
Harvard Educational Review	1
International Journal of…	1
Journal of Experimental…	1
Journal of Experimental…	1
Journal of Research on…	1
Journal of Statistics…	1
Journal of the American…	1
National Center for Education…	1
Online Submission	1
Sociological Methods &…	1
US Department of Education	1
More ▼

Publication Type

Journal Articles	18
Reports - Research	7
Reports - Evaluative	6
Reports - Descriptive	4
Opinion Papers	3
Guides - Non-Classroom	1
Information Analyses	1
Speeches/Meeting Papers	1

Education Level

Elementary Education	2
Junior High Schools	2
Middle Schools	2
Secondary Education	2
Grade 3	1
Grade 6	1
Grade 9	1
High Schools	1

Audience

Researchers

Location

Germany	1
Indiana	1
United Kingdom (England)	1
United Kingdom (Scotland)	1
United Kingdom (Wales)	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing 1 to 15 of 21 results Save | Export

Propensity Score Methods for Causal Inference and Generalization

Peer reviewed

Direct link

Wendy Chan – Asia Pacific Education Review, 2024

As evidence from evaluation and experimental studies continue to influence decision and policymaking, applied researchers and practitioners require tools to derive valid and credible inferences. Over the past several decades, research in causal inference has progressed with the development and application of propensity scores. Since their…

Descriptors: Probability, Scores, Causal Models, Statistical Inference

A Comparison of Three Popular Methods for Handling Missing Data: Complete-Case Analysis, Inverse Probability Weighting, and Multiple Imputation

Peer reviewed

Direct link

Roderick J. Little; James R. Carpenter; Katherine J. Lee – Sociological Methods & Research, 2024

Missing data are a pervasive problem in data analysis. Three common methods for addressing the problem are (a) complete-case analysis, where only units that are complete on the variables in an analysis are included; (b) weighting, where the complete cases are weighted by the inverse of an estimate of the probability of being complete; and (c)…

Descriptors: Foreign Countries, Probability, Robustness (Statistics), Responses

The BASIE (BAyeSian Interpretation of Estimates) Framework for Interpreting Findings from Impact Evaluations: A Practical Guide for Education Researchers. Toolkit. NCEE 2022-005

Peer reviewed
PDF on ERIC

Download full text

Deke, John; Finucane, Mariel; Thal, Daniel – National Center for Education Evaluation and Regional Assistance, 2022

BASIE is a framework for interpreting impact estimates from evaluations. It is an alternative to null hypothesis significance testing. This guide walks researchers through the key steps of applying BASIE, including selecting prior evidence, reporting impact estimates, interpreting impact estimates, and conducting sensitivity analyses. The guide…

Descriptors: Bayesian Statistics, Educational Research, Data Interpretation, Hypothesis Testing

Partially Identified Treatment Effects for Generalizability

Peer reviewed

Direct link

Chan, Wendy – Journal of Research on Educational Effectiveness, 2017

Recent methods to improve generalizations from nonrandom samples typically invoke assumptions such as the strong ignorability of sample selection, which is challenging to meet in practice. Although researchers acknowledge the difficulty in meeting this assumption, point estimates are still provided and used without considering alternative…

Descriptors: Generalization, Inferences, Probability, Educational Research

Quasi-Experimental Designs for Causal Inference

Peer reviewed

Direct link

Kim, Yongnam; Steiner, Peter – Educational Psychologist, 2016

When randomized experiments are infeasible, quasi-experimental designs can be exploited to evaluate causal treatment effects. The strongest quasi-experimental designs for causal inference are regression discontinuity designs, instrumental variable designs, matching and propensity score designs, and comparative interrupted time series designs. This…

Descriptors: Quasiexperimental Design, Causal Models, Statistical Inference, Randomized Controlled Trials

Rethinking Teacher Evaluation: A Conversation about Statistical Inferences and Value-Added Models

Peer reviewed

Direct link

Callister Everson, Kimberlee; Feinauer, Erika; Sudweeks, Richard R. – Harvard Educational Review, 2013

In this article, the authors provide a methodological critique of the current standard of value-added modeling forwarded in educational policy contexts as a means of measuring teacher effectiveness. Conventional value-added estimates of teacher quality are attempts to determine to what degree a teacher would theoretically contribute, on average,…

Descriptors: Teacher Evaluation, Teacher Effectiveness, Evaluation Methods, Accountability

Taking the Missing Propensity into Account When Estimating Competence Scores: Evaluation of Item Response Theory Models for Nonignorable Omissions

Peer reviewed

Direct link

Köhler, Carmen; Pohl, Steffi; Carstensen, Claus H. – Educational and Psychological Measurement, 2015

When competence tests are administered, subjects frequently omit items. These missing responses pose a threat to correctly estimating the proficiency level. Newer model-based approaches aim to take nonignorable missing data processes into account by incorporating a latent missing propensity into the measurement model. Two assumptions are typically…

Descriptors: Competence, Tests, Evaluation Methods, Adults

A Tutorial Introduction to Bayesian Models of Cognitive Development

Peer reviewed

Direct link

Perfors, Amy; Tenenbaum, Joshua B.; Griffiths, Thomas L.; Xu, Fei – Cognition, 2011

We present an introduction to Bayesian inference as it is used in probabilistic models of cognitive development. Our goal is to provide an intuitive and accessible guide to the "what", the "how", and the "why" of the Bayesian approach: what sorts of problems and data the framework is most relevant for, and how and why it may be useful for…

Descriptors: Bayesian Statistics, Cognitive Psychology, Inferences, Cognitive Development

Killeen's (2005) "p[subscript rep]" Coefficient: Logical and Mathematical Problems

Peer reviewed

Direct link

Maraun, Michael; Gabriel, Stephanie – Psychological Methods, 2010

In his article, "An Alternative to Null-Hypothesis Significance Tests," Killeen (2005) urged the discipline to abandon the practice of "p[subscript obs]"-based null hypothesis testing and to quantify the signal-to-noise characteristics of experimental outcomes with replication probabilities. He described the coefficient that he…

Descriptors: Hypothesis Testing, Statistical Inference, Probability, Statistical Significance

Using the Attribute Hierarchy Method to Make Diagnostic Inferences about Examinees' Knowledge and Skills in Mathematics: An Operational Implementation of Cognitive Diagnostic Assessment

Peer reviewed

Direct link

Gierl, Mark J.; Alves, Cecilia; Majeau, Renate Taylor – International Journal of Testing, 2010

The purpose of this study is to apply the attribute hierarchy method in an operational diagnostic mathematics program at Grades 3 and 6 to promote cognitive inferences about students' problem-solving skills. The attribute hierarchy method is a psychometric procedure for classifying examinees' test item responses into a set of structured attribute…

Descriptors: Test Items, Student Reaction, Diagnostic Tests, Psychometrics

"p[subscript rep]" Replicates: Comment Prompted by Iverson, Wagenmakers, and Lee (2010); Lecoutre, Lecoutre, and Poitevineau (2010); and Maraun and Gabriel (2010)

Peer reviewed

Direct link

Killeen, Peter R. – Psychological Methods, 2010

Lecoutre, Lecoutre, and Poitevineau (2010) have provided sophisticated grounding for "p[subscript rep]." Computing it precisely appears, fortunately, no more difficult than doing so approximately. Their analysis will help move predictive inference into the mainstream. Iverson, Wagenmakers, and Lee (2010) have also validated…

Descriptors: Replication (Evaluation), Measurement Techniques, Research Design, Research Methodology

Killeen's Probability of Replication and Predictive Probabilities: How to Compute, Use, and Interpret Them

Peer reviewed

Direct link

Lecoutre, Bruno; Lecoutre, Marie-Paule; Poitevineau, Jacques – Psychological Methods, 2010

P. R. Killeen's (2005a) probability of replication ("p[subscript rep]") of an experimental result is the fiducial Bayesian predictive probability of finding a same-sign effect in a replication of an experiment. "p[subscript rep]" is now routinely reported in "Psychological Science" and has also begun to appear in…

Descriptors: Research Methodology, Guidelines, Probability, Computation

A Model-Averaging Approach to Replication : The Case of "p[subscript rep]"

Peer reviewed

Direct link

Iverson, Geoffrey J.; Wagenmakers, Eric-Jan; Lee, Michael D. – Psychological Methods, 2010

The purpose of the recently proposed "p[subscript rep]" statistic is to estimate the probability of concurrence, that is, the probability that a replicate experiment yields an effect of the same sign (Killeen, 2005a). The influential journal "Psychological Science" endorses "p[subscript rep]" and recommends its use…

Descriptors: Effect Size, Evaluation Methods, Probability, Experiments

Regarding "p[subscript rep]": Comment Prompted by Iverson, Wagenmakers, and Lee (2010); Lecoutre, Lecoutre, and Poitevineau (2010); and Maraun and Gabriel (2010)

Peer reviewed

Direct link

Serlin, Ronald C. – Psychological Methods, 2010

The sense that replicability is an important aspect of empirical science led Killeen (2005a) to define "p[subscript rep]," the probability that a replication will result in an outcome in the same direction as that found in a current experiment. Since then, several authors have praised and criticized 'p[subscript rep]," culminating…

Descriptors: Epistemology, Effect Size, Replication (Evaluation), Measurement Techniques

Replication, "p[subscript rep]," and Confidence Intervals: Comment Prompted by Iverson, Wagenmakers, and Lee (2010); Lecoutre, Lecoutre, and Poitevineau (2010); and Maraun and Gabriel (2010)

Peer reviewed

Direct link

Cumming, Geoff – Psychological Methods, 2010

This comment offers three descriptions of "p[subscript rep]" that start with a frequentist account of confidence intervals, draw on R. A. Fisher's fiducial argument, and do not make Bayesian assumptions. Links are described among "p[subscript rep]," "p" values, and the probability a confidence interval will capture…

Descriptors: Replication (Evaluation), Measurement Techniques, Research Methodology, Validity

Previous Page | Next Page »

Pages: 1 | 2

Alves, Cecilia	1
Bonnefon, Jean-Francois	1
Callister Everson, Kimberlee	1
Carstensen, Claus H.	1
Chan, Wendy	1
Cumming, Geoff	1
Deke, John	1
Feinauer, Erika	1
Finucane, Mariel	1
Fox, Craig R.	1
Gabriel, Stephanie	1
Gierl, Mark J.	1
Griffiths, Thomas L.	1
Hilton, Denis J.	1
Iverson, Geoffrey J.	1
James R. Carpenter	1
Katherine J. Lee	1
Killeen, Peter R.	1
Kim, Yongnam	1
Köhler, Carmen	1
Lecoutre, Bruno	1
Lecoutre, Marie-Paule	1
Lee, Michael D.	1
Levav, Jonathan	1
Levy, Roy	1
More ▼