ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	11

Descriptor

Evaluation Methods	15
Hypothesis Testing	15
Statistical Inference	15
Probability	8
Bayesian Statistics	7
Evaluation Problems	6
Experiments	6
Measurement Techniques	6
Misconceptions	6
Replication (Evaluation)	6
Research Methodology	6
Statistical Significance	6
Effect Size	5
Predictive Measurement	5
Error of Measurement	4
Research Design	4
Validity	4
Data Analysis	3
Models	3
Simulation	3
Statistical Analysis	3
Comparative Analysis	2
Educational Research	2
Evaluation Research	2
Evidence	2
More ▼

Source

Psychological Methods	6
Educational and Psychological…	2
Educational Administration…	1
National Center for Education…	1
Online Submission	1
Structural Equation Modeling	1
Structural Equation Modeling:…	1

Publication Type

Journal Articles	11
Reports - Research	5
Reports - Evaluative	4
Reports - Descriptive	3
Speeches/Meeting Papers	3
Opinion Papers	2
Guides - Non-Classroom	1
Information Analyses	1

Education Level

Audience

Researchers

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 15 results Save | Export

Minimal-Effect Testing, Equivalence Testing, and the Conventional Null Hypothesis Testing for the Analysis of Bi-Factor Models

Peer reviewed

Direct link

Shunji Wang; Katerina M. Marcoulides; Jiashan Tang; Ke-Hai Yuan – Structural Equation Modeling: A Multidisciplinary Journal, 2024

A necessary step in applying bi-factor models is to evaluate the need for domain factors with a general factor in place. The conventional null hypothesis testing (NHT) was commonly used for such a purpose. However, the conventional NHT meets challenges when the domain loadings are weak or the sample size is insufficient. This article proposes…

Descriptors: Hypothesis Testing, Error of Measurement, Comparative Analysis, Monte Carlo Methods

The BASIE (BAyeSian Interpretation of Estimates) Framework for Interpreting Findings from Impact Evaluations: A Practical Guide for Education Researchers. Toolkit. NCEE 2022-005

Peer reviewed
PDF on ERIC

Download full text

Deke, John; Finucane, Mariel; Thal, Daniel – National Center for Education Evaluation and Regional Assistance, 2022

BASIE is a framework for interpreting impact estimates from evaluations. It is an alternative to null hypothesis significance testing. This guide walks researchers through the key steps of applying BASIE, including selecting prior evidence, reporting impact estimates, interpreting impact estimates, and conducting sensitivity analyses. The guide…

Descriptors: Bayesian Statistics, Educational Research, Data Interpretation, Hypothesis Testing

Perspectives on the Use of Null Hypothesis Statistical Testing. Part III: the Various Nuts and Bolts of Statistical and Hypothesis Testing

Peer reviewed

Direct link

Marmolejo-Ramos, Fernando; Cousineau, Denis – Educational and Psychological Measurement, 2017

The number of articles showing dissatisfaction with the null hypothesis statistical testing (NHST) framework has been progressively increasing over the years. Alternatives to NHST have been proposed and the Bayesian approach seems to have achieved the highest amount of visibility. In this last part of the special issue, a few alternative…

Descriptors: Hypothesis Testing, Bayesian Statistics, Evaluation Methods, Statistical Inference

Observation-Oriented Modeling: Going beyond "Is It All a Matter of Chance"?

Peer reviewed

Direct link

Grice, James W.; Yepez, Maria; Wilson, Nicole L.; Shoda, Yuichi – Educational and Psychological Measurement, 2017

An alternative to null hypothesis significance testing is presented and discussed. This approach, referred to as observation-oriented modeling, is centered on model building in an effort to explicate the structures and processes believed to generate a set of observations. In terms of analysis, this novel approach complements traditional methods…

Descriptors: Hypothesis Testing, Models, Observation, Statistical Inference

Killeen's (2005) "p[subscript rep]" Coefficient: Logical and Mathematical Problems

Peer reviewed

Direct link

Maraun, Michael; Gabriel, Stephanie – Psychological Methods, 2010

In his article, "An Alternative to Null-Hypothesis Significance Tests," Killeen (2005) urged the discipline to abandon the practice of "p[subscript obs]"-based null hypothesis testing and to quantify the signal-to-noise characteristics of experimental outcomes with replication probabilities. He described the coefficient that he…

Descriptors: Hypothesis Testing, Statistical Inference, Probability, Statistical Significance

"p[subscript rep]" Replicates: Comment Prompted by Iverson, Wagenmakers, and Lee (2010); Lecoutre, Lecoutre, and Poitevineau (2010); and Maraun and Gabriel (2010)

Peer reviewed

Direct link

Killeen, Peter R. – Psychological Methods, 2010

Lecoutre, Lecoutre, and Poitevineau (2010) have provided sophisticated grounding for "p[subscript rep]." Computing it precisely appears, fortunately, no more difficult than doing so approximately. Their analysis will help move predictive inference into the mainstream. Iverson, Wagenmakers, and Lee (2010) have also validated…

Descriptors: Replication (Evaluation), Measurement Techniques, Research Design, Research Methodology

Killeen's Probability of Replication and Predictive Probabilities: How to Compute, Use, and Interpret Them

Peer reviewed

Direct link

Lecoutre, Bruno; Lecoutre, Marie-Paule; Poitevineau, Jacques – Psychological Methods, 2010

P. R. Killeen's (2005a) probability of replication ("p[subscript rep]") of an experimental result is the fiducial Bayesian predictive probability of finding a same-sign effect in a replication of an experiment. "p[subscript rep]" is now routinely reported in "Psychological Science" and has also begun to appear in…

Descriptors: Research Methodology, Guidelines, Probability, Computation

A Model-Averaging Approach to Replication : The Case of "p[subscript rep]"

Peer reviewed

Direct link

Iverson, Geoffrey J.; Wagenmakers, Eric-Jan; Lee, Michael D. – Psychological Methods, 2010

The purpose of the recently proposed "p[subscript rep]" statistic is to estimate the probability of concurrence, that is, the probability that a replicate experiment yields an effect of the same sign (Killeen, 2005a). The influential journal "Psychological Science" endorses "p[subscript rep]" and recommends its use…

Descriptors: Effect Size, Evaluation Methods, Probability, Experiments

Regarding "p[subscript rep]": Comment Prompted by Iverson, Wagenmakers, and Lee (2010); Lecoutre, Lecoutre, and Poitevineau (2010); and Maraun and Gabriel (2010)

Peer reviewed

Direct link

Serlin, Ronald C. – Psychological Methods, 2010

The sense that replicability is an important aspect of empirical science led Killeen (2005a) to define "p[subscript rep]," the probability that a replication will result in an outcome in the same direction as that found in a current experiment. Since then, several authors have praised and criticized 'p[subscript rep]," culminating…

Descriptors: Epistemology, Effect Size, Replication (Evaluation), Measurement Techniques

Replication, "p[subscript rep]," and Confidence Intervals: Comment Prompted by Iverson, Wagenmakers, and Lee (2010); Lecoutre, Lecoutre, and Poitevineau (2010); and Maraun and Gabriel (2010)

Peer reviewed

Direct link

Cumming, Geoff – Psychological Methods, 2010

This comment offers three descriptions of "p[subscript rep]" that start with a frequentist account of confidence intervals, draw on R. A. Fisher's fiducial argument, and do not make Bayesian assumptions. Links are described among "p[subscript rep]," "p" values, and the probability a confidence interval will capture…

Descriptors: Replication (Evaluation), Measurement Techniques, Research Methodology, Validity

Balkanization and Unification of Probabilistic Inferences

Download full text

Yu, Chong-Ho – Online Submission, 2005

Many research-related classes in social sciences present probability as a unified approach based upon mathematical axioms, but neglect the diversity of various probability theories and their associated philosophical assumptions. Although currently the dominant statistical and probabilistic approach is the Fisherian tradition, the use of Fisherian…

Descriptors: Probability, Inferences, Social Sciences, Statistical Significance

A Call for Statistical Reform in EAQ

Peer reviewed

Direct link

Byrd, Jimmy K. – Educational Administration Quarterly, 2007

Purpose: The purpose of this study was to review research published by Educational Administration Quarterly (EAQ) during the past 10 years to determine if confidence intervals and effect sizes were being reported as recommended by the American Psychological Association (APA) Publication Manual. Research Design: The author examined 49 volumes of…

Descriptors: Research Design, Intervals, Statistical Inference, Effect Size

Evaluation of the Magnitude of Differential Item Functioning in Polytomous Items. Program Statistics Research Technical Report No. 94-2.

Download full text

Zwick, Rebecca; Thayer, Dorothy T. – 1994

Several recent studies have investigated the application of statistical inference procedures to the analysis of differential item functioning (DIF) in test items that are scored on an ordinal scale. Mantel's extension of the Mantel-Haenszel test is a possible hypothesis-testing method for this purpose. The development of descriptive statistics for…

Descriptors: Error of Measurement, Evaluation Methods, Hypothesis Testing, Item Bias

In Search of Golden Rules: Comment on Hypothesis-Testing Approaches to Setting Cutoff Values for Fit Indexes and Dangers in Overgeneralizing Hu and Bentler's (1999) Findings

Peer reviewed

Direct link

Marsh, Herbert W.; Hau, Kit-Tai; Wen, Zhonglin – Structural Equation Modeling, 2004

Goodness-of-fit (GOF) indexes provide "rules of thumb"?recommended cutoff values for assessing fit in structural equation modeling. Hu and Bentler (1999) proposed a more rigorous approach to evaluating decision rules based on GOF indexes and, on this basis, proposed new and more stringent cutoff values for many indexes. This article discusses…

Descriptors: Statistical Significance, Structural Equation Models, Evaluation Methods, Evaluation Research

Applying Statistical Process Quality Control Methodology to Educational Settings.

Download full text

Blumberg, Carol Joyce – 1989

A subset of Statistical Process Control (SPC) methodology known as Control Charting is introduced. SPC methodology is a collection of graphical and inferential statistics techniques used to study the progress of phenomena over time. The types of control charts covered are the null X (mean), R (Range), X (individual observations), MR (moving…

Descriptors: Charts, Data Analysis, Educational Research, Evaluation Methods

Blumberg, Carol Joyce	1
Byrd, Jimmy K.	1
Cousineau, Denis	1
Cumming, Geoff	1
Deke, John	1
Finucane, Mariel	1
Gabriel, Stephanie	1
Grice, James W.	1
Hau, Kit-Tai	1
Iverson, Geoffrey J.	1
Jiashan Tang	1
Katerina M. Marcoulides	1
Ke-Hai Yuan	1
Killeen, Peter R.	1
Lecoutre, Bruno	1
Lecoutre, Marie-Paule	1
Lee, Michael D.	1
Maraun, Michael	1
Marmolejo-Ramos, Fernando	1
Marsh, Herbert W.	1
Poitevineau, Jacques	1
Serlin, Ronald C.	1
Shoda, Yuichi	1
Shunji Wang	1
Thal, Daniel	1
More ▼