NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Date
In 20250
Since 20240
Since 2021 (last 5 years)0
Since 2016 (last 10 years)4
Since 2006 (last 20 years)13
Audience
Laws, Policies, & Programs
Assessments and Surveys
Cognitive Abilities Test1
What Works Clearinghouse Rating
Showing 1 to 15 of 21 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Nicholson, James; Ridgway, Jim – Statistics Education Research Journal, 2017
White and Gorard make important and relevant criticisms of some of the methods commonly used in social science research, but go further by criticising the logical basis for inferential statistical tests. This paper comments briefly on matters we broadly agree on with them and more fully on matters where we disagree. We agree that too little…
Descriptors: Statistical Inference, Statistics, Teaching Methods, Criticism
Peer reviewed Peer reviewed
Direct linkDirect link
White, Patrick; Gorard, Stephen – Statistics Education Research Journal, 2017
Recent concerns about a shortage of capacity for statistical and numerical analysis skills among social science students and researchers have prompted a range of initiatives aiming to improve teaching in this area. However, these projects have rarely re-evaluated the content of what is taught to students and have instead focussed primarily on…
Descriptors: Statistical Inference, Statistics, Teaching Methods, Social Science Research
Peer reviewed Peer reviewed
Direct linkDirect link
García-Pérez, Miguel A. – Educational and Psychological Measurement, 2017
Null hypothesis significance testing (NHST) has been the subject of debate for decades and alternative approaches to data analysis have been proposed. This article addresses this debate from the perspective of scientific inquiry and inference. Inference is an inverse problem and application of statistical methods cannot reveal whether effects…
Descriptors: Hypothesis Testing, Statistical Inference, Effect Size, Bayesian Statistics
Peer reviewed Peer reviewed
Direct linkDirect link
Neale, Dave – Oxford Review of Education, 2015
Recently, Stephen Gorard has outlined strong objections to the use of significance testing in social research. He has argued, first, that as the samples used in social research are almost always non-random it is not possible to use inferential statistical techniques and, second, that even if a truly random sample were achieved, the logic behind…
Descriptors: Statistical Significance, Statistical Analysis, Sampling, Probability
Randall, David; Welser, Christopher – National Association of Scholars, 2018
A reproducibility crisis afflicts a wide range of scientific and social-scientific disciplines, from epidemiology to social psychology. Improper research techniques, lack of accountability, disciplinary and political groupthink, and a scientific culture biased toward producing positive results together have produced a critical state of affairs.…
Descriptors: Scientific Methodology, Replication (Evaluation), Scientific Research, Guidelines
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Vanhove, Jan – Studies in Second Language Learning and Teaching, 2015
I discuss three common practices that obfuscate or invalidate the statistical analysis of randomized controlled interventions in applied linguistics. These are (a) checking whether randomization produced groups that are balanced on a number of possibly relevant covariates, (b) using repeated measures ANOVA to analyze pretest-posttest designs, and…
Descriptors: Randomized Controlled Trials, Intervention, Applied Linguistics, Statistical Analysis
Spinella, Sarah – Online Submission, 2011
As result replicability is essential to science and difficult to achieve through external replicability, the present paper notes the insufficiency of null hypothesis statistical significance testing (NHSST) and explains the bootstrap as a plausible alternative, with a heuristic example to illustrate the bootstrap method. The bootstrap relies on…
Descriptors: Sampling, Statistical Inference, Statistical Significance, Error of Measurement
Peer reviewed Peer reviewed
Direct linkDirect link
Sun, Shuyan; Pan, Wei; Wang, Lihshing Leigh – Journal of Educational Psychology, 2010
Null hypothesis significance testing has dominated quantitative research in education and psychology. However, the statistical significance of a test as indicated by a p-value does not speak to the practical significance of the study. Thus, reporting effect size to supplement p-value is highly recommended by scholars, journal editors, and academic…
Descriptors: Effect Size, Statistical Inference, Statistical Significance, Data Interpretation
Peer reviewed Peer reviewed
Direct linkDirect link
Maraun, Michael; Gabriel, Stephanie – Psychological Methods, 2010
In his article, "An Alternative to Null-Hypothesis Significance Tests," Killeen (2005) urged the discipline to abandon the practice of "p[subscript obs]"-based null hypothesis testing and to quantify the signal-to-noise characteristics of experimental outcomes with replication probabilities. He described the coefficient that he…
Descriptors: Hypothesis Testing, Statistical Inference, Probability, Statistical Significance
Peer reviewed Peer reviewed
Direct linkDirect link
Kim, Se-Kang – International Journal of Testing, 2010
The aim of the current study is to validate the invariance of major profile patterns derived from multidimensional scaling (MDS) by bootstrapping. Profile Analysis via Multidimensional Scaling (PAMS) was employed to obtain profiles and bootstrapping was used to construct the sampling distributions of the profile coordinates and the empirical…
Descriptors: Intervals, Multidimensional Scaling, Profiles, Evaluation
Wang, Jianjun – International Journal of Research & Method in Education, 2008
As an alternative to statistical testing, effect size has a non-monotonic linkage with practical importance. Besides random variance and systematic bias, a proper interpretation of effect size hinges on its implication to outcomes of deductive and/or inductive enquiries. Consequently, a small effect size might suggest an important finding, and the…
Descriptors: Effect Size, Statistical Significance, Statistical Inference, Evaluation
Peer reviewed Peer reviewed
Direct linkDirect link
Levine, Timothy R.; Weber, Rene; Hullett, Craig; Park, Hee Sun; Lindsey, Lisa L. Massi – Human Communication Research, 2008
Null hypothesis significance testing (NHST) is the most widely accepted and frequently used approach to statistical inference in quantitative communication research. NHST, however, is highly controversial, and several serious problems with the approach have been identified. This paper reviews NHST and the controversy surrounding it. Commonly…
Descriptors: Communication Research, Testing, Statistical Significance, Statistical Inference
Peer reviewed Peer reviewed
Direct linkDirect link
Serlin, Ronald C. – Psychological Methods, 2010
The sense that replicability is an important aspect of empirical science led Killeen (2005a) to define "p[subscript rep]," the probability that a replication will result in an outcome in the same direction as that found in a current experiment. Since then, several authors have praised and criticized 'p[subscript rep]," culminating…
Descriptors: Epistemology, Effect Size, Replication (Evaluation), Measurement Techniques
Kim, Seock-Ho; Cohen, Allan S. – 1995
The Behrens-Fisher problem arises when one seeks to make inferences about the means of two normal populations without assuming the variances are equal. This paper presents a review of fundamental concepts and applications used to address the Behrens-Fisher problem under fiducial, Bayesian, and frequentist approaches. Methods of approximations to…
Descriptors: Bayesian Statistics, Hypothesis Testing, Probability, Statistical Inference
Peer reviewed Peer reviewed
Kellow, J. Thomas – American Journal of Evaluation, 1998
Many evaluation students are still being taught the use of tests of statistical significance without being warned about their limitations. This paper discusses other estimates of treatment effects necessary to interpret between-group differences correctly. Sources to improve evaluation practice are also suggested. (SLD)
Descriptors: Estimation (Mathematics), Evaluation Utilization, Groups, Probability
Previous Page | Next Page »
Pages: 1  |  2