NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Date
In 20250
Since 20240
Since 2021 (last 5 years)0
Since 2016 (last 10 years)3
Since 2006 (last 20 years)5
Audience
Researchers3
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Showing 1 to 15 of 19 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Taber, Keith S. – Chemistry Education Research and Practice, 2020
This comment discusses some issues about the use and reporting of experimental studies in education, illustrated by a recently published study that claimed (i) that an educational innovation was effective despite outcomes not reaching statistical significance, and (ii) that this refuted the findings of an earlier study. The two key issues raised…
Descriptors: Chemistry, Educational Innovation, Statistical Significance, Statistical Inference
Peer reviewed Peer reviewed
Direct linkDirect link
Gorard, Stephen; White, Patrick – Statistics Education Research Journal, 2017
In their response to our paper, Nicholson and Ridgway agree with the majority of what we wrote. They echo our concerns about the misuse of inferential statistics and NHST in particular. Very little of their response explicitly challenges the points we made but where it does their defence of the use of inferential techniques does not stand up to…
Descriptors: Statistical Inference, Statistics, Statistical Significance, Probability
Peer reviewed Peer reviewed
Direct linkDirect link
Nicholson, James; Ridgway, Jim – Statistics Education Research Journal, 2017
White and Gorard make important and relevant criticisms of some of the methods commonly used in social science research, but go further by criticising the logical basis for inferential statistical tests. This paper comments briefly on matters we broadly agree on with them and more fully on matters where we disagree. We agree that too little…
Descriptors: Statistical Inference, Statistics, Teaching Methods, Criticism
Wang, Jianjun – International Journal of Research & Method in Education, 2008
As an alternative to statistical testing, effect size has a non-monotonic linkage with practical importance. Besides random variance and systematic bias, a proper interpretation of effect size hinges on its implication to outcomes of deductive and/or inductive enquiries. Consequently, a small effect size might suggest an important finding, and the…
Descriptors: Effect Size, Statistical Significance, Statistical Inference, Evaluation
Peer reviewed Peer reviewed
Direct linkDirect link
Cumming, Geoff – Psychological Methods, 2010
This comment offers three descriptions of "p[subscript rep]" that start with a frequentist account of confidence intervals, draw on R. A. Fisher's fiducial argument, and do not make Bayesian assumptions. Links are described among "p[subscript rep]," "p" values, and the probability a confidence interval will capture…
Descriptors: Replication (Evaluation), Measurement Techniques, Research Methodology, Validity
Peer reviewed Peer reviewed
Schafer, William D. – Measurement and Evaluation in Counseling and Development, 1993
Considers objections to comparisonwise position, which holds that, when conducting simultaneous significance procedures, per-test Type I error rate should be controlled and that it is unnecessary to introduce adjustments designed to control familywise rate. Objections collected by Saville in an attempt to refute them are discussed along with…
Descriptors: Statistical Inference, Statistical Significance, Statistics
Peer reviewed Peer reviewed
Direct linkDirect link
Lee, Michael D.; Wagenmakers, Eric-Jan – Psychological Review, 2005
D. Trafimow presented an analysis of null hypothesis significance testing (NHST) using Bayes's theorem. Among other points, he concluded that NHST is logically invalid, but that logically valid Bayesian analyses are often not possible. The latter conclusion reflects a fundamental misunderstanding of the nature of Bayesian inference. This view…
Descriptors: Psychology, Statistical Inference, Statistical Significance, Bayesian Statistics
Peer reviewed Peer reviewed
Vacha-Haase, Tammi; Thompson, Bruce – Measurement and Evaluation in Counseling and Development, 1998
Responds to Biskin's comments (this issue) on the significance test controversy. Highlights areas of agreement (importance of replication evidence, importance of effect sizes) and disagreement (influence of sample size, evaluation of populations vs. samples, significance of Carver's article). Includes further recommendations for reporting research…
Descriptors: Data Interpretation, Hypothesis Testing, Psychological Studies, Sampling
Peer reviewed Peer reviewed
Biskin, Bruce H. – Measurement and Evaluation in Counseling and Development, 1998
Significance tests are often used inappropriately in counseling research. In addition to comments on the significance test controversy (Vacha-Haase and Nilsson, this issue), a wider historical context and personal experiences are provided. The controversy is discussed, and seven recommendations for using significance tests are included. (EMK)
Descriptors: Data Interpretation, Hypothesis Testing, Psychological Studies, Psychometrics
Peer reviewed Peer reviewed
Hsu, Louis M. – Journal of Counseling Psychology, 1989
Discusses three topics related to interpretation of discriminant analyses (DA's): (1) partial F ratios and partial Wilks's lambdas for predictor variables in standard, step-down, and stepwise DA's; (2) relation of goals of classification to definition/evaluation of classification rules; and (3) significance tests for total hit rates in internal…
Descriptors: Data Interpretation, Discriminant Analysis, Multivariate Analysis, Predictor Variables
Peer reviewed Peer reviewed
Kirk, Roger E. – Educational and Psychological Measurement, 2001
Makes the case that science is best served when researchers focus on the size of effects and their practical significance. Advocates the use of confidence intervals for deciding whether chance or sampling variability is an unlikely explanation for an observed effect. Calls for more emphasis on effect sizes in the next edition of the American…
Descriptors: Effect Size, Hypothesis Testing, Psychology, Research Reports
Peer reviewed Peer reviewed
Austin, James T.; And Others – Personnel Psychology, 1989
A critical reanalysis of Barrett, Caldwell, and Alexander's (1985) critique of dynamic criteria. Summarizes and questions Barrett, et al.'s three definitions of dynamic criteria and their conclusion that reported temporal changes in criteria could be explained by methodological artifacts. A greater focus on dynamic criteria as constructs is…
Descriptors: Evaluation Criteria, Predictor Variables, Psychometrics, Reader Response
Peer reviewed Peer reviewed
Barrett, Gerald V.; Alexander, Ralph A. – Personnel Psychology, 1989
Responds to Austin, Humphreys, and Hulin's (1989) critique of Barrett, Caldwell, and Alexander, suggesting that the burden of proof still rests on the advocates of the concept of dynamic criteria, and that empirical support is lacking for the existence of dynamic criteria as a simplex. Contrary evidence from educational, organizations, and…
Descriptors: Evaluation Criteria, Predictor Variables, Psychometrics, Reader Response
Peer reviewed Peer reviewed
Suen, Hoi K. – Topics in Early Childhood Special Education, 1992
This commentary on EC 603 695 argues that significance testing is a necessary but insufficient condition for positivistic research, that judgment-based assessment and single-subject research are not substitutes for significance testing, and that sampling fluctuation should be considered as one of numerous epistemological concerns in any…
Descriptors: Evaluation Methods, Evaluative Thinking, Research Design, Research Methodology
Peer reviewed Peer reviewed
Levine, Timothy R.; Banas, John – Communication Monographs, 2002
Documents the recent use of one-tailed F-tests in communication journals, and examines the arguments both for and against their use. Examines the use of these tests within the broader unresolved controversy surrounding the use of one-tailed tests. Recommends that future researchers should most often avoid one-tailed Fs, and generally exercise…
Descriptors: Communication (Thought Transfer), Communication Research, Futures (of Society), Higher Education
Previous Page | Next Page ยป
Pages: 1  |  2