Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 6 |
Descriptor
Evaluation Methods | 12 |
Statistical Inference | 12 |
Statistical Significance | 12 |
Hypothesis Testing | 6 |
Research Methodology | 6 |
Measurement Techniques | 5 |
Research Design | 5 |
Effect Size | 4 |
Error of Measurement | 4 |
Probability | 4 |
Statistical Analysis | 4 |
More ▼ |
Source
Author
Suen, Hoi K. | 2 |
Byrd, Jimmy K. | 1 |
Cumming, Geoff | 1 |
Da Prato, Robert A. | 1 |
Daniel, Larry G. | 1 |
Gabriel, Stephanie | 1 |
Hau, Kit-Tai | 1 |
Lefebvre, Daniel J. | 1 |
Maraun, Michael | 1 |
Marsh, Herbert W. | 1 |
Onwuegbuzie, Anthony J. | 1 |
More ▼ |
Publication Type
Journal Articles | 9 |
Reports - Research | 4 |
Opinion Papers | 3 |
Reports - Evaluative | 3 |
Speeches/Meeting Papers | 2 |
Guides - Non-Classroom | 1 |
Information Analyses | 1 |
Reports - Descriptive | 1 |
Education Level
Audience
Researchers | 2 |
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
What Works Clearinghouse, 2014
This "What Works Clearinghouse Procedures and Standards Handbook (Version 3.0)" provides a detailed description of the standards and procedures of the What Works Clearinghouse (WWC). The remaining chapters of this Handbook are organized to take the reader through the basic steps that the WWC uses to develop a review protocol, identify…
Descriptors: Educational Research, Guides, Intervention, Classification
Maraun, Michael; Gabriel, Stephanie – Psychological Methods, 2010
In his article, "An Alternative to Null-Hypothesis Significance Tests," Killeen (2005) urged the discipline to abandon the practice of "p[subscript obs]"-based null hypothesis testing and to quantify the signal-to-noise characteristics of experimental outcomes with replication probabilities. He described the coefficient that he…
Descriptors: Hypothesis Testing, Statistical Inference, Probability, Statistical Significance
Overall, John E.; Tonidandel, Scott – Multivariate Behavioral Research, 2010
A previous Monte Carlo study examined the relative powers of several simple and more complex procedures for testing the significance of difference in mean rates of change in a controlled, longitudinal, treatment evaluation study. Results revealed that the relative powers depended on the correlation structure of the simulated repeated measurements.…
Descriptors: Monte Carlo Methods, Statistical Significance, Correlation, Depression (Psychology)
Serlin, Ronald C. – Psychological Methods, 2010
The sense that replicability is an important aspect of empirical science led Killeen (2005a) to define "p[subscript rep]," the probability that a replication will result in an outcome in the same direction as that found in a current experiment. Since then, several authors have praised and criticized 'p[subscript rep]," culminating…
Descriptors: Epistemology, Effect Size, Replication (Evaluation), Measurement Techniques
Cumming, Geoff – Psychological Methods, 2010
This comment offers three descriptions of "p[subscript rep]" that start with a frequentist account of confidence intervals, draw on R. A. Fisher's fiducial argument, and do not make Bayesian assumptions. Links are described among "p[subscript rep]," "p" values, and the probability a confidence interval will capture…
Descriptors: Replication (Evaluation), Measurement Techniques, Research Methodology, Validity
Yu, Chong-Ho – Online Submission, 2005
Many research-related classes in social sciences present probability as a unified approach based upon mathematical axioms, but neglect the diversity of various probability theories and their associated philosophical assumptions. Although currently the dominant statistical and probabilistic approach is the Fisherian tradition, the use of Fisherian…
Descriptors: Probability, Inferences, Social Sciences, Statistical Significance

Onwuegbuzie, Anthony J.; Roberts, J. Kyle; Daniel, Larry G. – Measurement and Evaluation in Counseling and Development, 2005
In this article, the authors (a) illustrate how displaying disattenuated correlation coefficients alongside their unadjusted counterparts will allow researchers to assess the impact of unreliability on bivariate relationships and (b) demonstrate how a proposed new "what if reliability" analysis can complement null hypothesis significance…
Descriptors: Correlation, Statistical Significance, Reliability, Error of Measurement
Byrd, Jimmy K. – Educational Administration Quarterly, 2007
Purpose: The purpose of this study was to review research published by Educational Administration Quarterly (EAQ) during the past 10 years to determine if confidence intervals and effect sizes were being reported as recommended by the American Psychological Association (APA) Publication Manual. Research Design: The author examined 49 volumes of…
Descriptors: Research Design, Intervals, Statistical Inference, Effect Size

Suen, Hoi K. – Topics in Early Childhood Special Education, 1992
This commentary on EC 603 695 argues that significance testing is a necessary but insufficient condition for positivistic research, that judgment-based assessment and single-subject research are not substitutes for significance testing, and that sampling fluctuation should be considered as one of numerous epistemological concerns in any…
Descriptors: Evaluation Methods, Evaluative Thinking, Research Design, Research Methodology

Da Prato, Robert A. – Topics in Early Childhood Special Education, 1992
This paper argues that judgment-based assessment of data from multiply replicated single-subject or small-N studies should replace normative-based (p=less than 0.05) assessment of large-N research in the clinical sciences, and asserts that inferential statistics should be abandoned as a method of evaluating clinical research data. (Author/JDD)
Descriptors: Evaluation Methods, Evaluative Thinking, Norms, Research Design
Marsh, Herbert W.; Hau, Kit-Tai; Wen, Zhonglin – Structural Equation Modeling, 2004
Goodness-of-fit (GOF) indexes provide "rules of thumb"?recommended cutoff values for assessing fit in structural equation modeling. Hu and Bentler (1999) proposed a more rigorous approach to evaluating decision rules based on GOF indexes and, on this basis, proposed new and more stringent cutoff values for many indexes. This article discusses…
Descriptors: Statistical Significance, Structural Equation Models, Evaluation Methods, Evaluation Research
Lefebvre, Daniel J.; Suen, Hoi K. – 1990
An empirical investigation of methodological issues associated with evaluating treatment effect in single-subject research (SSR) designs is presented. This investigation: (1) conducted a generalizability (G) study to identify the sources of systematic and random measurement error (SRME); (2) used an analytic approach based on G theory to integrate…
Descriptors: Classroom Observation Techniques, Disabilities, Educational Research, Error of Measurement