ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	6

Descriptor

Evaluation Methods	12
Statistical Inference	12
Statistical Significance	12
Hypothesis Testing	6
Research Methodology	6
Measurement Techniques	5
Research Design	5
Effect Size	4
Error of Measurement	4
Probability	4
Statistical Analysis	4
Evaluation Problems	3
Experiments	3
Misconceptions	3
Replication (Evaluation)	3
Sampling	3
Bayesian Statistics	2
Correlation	2
Data Analysis	2
Educational Research	2
Evaluative Thinking	2
Outcomes of Treatment	2
Predictive Measurement	2
Social Sciences	2
Statistics	2
More ▼

Source

Psychological Methods	3
Topics in Early Childhood…	2
Educational Administration…	1
Measurement and Evaluation in…	1
Multivariate Behavioral…	1
Online Submission	1
Structural Equation Modeling	1
What Works Clearinghouse	1

Author

Suen, Hoi K.	2
Byrd, Jimmy K.	1
Cumming, Geoff	1
Da Prato, Robert A.	1
Daniel, Larry G.	1
Gabriel, Stephanie	1
Hau, Kit-Tai	1
Lefebvre, Daniel J.	1
Maraun, Michael	1
Marsh, Herbert W.	1
Onwuegbuzie, Anthony J.	1
Overall, John E.	1
Roberts, J. Kyle	1
Serlin, Ronald C.	1
Tonidandel, Scott	1
Wen, Zhonglin	1
Yu, Chong-Ho	1
More ▼

Publication Type

Journal Articles	9
Reports - Research	4
Opinion Papers	3
Reports - Evaluative	3
Speeches/Meeting Papers	2
Guides - Non-Classroom	1
Information Analyses	1
Reports - Descriptive	1

Education Level

Audience

Researchers

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 12 results Save | Export

What Works Clearinghouse Procedures and Standards Handbook, Version 3.0

Peer reviewed
PDF on ERIC

Download full text

What Works Clearinghouse, 2014

This "What Works Clearinghouse Procedures and Standards Handbook (Version 3.0)" provides a detailed description of the standards and procedures of the What Works Clearinghouse (WWC). The remaining chapters of this Handbook are organized to take the reader through the basic steps that the WWC uses to develop a review protocol, identify…

Descriptors: Educational Research, Guides, Intervention, Classification

Killeen's (2005) "p[subscript rep]" Coefficient: Logical and Mathematical Problems

Peer reviewed

Direct link

Maraun, Michael; Gabriel, Stephanie – Psychological Methods, 2010

In his article, "An Alternative to Null-Hypothesis Significance Tests," Killeen (2005) urged the discipline to abandon the practice of "p[subscript obs]"-based null hypothesis testing and to quantify the signal-to-noise characteristics of experimental outcomes with replication probabilities. He described the coefficient that he…

Descriptors: Hypothesis Testing, Statistical Inference, Probability, Statistical Significance

The Case for Use of Simple Difference Scores to Test the Significance of Differences in Mean Rates of Change in Controlled Repeated Measurements Designs

Peer reviewed

Direct link

Overall, John E.; Tonidandel, Scott – Multivariate Behavioral Research, 2010

A previous Monte Carlo study examined the relative powers of several simple and more complex procedures for testing the significance of difference in mean rates of change in a controlled, longitudinal, treatment evaluation study. Results revealed that the relative powers depended on the correlation structure of the simulated repeated measurements.…

Descriptors: Monte Carlo Methods, Statistical Significance, Correlation, Depression (Psychology)

Regarding "p[subscript rep]": Comment Prompted by Iverson, Wagenmakers, and Lee (2010); Lecoutre, Lecoutre, and Poitevineau (2010); and Maraun and Gabriel (2010)

Peer reviewed

Direct link

Serlin, Ronald C. – Psychological Methods, 2010

The sense that replicability is an important aspect of empirical science led Killeen (2005a) to define "p[subscript rep]," the probability that a replication will result in an outcome in the same direction as that found in a current experiment. Since then, several authors have praised and criticized 'p[subscript rep]," culminating…

Descriptors: Epistemology, Effect Size, Replication (Evaluation), Measurement Techniques

Replication, "p[subscript rep]," and Confidence Intervals: Comment Prompted by Iverson, Wagenmakers, and Lee (2010); Lecoutre, Lecoutre, and Poitevineau (2010); and Maraun and Gabriel (2010)

Peer reviewed

Direct link

Cumming, Geoff – Psychological Methods, 2010

This comment offers three descriptions of "p[subscript rep]" that start with a frequentist account of confidence intervals, draw on R. A. Fisher's fiducial argument, and do not make Bayesian assumptions. Links are described among "p[subscript rep]," "p" values, and the probability a confidence interval will capture…

Descriptors: Replication (Evaluation), Measurement Techniques, Research Methodology, Validity

Balkanization and Unification of Probabilistic Inferences

Download full text

Yu, Chong-Ho – Online Submission, 2005

Many research-related classes in social sciences present probability as a unified approach based upon mathematical axioms, but neglect the diversity of various probability theories and their associated philosophical assumptions. Although currently the dominant statistical and probabilistic approach is the Fisherian tradition, the use of Fisherian…

Descriptors: Probability, Inferences, Social Sciences, Statistical Significance

A Proposed New "What if Reliability" Analysis for Assessing the Statistical Significance of Bivariate Relationships

Peer reviewed

Onwuegbuzie, Anthony J.; Roberts, J. Kyle; Daniel, Larry G. – Measurement and Evaluation in Counseling and Development, 2005

In this article, the authors (a) illustrate how displaying disattenuated correlation coefficients alongside their unadjusted counterparts will allow researchers to assess the impact of unreliability on bivariate relationships and (b) demonstrate how a proposed new "what if reliability" analysis can complement null hypothesis significance…

Descriptors: Correlation, Statistical Significance, Reliability, Error of Measurement

Significance Testing; Necessary but Insufficient.

Peer reviewed

Suen, Hoi K. – Topics in Early Childhood Special Education, 1992

This commentary on EC 603 695 argues that significance testing is a necessary but insufficient condition for positivistic research, that judgment-based assessment and single-subject research are not substitutes for significance testing, and that sampling fluctuation should be considered as one of numerous epistemological concerns in any…

Descriptors: Evaluation Methods, Evaluative Thinking, Research Design, Research Methodology

A Call for Statistical Reform in EAQ

Peer reviewed

Direct link

Byrd, Jimmy K. – Educational Administration Quarterly, 2007

Purpose: The purpose of this study was to review research published by Educational Administration Quarterly (EAQ) during the past 10 years to determine if confidence intervals and effect sizes were being reported as recommended by the American Psychological Association (APA) Publication Manual. Research Design: The author examined 49 volumes of…

Descriptors: Research Design, Intervals, Statistical Inference, Effect Size

Large-Group Fantasies versus Single-Subject Science.

Peer reviewed

Da Prato, Robert A. – Topics in Early Childhood Special Education, 1992

This paper argues that judgment-based assessment of data from multiply replicated single-subject or small-N studies should replace normative-based (p=less than 0.05) assessment of large-N research in the clinical sciences, and asserts that inferential statistics should be abandoned as a method of evaluating clinical research data. (Author/JDD)

Descriptors: Evaluation Methods, Evaluative Thinking, Norms, Research Design

In Search of Golden Rules: Comment on Hypothesis-Testing Approaches to Setting Cutoff Values for Fit Indexes and Dangers in Overgeneralizing Hu and Bentler's (1999) Findings

Peer reviewed

Direct link

Marsh, Herbert W.; Hau, Kit-Tai; Wen, Zhonglin – Structural Equation Modeling, 2004

Goodness-of-fit (GOF) indexes provide "rules of thumb"?recommended cutoff values for assessing fit in structural equation modeling. Hu and Bentler (1999) proposed a more rigorous approach to evaluating decision rules based on GOF indexes and, on this basis, proposed new and more stringent cutoff values for many indexes. This article discusses…

Descriptors: Statistical Significance, Structural Equation Models, Evaluation Methods, Evaluation Research

Applying Generalizability Theory To Evaluate Treatment Effect in Single-Subject Research.

Download full text

Lefebvre, Daniel J.; Suen, Hoi K. – 1990

An empirical investigation of methodological issues associated with evaluating treatment effect in single-subject research (SSR) designs is presented. This investigation: (1) conducted a generalizability (G) study to identify the sources of systematic and random measurement error (SRME); (2) used an analytic approach based on G theory to integrate…

Descriptors: Classroom Observation Techniques, Disabilities, Educational Research, Error of Measurement