Publication Date
| In 2026 | 0 |
| Since 2025 | 4 |
| Since 2022 (last 5 years) | 35 |
| Since 2017 (last 10 years) | 87 |
| Since 2007 (last 20 years) | 253 |
Descriptor
Source
Author
| Spybrook, Jessaca | 8 |
| Konstantopoulos, Spyros | 7 |
| Hedges, Larry V. | 5 |
| Kelcey, Ben | 5 |
| Kromrey, Jeffrey D. | 5 |
| Slavin, Robert E. | 5 |
| Thompson, Bruce | 5 |
| Algina, James | 4 |
| Dong, Nianbo | 4 |
| Olejnik, Stephen | 4 |
| Penfield, Randall D. | 4 |
| More ▼ | |
Publication Type
Education Level
Audience
| Researchers | 13 |
| Policymakers | 1 |
| Practitioners | 1 |
| Teachers | 1 |
Location
| Turkey | 9 |
| Florida | 4 |
| United States | 4 |
| Indonesia | 3 |
| Taiwan | 3 |
| Asia | 2 |
| Indiana | 2 |
| Texas | 2 |
| Australia | 1 |
| California | 1 |
| Canada | 1 |
| More ▼ | |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 4 |
| Elementary and Secondary… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 2 |
| Meets WWC Standards with or without Reservations | 2 |
| Does not meet standards | 1 |
Conrad Borchers – International Educational Data Mining Society, 2025
Algorithmic bias is a pressing concern in educational data mining (EDM), as it risks amplifying inequities in learning outcomes. The Area Between ROC Curves (ABROCA) metric is frequently used to measure discrepancies in model performance across demographic groups to quantify overall model fairness. However, its skewed distribution--especially when…
Descriptors: Algorithms, Bias, Statistics, Simulation
Wenyi Li; Qian Zhang – Society for Research on Educational Effectiveness, 2025
This study compared Stepwise Logistic Regression (Stepwise-LR) and three machine learning (ML) methods--Classification and Regression Trees (CART), Random Forest (RF), and Generalized Boosted Modeling (GBM) for estimating propensity scores (PS) applied in causal inference. A simulation study was conducted considering factors of the sample size,…
Descriptors: Regression (Statistics), Artificial Intelligence, Statistical Analysis, Computation
Peter M. Steiner; Patrick Sheehan; Vivian C. Wong – Grantee Submission, 2023
Given recent evidence challenging the replicability of results in the social and behavioral sciences, critical questions have been raised about appropriate measures for determining replication success in comparing effect estimates across studies. At issue is the fact that conclusions about replication success often depend on the measure used for…
Descriptors: Replication (Evaluation), Measurement Techniques, Statistical Analysis, Effect Size
Terry A. Beehr; Minseo Kim; Ian W. Armstrong – International Journal of Social Research Methodology, 2024
Previous research extensively studied reasons for and ways to avoid low response rates, but it largely ignored the primary research issue of the degree to which response rates matter, which we address. Methodological survey research on response rates has been concerned with how to increase responsiveness and with the effects of response rates on…
Descriptors: Surveys, Response Rates (Questionnaires), Effect Size, Research Methodology
James D. Weese; Ronna C. Turner; Allison Ames; Xinya Liang; Brandon Crawford – Journal of Experimental Education, 2024
In this study a standardized effect size was created for use with the SIBTEST procedure. Using this standardized effect size, a single set of heuristics was developed that are appropriate for data fitting different item response models (e.g., 2-parameter logistic, 3-parameter logistic). The standardized effect size rescales the raw beta-uni value…
Descriptors: Test Bias, Test Items, Item Response Theory, Effect Size
Cox, Kyle; Kelcey, Benjamin – American Journal of Evaluation, 2023
Analysis of the differential treatment effects across targeted subgroups and contexts is a critical objective in many evaluations because it delineates for whom and under what conditions particular programs, therapies or treatments are effective. Unfortunately, it is unclear how to plan efficient and effective evaluations that include these…
Descriptors: Statistical Analysis, Research Design, Cluster Grouping, Sample Size
Bulus, Metin – Journal of Research on Educational Effectiveness, 2022
Although Cattaneo et al. (2019) provided a data-driven framework for power computations for Regression Discontinuity Designs in line with rdrobust Stata and R commands, which allows higher-order functional forms for the score variable when using the non-parametric local polynomial estimation, analogous advancements in their parametric estimation…
Descriptors: Effect Size, Computation, Regression (Statistics), Statistical Analysis
Chuanjian Zhang; Na Sun; Yueshuai Jiang; Huacong Liu; Qinhui Huang – Asia-Pacific Education Researcher, 2025
Peer tutoring has become a widely used practice in higher education institutions to support students' academic success, although its effects remain controversial. This article synthesizes 27 independent experimental and quasi-experimental studies to examine the relationship between peer tutoring programs and college students' academic performance,…
Descriptors: Peer Teaching, Tutoring, Academic Achievement, Higher Education
Peter M. Steiner; Patrick Sheehan; Vivian C. Wong – Annenberg Institute for School Reform at Brown University, 2022
Given recent evidence challenging the replicability of results in the social and behavioral sciences, critical questions have been raised about appropriate measures for determining replication success in comparing effect estimates across studies. At issue is the fact that conclusions about replication success often depend on the measure used for…
Descriptors: Replication (Evaluation), Measurement Techniques, Statistical Analysis, Effect Size
Caspar J. Van Lissa; Eli-Boaz Clapper; Rebecca Kuiper – Research Synthesis Methods, 2024
The product Bayes factor (PBF) synthesizes evidence for an informative hypothesis across heterogeneous replication studies. It can be used when fixed- or random effects meta-analysis fall short. For example, when effect sizes are incomparable and cannot be pooled, or when studies diverge significantly in the populations, study designs, and…
Descriptors: Hypothesis Testing, Evaluation Methods, Replication (Evaluation), Sample Size
McKay, Brad; Corson, Abbey; Vinh, Mary-Anne; Jeyarajan, Gianna; Tandon, Chitrini; Brooks, Hugh; Hubley, Julie; Carter, Michael J. – Journal of Motor Learning and Development, 2023
A priori power analyses can ensure studies are unlikely to miss interesting effects. Recent metascience has suggested that kinesiology research may be underpowered and selectively reported. Here, we examined whether power analyses are being used to ensure informative studies in motor behavior. We reviewed every article published in three motor…
Descriptors: Incidence, Statistical Analysis, Psychomotor Skills, Motor Development
Weese, James D.; Turner, Ronna C.; Liang, Xinya; Ames, Allison; Crawford, Brandon – Educational and Psychological Measurement, 2023
A study was conducted to implement the use of a standardized effect size and corresponding classification guidelines for polytomous data with the POLYSIBTEST procedure and compare those guidelines with prior recommendations. Two simulation studies were included. The first identifies new unstandardized test heuristics for classifying moderate and…
Descriptors: Effect Size, Classification, Guidelines, Statistical Analysis
Chunhua Cao; Benjamin Lugu; Jujia Li – Structural Equation Modeling: A Multidisciplinary Journal, 2024
This study examined the false positive (FP) rates and sensitivity of Bayesian fit indices to structural misspecification in Bayesian structural equation modeling. The impact of measurement quality, sample size, model size, the magnitude of misspecified path effect, and the choice or prior on the performance of the fit indices was also…
Descriptors: Structural Equation Models, Bayesian Statistics, Measurement, Error of Measurement
Kush, Joseph M.; Konold, Timothy R.; Bradshaw, Catherine P. – Journal of Experimental Education, 2022
In two-level designs, the total sample is a function of both the number of Level 2 clusters and the average number of Level 1 units per cluster. Traditional multilevel power calculations rely on either the arithmetic average or the harmonic mean when estimating the average number of Level 1 units across clusters of unbalanced size. The current…
Descriptors: Multivariate Analysis, Randomized Controlled Trials, Monte Carlo Methods, Sample Size
Mikkel Helding Vembye; James Eric Pustejovsky; Therese Deocampo Pigott – Research Synthesis Methods, 2024
Sample size and statistical power are important factors to consider when planning a research synthesis. Power analysis methods have been developed for fixed effect or random effects models, but until recently these methods were limited to simple data structures with a single, independent effect per study. Recent work has provided power…
Descriptors: Sample Size, Robustness (Statistics), Effect Size, Social Science Research

Peer reviewed
Direct link
