Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 4 |
Since 2016 (last 10 years) | 11 |
Since 2006 (last 20 years) | 22 |
Descriptor
Sampling | 23 |
Statistical Bias | 23 |
Statistical Inference | 23 |
Computation | 10 |
Error of Measurement | 7 |
Statistical Analysis | 7 |
Monte Carlo Methods | 6 |
Sample Size | 6 |
Correlation | 5 |
Probability | 5 |
Comparative Analysis | 4 |
More ▼ |
Source
Author
MacKinnon, David P. | 2 |
Bai, Haiyan | 1 |
Barker, Gregory | 1 |
Beretvas, S. Natasha | 1 |
Bishara, Anthony J. | 1 |
Botelho, A. F. | 1 |
Chen Li | 1 |
Chiung-Yu Huang | 1 |
Cook, Thomas D. | 1 |
Eisermann, Jens | 1 |
Ellison, George T. H. | 1 |
More ▼ |
Publication Type
Journal Articles | 18 |
Reports - Research | 17 |
Dissertations/Theses -… | 2 |
Opinion Papers | 2 |
Reports - Evaluative | 2 |
Guides - Classroom - Learner | 1 |
Guides - Classroom - Teacher | 1 |
Numerical/Quantitative Data | 1 |
Education Level
Audience
Practitioners | 1 |
Laws, Policies, & Programs
Assessments and Surveys
National Assessment of… | 1 |
Trends in International… | 1 |
What Works Clearinghouse Rating
Wendy Chan; Jimin Oh; Chen Li; Jiexuan Huang; Yeran Tong – Society for Research on Educational Effectiveness, 2023
Background: The generalizability of a study's results continues to be at the forefront of concerns in evaluation research in education (Tipton & Olsen, 2018). Over the past decade, statisticians have developed methods, mainly based on propensity scores, to improve generalizations in the absence of random sampling (Stuart et al., 2011; Tipton,…
Descriptors: Generalizability Theory, Probability, Scores, Sampling
Sarah E. Robertson; Jon A. Steingrimsson; Issa J. Dahabreh – Evaluation Review, 2024
When planning a cluster randomized trial, evaluators often have access to an enumerated cohort representing the target population of clusters. Practicalities of conducting the trial, such as the need to oversample clusters with certain characteristics in order to improve trial economy or support inferences about subgroups of clusters, may preclude…
Descriptors: Randomized Controlled Trials, Generalization, Inferences, Hierarchical Linear Modeling
Kelvin Terrell Pompey – ProQuest LLC, 2021
Many methods are used to measure interrater reliability for studies where each target receives ratings by a different set of judges. The purpose of this study is to explore the use of hierarchical modeling for estimating interrater reliability using the intraclass correlation coefficient. This study provides a description of how the ICC can be…
Descriptors: Interrater Reliability, Evaluation Methods, Test Reliability, Correlation
Ellison, George T. H. – Journal of Statistics and Data Science Education, 2021
Temporality-driven covariate classification had limited impact on: the specification of directed acyclic graphs (DAGs) by 85 novice analysts (medical undergraduates); or the risk of bias in DAG-informed multivariable models designed to generate causal inference from observational data. Only 71 students (83.5%) managed to complete the…
Descriptors: Statistics Education, Medical Education, Undergraduate Students, Graphs
Gagnon-Bartsch, J. A.; Sales, A. C.; Wu, E.; Botelho, A. F.; Erickson, J. A.; Miratrix, L. W.; Heffernan, N. T. – Grantee Submission, 2019
Randomized controlled trials (RCTs) admit unconfounded design-based inference--randomization largely justifies the assumptions underlying statistical effect estimates--but often have limited sample sizes. However, researchers may have access to big observational data on covariates and outcomes from RCT non-participants. For example, data from A/B…
Descriptors: Randomized Controlled Trials, Educational Research, Prediction, Algorithms
Valente, Matthew J.; Gonzalez, Oscar; Miocevic, Milica; MacKinnon, David P. – Educational and Psychological Measurement, 2016
Methods to assess the significance of mediated effects in education and the social sciences are well studied and fall into two categories: single sample methods and computer-intensive methods. A popular single sample method to detect the significance of the mediated effect is the test of joint significance, and a popular computer-intensive method…
Descriptors: Structural Equation Models, Sampling, Statistical Inference, Statistical Bias
Deep Learning Based Imbalanced Data Classification and Information Retrieval for Multimedia Big Data
Yan, Yilin – ProQuest LLC, 2018
The development in information science has enabled an explosive growth of data, which attracts more and more researchers to engage in the field of big data analytics. Noticeably, in many real-world applications, large amounts of data are imbalanced data since the events of interests occur infrequently. Classification of imbalanced data is an…
Descriptors: Information Science, Information Retrieval, Multimedia Materials, Data
Gongjun Xu; Tony Sit; Lan Wang; Chiung-Yu Huang – Grantee Submission, 2017
Biased sampling occurs frequently in economics, epidemiology, and medical studies either by design or due to data collecting mechanism. Failing to take into account the sampling bias usually leads to incorrect inference. We propose a unified estimation procedure and a computationally fast resampling method to make statistical inference for…
Descriptors: Sampling, Statistical Inference, Computation, Generalization
Mazza, Angelo; Punzo, Antonio – Sociological Methods & Research, 2015
The dissimilarity index of Duncan and Duncan is widely used in a broad range of contexts to assess the overall extent of segregation in the allocation of two groups in two or more units. Its sensitivity to random allocation implies an upward bias with respect to the unknown amount of systematic segregation. In this article, following a multinomial…
Descriptors: Statistical Bias, Error of Measurement, Error Correction, Mathematical Logic
Maeda, Hotaka; Zhang, Bo – International Journal of Testing, 2017
The omega (?) statistic is reputed to be one of the best indices for detecting answer copying on multiple choice tests, but its performance relies on the accurate estimation of copier ability, which is challenging because responses from the copiers may have been contaminated. We propose an algorithm that aims to identify and delete the suspected…
Descriptors: Cheating, Test Items, Mathematics, Statistics
Leth-Steensen, Craig; Gallitto, Elena – Educational and Psychological Measurement, 2016
A large number of approaches have been proposed for estimating and testing the significance of indirect effects in mediation models. In this study, four sets of Monte Carlo simulations involving full latent variable structural equation models were run in order to contrast the effectiveness of the currently popular bias-corrected bootstrapping…
Descriptors: Mediation Theory, Structural Equation Models, Monte Carlo Methods, Simulation
Bishara, Anthony J.; Hittner, James B. – Educational and Psychological Measurement, 2015
It is more common for educational and psychological data to be nonnormal than to be approximately normal. This tendency may lead to bias and error in point estimates of the Pearson correlation coefficient. In a series of Monte Carlo simulations, the Pearson correlation was examined under conditions of normal and nonnormal data, and it was compared…
Descriptors: Research Methodology, Monte Carlo Methods, Correlation, Simulation
Ugille, Maaike; Moeyaert, Mariola; Beretvas, S. Natasha; Ferron, John M.; Van den Noortgate, Wim – Journal of Experimental Education, 2014
A multilevel meta-analysis can combine the results of several single-subject experimental design studies. However, the estimated effects are biased if the effect sizes are standardized and the number of measurement occasions is small. In this study, the authors investigated 4 approaches to correct for this bias. First, the standardized effect…
Descriptors: Effect Size, Statistical Bias, Sample Size, Regression (Statistics)
Bai, Haiyan – Journal of Experimental Education, 2013
Propensity score estimation plays a fundamental role in propensity score matching for reducing group selection bias in observational data. To increase the accuracy of propensity score estimation, the author developed a bootstrap propensity score. The commonly used propensity score matching methods: nearest neighbor matching, caliper matching, and…
Descriptors: Statistical Inference, Sampling, Probability, Computation
Skaggs, Gary; Wilkins, Jesse L. M.; Hein, Serge F. – International Journal of Testing, 2016
The purpose of this study was to explore the degree of grain size of the attributes and the sample sizes that can support accurate parameter recovery with the General Diagnostic Model (GDM) for a large-scale international assessment. In this resampling study, bootstrap samples were obtained from the 2003 Grade 8 TIMSS in Mathematics at varying…
Descriptors: Achievement Tests, Foreign Countries, Elementary Secondary Education, Science Achievement
Previous Page | Next Page ยป
Pages: 1 | 2