Publication Date
In 2025 | 0 |
Since 2024 | 3 |
Since 2021 (last 5 years) | 15 |
Since 2016 (last 10 years) | 38 |
Since 2006 (last 20 years) | 43 |
Descriptor
Sampling | 43 |
Correlation | 9 |
Evaluation Methods | 9 |
Computation | 8 |
Error of Measurement | 8 |
Prediction | 8 |
Statistical Inference | 8 |
Educational Research | 7 |
Models | 7 |
Research Methodology | 7 |
Algorithms | 6 |
More ▼ |
Source
Grantee Submission | 43 |
Author
Hedges, Larry V. | 4 |
Gongjun Xu | 3 |
Peng Ding | 3 |
Zhang, Zhiyong | 3 |
Amisha Jindal | 2 |
Ashish Gurung | 2 |
Brunskill, Emma | 2 |
Chun Wang | 2 |
Doroudi, Shayan | 2 |
Erin Ottmar | 2 |
Ji-Eun Lee | 2 |
More ▼ |
Publication Type
Reports - Research | 33 |
Journal Articles | 9 |
Speeches/Meeting Papers | 6 |
Reports - Evaluative | 5 |
Reports - Descriptive | 3 |
Information Analyses | 2 |
Education Level
Audience
Location
Florida | 2 |
California | 1 |
Canada | 1 |
District of Columbia | 1 |
Kentucky | 1 |
Maine | 1 |
Maryland | 1 |
Minnesota | 1 |
Tennessee | 1 |
Texas | 1 |
United Kingdom (Wales) | 1 |
More ▼ |
Laws, Policies, & Programs
Head Start | 1 |
Assessments and Surveys
Child Behavior Checklist | 1 |
MacArthur Bates Communicative… | 1 |
Program for International… | 1 |
Program for the International… | 1 |
What Works Clearinghouse Rating
Jiaying Xiao; Chun Wang; Gongjun Xu – Grantee Submission, 2024
Accurate item parameters and standard errors (SEs) are crucial for many multidimensional item response theory (MIRT) applications. A recent study proposed the Gaussian Variational Expectation Maximization (GVEM) algorithm to improve computational efficiency and estimation accuracy (Cho et al., 2021). However, the SE estimation procedure has yet to…
Descriptors: Error of Measurement, Models, Evaluation Methods, Item Analysis
Magooda, Ahmed; Litman, Diane – Grantee Submission, 2021
This paper explores three simple data manipulation techniques (synthesis, augmentation, curriculum) for improving abstractive summarization models without the need for any additional data. We introduce a method of data synthesis with paraphrasing, a data augmentation technique with sample mixing, and curriculum learning with two new difficulty…
Descriptors: Data Analysis, Synthesis, Documentation, Models
Weicong Lyu; Chun Wang; Gongjun Xu – Grantee Submission, 2024
Data harmonization is an emerging approach to strategically combining data from multiple independent studies, enabling addressing new research questions that are not answerable by a single contributing study. A fundamental psychometric challenge for data harmonization is to create commensurate measures for the constructs of interest across…
Descriptors: Data Analysis, Test Items, Psychometrics, Item Response Theory
Magooda, Ahmed; Elaraby, Mohamed; Litman, Diane – Grantee Submission, 2021
This paper explores the effect of using multitask learning for abstractive summarization in the context of small training corpora. In particular, we incorporate four different tasks (extractive summarization, language modeling, concept detection, and paraphrase detection) both individually and in combination, with the goal of enhancing the target…
Descriptors: Data Analysis, Synthesis, Documentation, Training
Avery H. Closser; Adam Sales; Anthony F. Botelho – Grantee Submission, 2024
Emergent technologies present platforms for educational researchers to conduct randomized controlled trials (RCTs) and collect rich data on study students' performance, behavior, learning processes, and outcomes in authentic learning environments. As educational research increasingly uses methods and data collection from such platforms, it is…
Descriptors: Data Analysis, Educational Research, Randomized Controlled Trials, Sampling
Daniel Litwok; Austin Nichols; Azim Shivji; Robert B. Olsen – Grantee Submission, 2022
Experimental studies of educational interventions are rarely based on representative samples of the target population. This simulation study tests two formal sampling strategies for selecting districts and schools from within strata when they may not agree to participate if selected: (1) balanced selection of the most typical district or school…
Descriptors: Educational Research, School Districts, Schools, Research Methodology
Tamara Broderick; Andrew Gelman; Rachael Meager; Anna L. Smith; Tian Zheng – Grantee Submission, 2022
Probabilistic machine learning increasingly informs critical decisions in medicine, economics, politics, and beyond. To aid the development of trust in these decisions, we develop a taxonomy delineating where trust in an analysis can break down: (1) in the translation of real-world goals to goals on a particular set of training data, (2) in the…
Descriptors: Taxonomy, Trust (Psychology), Algorithms, Probability
Anqi Zhao; Peng Ding; Tirthankar Dasgupta – Grantee Submission, 2018
Given two 2-level factors of interest, a 2[superscript 2] split-plot design (a) takes each of the 2 [superscript 2] = 4 possible factorial combinations as a treatment, (b) identifies one factor as `whole-plot,' (c) divides the experimental units into blocks, and (d) assigns the treatments in such away that all units within the same block receive…
Descriptors: Statistical Inference, Computation, Statistical Analysis, Sampling
Qu, Wen; Liu, Haiyan; Zhang, Zhiyong – Grantee Submission, 2020
In social and behavioral sciences, data are typically not normally distributed, which can invalidate hypothesis testing and lead to unreliable results when being analyzed by methods developed for normal data. The existing methods of generating multivariate non-normal data typically create data according to specific univariate marginal measures…
Descriptors: Social Science Research, Multivariate Analysis, Statistical Distributions, Monte Carlo Methods
Xinran Li; Peng Ding; Donald B. Rubin – Grantee Submission, 2020
With many pretreatment covariates and treatment factors, the classical factorial experiment often fails to balance covariates across multiple factorial effects simultaneously. Therefore, it is intuitive to restrict the randomization of the treatment factors to satisfy certain covariate balance criteria, possibly conforming to the tiers of…
Descriptors: Experiments, Research Design, Randomized Controlled Trials, Sampling
Merkle, Edgar C.; Fitzsimmons, Ellen; Uanhoro, James; Goodrich, Ben – Grantee Submission, 2021
Structural equation models comprise a large class of popular statistical models, including factor analysis models, certain mixed models, and extensions thereof. Model estimation is complicated by the fact that we typically have multiple interdependent response variables and multiple latent variables (which may also be called random effects or…
Descriptors: Bayesian Statistics, Structural Equation Models, Psychometrics, Factor Analysis
Tipton, Elizabeth; Matlen, Bryan J. – Grantee Submission, 2019
Randomized control trials (RCTs) have long been considered the "gold standard" for evaluating the impacts of interventions. However, in most education RCTs, the sample of schools included is recruited based on convenience, potentially compromising a study's ability to generalize to an intended population. An alternative approach is to…
Descriptors: Randomized Controlled Trials, Recruitment, Educational Research, Generalization
Yamashita, Takashi; Smith, Thomas J.; Cummins, Phyllis A. – Grantee Submission, 2020
Background: Several statistical applications including Mplus, STATA, and R are available to conduct analyses such as structural equation modeling and multi-level modeling using large-scale assessment data that employ complex sampling and assessment designs and that provide associated information such as sampling weights, replicate weights, and…
Descriptors: Learning Analytics, Computer Software, Syntax, Adults
Kush, Joseph M.; Konold, Timothy R.; Bradshaw, Catherine P. – Grantee Submission, 2021
Multilevel structural equation (MSEM) models allow researchers to model latent factor structures at multiple levels simultaneously by decomposing within- and between-group variation. Yet the extent to which the sampling ratio (i.e., proportion of cases sampled from each group) influences the results of MSEM models remains unknown. This paper…
Descriptors: Sampling, Structural Equation Models, Factor Structure, Monte Carlo Methods
Qu, Wen; Liu, Haiyan; Zhang, Zhiyong – Grantee Submission, 2020
In social and behavioral sciences, data are typically not normally distributed, which can invalidate hypothesis testing and lead to unreliable results when being analyzed by methods developed for normal data. The existing methods of generating multivariate non-normal data typically create data according to specific univariate marginal measures…
Descriptors: Social Science Research, Statistical Distributions, Multivariate Analysis, Monte Carlo Methods