Publication Date
In 2025 | 20 |
Since 2024 | 62 |
Since 2021 (last 5 years) | 126 |
Since 2016 (last 10 years) | 272 |
Since 2006 (last 20 years) | 643 |
Descriptor
Robustness (Statistics) | 827 |
Statistical Analysis | 161 |
Foreign Countries | 156 |
Comparative Analysis | 133 |
Regression (Statistics) | 109 |
Models | 107 |
Error of Measurement | 97 |
Correlation | 95 |
Research Methodology | 95 |
Simulation | 93 |
Evaluation Methods | 92 |
More ▼ |
Source
Author
Keselman, H. J. | 13 |
Yuan, Ke-Hai | 10 |
Algina, James | 9 |
Zhang, Zhiyong | 8 |
Bentler, Peter M. | 7 |
Lix, Lisa M. | 7 |
Tipton, Elizabeth | 6 |
Wilcox, Rand R. | 6 |
Gorard, Stephen | 5 |
Blankmeyer, Eric | 4 |
Goldhaber, Dan | 4 |
More ▼ |
Publication Type
Education Level
Location
Australia | 17 |
United Kingdom | 16 |
United Kingdom (England) | 16 |
Germany | 13 |
California | 12 |
United States | 11 |
Canada | 10 |
Netherlands | 10 |
Italy | 9 |
China | 8 |
Florida | 7 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 13 |
Individuals with Disabilities… | 2 |
Race to the Top | 2 |
American Recovery and… | 1 |
Deferred Action for Childhood… | 1 |
Social Security Act | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 1 |
Meets WWC Standards with or without Reservations | 2 |
Does not meet standards | 2 |
Little, Todd D.; Bontempo, Daniel; Rioux, Charlie; Tracy, Allison – International Journal of Research & Method in Education, 2022
Multilevel modelling (MLM) is the most frequently used approach for evaluating interventions with clustered data. MLM, however, has some limitations that are associated with numerous obstacles to model estimation and valid inferences. Longitudinal multiple-group (LMG) modelling is a longstanding approach for testing intervention effects using…
Descriptors: Longitudinal Studies, Hierarchical Linear Modeling, Alternative Assessment, Intervention
Manapat, Patrick D.; Edwards, Michael C. – Educational and Psychological Measurement, 2022
When fitting unidimensional item response theory (IRT) models, the population distribution of the latent trait ([theta]) is often assumed to be normally distributed. However, some psychological theories would suggest a nonnormal [theta]. For example, some clinical traits (e.g., alcoholism, depression) are believed to follow a positively skewed…
Descriptors: Robustness (Statistics), Computational Linguistics, Item Response Theory, Psychological Patterns
Ashley L. Watts; Bridget A. Makol; Isabella M. Palumbo; Andres De Los Reyes; Thomas M. Olino; Robert D. Latzman; Colin G. DeYoung; Phillip K. Wood; Kenneth J. Sher – Grantee Submission, 2022
We used multitrait-multimethod (MTMM) modeling to examine general factors of psychopathology in three samples of youth (Ns = 2119, 303, 592) for whom three informants reported on the youth's psychopathology (e.g., child, parent, teacher). Empirical support for the "p"-factor diminished in multi-informant models compared with…
Descriptors: Multitrait Multimethod Techniques, Robustness (Statistics), Psychopathology, Youth
Huang, Qi; Bolt, Daniel M. – Educational and Psychological Measurement, 2023
Previous studies have demonstrated evidence of latent skill continuity even in tests intentionally designed for measurement of binary skills. In addition, the assumption of binary skills when continuity is present has been shown to potentially create a lack of invariance in item and latent ability parameters that may undermine applications. In…
Descriptors: Item Response Theory, Test Items, Skill Development, Robustness (Statistics)
Brent J. Goertzen; Kaley Klaus – Research & Practice in Assessment, 2023
When evaluating student learning, educators often employ scoring rubrics, for which quality can be determined through evaluating validity and reliability. This article discusses the norming process utilized in a graduate organizational leadership program for a capstone scoring rubric. Concepts of validity and reliability are discussed, as is the…
Descriptors: Graduate Students, Graduate Study, Graduate School Faculty, Scoring Rubrics
Xing, Wanli; Du, Dongping; Bakhshi, Ali; Chiu, Kuo-Chun; Du, Hanxiang – IEEE Transactions on Learning Technologies, 2021
Predictive modeling in online education is a popular topic in learning analytics research and practice. This study proposes a novel predictive modeling method to improve model transferability over time within the same course and across different courses. The research gaps addressed are limited evidence showing whether a predictive model built on…
Descriptors: Electronic Learning, Bayesian Statistics, Prediction, Models
Sinharay, Sandip – Grantee Submission, 2021
Drasgow, Levine, and Zickar (1996) suggested a statistic based on the Neyman-Pearson lemma (e.g., Lehmann & Romano, 2005, p. 60) for detecting preknowledge on a known set of items. The statistic is a special case of the optimal appropriateness indices of Levine and Drasgow (1988) and is the most powerful statistic for detecting item…
Descriptors: Robustness (Statistics), Hypothesis Testing, Statistics, Test Items
Youmi Suk – Journal of Educational and Behavioral Statistics, 2024
Machine learning (ML) methods for causal inference have gained popularity due to their flexibility to predict the outcome model and the propensity score. In this article, we provide a within-group approach for ML-based causal inference methods in order to robustly estimate average treatment effects in multilevel studies when there is cluster-level…
Descriptors: Artificial Intelligence, Causal Models, Statistical Inference, Maximum Likelihood Statistics
Qinyun Lin; Amy K. Nuttall; Qian Zhang; Kenneth A. Frank – Grantee Submission, 2023
Empirical studies often demonstrate multiple causal mechanisms potentially involving simultaneous or causally related mediators. However, researchers often use simple mediation models to understand the processes because they do not or cannot measure other theoretically relevant mediators. In such cases, another potentially relevant but unobserved…
Descriptors: Causal Models, Mediation Theory, Error of Measurement, Statistical Inference
Josh Leung-Gagné; Sean F. Reardon – Grantee Submission, 2023
Recent studies have shown that U.S. Census-- and American Community Survey (ACS)--based estimates of income segregation are subject to upward finite sampling bias (Logan et al. 2018; Logan et al. 2020; Reardon et al. 2018). We identify two additional sources of bias that are larger and opposite in sign to finite sampling bias: measurement…
Descriptors: Income, Low Income Groups, Social Bias, Statistical Bias
Wang, Shiyu; Xiao, Houping; Cohen, Allan – Journal of Educational and Behavioral Statistics, 2021
An adaptive weight estimation approach is proposed to provide robust latent ability estimation in computerized adaptive testing (CAT) with response revision. This approach assigns different weights to each distinct response to the same item when response revision is allowed in CAT. Two types of weight estimation procedures, nonfunctional and…
Descriptors: Computer Assisted Testing, Adaptive Testing, Computation, Robustness (Statistics)
Magdalena Bennett – Society for Research on Educational Effectiveness, 2021
Introduction: Differences-in-Differences (DD) is a commonly-used approach in policy evaluation for identifying the impact of an intervention or treatment. Under a parallel trend assumption (PTA), we can recover a causal effect by comparing the difference in outcomes between a treatment and a control group, both before and after an intervention was…
Descriptors: Educational Vouchers, Preferences, School Segregation, Program Evaluation
Du, Han; Enders, Craig; Keller, Brian; Bradbury, Thomas N.; Karney, Benjamin R. – Grantee Submission, 2022
Missing data are exceedingly common across a variety of disciplines, such as educational, social, and behavioral science areas. Missing not at random (MNAR) mechanism where missingness is related to unobserved data is widespread in real data and has detrimental consequence. However, the existing MNAR-based methods have potential problems such as…
Descriptors: Bayesian Statistics, Data Analysis, Computer Simulation, Sample Size
Slez, Adam – Sociological Methods & Research, 2019
Young and Holsteen (YH) introduce a number of tools for evaluating model uncertainty. In so doing, they are careful to differentiate their method from existing forms of model averaging. The fundamental difference lies in the way in which the underlying estimates are weighted. Whereas standard approaches to model averaging assign higher weight to…
Descriptors: Research Methodology, Models, Ambiguity (Context), Computation
Deng, Lifang; Yuan, Ke-Hai – Grantee Submission, 2022
Structural equation modeling (SEM) has been deemed as a proper method when variables contain measurement errors. In contrast, path analysis with composite-scores is preferred for prediction and diagnosis of individuals. While path analysis with composite-scores has been criticized for yielding biased parameter estimates, recent literature pointed…
Descriptors: Structural Equation Models, Path Analysis, Weighted Scores, Error of Measurement