NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing all 10 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Tong Wu; Stella Y. Kim; Carl Westine; Michelle Boyer – Journal of Educational Measurement, 2025
While significant attention has been given to test equating to ensure score comparability, limited research has explored equating methods for rater-mediated assessments, where human raters inherently introduce error. If not properly addressed, these errors can undermine score interchangeability and test validity. This study proposes an equating…
Descriptors: Item Response Theory, Evaluators, Error of Measurement, Test Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Wallin, Gabriel; Wiberg, Marie – Journal of Educational and Behavioral Statistics, 2023
This study explores the usefulness of covariates on equating test scores from nonequivalent test groups. The covariates are captured by an estimated propensity score, which is used as a proxy for latent ability to balance the test groups. The objective is to assess the sensitivity of the equated scores to various misspecifications in the…
Descriptors: Models, Error of Measurement, Robustness (Statistics), Equated Scores
Tong, Xin; Zhang, Zhiyong – Grantee Submission, 2020
Despite broad applications of growth curve models, few studies have dealt with a practical issue -- nonnormality of data. Previous studies have used Student's "t" distributions to remedy the nonnormal problems. In this study, robust distributional growth curve models are proposed from a semiparametric Bayesian perspective, in which…
Descriptors: Robustness (Statistics), Bayesian Statistics, Models, Error of Measurement
Peer reviewed Peer reviewed
Direct linkDirect link
Tong, Xin; Zhang, Zhiyong – Multivariate Behavioral Research, 2012
Growth curve models with different types of distributions of random effects and of intraindividual measurement errors for robust analysis are compared. After demonstrating the influence of distribution specification on parameter estimation, 3 methods for diagnosing the distributions for both random effects and intraindividual measurement errors…
Descriptors: Models, Robustness (Statistics), Statistical Analysis, Error of Measurement
Peer reviewed Peer reviewed
Direct linkDirect link
Foster, E. Michael – Developmental Psychology, 2010
The relationship between complexity and usefulness can be captured by a U-shaped curve. This comment explores that relationship. Complexity may be useful for one of the main aims of developmental psychology (causal inference) but not for another (description of developmental phenomena). Currently, developmentalists conduct complex analyses that…
Descriptors: Inferences, Developmental Psychology, Models, Methods
Peer reviewed Peer reviewed
Direct linkDirect link
Rhemtulla, Mijke; Brosseau-Liard, Patricia E.; Savalei, Victoria – Psychological Methods, 2012
A simulation study compared the performance of robust normal theory maximum likelihood (ML) and robust categorical least squares (cat-LS) methodology for estimating confirmatory factor analysis models with ordinal variables. Data were generated from 2 models with 2-7 categories, 4 sample sizes, 2 latent distributions, and 5 patterns of category…
Descriptors: Factor Analysis, Computation, Simulation, Sample Size
Harris, Douglas N. – Phi Delta Kappan, 2010
Current value-added models for teacher accountability are better than models based only on student achievement, but they have their weakness. They are subject to systematic and random error, as are all measures, and there are concerns about the tests used for the measurements. However, value-added models are better than the alternatives at the…
Descriptors: School Effectiveness, Error of Measurement, Achievement Gains, Academic Achievement
Murphy, Richard; Weinhardt, Felix – Centre for Economic Performance, 2013
We find an individual's rank within their reference group has effects on later objective outcomes. To evaluate the impact of local rank, we use a large administrative dataset tracking over two million students in England from primary through to secondary school. Academic rank within primary school has sizable, robust and significant effects on…
Descriptors: Foreign Countries, Class Rank, Progress Monitoring, Effect Size
Peer reviewed Peer reviewed
Direct linkDirect link
Gorard, Stephen – British Educational Research Journal, 2010
This paper considers the model of school effectiveness (SE) currently dominant in research, policy and practice in England (although the concerns it raises are international). It shows, principally through consideration of initial and propagated error, that SE results cannot be relied upon. By considering the residual difference between the…
Descriptors: School Effectiveness, Foreign Countries, Scores, Educational Policy
Beasley, T. Mark – 1994
In educational research, nonessential factors are commonly ignored and when accounted for, they are often treated statistically as fixed effects. Yet many researchers in these situations generalize their findings beyond the specific levels selected; however, the analyses may require treating the factor as a random effect. Such inappropriate…
Descriptors: Analysis of Variance, Behavioral Science Research, Educational Research, Equations (Mathematics)