Publication Date
In 2025 | 13 |
Since 2024 | 65 |
Since 2021 (last 5 years) | 209 |
Since 2016 (last 10 years) | 487 |
Since 2006 (last 20 years) | 1041 |
Descriptor
Source
Author
Kromrey, Jeffrey D. | 21 |
Fan, Xitao | 18 |
Barcikowski, Robert S. | 16 |
DeSarbo, Wayne S. | 14 |
Donoghue, John R. | 12 |
Ferron, John M. | 12 |
Finch, W. Holmes | 12 |
Zhang, Zhiyong | 11 |
Cohen, Allan S. | 10 |
Finch, Holmes | 10 |
Kim, Seock-Ho | 10 |
More ▼ |
Publication Type
Education Level
Audience
Researchers | 49 |
Practitioners | 22 |
Teachers | 19 |
Students | 4 |
Administrators | 2 |
Location
Germany | 10 |
Australia | 7 |
United Kingdom | 7 |
Canada | 6 |
Netherlands | 6 |
United States | 6 |
Belgium | 5 |
California | 5 |
Hong Kong | 5 |
South Korea | 5 |
Spain | 5 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 4 |
Pell Grant Program | 2 |
Aid to Families with… | 1 |
American Recovery and… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 1 |
Meets WWC Standards with or without Reservations | 1 |
Does not meet standards | 1 |
Finch, W. Holmes; Shim, Sungok Serena – Educational and Psychological Measurement, 2018
Collection and analysis of longitudinal data is an important tool in understanding growth and development over time in a whole range of human endeavors. Ideally, researchers working in the longitudinal framework are able to collect data at more than two points in time, as this will provide them with the potential for a deeper understanding of the…
Descriptors: Comparative Analysis, Computation, Time, Change
Porter, Kristin E. – Journal of Research on Educational Effectiveness, 2018
Researchers are often interested in testing the effectiveness of an intervention on multiple outcomes, for multiple subgroups, at multiple points in time, or across multiple treatment groups. The resulting multiplicity of statistical hypothesis tests can lead to spurious findings of effects. Multiple testing procedures (MTPs) are statistical…
Descriptors: Statistical Analysis, Program Effectiveness, Intervention, Hypothesis Testing
Zheng, Xiaying; Yang, Ji Seung – AERA Online Paper Repository, 2018
Measuring change in an educational or psychological construct over time is often achieved by repeatedly administering the same items to the same examinees over time. When the response data are categorical, item response theory (IRT) model can be used as the measurement model of a second-order latent growth model (referred to as LGM-IRT) to measure…
Descriptors: Statistical Analysis, Item Response Theory, Computation, Longitudinal Studies
Wang, Cheng; Butts, Carter T.; Hipp, John; Lakon, Cynthia M. – Sociological Methods & Research, 2022
The recent popularity of models that capture the dynamic coevolution of both network structure and behavior has driven the need for summary indices to assess the adequacy of these models to reproduce dynamic properties of scientific or practical importance. Whereas there are several existing indices for assessing the ability of the model to…
Descriptors: Models, Goodness of Fit, Comparative Analysis, Computer Software
Hsiao, Yu-Yu; Kwok, Oi-Man; Lai, Mark H. C. – Educational and Psychological Measurement, 2018
Path models with observed composites based on multiple items (e.g., mean or sum score of the items) are commonly used to test interaction effects. Under this practice, researchers generally assume that the observed composites are measured without errors. In this study, we reviewed and evaluated two alternative methods within the structural…
Descriptors: Error of Measurement, Testing, Scores, Models
Lockwood, J. R.; Castellano, Katherine E.; Shear, Benjamin R. – Journal of Educational and Behavioral Statistics, 2018
This article proposes a flexible extension of the Fay--Herriot model for making inferences from coarsened, group-level achievement data, for example, school-level data consisting of numbers of students falling into various ordinal performance categories. The model builds on the heteroskedastic ordered probit (HETOP) framework advocated by Reardon,…
Descriptors: Bayesian Statistics, Mathematical Models, Statistical Inference, Computation
Muñoz, J. F.; Álvarez-Verdejo, E.; García-Fernández, R. M. – Sociological Methods & Research, 2018
Many poverty measures are estimated by using sample data collected from social surveys. Two examples are the poverty gap and the poverty severity indices. A novel method for the estimation of these poverty indicators is described. Social surveys usually contain different variables, some of which can be used to improve the estimation of poverty…
Descriptors: Poverty, Simulation, Income, Socioeconomic Status
Man, Kaiwen; Harring, Jeffery R.; Ouyang, Yunbo; Thomas, Sarah L. – International Journal of Testing, 2018
Many important high-stakes decisions--college admission, academic performance evaluation, and even job promotion--depend on accurate and reliable scores from valid large-scale assessments. However, examinees sometimes cheat by copying answers from other test-takers or practicing with test items ahead of time, which can undermine the effectiveness…
Descriptors: Reaction Time, High Stakes Tests, Test Wiseness, Cheating
Mark L. Davison; David J. Weiss; Ozge Ersan; Joseph N. DeWeese; Gina Biancarosa; Patrick C. Kennedy – Grantee Submission, 2021
MOCCA is an online assessment of inferential reading comprehension for students in 3rd through 6th grades. It can be used to identify good readers and, for struggling readers, identify those who overly rely on either a Paraphrasing process or an Elaborating process when their comprehension is incorrect. Here a propensity to over-rely on…
Descriptors: Reading Tests, Computer Assisted Testing, Reading Comprehension, Elementary School Students
Luo, Yong; Dimitrov, Dimiter M. – Educational and Psychological Measurement, 2019
Plausible values can be used to either estimate population-level statistics or compute point estimates of latent variables. While it is well known that five plausible values are usually sufficient for accurate estimation of population-level statistics in large-scale surveys, the minimum number of plausible values needed to obtain accurate latent…
Descriptors: Item Response Theory, Monte Carlo Methods, Markov Processes, Outcome Measures
López-López, José Antonio; Van den Noortgate, Wim; Tanner-Smith, Emily E.; Wilson, Sandra Jo; Lipsey, Mark W. – Research Synthesis Methods, 2017
Dependent effect sizes are ubiquitous in meta-analysis. Using Monte Carlo simulation, we compared the performance of 2 methods for meta-regression with dependent effect sizes--robust variance estimation (RVE) and 3-level modeling--with the standard meta-analytic method for independent effect sizes. We further compared bias-reduced linearization…
Descriptors: Effect Size, Regression (Statistics), Meta Analysis, Comparative Analysis
Bazaldua, Diego A. Luna; Lee, Young-Sun; Keller, Bryan; Fellers, Lauren – Asia Pacific Education Review, 2017
The performance of various classical test theory (CTT) item discrimination estimators has been compared in the literature using both empirical and simulated data, resulting in mixed results regarding the preference of some discrimination estimators over others. This study analyzes the performance of various item discrimination estimators in CTT:…
Descriptors: Test Items, Monte Carlo Methods, Item Response Theory, Correlation
Bai, Haiyan; Sivo, Stephen A.; Pan, Wei; Fan, Xitao – International Journal of Research & Method in Education, 2016
Among the commonly used resampling methods of dealing with small-sample problems, the bootstrap enjoys the widest applications because it often outperforms its counterparts. However, the bootstrap still has limitations when its operations are contemplated. Therefore, the purpose of this study is to examine an alternative, new resampling method…
Descriptors: Sampling, Structural Equation Models, Statistical Inference, Comparative Analysis
Finch, W. Holmes – Applied Measurement in Education, 2016
Differential item functioning (DIF) assessment is a crucial component in test construction, serving as the primary way in which instrument developers ensure that measures perform in the same way for multiple groups within the population. When such is not the case, scores may not accurately reflect the trait of interest for all individuals in the…
Descriptors: Test Bias, Monte Carlo Methods, Comparative Analysis, Population Groups
Huang, Francis L. – Journal of Experimental Education, 2018
Studies analyzing clustered data sets using both multilevel models (MLMs) and ordinary least squares (OLS) regression have generally concluded that resulting point estimates, but not the standard errors, are comparable with each other. However, the accuracy of the estimates of OLS models is important to consider, as several alternative techniques…
Descriptors: Hierarchical Linear Modeling, Least Squares Statistics, Regression (Statistics), Comparative Analysis