Publication Date
In 2025 | 2 |
Since 2024 | 5 |
Since 2021 (last 5 years) | 17 |
Since 2016 (last 10 years) | 34 |
Since 2006 (last 20 years) | 53 |
Descriptor
Statistical Inference | 61 |
Computation | 24 |
Statistical Analysis | 21 |
Bayesian Statistics | 17 |
Error of Measurement | 14 |
Causal Models | 12 |
Monte Carlo Methods | 11 |
Simulation | 11 |
Item Response Theory | 10 |
Foreign Countries | 9 |
Models | 9 |
More ▼ |
Source
Journal of Educational and… | 61 |
Author
Hong, Guanglei | 3 |
Schochet, Peter Z. | 3 |
Bonett, Douglas G. | 2 |
Gelman, Andrew | 2 |
Kim, Jee-Seon | 2 |
Qin, Xu | 2 |
Robitzsch, Alexander | 2 |
Rubin, Donald B. | 2 |
Suk, Youmi | 2 |
Yamaguchi, Kazuhiro | 2 |
Adrian Quintero | 1 |
More ▼ |
Publication Type
Journal Articles | 61 |
Reports - Research | 35 |
Reports - Evaluative | 17 |
Reports - Descriptive | 8 |
Information Analyses | 1 |
Numerical/Quantitative Data | 1 |
Opinion Papers | 1 |
Education Level
Audience
Teachers | 1 |
Location
Italy | 2 |
California | 1 |
California (Riverside) | 1 |
Canada | 1 |
Massachusetts | 1 |
New York | 1 |
North Carolina | 1 |
Pennsylvania | 1 |
Puerto Rico | 1 |
South Korea | 1 |
United Kingdom (Scotland) | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
Program for International… | 5 |
Early Childhood Longitudinal… | 4 |
Trends in International… | 4 |
Center for Epidemiologic… | 1 |
National Assessment of… | 1 |
SAT (College Admission Test) | 1 |
What Works Clearinghouse Rating
Adrian Quintero; Emmanuel Lesaffre; Geert Verbeke – Journal of Educational and Behavioral Statistics, 2024
Bayesian methods to infer model dimensionality in factor analysis generally assume a lower triangular structure for the factor loadings matrix. Consequently, the ordering of the outcomes influences the results. Therefore, we propose a method to infer model dimensionality without imposing any prior restriction on the loadings matrix. Our approach…
Descriptors: Bayesian Statistics, Factor Analysis, Factor Structure, Sampling
Bonett, Douglas G. – Journal of Educational and Behavioral Statistics, 2022
The limitations of Cohen's ? are reviewed and an alternative G-index is recommended for assessing nominal-scale agreement. Maximum likelihood estimates, standard errors, and confidence intervals for a two-rater G-index are derived for one-group and two-group designs. A new G-index of agreement for multirater designs is proposed. Statistical…
Descriptors: Statistical Inference, Statistical Data, Interrater Reliability, Design
Daniel Koretz – Journal of Educational and Behavioral Statistics, 2024
A critically important balance in educational measurement between practical concerns and matters of technique has atrophied in recent decades, and as a result, some important issues in the field have not been adequately addressed. I start with the work of E. F. Lindquist, who exemplified the balance that is now wanting. Lindquist was arguably the…
Descriptors: Educational Assessment, Evaluation Methods, Achievement Tests, Educational History
Mingya Huang; David Kaplan – Journal of Educational and Behavioral Statistics, 2025
The issue of model uncertainty has been gaining interest in education and the social sciences community over the years, and the dominant methods for handling model uncertainty are based on Bayesian inference, particularly, Bayesian model averaging. However, Bayesian model averaging assumes that the true data-generating model is within the…
Descriptors: Bayesian Statistics, Hierarchical Linear Modeling, Statistical Inference, Predictor Variables
Wendy Chan; Larry Vernon Hedges – Journal of Educational and Behavioral Statistics, 2022
Multisite field experiments using the (generalized) randomized block design that assign treatments to individuals within sites are common in education and the social sciences. Under this design, there are two possible estimands of interest and they differ based on whether sites or blocks have fixed or random effects. When the average treatment…
Descriptors: Research Design, Educational Research, Statistical Analysis, Statistical Inference
Lang, Joseph B. – Journal of Educational and Behavioral Statistics, 2023
This article is concerned with the statistical detection of copying on multiple-choice exams. As an alternative to existing permutation- and model-based copy-detection approaches, a simple randomization p-value (RP) test is proposed. The RP test, which is based on an intuitive match-score statistic, makes no assumptions about the distribution of…
Descriptors: Identification, Cheating, Multiple Choice Tests, Item Response Theory
Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2022
Takers of educational tests often receive proficiency levels instead of or in addition to scaled scores. For example, proficiency levels are reported for the Advanced Placement (AP®) and U.S. Medical Licensing examinations. Technical difficulties and other unforeseen events occasionally lead to missing item scores and hence to incomplete data on…
Descriptors: Computation, Data Analysis, Educational Testing, Accuracy
Yamaguchi, Kazuhiro; Okada, Kensuke – Journal of Educational and Behavioral Statistics, 2020
In this article, we propose a variational Bayes (VB) inference method for the deterministic input noisy AND gate model of cognitive diagnostic assessment. The proposed method, which applies the iterative algorithm for optimization, is derived based on the optimal variational posteriors of the model parameters. The proposed VB inference enables…
Descriptors: Bayesian Statistics, Statistical Inference, Cognitive Measurement, Mathematics
Lee, Daniel Y.; Harring, Jeffrey R. – Journal of Educational and Behavioral Statistics, 2023
A Monte Carlo simulation was performed to compare methods for handling missing data in growth mixture models. The methods considered in the current study were (a) a fully Bayesian approach using a Gibbs sampler, (b) full information maximum likelihood using the expectation-maximization algorithm, (c) multiple imputation, (d) a two-stage multiple…
Descriptors: Monte Carlo Methods, Research Problems, Statistical Inference, Bayesian Statistics
Martinková, Patrícia; Bartoš, František; Brabec, Marek – Journal of Educational and Behavioral Statistics, 2023
Inter-rater reliability (IRR), which is a prerequisite of high-quality ratings and assessments, may be affected by contextual variables, such as the rater's or ratee's gender, major, or experience. Identification of such heterogeneity sources in IRR is important for the implementation of policies with the potential to decrease measurement error…
Descriptors: Interrater Reliability, Bayesian Statistics, Statistical Inference, Hierarchical Linear Modeling
Youmi Suk – Journal of Educational and Behavioral Statistics, 2024
Machine learning (ML) methods for causal inference have gained popularity due to their flexibility to predict the outcome model and the propensity score. In this article, we provide a within-group approach for ML-based causal inference methods in order to robustly estimate average treatment effects in multilevel studies when there is cluster-level…
Descriptors: Artificial Intelligence, Causal Models, Statistical Inference, Maximum Likelihood Statistics
Bartolucci, Francesco; Pennoni, Fulvia; Vittadini, Giorgio – Journal of Educational and Behavioral Statistics, 2023
In order to evaluate the effect of a policy or treatment with pre- and post-treatment outcomes, we propose an approach based on a transition model, which may be applied with multivariate outcomes and accounts for unobserved heterogeneity. This model is based on potential versions of discrete latent variables representing the individual…
Descriptors: Causal Models, Multivariate Analysis, Markov Processes, Human Capital
Köhler, Carmen; Robitzsch, Alexander; Hartig, Johannes – Journal of Educational and Behavioral Statistics, 2020
Testing whether items fit the assumptions of an item response theory model is an important step in evaluating a test. In the literature, numerous item fit statistics exist, many of which show severe limitations. The current study investigates the root mean squared deviation (RMSD) item fit statistic, which is used for evaluating item fit in…
Descriptors: Test Items, Goodness of Fit, Statistics, Bias
Joshua B. Gilbert; Luke W. Miratrix; Mridul Joshi; Benjamin W. Domingue – Journal of Educational and Behavioral Statistics, 2025
Analyzing heterogeneous treatment effects (HTEs) plays a crucial role in understanding the impacts of educational interventions. A standard practice for HTE analysis is to examine interactions between treatment status and preintervention participant characteristics, such as pretest scores, to identify how different groups respond to treatment.…
Descriptors: Causal Models, Item Response Theory, Statistical Inference, Psychometrics
Kang, Hyeon-Ah; Zheng, Yi; Chang, Hua-Hua – Journal of Educational and Behavioral Statistics, 2020
With the widespread use of computers in modern assessment, online calibration has become increasingly popular as a way of replenishing an item pool. The present study discusses online calibration strategies for a joint model of responses and response times. The study proposes likelihood inference methods for item paramter estimation and evaluates…
Descriptors: Adaptive Testing, Computer Assisted Testing, Item Response Theory, Reaction Time