Publication Date
In 2025 | 0 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 4 |
Since 2016 (last 10 years) | 10 |
Since 2006 (last 20 years) | 15 |
Descriptor
Statistical Inference | 17 |
Inferences | 6 |
Monte Carlo Methods | 6 |
Models | 5 |
Computation | 4 |
Error of Measurement | 4 |
Evaluation Methods | 4 |
Item Response Theory | 4 |
Psychometrics | 4 |
Statistical Analysis | 4 |
Bayesian Statistics | 3 |
More ▼ |
Source
Journal of Educational and… | 23 |
Author
Sinharay, Sandip | 2 |
Adrian Quintero | 1 |
Baram, Tallie Z. | 1 |
Bonett, Douglas G. | 1 |
Bradlow, Eric T. | 1 |
Daniel Koretz | 1 |
Davis, Elysia Poggi | 1 |
De Boeck, Paul | 1 |
Draper, David | 1 |
Emmanuel Lesaffre | 1 |
Ferrer, Emilio | 1 |
More ▼ |
Publication Type
Journal Articles | 23 |
Reports - Evaluative | 23 |
Opinion Papers | 1 |
Education Level
Early Childhood Education | 1 |
Elementary Education | 1 |
Elementary Secondary Education | 1 |
Kindergarten | 1 |
Primary Education | 1 |
Audience
Teachers | 1 |
Location
Italy | 1 |
United Kingdom (Scotland) | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Early Childhood Longitudinal… | 1 |
Program for International… | 1 |
What Works Clearinghouse Rating
Adrian Quintero; Emmanuel Lesaffre; Geert Verbeke – Journal of Educational and Behavioral Statistics, 2024
Bayesian methods to infer model dimensionality in factor analysis generally assume a lower triangular structure for the factor loadings matrix. Consequently, the ordering of the outcomes influences the results. Therefore, we propose a method to infer model dimensionality without imposing any prior restriction on the loadings matrix. Our approach…
Descriptors: Bayesian Statistics, Factor Analysis, Factor Structure, Sampling
Bonett, Douglas G. – Journal of Educational and Behavioral Statistics, 2022
The limitations of Cohen's ? are reviewed and an alternative G-index is recommended for assessing nominal-scale agreement. Maximum likelihood estimates, standard errors, and confidence intervals for a two-rater G-index are derived for one-group and two-group designs. A new G-index of agreement for multirater designs is proposed. Statistical…
Descriptors: Statistical Inference, Statistical Data, Interrater Reliability, Design
Daniel Koretz – Journal of Educational and Behavioral Statistics, 2024
A critically important balance in educational measurement between practical concerns and matters of technique has atrophied in recent decades, and as a result, some important issues in the field have not been adequately addressed. I start with the work of E. F. Lindquist, who exemplified the balance that is now wanting. Lindquist was arguably the…
Descriptors: Educational Assessment, Evaluation Methods, Achievement Tests, Educational History
Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2022
Takers of educational tests often receive proficiency levels instead of or in addition to scaled scores. For example, proficiency levels are reported for the Advanced Placement (AP®) and U.S. Medical Licensing examinations. Technical difficulties and other unforeseen events occasionally lead to missing item scores and hence to incomplete data on…
Descriptors: Computation, Data Analysis, Educational Testing, Accuracy
Hao, Jiangang; Ho, Tin Kam – Journal of Educational and Behavioral Statistics, 2019
Machine learning is a popular topic in data analysis and modeling. Many different machine learning algorithms have been developed and implemented in a variety of programming languages over the past 20 years. In this article, we first provide an overview of machine learning and clarify its difference from statistical inference. Then, we review…
Descriptors: Artificial Intelligence, Statistical Inference, Data Analysis, Programming Languages
Lai, Mark H. C. – Journal of Educational and Behavioral Statistics, 2019
Previous studies have detailed the consequence of ignoring a level of clustering in multilevel models with straightly hierarchical structures and have proposed methods to adjust for the fixed effect standard errors (SEs). However, in behavioral and social science research, there are usually two or more crossed clustering levels, such as when…
Descriptors: Error of Measurement, Hierarchical Linear Modeling, Least Squares Statistics, Statistical Bias
Vegetabile, Brian G.; Stout-Oswald, Stephanie A.; Davis, Elysia Poggi; Baram, Tallie Z.; Stern, Hal S. – Journal of Educational and Behavioral Statistics, 2019
Predictability of behavior is an important characteristic in many fields including biology, medicine, marketing, and education. When a sequence of actions performed by an individual can be modeled as a stationary time-homogeneous Markov chain the predictability of the individual's behavior can be quantified by the entropy rate of the process. This…
Descriptors: Markov Processes, Prediction, Behavior, Computation
Monroe, Scott – Journal of Educational and Behavioral Statistics, 2019
In item response theory (IRT) modeling, the Fisher information matrix is used for numerous inferential procedures such as estimating parameter standard errors, constructing test statistics, and facilitating test scoring. In principal, these procedures may be carried out using either the expected information or the observed information. However, in…
Descriptors: Item Response Theory, Error of Measurement, Scoring, Inferences
Wainer, Howard – Journal of Educational and Behavioral Statistics, 2016
The usual role of a discussant is to clarify and correct the paper being discussed, but in this case, the author, Howard Wainer, generally agrees with everything David Thissen says in his essay, "Bad Questions: An Essay Involving Item Response Theory." This essay expands on David Thissen's statement that there are typically two principal…
Descriptors: Item Response Theory, Educational Assessment, Sample Size, Statistical Inference
VanHoudnos, Nathan M.; Greenhouse, Joel B. – Journal of Educational and Behavioral Statistics, 2016
When cluster randomized experiments are analyzed as if units were independent, test statistics for treatment effects can be anticonservative. Hedges proposed a correction for such tests by scaling them to control their Type I error rate. This article generalizes the Hedges correction from a posttest-only experimental design to more common designs…
Descriptors: Statistical Analysis, Randomized Controlled Trials, Error of Measurement, Scaling
Gu, Fei; Preacher, Kristopher J.; Ferrer, Emilio – Journal of Educational and Behavioral Statistics, 2014
Mediation is a causal process that evolves over time. Thus, a study of mediation requires data collected throughout the process. However, most applications of mediation analysis use cross-sectional rather than longitudinal data. Another implicit assumption commonly made in longitudinal designs for mediation analysis is that the same mediation…
Descriptors: Statistical Analysis, Models, Research Design, Case Studies
Furno, Marilena – Journal of Educational and Behavioral Statistics, 2011
The article considers a test of specification for quantile regressions. The test relies on the increase of the objective function and the worsening of the fit when unnecessary constraints are imposed. It compares the objective functions of restricted and unrestricted models and, in its different formulations, it verifies (a) forecast ability, (b)…
Descriptors: Goodness of Fit, Statistical Inference, Regression (Statistics), Least Squares Statistics
Hong, Guanglei – Journal of Educational and Behavioral Statistics, 2010
Defining causal effects as comparisons between marginal population means, this article introduces marginal mean weighting through stratification (MMW-S) to adjust for selection bias in multilevel educational data. The article formally shows the inherent connections among the MMW-S method, propensity score stratification, and…
Descriptors: Statistical Analysis, Scores, Statistical Inference, Homogeneous Grouping

Kasim, Rafa M.; Raudenbush, Stephen W. – Journal of Educational and Behavioral Statistics, 1998
Gibbs sampling was applied to obtain Bayes inferences in the case of unbalanced multilevel data when the homogeneity of variance assumption fails and when interest focuses on inferences for some or all of the groups' variances. This approach is compared to a more standard analysis based on restricted maximum-likelihood statistics. (SLD)
Descriptors: Bayesian Statistics, Statistical Inference
Hafdahl, Adam R. – Journal of Educational and Behavioral Statistics, 2007
The originally proposed multivariate meta-analysis approach for correlation matrices--analyze Pearson correlations, with each study's observed correlations replacing their population counterparts in its conditional-covariance matrix--performs poorly. Two refinements are considered: Analyze Fisher Z-transformed correlations, and substitute better…
Descriptors: Monte Carlo Methods, Correlation, Meta Analysis, Matrices
Previous Page | Next Page »
Pages: 1 | 2