Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 5 |
Since 2006 (last 20 years) | 15 |
Descriptor
Models | 19 |
Statistical Inference | 19 |
Bayesian Statistics | 8 |
Evaluation Methods | 5 |
Regression (Statistics) | 5 |
Comparative Analysis | 4 |
Computation | 4 |
Hypothesis Testing | 4 |
Statistics | 4 |
Difficulty Level | 3 |
Probability | 3 |
More ▼ |
Source
Author
Publication Type
Reports - Descriptive | 19 |
Journal Articles | 18 |
Opinion Papers | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Higher Education | 4 |
Elementary Education | 2 |
Postsecondary Education | 2 |
Early Childhood Education | 1 |
Elementary Secondary Education | 1 |
Grade 4 | 1 |
Intermediate Grades | 1 |
Audience
Researchers | 1 |
Teachers | 1 |
Location
Laws, Policies, & Programs
Assessments and Surveys
Progress in International… | 1 |
Trends in International… | 1 |
What Works Clearinghouse Rating
Yamaguchi, Kazuhiro; Okada, Kensuke – Journal of Educational and Behavioral Statistics, 2020
In this article, we propose a variational Bayes (VB) inference method for the deterministic input noisy AND gate model of cognitive diagnostic assessment. The proposed method, which applies the iterative algorithm for optimization, is derived based on the optimal variational posteriors of the model parameters. The proposed VB inference enables…
Descriptors: Bayesian Statistics, Statistical Inference, Cognitive Measurement, Mathematics
Daniel Kasper; Katrin Schulz-Heidorf; Knut Schwippert – Sociological Methods & Research, 2024
In this article, we extend Liao's test for across-group comparisons of the fixed effects from the generalized linear model to the fixed and random effects of the generalized linear mixed model (GLMM). Using as our basis the Wald statistic, we developed an asymptotic test statistic for across-group comparisons of these effects. The test can be…
Descriptors: Models, Achievement Tests, Foreign Countries, International Assessment
Marmolejo-Ramos, Fernando; Cousineau, Denis – Educational and Psychological Measurement, 2017
The number of articles showing dissatisfaction with the null hypothesis statistical testing (NHST) framework has been progressively increasing over the years. Alternatives to NHST have been proposed and the Bayesian approach seems to have achieved the highest amount of visibility. In this last part of the special issue, a few alternative…
Descriptors: Hypothesis Testing, Bayesian Statistics, Evaluation Methods, Statistical Inference
Finch, Holmes – Practical Assessment, Research & Evaluation, 2022
Researchers in many disciplines work with ranking data. This data type is unique in that it is often deterministic in nature (the ranks of items "k"-1 determine the rank of item "k"), and the difference in a pair of rank scores separated by "k" units is equivalent regardless of the actual values of the two ranks in…
Descriptors: Data Analysis, Statistical Inference, Models, College Faculty
Ames, Allison; Myers, Aaron – Educational Measurement: Issues and Practice, 2019
Drawing valid inferences from modern measurement models is contingent upon a good fit of the data to the model. Violations of model-data fit have numerous consequences, limiting the usefulness and applicability of the model. As Bayesian estimation is becoming more common, understanding the Bayesian approaches for evaluating model-data fit models…
Descriptors: Bayesian Statistics, Psychometrics, Models, Predictive Measurement
France, Stephen L.; Batchelder, William H. – Educational and Psychological Measurement, 2015
Cultural consensus theory (CCT) is a data aggregation technique with many applications in the social and behavioral sciences. We describe the intuition and theory behind a set of CCT models for continuous type data using maximum likelihood inference methodology. We describe how bias parameters can be incorporated into these models. We introduce…
Descriptors: Maximum Likelihood Statistics, Test Items, Difficulty Level, Test Theory
Callister Everson, Kimberlee; Feinauer, Erika; Sudweeks, Richard R. – Harvard Educational Review, 2013
In this article, the authors provide a methodological critique of the current standard of value-added modeling forwarded in educational policy contexts as a means of measuring teacher effectiveness. Conventional value-added estimates of teacher quality are attempts to determine to what degree a teacher would theoretically contribute, on average,…
Descriptors: Teacher Evaluation, Teacher Effectiveness, Evaluation Methods, Accountability
Berenson, Mark L. – Decision Sciences Journal of Innovative Education, 2013
There is consensus in the statistical literature that severe departures from its assumptions invalidate the use of regression modeling for purposes of inference. The assumptions of regression modeling are usually evaluated subjectively through visual, graphic displays in a residual analysis but such an approach, taken alone, may be insufficient…
Descriptors: Spreadsheets, Computer Software, Regression (Statistics), Models
Piantadosi, Steven T.; Tenenbaum, Joshua B.; Goodman, Noah D. – Cognition, 2012
In acquiring number words, children exhibit a qualitative leap in which they transition from understanding a few number words, to possessing a rich system of interrelated numerical concepts. We present a computational framework for understanding this inductive leap as the consequence of statistical inference over a sufficiently powerful…
Descriptors: Statistical Inference, Number Concepts, Models, Computation
Coffman, Donna L. – Structural Equation Modeling: A Multidisciplinary Journal, 2011
Mediation is usually assessed by a regression-based or structural equation modeling (SEM) approach that we refer to as the classical approach. This approach relies on the assumption that there are no confounders that influence both the mediator, "M", and the outcome, "Y". This assumption holds if individuals are randomly…
Descriptors: Structural Equation Models, Simulation, Regression (Statistics), Probability
Frederickx, Sofie; Tuerlinckx, Francis; De Boeck, Paul; Magis, David – Journal of Educational Measurement, 2010
In this paper we present a new methodology for detecting differential item functioning (DIF). We introduce a DIF model, called the random item mixture (RIM), that is based on a Rasch model with random item difficulties (besides the common random person abilities). In addition, a mixture model is assumed for the item difficulties such that the…
Descriptors: Test Bias, Models, Test Items, Difficulty Level
Kemp, Charles; Tenenbaum, Joshua B. – Psychological Review, 2009
Everyday inductive inferences are often guided by rich background knowledge. Formal models of induction should aim to incorporate this knowledge and should explain how different kinds of knowledge lead to the distinctive patterns of reasoning found in different inductive contexts. This article presents a Bayesian framework that attempts to meet…
Descriptors: Logical Thinking, Inferences, Statistical Inference, Models
LeMire, Steven D. – Journal of Statistics Education, 2010
This paper proposes an argument framework for the teaching of null hypothesis statistical testing and its application in support of research. Elements of the Toulmin (1958) model of argument are used to illustrate the use of p values and Type I and Type II error rates in support of claims about statistical parameters and subject matter research…
Descriptors: Hypothesis Testing, Relationship, Statistical Significance, Models
Brownstein, Naomi; Pensky, Marianna – Journal of Statistics Education, 2008
The objective of the present paper is to provide a simple approach to statistical inference using the method of transformations of variables. We demonstrate performance of this powerful tool on examples of constructions of various estimation procedures, hypothesis testing, Bayes analysis and statistical inference for the stress-strength systems.…
Descriptors: Transformations (Mathematics), Computation, Hypothesis Testing, Models
Iverson, Geoffrey J.; Wagenmakers, Eric-Jan; Lee, Michael D. – Psychological Methods, 2010
The purpose of the recently proposed "p[subscript rep]" statistic is to estimate the probability of concurrence, that is, the probability that a replicate experiment yields an effect of the same sign (Killeen, 2005a). The influential journal "Psychological Science" endorses "p[subscript rep]" and recommends its use…
Descriptors: Effect Size, Evaluation Methods, Probability, Experiments
Previous Page | Next Page ยป
Pages: 1 | 2