ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	10

Descriptor

Probability	10
Statistical Inference	10
Statistical Analysis	6
Computation	4
Bayesian Statistics	3
Error of Measurement	3
Hypothesis Testing	3
Item Response Theory	3
Sample Size	3
Test Length	3
Data Analysis	2
Effect Size	2
Foreign Countries	2
Maximum Likelihood Statistics	2
Sampling	2
Simulation	2
Statistical Bias	2
Statistical Significance	2
Ability	1
Adults	1
Classification	1
Comparative Analysis	1
Competence	1
Correlation	1
Data Interpretation	1
More ▼

Source

Educational and Psychological…

Publication Type

Journal Articles	10
Reports - Research	8
Reports - Evaluative	2

Education Level

Grade 9	1
High Schools	1
Higher Education	1
Junior High Schools	1
Middle Schools	1
Postsecondary Education	1
Secondary Education	1

Audience

Location

Germany	1
Netherlands	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 10 results Save | Export

A Note on Statistical Hypothesis Testing: Probabilifying "Modus Tollens" Invalidates Its Force? Not True!

Peer reviewed

Direct link

Widaman, Keith F. – Educational and Psychological Measurement, 2023

The import or force of the result of a statistical test has long been portrayed as consistent with deductive reasoning. The simplest form of deductive argument has a first premise with conditional form, such as p[right arrow]q, which means that "if p is true, then q must be true." Given the first premise, one can either affirm or deny…

Descriptors: Hypothesis Testing, Statistical Analysis, Logical Thinking, Probability

Three Insights from a Bayesian Interpretation of the One-Sided "P" Value

Peer reviewed

Direct link

Marsman, Maarten; Wagenmakers, Eric-Jan – Educational and Psychological Measurement, 2017

P values have been critiqued on several grounds but remain entrenched as the dominant inferential method in the empirical sciences. In this article, we elaborate on the fact that in many statistical models, the one-sided "P" value has a direct Bayesian interpretation as the approximate posterior mass for values lower than zero. The…

Descriptors: Bayesian Statistics, Statistical Inference, Probability, Statistical Analysis

Using the Coefficient of Confidence to Make the Philosophical Switch from a Posteriori to a Priori Inferential Statistics

Peer reviewed

Direct link

Trafimow, David – Educational and Psychological Measurement, 2017

There has been much controversy over the null hypothesis significance testing procedure, with much of the criticism centered on the problem of inverse inference. Specifically, p gives the probability of the finding (or one more extreme) given the null hypothesis, whereas the null hypothesis significance testing procedure involves drawing a…

Descriptors: Statistical Inference, Hypothesis Testing, Probability, Intervals

Testing Mediation in Structural Equation Modeling: The Effectiveness of the Test of Joint Significance

Peer reviewed

Direct link

Leth-Steensen, Craig; Gallitto, Elena – Educational and Psychological Measurement, 2016

A large number of approaches have been proposed for estimating and testing the significance of indirect effects in mediation models. In this study, four sets of Monte Carlo simulations involving full latent variable structural equation models were run in order to contrast the effectiveness of the currently popular bias-corrected bootstrapping…

Descriptors: Mediation Theory, Structural Equation Models, Monte Carlo Methods, Simulation

Taking the Missing Propensity into Account When Estimating Competence Scores: Evaluation of Item Response Theory Models for Nonignorable Omissions

Peer reviewed

Direct link

Köhler, Carmen; Pohl, Steffi; Carstensen, Claus H. – Educational and Psychological Measurement, 2015

When competence tests are administered, subjects frequently omit items. These missing responses pose a threat to correctly estimating the proficiency level. Newer model-based approaches aim to take nonignorable missing data processes into account by incorporating a latent missing propensity into the measurement model. Two assumptions are typically…

Descriptors: Competence, Tests, Evaluation Methods, Adults

Confidence Intervals for Squared Semipartial Correlation Coefficients: The Effect of Nonnormality

Peer reviewed

Direct link

Algina, James; Keselman, H. J.; Penfield, Randall D. – Educational and Psychological Measurement, 2010

The increase in the squared multiple correlation coefficient ([delta]R[superscript 2]) associated with a variable in a regression equation is a commonly used measure of importance in regression analysis. Algina, Keselman, and Penfield found that intervals based on asymptotic principles were typically very inaccurate, even though the sample size…

Descriptors: Computation, Statistical Analysis, Correlation, Statistical Inference

Confidence Intervals Make a Difference: Effects of Showing Confidence Intervals on Inferential Reasoning

Peer reviewed

Direct link

Hoekstra, Rink; Johnson, Addie; Kiers, Henk A. L. – Educational and Psychological Measurement, 2012

The use of confidence intervals (CIs) as an addition or as an alternative to null hypothesis significance testing (NHST) has been promoted as a means to make researchers more aware of the uncertainty that is inherent in statistical inference. Little is known, however, about whether presenting results via CIs affects how readers judge the…

Descriptors: Computation, Statistical Analysis, Hypothesis Testing, Statistical Significance

Formulating the Rasch Differential Item Functioning Model under the Marginal Maximum Likelihood Estimation Context and Its Comparison with Mantel-Haenszel Procedure in Short Test and Small Sample Conditions

Peer reviewed

Direct link

Paek, Insu; Wilson, Mark – Educational and Psychological Measurement, 2011

This study elaborates the Rasch differential item functioning (DIF) model formulation under the marginal maximum likelihood estimation context. Also, the Rasch DIF model performance was examined and compared with the Mantel-Haenszel (MH) procedure in small sample and short test length conditions through simulations. The theoretically known…

Descriptors: Test Bias, Test Length, Statistical Inference, Geometric Concepts

Assessing Goodness of Fit in Item Response Theory with Nonparametric Models: A Comparison of Posterior Probabilities and Kernel-Smoothing Approaches

Peer reviewed

Direct link

Sueiro, Manuel J.; Abad, Francisco J. – Educational and Psychological Measurement, 2011

The distance between nonparametric and parametric item characteristic curves has been proposed as an index of goodness of fit in item response theory in the form of a root integrated squared error index. This article proposes to use the posterior distribution of the latent trait as the nonparametric model and compares the performance of an index…

Descriptors: Goodness of Fit, Item Response Theory, Nonparametric Statistics, Probability

Modeling Nonignorable Missing Data in Speeded Tests

Peer reviewed

Direct link

Glas, Cees A. W.; Pimentel, Jonald L. – Educational and Psychological Measurement, 2008

In tests with time limits, items at the end are often not reached. Usually, the pattern of missing responses depends on the ability level of the respondents; therefore, missing data are not ignorable in statistical inference. This study models data using a combination of two item response theory (IRT) models: one for the observed response data and…

Descriptors: Intelligence Tests, Statistical Inference, Item Response Theory, Modeling (Psychology)

Abad, Francisco J.	1
Algina, James	1
Carstensen, Claus H.	1
Gallitto, Elena	1
Glas, Cees A. W.	1
Hoekstra, Rink	1
Johnson, Addie	1
Keselman, H. J.	1
Kiers, Henk A. L.	1
Köhler, Carmen	1
Leth-Steensen, Craig	1
Marsman, Maarten	1
Paek, Insu	1
Penfield, Randall D.	1
Pimentel, Jonald L.	1
Pohl, Steffi	1
Sueiro, Manuel J.	1
Trafimow, David	1
Wagenmakers, Eric-Jan	1
Widaman, Keith F.	1
Wilson, Mark	1
More ▼