ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	5
Since 2016 (last 10 years)	8
Since 2006 (last 20 years)	11

Descriptor

Error of Measurement	14
Hypothesis Testing	14
Simulation	14
Statistical Analysis	7
Evaluation Methods	6
Test Items	5
Comparative Analysis	4
Effect Size	3
Sample Size	3
Statistical Inference	3
Accuracy	2
Adaptive Testing	2
Computer Assisted Testing	2
Data Interpretation	2
Error Correction	2
Item Analysis	2
Item Bias	2
Monte Carlo Methods	2
Research Design	2
Sampling	2
Social Science Research	2
Animal Behavior	1
Animals	1
Bayesian Statistics	1
Change	1
More ▼

Source

Journal of Educational and…	2
Applied Psychological…	1
Bioscene: Journal of College…	1
Educational and Psychological…	1
Journal of Educational…	1
National Center for Education…	1
ProQuest LLC	1
Psicologica: International…	1
Psychological Methods	1
Research Synthesis Methods	1
Society for Research on…	1
Structural Equation Modeling:…	1
More ▼

Publication Type

Journal Articles	10
Reports - Research	10
Reports - Evaluative	2
Speeches/Meeting Papers	2
Dissertations/Theses -…	1
Guides - Non-Classroom	1

Education Level

Audience

Researchers

Location

Pennsylvania

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 14 results Save | Export

Minimal-Effect Testing, Equivalence Testing, and the Conventional Null Hypothesis Testing for the Analysis of Bi-Factor Models

Peer reviewed

Direct link

Shunji Wang; Katerina M. Marcoulides; Jiashan Tang; Ke-Hai Yuan – Structural Equation Modeling: A Multidisciplinary Journal, 2024

A necessary step in applying bi-factor models is to evaluate the need for domain factors with a general factor in place. The conventional null hypothesis testing (NHT) was commonly used for such a purpose. However, the conventional NHT meets challenges when the domain loadings are weak or the sample size is insufficient. This article proposes…

Descriptors: Hypothesis Testing, Error of Measurement, Comparative Analysis, Monte Carlo Methods

Robustness of Adaptive Measurement of Change to Item Parameter Estimation Error

Peer reviewed

Direct link

Cooperman, Allison W.; Weiss, David J.; Wang, Chun – Educational and Psychological Measurement, 2022

Adaptive measurement of change (AMC) is a psychometric method for measuring intra-individual change on one or more latent traits across testing occasions. Three hypothesis tests--a Z test, likelihood ratio test, and score ratio index--have demonstrated desirable statistical properties in this context, including low false positive rates and high…

Descriptors: Error of Measurement, Psychometrics, Hypothesis Testing, Simulation

Cluster Wild Bootstrapping to Handle Dependent Effect Sizes in Meta-Analysis with a Small Number of Studies

Peer reviewed

Direct link

Joshi, Megha; Pustejovsky, James E.; Beretvas, S. Natasha – Research Synthesis Methods, 2022

The most common and well-known meta-regression models work under the assumption that there is only one effect size estimate per study and that the estimates are independent. However, meta-analytic reviews of social science research often include multiple effect size estimates per primary study, leading to dependence in the estimates. Some…

Descriptors: Meta Analysis, Regression (Statistics), Models, Effect Size

Statistical Power When Adjusting for Multiple Hypothesis Tests: Methodology Expansions and Software Tools

Peer reviewed

Direct link

Kristin Porter; Luke Miratrix; Kristen Hunter – Society for Research on Educational Effectiveness, 2021

Background: Researchers are often interested in testing the effectiveness of an intervention on multiple outcomes, for multiple subgroups, at multiple points in time, or across multiple treatment groups. The resulting multiplicity of statistical hypothesis tests can lead to spurious findings of effects. Multiple testing procedures (MTPs)…

Descriptors: Statistical Analysis, Hypothesis Testing, Computer Software, Randomized Controlled Trials

The BASIE (BAyeSian Interpretation of Estimates) Framework for Interpreting Findings from Impact Evaluations: A Practical Guide for Education Researchers. Toolkit. NCEE 2022-005

Peer reviewed
PDF on ERIC

Download full text

Deke, John; Finucane, Mariel; Thal, Daniel – National Center for Education Evaluation and Regional Assistance, 2022

BASIE is a framework for interpreting impact estimates from evaluations. It is an alternative to null hypothesis significance testing. This guide walks researchers through the key steps of applying BASIE, including selecting prior evidence, reporting impact estimates, interpreting impact estimates, and conducting sensitivity analyses. The guide…

Descriptors: Bayesian Statistics, Educational Research, Data Interpretation, Hypothesis Testing

Methods to Estimate the Variance of Some Indices of the Signal Detection Theory: A Simulation Study

Peer reviewed
PDF on ERIC

Download full text

Suero, Manuel; Privado, Jesús; Botella, Juan – Psicologica: International Journal of Methodology and Experimental Psychology, 2017

A simulation study is presented to evaluate and compare three methods to estimate the variance of the estimates of the parameters d and "C" of the signal detection theory (SDT). Several methods have been proposed to calculate the variance of their estimators, "d'" and "c." Those methods have been mostly assessed by…

Descriptors: Evaluation Methods, Theories, Simulation, Statistical Analysis

Exploring Experimental Design: An Excel-Based Simulation Using Steller Sea Lion Behavior

Peer reviewed
PDF on ERIC

Download full text

Ryan, Wendy L.; St. Iago-McRae, Ezry – Bioscene: Journal of College Biology Teaching, 2016

Experimentation is the foundation of science and an important process for students to understand and experience. However, it can be difficult to teach some aspects of experimentation within the time and resource constraints of an academic semester. Interactive models can be a useful tool in bridging this gap. This freely accessible simulation…

Descriptors: Research Design, Simulation, Animals, Animal Behavior

Monitoring Items in Real Time to Enhance CAT Security

Peer reviewed

Direct link

Zhang, Jinming; Li, Jie – Journal of Educational Measurement, 2016

An IRT-based sequential procedure is developed to monitor items for enhancing test security. The procedure uses a series of statistical hypothesis tests to examine whether the statistical characteristics of each item under inspection have changed significantly during CAT administration. This procedure is compared with a previously developed…

Descriptors: Computer Assisted Testing, Test Items, Difficulty Level, Item Response Theory

Robust Means Modeling: An Alternative for Hypothesis Testing of Independent Means under Variance Heterogeneity and Nonnormality

Peer reviewed

Direct link

Fan, Weihua; Hancock, Gregory R. – Journal of Educational and Behavioral Statistics, 2012

This study proposes robust means modeling (RMM) approaches for hypothesis testing of mean differences for between-subjects designs in order to control the biasing effects of nonnormality and variance inequality. Drawing from structural equation modeling (SEM), the RMM approaches make no assumption of variance homogeneity and employ robust…

Descriptors: Robustness (Statistics), Hypothesis Testing, Monte Carlo Methods, Simulation

Appropriate Statistical Analysis for Two Independent Groups of Likert-Type Data

Direct link

Warachan, Boonyasit – ProQuest LLC, 2011

The objective of this research was to determine the robustness and statistical power of three different methods for testing the hypothesis that ordinal samples of five and seven Likert categories come from equal populations. The three methods are the two sample t-test with equal variances, the Mann-Whitney test, and the Kolmogorov-Smirnov test. In…

Descriptors: Statistical Analysis, Likert Scales, Hypothesis Testing, Data

Evaluating the Magnitude of Differential Item Functioning in Polytomous Items.

Peer reviewed

Zwick, Rebecca; Thayer, Dorothy T. – Journal of Educational and Behavioral Statistics, 1996

Two possible standard error formulas for the polytomous differential item functioning index proposed by N. J. Dorans and A. P. Schmitt (1991) were derived. These standard errors, and associated hypothesis-testing procedures, were evaluated through simulated data. The standard error that performed better is based on N. Mantel's (1963)…

Descriptors: Error of Measurement, Evaluation Methods, Hypothesis Testing, Item Bias

Detecting Answer Copying Using the Kappa Statistic

Peer reviewed

Direct link

Sotaridona, Leonardo S.; van der Linden, Wim J.; Meijer, Rob R. – Applied Psychological Measurement, 2006

A statistical test for detecting answer copying on multiple-choice tests based on Cohen's kappa is proposed. The test is free of any assumptions on the response processes of the examinees suspected of copying and having served as the source, except for the usual assumption that these processes are probabilistic. Because the asymptotic null and…

Descriptors: Cheating, Test Items, Simulation, Statistical Analysis

Evaluation of the Magnitude of Differential Item Functioning in Polytomous Items. Program Statistics Research Technical Report No. 94-2.

Download full text

Zwick, Rebecca; Thayer, Dorothy T. – 1994

Several recent studies have investigated the application of statistical inference procedures to the analysis of differential item functioning (DIF) in test items that are scored on an ordinal scale. Mantel's extension of the Mantel-Haenszel test is a possible hypothesis-testing method for this purpose. The development of descriptive statistics for…

Descriptors: Error of Measurement, Evaluation Methods, Hypothesis Testing, Item Bias

Omnibus Hypothesis Testing in Dominance-Based Ordinal Multiple Regression

Peer reviewed

Direct link

Long, Jeffrey D. – Psychological Methods, 2005

Often quantitative data in the social sciences have only ordinal justification. Problems of interpretation can arise when least squares multiple regression (LSMR) is used with ordinal data. Two ordinal alternatives are discussed, dominance-based ordinal multiple regression (DOMR) and proportional odds multiple regression. The Q[superscript 2]…

Descriptors: Simulation, Social Science Research, Error of Measurement, Least Squares Statistics

Thayer, Dorothy T.	2
Zwick, Rebecca	2
Beretvas, S. Natasha	1
Botella, Juan	1
Cooperman, Allison W.	1
Deke, John	1
Fan, Weihua	1
Finucane, Mariel	1
Hancock, Gregory R.	1
Jiashan Tang	1
Joshi, Megha	1
Katerina M. Marcoulides	1
Ke-Hai Yuan	1
Kristen Hunter	1
Kristin Porter	1
Li, Jie	1
Long, Jeffrey D.	1
Luke Miratrix	1
Meijer, Rob R.	1
Privado, Jesús	1
Pustejovsky, James E.	1
Ryan, Wendy L.	1
Shunji Wang	1
Sotaridona, Leonardo S.	1
St. Iago-McRae, Ezry	1
More ▼