ERIC - Search Results

Publication Date

In 2025	3
Since 2024	3
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	6
Since 2006 (last 20 years)	13

Source

Journal of Educational and…

Publication Type

Journal Articles	19
Reports - Research	12
Reports - Evaluative	6
Reports - Descriptive	1

Education Level

Elementary Secondary Education	1
Secondary Education	1

Audience

Location

Hong Kong	1
Indiana	1

Laws, Policies, & Programs

Assessments and Surveys

National Longitudinal Study…	1
Pre Professional Skills Tests	1

What Works Clearinghouse Rating

Showing 1 to 15 of 19 results Save | Export

Redefining Item Response Models for Small Samples

Peer reviewed

Direct link

Jean-Paul Fox – Journal of Educational and Behavioral Statistics, 2025

Popular item response theory (IRT) models are considered complex, mainly due to the inclusion of a random factor variable (latent variable). The random factor variable represents the incidental parameter problem since the number of parameters increases when including data of new persons. Therefore, IRT models require a specific estimation method…

Descriptors: Sample Size, Item Response Theory, Accuracy, Bayesian Statistics

Using Regularized Methods to Validate Q-Matrix in Cognitive Diagnostic Assessment

Peer reviewed

Direct link

Daoxuan Fu; Chunying Qin; Zhaosheng Luo; Yujun Li; Xiaofeng Yu; Ziyu Ye – Journal of Educational and Behavioral Statistics, 2025

One of the central components of cognitive diagnostic assessment is the Q-matrix, which is an essential loading indicator matrix and is typically constructed by subject matter experts. Nonetheless, to a large extent, the construction of Q-matrix remains a subjective process and might lead to misspecifications. Many researchers have recognized the…

Descriptors: Q Methodology, Matrices, Diagnostic Tests, Cognitive Measurement

Bayesian Adaptive Lasso for the Detection of Differential Item Functioning in Graded Response Models

Peer reviewed

Direct link

Na Shan; Ping-Feng Xu – Journal of Educational and Behavioral Statistics, 2025

The detection of differential item functioning (DIF) is important in psychological and behavioral sciences. Standard DIF detection methods perform an item-by-item test iteratively, often assuming that all items except the one under investigation are DIF-free. This article proposes a Bayesian adaptive Lasso method to detect DIF in graded response…

Descriptors: Bayesian Statistics, Item Response Theory, Adolescents, Longitudinal Studies

Forced-Choice Ranking Models for Raters' Ranking Data

Peer reviewed

Direct link

Hung, Su-Pin; Huang, Hung-Yu – Journal of Educational and Behavioral Statistics, 2022

To address response style or bias in rating scales, forced-choice items are often used to request that respondents rank their attitudes or preferences among a limited set of options. The rating scales used by raters to render judgments on ratees' performance also contribute to rater bias or errors; consequently, forced-choice items have recently…

Descriptors: Evaluation Methods, Rating Scales, Item Analysis, Preferences

Applications of Small Area Estimation to Generalization with Subclassification by Propensity Scores

Peer reviewed

Direct link

Chan, Wendy – Journal of Educational and Behavioral Statistics, 2018

Policymakers have grown increasingly interested in how experimental results may generalize to a larger population. However, recently developed propensity score-based methods are limited by small sample sizes, where the experimental study is generalized to a population that is at least 20 times larger. This is particularly problematic for methods…

Descriptors: Computation, Generalization, Probability, Sample Size

A Comparative Study of Online Item Calibration Methods in Multidimensional Computerized Adaptive Testing

Peer reviewed

Direct link

Chen, Ping – Journal of Educational and Behavioral Statistics, 2017

Calibration of new items online has been an important topic in item replenishment for multidimensional computerized adaptive testing (MCAT). Several online calibration methods have been proposed for MCAT, such as multidimensional "one expectation-maximization (EM) cycle" (M-OEM) and multidimensional "multiple EM cycles"…

Descriptors: Test Items, Item Response Theory, Test Construction, Adaptive Testing

Small-Sample Adjustments for Tests of Moderators and Model Fit Using Robust Variance Estimation in Meta-Regression

Peer reviewed

Direct link

Tipton, Elizabeth; Pustejovsky, James E. – Journal of Educational and Behavioral Statistics, 2015

Meta-analyses often include studies that report multiple effect sizes based on a common pool of subjects or that report effect sizes from several samples that were treated with very similar research protocols. The inclusion of such studies introduces dependence among the effect size estimates. When the number of studies is large, robust variance…

Descriptors: Meta Analysis, Effect Size, Computation, Robustness (Statistics)

Interval Estimation of Standardized Mean Differences in Paired-Samples Designs

Peer reviewed

Direct link

Bonett, Douglas G. – Journal of Educational and Behavioral Statistics, 2015

Paired-samples designs are used frequently in educational and behavioral research. In applications where the response variable is quantitative, researchers are encouraged to supplement the results of a paired-samples t-test with a confidence interval (CI) for a mean difference or a standardized mean difference. Six CIs for standardized mean…

Descriptors: Educational Research, Sample Size, Statistical Analysis, Effect Size

Covariate Adjustment Strategy Increases Power in the Randomized Controlled Trial With Discrete-Time Survival Endpoints

Peer reviewed

Direct link

Safarkhani, Maryam; Moerbeek, Mirjam – Journal of Educational and Behavioral Statistics, 2013

In a randomized controlled trial, a decision needs to be made about the total number of subjects for adequate statistical power. One way to increase the power of a trial is by including a predictive covariate in the model. In this article, the effects of various covariate adjustment strategies on increasing the power is studied for discrete-time…

Descriptors: Statistical Analysis, Scientific Methodology, Research Design, Sample Size

Robust Means Modeling: An Alternative for Hypothesis Testing of Independent Means under Variance Heterogeneity and Nonnormality

Peer reviewed

Direct link

Fan, Weihua; Hancock, Gregory R. – Journal of Educational and Behavioral Statistics, 2012

This study proposes robust means modeling (RMM) approaches for hypothesis testing of mean differences for between-subjects designs in order to control the biasing effects of nonnormality and variance inequality. Drawing from structural equation modeling (SEM), the RMM approaches make no assumption of variance homogeneity and employ robust…

Descriptors: Robustness (Statistics), Hypothesis Testing, Monte Carlo Methods, Simulation

Sample Size Estimation in Cluster Randomized Educational Trials: An Empirical Bayes Approach

Peer reviewed

Direct link

Rotondi, Michael A.; Donner, Allan – Journal of Educational and Behavioral Statistics, 2009

The educational field has now accumulated an extensive literature reporting on values of the intraclass correlation coefficient, a parameter essential to determining the required size of a planned cluster randomized trial. We propose here a simple simulation-based approach including all relevant information that can facilitate this task. An…

Descriptors: Sample Size, Computation, Correlation, Bayesian Statistics

Using Past Data to Enhance Small Sample DIF Estimation: A Bayesian Approach

Peer reviewed

Direct link

Sinharay, Sandip; Dorans, Neil J.; Grant, Mary C.; Blew, Edwin O. – Journal of Educational and Behavioral Statistics, 2009

Test administrators often face the challenge of detecting differential item functioning (DIF) with samples of size smaller than that recommended by experts. A Bayesian approach can incorporate, in the form of a prior distribution, existing information on the inference problem at hand, which yields more stable estimation, especially for small…

Descriptors: Test Bias, Computation, Bayesian Statistics, Data

An Importance Sampling EM Algorithm for Latent Regression Models

Peer reviewed

Direct link

von Davier, Matthias; Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2007

Reporting methods used in large-scale assessments such as the National Assessment of Educational Progress (NAEP) rely on latent regression models. To fit the latent regression model using the maximum likelihood estimation technique, multivariate integrals must be evaluated. In the computer program MGROUP used by the Educational Testing Service for…

Descriptors: Simulation, Computer Software, Sampling, Data Analysis

Statistical Inference for a Ratio of Dispersions Using Paired Samples.

Peer reviewed

Bonett, Douglas G.; Seier, Edith – Journal of Educational and Behavioral Statistics, 2003

Derived a confidence interval for a ratio of correlated mean absolute deviations. Simulation results show that it performs well in small sample sizes across realistically nonnormal distributions and that it is almost as powerful as the most powerful test examined by R. Wilcox (1990). (SLD)

Descriptors: Correlation, Equations (Mathematics), Hypothesis Testing, Sample Size

On Sample Size Requirements for Johansen's Test.

Peer reviewed

Coombs, William T.; Algina, James – Journal of Educational and Behavioral Statistics, 1996

Type I error rates for the Johansen test were estimated using simulated data for a variety of conditions. Results indicate that Type I error rates for the Johansen test depend heavily on the number of groups and the ratio of the smallest sample size to the number of dependent variables. Sample size guidelines are presented. (SLD)

Descriptors: Group Membership, Hypothesis Testing, Multivariate Analysis, Robustness (Statistics)

Previous Page | Next Page »

Pages: 1 | 2

Sample Size	19
Simulation	17
Item Response Theory	6
Models	6
Bayesian Statistics	5
Computation	5
Correlation	4
Effect Size	4
Robustness (Statistics)	4
Statistical Analysis	4
Educational Assessment	3
Evaluation Methods	3
Hypothesis Testing	3
Item Analysis	3
Accuracy	2
Comparative Analysis	2
Computer Simulation	2
Computer Software	2
Educational Research	2
Equations (Mathematics)	2
Error of Measurement	2
Estimation (Mathematics)	2
Gender Differences	2
Generalization	2
Mathematics	2
More ▼

Bonett, Douglas G.	2
Sinharay, Sandip	2
Algina, James	1
Blew, Edwin O.	1
Chan, Wendy	1
Chen, Ping	1
Chunying Qin	1
Coombs, William T.	1
Cope, Ronald T.	1
Daoxuan Fu	1
Donner, Allan	1
Dorans, Neil J.	1
Fan, Weihua	1
Grant, Mary C.	1
Hancock, Gregory R.	1
Hedges, Larry V.	1
Huang, Hung-Yu	1
Hung, Su-Pin	1
Jean-Paul Fox	1
Lee, John C. K.	1
Lee, Sik-Yum	1
Maydeu-Olivares, Albert	1
Moerbeek, Mirjam	1
Na Shan	1
Ping-Feng Xu	1
More ▼