Publication Date
In 2025 | 3 |
Since 2024 | 7 |
Since 2021 (last 5 years) | 15 |
Since 2016 (last 10 years) | 34 |
Since 2006 (last 20 years) | 75 |
Descriptor
Models | 92 |
Sample Size | 92 |
Simulation | 82 |
Item Response Theory | 34 |
Error of Measurement | 24 |
Evaluation Methods | 22 |
Statistical Analysis | 21 |
Goodness of Fit | 20 |
Test Items | 20 |
Comparative Analysis | 18 |
Computation | 18 |
More ▼ |
Source
Author
Paek, Insu | 3 |
de la Torre, Jimmy | 3 |
Beretvas, S. Natasha | 2 |
Chason, Walter M. | 2 |
Cho, Sun-Joo | 2 |
Kromrey, Jeffrey D. | 2 |
Lee, Young-Sun | 2 |
Murphy, Daniel L. | 2 |
Neale, Michael C. | 2 |
Parshall, Cynthia G. | 2 |
Suh, Youngsuk | 2 |
More ▼ |
Publication Type
Journal Articles | 75 |
Reports - Research | 60 |
Reports - Evaluative | 19 |
Speeches/Meeting Papers | 8 |
Dissertations/Theses -… | 7 |
Reports - Descriptive | 6 |
Tests/Questionnaires | 1 |
Education Level
Secondary Education | 4 |
High Schools | 2 |
Junior High Schools | 2 |
Middle Schools | 2 |
Adult Education | 1 |
Elementary Education | 1 |
Grade 7 | 1 |
Grade 8 | 1 |
Audience
Researchers | 1 |
Location
Florida (Miami) | 1 |
Turkey | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Jean-Paul Fox – Journal of Educational and Behavioral Statistics, 2025
Popular item response theory (IRT) models are considered complex, mainly due to the inclusion of a random factor variable (latent variable). The random factor variable represents the incidental parameter problem since the number of parameters increases when including data of new persons. Therefore, IRT models require a specific estimation method…
Descriptors: Sample Size, Item Response Theory, Accuracy, Bayesian Statistics
Daoxuan Fu; Chunying Qin; Zhaosheng Luo; Yujun Li; Xiaofeng Yu; Ziyu Ye – Journal of Educational and Behavioral Statistics, 2025
One of the central components of cognitive diagnostic assessment is the Q-matrix, which is an essential loading indicator matrix and is typically constructed by subject matter experts. Nonetheless, to a large extent, the construction of Q-matrix remains a subjective process and might lead to misspecifications. Many researchers have recognized the…
Descriptors: Q Methodology, Matrices, Diagnostic Tests, Cognitive Measurement
Jansen, Katrin; Holling, Heinz – Research Synthesis Methods, 2023
In meta-analyses of rare events, it can be challenging to obtain a reliable estimate of the pooled effect, in particular when the meta-analysis is based on a small number of studies. Recent simulation studies have shown that the beta-binomial model is a promising candidate in this situation, but have thus far only investigated its performance in a…
Descriptors: Bayesian Statistics, Meta Analysis, Probability, Simulation
Christopher E. Shank – ProQuest LLC, 2024
This dissertation compares the performance of equivalence test (EQT) and null hypothesis test (NHT) procedures for identifying invariant and noninvariant factor loadings under a range of experimental manipulations. EQT is the statistically appropriate approach when the research goal is to find evidence of group similarity rather than group…
Descriptors: Factor Analysis, Goodness of Fit, Intervals, Comparative Analysis
Mostafa Hosseinzadeh; Ki Lynn Matlock Cole – Educational and Psychological Measurement, 2024
In real-world situations, multidimensional data may appear on large-scale tests or psychological surveys. The purpose of this study was to investigate the effects of the quantity and magnitude of cross-loadings and model specification on item parameter recovery in multidimensional Item Response Theory (MIRT) models, especially when the model was…
Descriptors: Item Response Theory, Models, Maximum Likelihood Statistics, Algorithms
Olsson, Ulf – Practical Assessment, Research & Evaluation, 2022
We discuss analysis of 5-grade Likert type data in the two-sample case. Analysis using two-sample "t" tests, nonparametric Wilcoxon tests, and ordinal regression methods, are compared using simulated data based on an ordinal regression paradigm. One thousand pairs of samples of size "n"=10 and "n"=30 were generated,…
Descriptors: Regression (Statistics), Likert Scales, Sampling, Nonparametric Statistics
Jehanzeb Rashid Cheema – Journal of Education in Muslim Societies, 2024
This study explores the relationship between the Spiral Dynamics and the 3H (head, heart, hands) models of human growth and development, using constructs such as empathy, moral reasoning, forgiveness, and community mindedness that have been shown to have implications for education. The specific research question is, "Can a combination of…
Descriptors: Correlation, Factor Analysis, Computer Software, Moral Values
Emma Somer; Carl Falk; Milica Miocevic – Structural Equation Modeling: A Multidisciplinary Journal, 2024
Factor Score Regression (FSR) is increasingly employed as an alternative to structural equation modeling (SEM) in small samples. Despite its popularity in psychology, the performance of FSR in multigroup models with small samples remains relatively unknown. The goal of this study was to examine the performance of FSR, namely Croon's correction and…
Descriptors: Scores, Structural Equation Models, Comparative Analysis, Sample Size
Paek, Insu; Lin, Zhongtian; Chalmers, Robert Philip – Educational and Psychological Measurement, 2023
To reduce the chance of Heywood cases or nonconvergence in estimating the 2PL or the 3PL model in the marginal maximum likelihood with the expectation-maximization (MML-EM) estimation method, priors for the item slope parameter in the 2PL model or for the pseudo-guessing parameter in the 3PL model can be used and the marginal maximum a posteriori…
Descriptors: Models, Item Response Theory, Test Items, Intervals
Kalkan, Ömür Kaya; Toprak, Emre – International Journal of Psychology and Educational Studies, 2022
All cognitive diagnostic models that evaluate educational test data require a Q-matrix that combines every item in a test with the required cognitive skills for each item to be answered correctly. Generally, the Q-matrix is constructed by education experts' judgment, leading to some uncertainty in its elements. Various statistical methods are…
Descriptors: Q Methodology, Matrices, Input Output Analysis, Models
Aidoo, Eric Nimako; Appiah, Simon K.; Boateng, Alexander – Journal of Experimental Education, 2021
This study investigated the small sample biasness of the ordered logit model parameters under multicollinearity using Monte Carlo simulation. The results showed that the level of biasness associated with the ordered logit model parameters consistently decreases for an increasing sample size while the distribution of the parameters becomes less…
Descriptors: Statistical Bias, Monte Carlo Methods, Simulation, Sample Size
Du, Han; Enders, Craig; Keller, Brian; Bradbury, Thomas N.; Karney, Benjamin R. – Grantee Submission, 2022
Missing data are exceedingly common across a variety of disciplines, such as educational, social, and behavioral science areas. Missing not at random (MNAR) mechanism where missingness is related to unobserved data is widespread in real data and has detrimental consequence. However, the existing MNAR-based methods have potential problems such as…
Descriptors: Bayesian Statistics, Data Analysis, Computer Simulation, Sample Size
Bayesian Adaptive Lasso for the Detection of Differential Item Functioning in Graded Response Models
Na Shan; Ping-Feng Xu – Journal of Educational and Behavioral Statistics, 2025
The detection of differential item functioning (DIF) is important in psychological and behavioral sciences. Standard DIF detection methods perform an item-by-item test iteratively, often assuming that all items except the one under investigation are DIF-free. This article proposes a Bayesian adaptive Lasso method to detect DIF in graded response…
Descriptors: Bayesian Statistics, Item Response Theory, Adolescents, Longitudinal Studies
Fu, Yanyan; Strachan, Tyler; Ip, Edward H.; Willse, John T.; Chen, Shyh-Huei; Ackerman, Terry – International Journal of Testing, 2020
This research examined correlation estimates between latent abilities when using the two-dimensional and three-dimensional compensatory and noncompensatory item response theory models. Simulation study results showed that the recovery of the latent correlation was best when the test contained 100% of simple structure items for all models and…
Descriptors: Item Response Theory, Models, Test Items, Simulation
Hung, Su-Pin; Huang, Hung-Yu – Journal of Educational and Behavioral Statistics, 2022
To address response style or bias in rating scales, forced-choice items are often used to request that respondents rank their attitudes or preferences among a limited set of options. The rating scales used by raters to render judgments on ratees' performance also contribute to rater bias or errors; consequently, forced-choice items have recently…
Descriptors: Evaluation Methods, Rating Scales, Item Analysis, Preferences