Publication Date
In 2025 | 0 |
Since 2024 | 3 |
Since 2021 (last 5 years) | 8 |
Since 2016 (last 10 years) | 24 |
Since 2006 (last 20 years) | 47 |
Descriptor
Goodness of Fit | 52 |
Models | 52 |
Sample Size | 52 |
Simulation | 20 |
Error of Measurement | 15 |
Item Response Theory | 15 |
Statistical Analysis | 14 |
Factor Analysis | 13 |
Monte Carlo Methods | 12 |
Evaluation Methods | 10 |
Computation | 9 |
More ▼ |
Source
Author
Hong, Sehee | 2 |
Lee, Taehun | 2 |
Liang, Tie | 2 |
Lubke, Gitta | 2 |
Murphy, Daniel L. | 2 |
Shi, Dexin | 2 |
Wells, Craig S. | 2 |
Abdous, Belkacem | 1 |
Alexandrowicz, Rainer W. | 1 |
Arnold, Carolyn L. | 1 |
Baghaei, Purya | 1 |
More ▼ |
Publication Type
Journal Articles | 39 |
Reports - Research | 37 |
Dissertations/Theses -… | 6 |
Reports - Evaluative | 6 |
Speeches/Meeting Papers | 4 |
Information Analyses | 2 |
Reports - Descriptive | 2 |
Opinion Papers | 1 |
Education Level
Adult Education | 1 |
Elementary Secondary Education | 1 |
High Schools | 1 |
Higher Education | 1 |
Middle Schools | 1 |
Postsecondary Education | 1 |
Secondary Education | 1 |
Audience
Location
Australia | 1 |
Laws, Policies, & Programs
Assessments and Surveys
National Longitudinal Survey… | 1 |
Schools and Staffing Survey… | 1 |
Self Description Questionnaire | 1 |
Test of English as a Foreign… | 1 |
Trends in International… | 1 |
Woodcock Johnson Tests of… | 1 |
What Works Clearinghouse Rating
Dubravka Svetina Valdivia; Shenghai Dai – Journal of Experimental Education, 2024
Applications of polytomous IRT models in applied fields (e.g., health, education, psychology) are abound. However, little is known about the impact of the number of categories and sample size requirements for precise parameter recovery. In a simulation study, we investigated the impact of the number of response categories and required sample size…
Descriptors: Item Response Theory, Sample Size, Models, Classification
David Goretzko; Karik Siemund; Philipp Sterner – Educational and Psychological Measurement, 2024
Confirmatory factor analyses (CFA) are often used in psychological research when developing measurement models for psychological constructs. Evaluating CFA model fit can be quite challenging, as tests for exact model fit may focus on negligible deviances, while fit indices cannot be interpreted absolutely without specifying thresholds or cutoffs.…
Descriptors: Factor Analysis, Goodness of Fit, Psychological Studies, Measurement
Christopher E. Shank – ProQuest LLC, 2024
This dissertation compares the performance of equivalence test (EQT) and null hypothesis test (NHT) procedures for identifying invariant and noninvariant factor loadings under a range of experimental manipulations. EQT is the statistically appropriate approach when the research goal is to find evidence of group similarity rather than group…
Descriptors: Factor Analysis, Goodness of Fit, Intervals, Comparative Analysis
Jang, Yoona; Hong, Sehee – Educational and Psychological Measurement, 2023
The purpose of this study was to evaluate the degree of classification quality in the basic latent class model when covariates are either included or are not included in the model. To accomplish this task, Monte Carlo simulations were conducted in which the results of models with and without a covariate were compared. Based on these simulations,…
Descriptors: Classification, Models, Prediction, Sample Size
Silva Diaz, John Alexander; Köhler, Carmen; Hartig, Johannes – Applied Measurement in Education, 2022
Testing item fit is central in item response theory (IRT) modeling, since a good fit is necessary to draw valid inferences from estimated model parameters. "Infit" and "outfit" fit statistics, widespread indices for detecting deviations from the Rasch model, are affected by data factors, such as sample size. Consequently, the…
Descriptors: Intervals, Item Response Theory, Item Analysis, Inferences
Cao, Chunhua; Kim, Eun Sook; Chen, Yi-Hsin; Ferron, John – Educational and Psychological Measurement, 2021
This study examined the impact of omitting covariates interaction effect on parameter estimates in multilevel multiple-indicator multiple-cause models as well as the sensitivity of fit indices to model misspecification when the between-level, within-level, or cross-level interaction effect was left out in the models. The parameter estimates…
Descriptors: Goodness of Fit, Hierarchical Linear Modeling, Computation, Models
Ben Stenhaug; Ben Domingue – Grantee Submission, 2022
The fit of an item response model is typically conceptualized as whether a given model could have generated the data. We advocate for an alternative view of fit, "predictive fit", based on the model's ability to predict new data. We derive two predictive fit metrics for item response models that assess how well an estimated item response…
Descriptors: Goodness of Fit, Item Response Theory, Prediction, Models
Tzou, Hueying; Yang, Ya-Huei – International Journal of Assessment Tools in Education, 2019
Selecting an appropriate cognitive diagnostic model (CDM) for data analysis is always challenging. Studies have explored several model fit indices for CDMs. The common results of these studies indicate that Q-matrix misspecifications lead to poor performance of the model fit indices in the context of CDMs. Thus, this study explored whether model…
Descriptors: Goodness of Fit, Sample Size, Cognitive Measurement, Models
Shi, Dexin; Lee, Taehun; Fairchild, Amanda J.; Maydeu-Olivares, Alberto – Educational and Psychological Measurement, 2020
This study compares two missing data procedures in the context of ordinal factor analysis models: pairwise deletion (PD; the default setting in Mplus) and multiple imputation (MI). We examine which procedure demonstrates parameter estimates and model fit indices closer to those of complete data. The performance of PD and MI are compared under a…
Descriptors: Factor Analysis, Statistical Analysis, Computation, Goodness of Fit
Su, Shiyang; Wang, Chun; Weiss, David J. – Educational and Psychological Measurement, 2021
S-X[superscript 2] is a popular item fit index that is available in commercial software packages such as "flex"MIRT. However, no research has systematically examined the performance of S-X[superscript 2] for detecting item misfit within the context of the multidimensional graded response model (MGRM). The primary goal of this study was…
Descriptors: Statistics, Goodness of Fit, Test Items, Models
Sen, Sedat; Cohen, Allan S. – Measurement: Interdisciplinary Research and Perspectives, 2019
Mixture item response theory (MixIRT) models combine IRT models with latent class model and assume that there exist latent subpopulations in the data. Identification of latent subpopulations via MixIRT models produces more detailed information. Detailed information about the response processing of examinees provides a better understanding of the…
Descriptors: Item Response Theory, Models, Item Analysis, Personality Traits
Xu, Jie – ProQuest LLC, 2019
Research has shown that cross-sectional mediation analysis cannot accurately reflect a true longitudinal mediated effect. To investigate longitudinal mediated effects, different longitudinal mediation models have been proposed and these models focus on different research questions related to longitudinal mediation. When fitting mediation models to…
Descriptors: Case Studies, Error of Measurement, Longitudinal Studies, Models
Ravand, Hamdollah; Baghaei, Purya – International Journal of Testing, 2020
More than three decades after their introduction, diagnostic classification models (DCM) do not seem to have been implemented in educational systems for the purposes they were devised. Most DCM research is either methodological for model development and refinement or retrofitting to existing nondiagnostic tests and, in the latter case, basically…
Descriptors: Classification, Models, Diagnostic Tests, Test Construction
DiStefano, Christine; McDaniel, Heather L.; Zhang, Liyun; Shi, Dexin; Jiang, Zhehan – Educational and Psychological Measurement, 2019
A simulation study was conducted to investigate the model size effect when confirmatory factor analysis (CFA) models include many ordinal items. CFA models including between 15 and 120 ordinal items were analyzed with mean- and variance-adjusted weighted least squares to determine how varying sample size, number of ordered categories, and…
Descriptors: Factor Analysis, Effect Size, Data, Sample Size
Kang, Yoonjeong; McNeish, Daniel M.; Hancock, Gregory R. – Educational and Psychological Measurement, 2016
Although differences in goodness-of-fit indices (?GOFs) have been advocated for assessing measurement invariance, studies that advanced recommended differential cutoffs for adjudicating invariance actually utilized a very limited range of values representing the quality of indicator variables (i.e., magnitude of loadings). Because quality of…
Descriptors: Measurement, Goodness of Fit, Guidelines, Models