Publication Date
In 2025 | 0 |
Since 2024 | 5 |
Since 2021 (last 5 years) | 11 |
Since 2016 (last 10 years) | 16 |
Since 2006 (last 20 years) | 29 |
Descriptor
Comparative Analysis | 44 |
Goodness of Fit | 44 |
Simulation | 44 |
Item Response Theory | 14 |
Models | 14 |
Statistical Analysis | 12 |
Test Items | 12 |
Evaluation Methods | 8 |
Mathematical Models | 7 |
Computation | 6 |
Factor Analysis | 6 |
More ▼ |
Source
Author
Reckase, Mark D. | 3 |
Holt, Judith A. | 2 |
Macready, George B. | 2 |
Sinharay, Sandip | 2 |
Smith, Richard M. | 2 |
Abad, Francisco J. | 1 |
Afdhal, Nezam | 1 |
Andreas Gold | 1 |
Baek, Eunkyeng | 1 |
Baker, Ryan S. | 1 |
Beauducel, Andre | 1 |
More ▼ |
Publication Type
Reports - Research | 32 |
Journal Articles | 30 |
Reports - Evaluative | 7 |
Speeches/Meeting Papers | 7 |
Dissertations/Theses -… | 3 |
Information Analyses | 2 |
Reports - Descriptive | 1 |
Education Level
Secondary Education | 3 |
Elementary Secondary Education | 2 |
Elementary Education | 1 |
Grade 12 | 1 |
Grade 5 | 1 |
Grade 7 | 1 |
Grade 8 | 1 |
High Schools | 1 |
Higher Education | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
More ▼ |
Audience
Practitioners | 1 |
Laws, Policies, & Programs
Assessments and Surveys
ACT Assessment | 1 |
National Longitudinal Study… | 1 |
Raven Advanced Progressive… | 1 |
Trends in International… | 1 |
What Works Clearinghouse Rating
Karl Schweizer; Andreas Gold; Dorothea Krampen; Stefan Troche – Educational and Psychological Measurement, 2024
Conceptualizing two-variable disturbances preventing good model fit in confirmatory factor analysis as item-level method effects instead of correlated residuals avoids violating the principle that residual variation is unique for each item. The possibility of representing such a disturbance by a method factor of a bifactor measurement model was…
Descriptors: Correlation, Factor Analysis, Measurement Techniques, Item Analysis
Zsuzsa Bakk – Structural Equation Modeling: A Multidisciplinary Journal, 2024
A standard assumption of latent class (LC) analysis is conditional independence, that is the items of the LC are independent of the covariates given the LCs. Several approaches have been proposed for identifying violations of this assumption. The recently proposed likelihood ratio approach is compared to residual statistics (bivariate residuals…
Descriptors: Goodness of Fit, Error of Measurement, Comparative Analysis, Models
James Ohisei Uanhoro – Educational and Psychological Measurement, 2024
Accounting for model misspecification in Bayesian structural equation models is an active area of research. We present a uniquely Bayesian approach to misspecification that models the degree of misspecification as a parameter--a parameter akin to the correlation root mean squared residual. The misspecification parameter can be interpreted on its…
Descriptors: Bayesian Statistics, Structural Equation Models, Simulation, Statistical Inference
Joakim Wallmark; James O. Ramsay; Juan Li; Marie Wiberg – Journal of Educational and Behavioral Statistics, 2024
Item response theory (IRT) models the relationship between the possible scores on a test item against a test taker's attainment of the latent trait that the item is intended to measure. In this study, we compare two models for tests with polytomously scored items: the optimal scoring (OS) model, a nonparametric IRT model based on the principles of…
Descriptors: Item Response Theory, Test Items, Models, Scoring
Christopher E. Shank – ProQuest LLC, 2024
This dissertation compares the performance of equivalence test (EQT) and null hypothesis test (NHT) procedures for identifying invariant and noninvariant factor loadings under a range of experimental manipulations. EQT is the statistically appropriate approach when the research goal is to find evidence of group similarity rather than group…
Descriptors: Factor Analysis, Goodness of Fit, Intervals, Comparative Analysis
Fu, Yuanshu; Wen, Zhonglin; Wang, Yang – Educational and Psychological Measurement, 2022
Composite reliability, or coefficient omega, can be estimated using structural equation modeling. Composite reliability is usually estimated under the basic independent clusters model of confirmatory factor analysis (ICM-CFA). However, due to the existence of cross-loadings, the model fit of the exploratory structural equation model (ESEM) is…
Descriptors: Comparative Analysis, Structural Equation Models, Factor Analysis, Reliability
Xue Zhang; Chun Wang – Grantee Submission, 2022
Item-level fit analysis not only serves as a complementary check to global fit analysis, it is also essential in scale development because the fit results will guide item revision and/or deletion (Liu & Maydeu-Olivares, 2014). During data collection, missing response data may likely happen due to various reasons. Chi-square-based item fit…
Descriptors: Goodness of Fit, Item Response Theory, Scores, Test Length
Baek, Eunkyeng; Luo, Wen; Henri, Maria – Journal of Experimental Education, 2022
It is common to include multiple dependent variables (DVs) in single-case experimental design (SCED) meta-analyses. However, statistical issues associated with multiple DVs in the multilevel modeling approach (i.e., possible dependency of error, heterogeneous treatment effects, and heterogeneous error structures) have not been fully investigated.…
Descriptors: Meta Analysis, Hierarchical Linear Modeling, Comparative Analysis, Statistical Inference
Wind, Stefanie A.; Walker, A. Adrienne – Educational Measurement: Issues and Practice, 2021
Many large-scale performance assessments include score resolution procedures for resolving discrepancies in rater judgments. The goal of score resolution is conceptually similar to person fit analyses: To identify students for whom observed scores may not accurately reflect their achievement. Previously, researchers have observed that…
Descriptors: Goodness of Fit, Performance Based Assessment, Evaluators, Decision Making
Monroe, Scott – Journal of Educational and Behavioral Statistics, 2021
This research proposes a new statistic for testing latent variable distribution fit for unidimensional item response theory (IRT) models. If the typical assumption of normality is violated, then item parameter estimates will be biased, and dependent quantities such as IRT score estimates will be adversely affected. The proposed statistic compares…
Descriptors: Item Response Theory, Simulation, Scores, Comparative Analysis
Wang, Cheng; Butts, Carter T.; Hipp, John; Lakon, Cynthia M. – Sociological Methods & Research, 2022
The recent popularity of models that capture the dynamic coevolution of both network structure and behavior has driven the need for summary indices to assess the adequacy of these models to reproduce dynamic properties of scientific or practical importance. Whereas there are several existing indices for assessing the ability of the model to…
Descriptors: Models, Goodness of Fit, Comparative Analysis, Computer Software
Do Adaptive Representations of the Item-Position Effect in APM Improve Model Fit? A Simulation Study
Zeller, Florian; Krampen, Dorothea; Reiß, Siegbert; Schweizer, Karl – Educational and Psychological Measurement, 2017
The item-position effect describes how an item's position within a test, that is, the number of previous completed items, affects the response to this item. Previously, this effect was represented by constraints reflecting simple courses, for example, a linear increase. Due to the inflexibility of these representations our aim was to examine…
Descriptors: Goodness of Fit, Simulation, Factor Analysis, Intelligence Tests
Sinharay, Sandip – Applied Measurement in Education, 2017
Karabatsos compared the power of 36 person-fit statistics using receiver operating characteristics curves and found the "H[superscript T]" statistic to be the most powerful in identifying aberrant examinees. He found three statistics, "C", "MCI", and "U3", to be the next most powerful. These four statistics,…
Descriptors: Nonparametric Statistics, Goodness of Fit, Simulation, Comparative Analysis
Lamprianou, Iasonas – Educational and Psychological Measurement, 2018
It is common practice for assessment programs to organize qualifying sessions during which the raters (often known as "markers" or "judges") demonstrate their consistency before operational rating commences. Because of the high-stakes nature of many rating activities, the research community tends to continuously explore new…
Descriptors: Social Networks, Network Analysis, Comparative Analysis, Innovation
Leahy, Joy; O'Leary, Aisling; Afdhal, Nezam; Gray, Emma; Milligan, Scott; Wehmeyer, Malte H.; Walsh, Cathal – Research Synthesis Methods, 2018
The use of individual patient data (IPD) in network meta-analysis (NMA) is becoming increasingly popular. However, as most studies do not report IPD, most NMAs are performed using aggregate data for at least some, if not all, of the studies. We investigate the benefits of including varying proportions of IPD studies in an NMA. Several models have…
Descriptors: Patients, Medical Research, Meta Analysis, Network Analysis