Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 5 |
Since 2006 (last 20 years) | 12 |
Descriptor
Goodness of Fit | 15 |
Simulation | 15 |
Item Response Theory | 8 |
Evaluation Methods | 7 |
Structural Equation Models | 6 |
Models | 5 |
Computer Software | 4 |
Test Items | 4 |
Computation | 3 |
Error of Measurement | 3 |
Accuracy | 2 |
More ▼ |
Source
Author
Asilkalkan, Abdullah | 1 |
Asparouhov, Tihomir | 1 |
Bai, Yun | 1 |
Bolsinova, Maria | 1 |
Butts, Carter T. | 1 |
Choi, Youn-Jeng | 1 |
Dauvier, Bruno | 1 |
Drasgow, Fritz | 1 |
Ferrando, Pere J. | 1 |
Hau, Kit-Tai | 1 |
Hipp, John | 1 |
More ▼ |
Publication Type
Journal Articles | 15 |
Reports - Descriptive | 15 |
Education Level
Secondary Education | 2 |
Elementary Education | 1 |
Grade 3 | 1 |
Audience
Researchers | 1 |
Location
United Kingdom (Glasgow) | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Program for International… | 1 |
What Works Clearinghouse Rating
Wan, Siyu; Keller, Lisa A. – Practical Assessment, Research & Evaluation, 2023
Statistical process control (SPC) charts have been widely used in the field of educational measurement. The cumulative sum (CUSUM) is an established SPC method to detect aberrant responses for educational assessments. There are many studies that investigated the performance of CUSUM in different test settings. This paper describes the CUSUM…
Descriptors: Visual Aids, Educational Assessment, Evaluation Methods, Item Response Theory
Choi, Youn-Jeng; Asilkalkan, Abdullah – Measurement: Interdisciplinary Research and Perspectives, 2019
About 45 R packages to analyze data using item response theory (IRT) have been developed over the last decade. This article introduces these 45 R packages with their descriptions and features. It also describes possible advanced IRT models using R packages, as well as dichotomous and polytomous IRT models, and R packages that contain applications…
Descriptors: Item Response Theory, Data Analysis, Computer Software, Test Bias
Tijmstra, Jesper; Bolsinova, Maria; Liaw, Yuan-Ling; Rutkowski, Leslie; Rutkowski, David – Journal of Educational Measurement, 2020
Although the root-mean squared deviation (RMSD) is a popular statistical measure for evaluating country-specific item-level misfit (i.e., differential item functioning [DIF]) in international large-scale assessment, this paper shows that its sensitivity to detect misfit may depend strongly on the proficiency distribution of the considered…
Descriptors: Test Items, Goodness of Fit, Probability, Accuracy
Wang, Cheng; Butts, Carter T.; Hipp, John; Lakon, Cynthia M. – Sociological Methods & Research, 2022
The recent popularity of models that capture the dynamic coevolution of both network structure and behavior has driven the need for summary indices to assess the adequacy of these models to reproduce dynamic properties of scientific or practical importance. Whereas there are several existing indices for assessing the ability of the model to…
Descriptors: Models, Goodness of Fit, Comparative Analysis, Computer Software
Willse, John T. – Measurement and Evaluation in Counseling and Development, 2017
This article provides a brief introduction to the Rasch model. Motivation for using Rasch analyses is provided. Important Rasch model concepts and key aspects of result interpretation are introduced, with major points reinforced using a simulation demonstration. Concrete guidelines are provided regarding sample size and the evaluation of items.
Descriptors: Item Response Theory, Test Results, Test Interpretation, Simulation
Tay, Louis; Drasgow, Fritz – Educational and Psychological Measurement, 2012
Two Monte Carlo simulation studies investigated the effectiveness of the mean adjusted X[superscript 2]/df statistic proposed by Drasgow and colleagues and, because of problems with the method, a new approach for assessing the goodness of fit of an item response theory model was developed. It has been previously recommended that mean adjusted…
Descriptors: Test Length, Monte Carlo Methods, Goodness of Fit, Item Response Theory
Ryu, Ehri; West, Stephen G. – Structural Equation Modeling: A Multidisciplinary Journal, 2009
In multilevel structural equation modeling, the "standard" approach to evaluating the goodness of model fit has a potential limitation in detecting the lack of fit at the higher level. Level-specific model fit evaluation can address this limitation and is more informative in locating the source of lack of model fit. We proposed level-specific test…
Descriptors: Structural Equation Models, Evaluation Methods, Goodness of Fit, Simulation
Bai, Yun; Poon, Wai-Yin – Structural Equation Modeling: A Multidisciplinary Journal, 2009
Two-level data sets are frequently encountered in social and behavioral science research. They arise when observations are drawn from a known hierarchical structure, such as when individuals are randomly drawn from groups that are randomly drawn from a target population. Although 2-level data analysis in the context of structural equation modeling…
Descriptors: Structural Equation Models, Data Analysis, Simulation, Goodness of Fit
Ferrando, Pere J. – Structural Equation Modeling: A Multidisciplinary Journal, 2009
Most personality tests are made up of Likert-type items and analyzed by means of factor analysis (FA). In this type of application, the fit of the model at the level of individual respondents is almost never assessed. This article proposes procedures for assessing individual fit (scalability). The procedures are intended for the analysis of…
Descriptors: Personality, Factor Analysis, Personality Measures, Item Response Theory
Ogasawara, Haruhiko – Psychometrika, 2007
Higher-order approximations to the distributions of fit indexes for structural equation models under fixed alternative hypotheses are obtained in nonnormal samples as well as normal ones. The fit indexes include the normal-theory likelihood ratio chi-square statistic for a posited model, the corresponding statistic for the baseline model of…
Descriptors: Intervals, Structural Equation Models, Goodness of Fit, Simulation

Millsap, Roger E. – Structural Equation Modeling, 2001
Different sets of uniqueness constraints may lead to different fit results when applied to the same data in confirmatory factor analysis. Provides several examples of this phenomenon in simulated data and describes reasons for the variation in fit results. Discusses the choice of uniqueness constraints under these circumstances. (SLD)
Descriptors: Goodness of Fit, Simulation

Wen, Zhonglin; Marsh, Herbert W.; Hau, Kit-Tai – Structural Equation Modeling, 2002
Points out two concerns with recent research by F. Li and others (2000) and T. Duncan and others (1999) that extended the structural equation model of latent interactions developed by K. Joreskog and F. Yang (1996) to latent growth modeling. Used mathematical derivation and a comparison of alternative models fitted to simulated data to develop a…
Descriptors: Goodness of Fit, Interaction, Simulation, Structural Equation Models
Asparouhov, Tihomir; Muthen, Bengt – Structural Equation Modeling: A Multidisciplinary Journal, 2009
Exploratory factor analysis (EFA) is a frequently used multivariate analysis technique in statistics. Jennrich and Sampson (1966) solved a significant EFA factor loading matrix rotation problem by deriving the direct Quartimin rotation. Jennrich was also the first to develop standard errors for rotated solutions, although these have still not made…
Descriptors: Structural Equation Models, Testing, Factor Analysis, Research Methodology
Noel, Yvonnick; Dauvier, Bruno – Applied Psychological Measurement, 2007
An item response model is proposed for the analysis of continuous response formats in an item response theory (IRT) framework. With such formats, respondents are asked to report their response as a mark on a fixed-length graphical segment whose ends are labeled with extreme responses. An interpolation process is proposed as the response mechanism…
Descriptors: Simulation, Item Response Theory, Models, Responses
Revuelta, Javier – Psychometrika, 2005
Complete response vectors of all answer options in multiple-choice items can be used to estimate ability. The rising selection ratios criterion is necessary for scoring individuals because it implies that estimated ability always increases when the correct alternative is selected. This paper introduces the generalized DLT model, which assumes…
Descriptors: Multiple Choice Tests, Simulation, Item Response Theory, Models