ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	12

Descriptor

Goodness of Fit	15
Simulation	15
Item Response Theory	8
Evaluation Methods	7
Structural Equation Models	6
Models	5
Computer Software	4
Test Items	4
Computation	3
Error of Measurement	3
Accuracy	2
Data Analysis	2
Error Patterns	2
Factor Analysis	2
Foreign Countries	2
Gender Differences	2
Item Analysis	2
Maximum Likelihood Statistics	2
Measurement Techniques	2
Monte Carlo Methods	2
Probability	2
Responses	2
Secondary School Students	2
Achievement Tests	1
Aggression	1
More ▼

Source

Structural Equation Modeling:…	4
Psychometrika	2
Structural Equation Modeling	2
Applied Psychological…	1
Educational and Psychological…	1
Journal of Educational…	1
Measurement and Evaluation in…	1
Measurement:…	1
Practical Assessment,…	1
Sociological Methods &…	1

Publication Type

Journal Articles	15
Reports - Descriptive	15

Education Level

Secondary Education	2
Elementary Education	1
Grade 3	1

Audience

Researchers

Location

United Kingdom (Glasgow)

Laws, Policies, & Programs

Assessments and Surveys

Program for International…

What Works Clearinghouse Rating

Showing all 15 results Save | Export

Using Cumulative Sum Control Chart to Detect Aberrant Responses in Educational Assessments

Peer reviewed
PDF on ERIC

Download full text

Wan, Siyu; Keller, Lisa A. – Practical Assessment, Research & Evaluation, 2023

Statistical process control (SPC) charts have been widely used in the field of educational measurement. The cumulative sum (CUSUM) is an established SPC method to detect aberrant responses for educational assessments. There are many studies that investigated the performance of CUSUM in different test settings. This paper describes the CUSUM…

Descriptors: Visual Aids, Educational Assessment, Evaluation Methods, Item Response Theory

R Packages for Item Response Theory Analysis: Descriptions and Features

Peer reviewed

Direct link

Choi, Youn-Jeng; Asilkalkan, Abdullah – Measurement: Interdisciplinary Research and Perspectives, 2019

About 45 R packages to analyze data using item response theory (IRT) have been developed over the last decade. This article introduces these 45 R packages with their descriptions and features. It also describes possible advanced IRT models using R packages, as well as dichotomous and polytomous IRT models, and R packages that contain applications…

Descriptors: Item Response Theory, Data Analysis, Computer Software, Test Bias

Sensitivity of the RMSD for Detecting Item-Level Misfit in Low-Performing Countries

Peer reviewed

Direct link

Tijmstra, Jesper; Bolsinova, Maria; Liaw, Yuan-Ling; Rutkowski, Leslie; Rutkowski, David – Journal of Educational Measurement, 2020

Although the root-mean squared deviation (RMSD) is a popular statistical measure for evaluating country-specific item-level misfit (i.e., differential item functioning [DIF]) in international large-scale assessment, this paper shows that its sensitivity to detect misfit may depend strongly on the proficiency distribution of the considered…

Descriptors: Test Items, Goodness of Fit, Probability, Accuracy

Model Adequacy Checking/Goodness-of-Fit Testing for Behavior in Joint Dynamic Network/Behavior Models, with an Extension to Two-Mode Networks

Peer reviewed

Direct link

Wang, Cheng; Butts, Carter T.; Hipp, John; Lakon, Cynthia M. – Sociological Methods & Research, 2022

The recent popularity of models that capture the dynamic coevolution of both network structure and behavior has driven the need for summary indices to assess the adequacy of these models to reproduce dynamic properties of scientific or practical importance. Whereas there are several existing indices for assessing the ability of the model to…

Descriptors: Models, Goodness of Fit, Comparative Analysis, Computer Software

Polytomous Rasch Models in Counseling Assessment

Peer reviewed

Direct link

Willse, John T. – Measurement and Evaluation in Counseling and Development, 2017

This article provides a brief introduction to the Rasch model. Motivation for using Rasch analyses is provided. Important Rasch model concepts and key aspects of result interpretation are introduced, with major points reinforced using a simulation demonstration. Concrete guidelines are provided regarding sample size and the evaluation of items.

Descriptors: Item Response Theory, Test Results, Test Interpretation, Simulation

Adjusting the Adjusted X[superscript 2]/df Ratio Statistic for Dichotomous Item Response Theory Analyses: Does the Model Fit?

Peer reviewed

Direct link

Tay, Louis; Drasgow, Fritz – Educational and Psychological Measurement, 2012

Two Monte Carlo simulation studies investigated the effectiveness of the mean adjusted X[superscript 2]/df statistic proposed by Drasgow and colleagues and, because of problems with the method, a new approach for assessing the goodness of fit of an item response theory model was developed. It has been previously recommended that mean adjusted…

Descriptors: Test Length, Monte Carlo Methods, Goodness of Fit, Item Response Theory

Level-Specific Evaluation of Model Fit in Multilevel Structural Equation Modeling

Peer reviewed

Direct link

Ryu, Ehri; West, Stephen G. – Structural Equation Modeling: A Multidisciplinary Journal, 2009

In multilevel structural equation modeling, the "standard" approach to evaluating the goodness of model fit has a potential limitation in detecting the lack of fit at the higher level. Level-specific model fit evaluation can address this limitation and is more informative in locating the source of lack of model fit. We proposed level-specific test…

Descriptors: Structural Equation Models, Evaluation Methods, Goodness of Fit, Simulation

Using Mx to Analyze Cross-Level Effects in Two-Level Structural Equation Models

Peer reviewed

Direct link

Bai, Yun; Poon, Wai-Yin – Structural Equation Modeling: A Multidisciplinary Journal, 2009

Two-level data sets are frequently encountered in social and behavioral science research. They arise when observations are drawn from a known hierarchical structure, such as when individuals are randomly drawn from groups that are randomly drawn from a target population. Although 2-level data analysis in the context of structural equation modeling…

Descriptors: Structural Equation Models, Data Analysis, Simulation, Goodness of Fit

Multidimensional Factor-Analysis-Based Procedures for Assessing Scalability in Personality Measurement

Peer reviewed

Direct link

Ferrando, Pere J. – Structural Equation Modeling: A Multidisciplinary Journal, 2009

Most personality tests are made up of Likert-type items and analyzed by means of factor analysis (FA). In this type of application, the fit of the model at the level of individual respondents is almost never assessed. This article proposes procedures for assessing individual fit (scalability). The procedures are intended for the analysis of…

Descriptors: Personality, Factor Analysis, Personality Measures, Item Response Theory

Higher-Order Approximations to the Distributions of Fit Indexes under Fixed Alternatives in Structural Equation Models

Peer reviewed

Direct link

Ogasawara, Haruhiko – Psychometrika, 2007

Higher-order approximations to the distributions of fit indexes for structural equation models under fixed alternative hypotheses are obtained in nonnormal samples as well as normal ones. The fit indexes include the normal-theory likelihood ratio chi-square statistic for a posited model, the corresponding statistic for the baseline model of…

Descriptors: Intervals, Structural Equation Models, Goodness of Fit, Simulation

When Trivial Constraints Are Not Trivial: The Choice of Uniqueness Constraints in Confirmatory Factor Analysis.

Peer reviewed

Millsap, Roger E. – Structural Equation Modeling, 2001

Different sets of uniqueness constraints may lead to different fit results when applied to the same data in confirmatory factor analysis. Provides several examples of this phenomenon in simulated data and describes reasons for the variation in fit results. Discusses the choice of uniqueness constraints under these circumstances. (SLD)

Descriptors: Goodness of Fit, Simulation

Interaction Effects in Growth Modeling: A Full Model.

Peer reviewed

Wen, Zhonglin; Marsh, Herbert W.; Hau, Kit-Tai – Structural Equation Modeling, 2002

Points out two concerns with recent research by F. Li and others (2000) and T. Duncan and others (1999) that extended the structural equation model of latent interactions developed by K. Joreskog and F. Yang (1996) to latent growth modeling. Used mathematical derivation and a comparison of alternative models fitted to simulated data to develop a…

Descriptors: Goodness of Fit, Interaction, Simulation, Structural Equation Models

Exploratory Structural Equation Modeling

Peer reviewed

Direct link

Asparouhov, Tihomir; Muthen, Bengt – Structural Equation Modeling: A Multidisciplinary Journal, 2009

Exploratory factor analysis (EFA) is a frequently used multivariate analysis technique in statistics. Jennrich and Sampson (1966) solved a significant EFA factor loading matrix rotation problem by deriving the direct Quartimin rotation. Jennrich was also the first to develop standard errors for rotated solutions, although these have still not made…

Descriptors: Structural Equation Models, Testing, Factor Analysis, Research Methodology

A Beta Item Response Model for Continuous Bounded Responses

Peer reviewed

Direct link

Noel, Yvonnick; Dauvier, Bruno – Applied Psychological Measurement, 2007

An item response model is proposed for the analysis of continuous response formats in an item response theory (IRT) framework. With such formats, respondents are asked to report their response as a mark on a fixed-length graphical segment whose ends are labeled with extreme responses. An interpolation process is proposed as the response mechanism…

Descriptors: Simulation, Item Response Theory, Models, Responses

An Item Response Model for Nominal Data Based on the Rising Selection Ratios Criterion

Peer reviewed

Direct link

Revuelta, Javier – Psychometrika, 2005

Complete response vectors of all answer options in multiple-choice items can be used to estimate ability. The rising selection ratios criterion is necessary for scoring individuals because it implies that estimated ability always increases when the correct alternative is selected. This paper introduces the generalized DLT model, which assumes…

Descriptors: Multiple Choice Tests, Simulation, Item Response Theory, Models

Asilkalkan, Abdullah	1
Asparouhov, Tihomir	1
Bai, Yun	1
Bolsinova, Maria	1
Butts, Carter T.	1
Choi, Youn-Jeng	1
Dauvier, Bruno	1
Drasgow, Fritz	1
Ferrando, Pere J.	1
Hau, Kit-Tai	1
Hipp, John	1
Keller, Lisa A.	1
Lakon, Cynthia M.	1
Liaw, Yuan-Ling	1
Marsh, Herbert W.	1
Millsap, Roger E.	1
Muthen, Bengt	1
Noel, Yvonnick	1
Ogasawara, Haruhiko	1
Poon, Wai-Yin	1
Revuelta, Javier	1
Rutkowski, David	1
Rutkowski, Leslie	1
Ryu, Ehri	1
Tay, Louis	1
More ▼