ERIC - Search Results

Publication Date

In 2025	1
Since 2024	10
Since 2021 (last 5 years)	23
Since 2016 (last 10 years)	46
Since 2006 (last 20 years)	88

Descriptor

Error of Measurement	96
Item Response Theory	96
Models	96
Simulation	31
Comparative Analysis	27
Computation	26
Test Items	25
Goodness of Fit	23
Sample Size	20
Item Analysis	17
Evaluation Methods	16
Scores	16
Measurement Techniques	13
Accuracy	12
Correlation	12
Maximum Likelihood Statistics	12
Psychometrics	12
Test Length	12
Foreign Countries	11
Monte Carlo Methods	11
Statistical Analysis	11
Regression (Statistics)	8
Statistical Bias	8
Test Bias	8
Difficulty Level	7
More ▼

Publication Type

Journal Articles	76
Reports - Research	65
Reports - Evaluative	16
Dissertations/Theses -…	9
Reports - Descriptive	5
Speeches/Meeting Papers	5

Education Level

Elementary Secondary Education	9
Higher Education	5
Postsecondary Education	5
Secondary Education	3
Adult Education	1
Elementary Education	1
Grade 7	1
Grade 8	1
High Schools	1
Junior High Schools	1
Middle Schools	1
More ▼

Audience

Location

Bahrain	1
China	1
Italy	1
Philippines	1
Saudi Arabia	1
South Korea	1
Turkey	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	3
Trends in International…	3
Behavioral Risk Factor…	1
Big Five Inventory	1
General Aptitude Test Battery	1
Law School Admission Test	1
Program for International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 96 results Save | Export

Detecting Differential Item Functioning with Multiple Causes: A Comparison of Three Methods

Peer reviewed

Direct link

Xiaowen Liu – International Journal of Testing, 2024

Differential item functioning (DIF) often arises from multiple sources. Within the context of multidimensional item response theory, this study examined DIF items with varying secondary dimensions using the three DIF methods: SIBTEST, Mantel-Haenszel, and logistic regression. The effect of the number of secondary dimensions on DIF detection rates…

Descriptors: Item Analysis, Test Items, Item Response Theory, Correlation

A Note on Standard Errors for Multidimensional Two-Parameter Logistic Models Using Gaussian Variational Estimation

Peer reviewed

Direct link

Jiaying Xiao; Chun Wang; Gongjun Xu – Grantee Submission, 2024

Accurate item parameters and standard errors (SEs) are crucial for many multidimensional item response theory (MIRT) applications. A recent study proposed the Gaussian Variational Expectation Maximization (GVEM) algorithm to improve computational efficiency and estimation accuracy (Cho et al., 2021). However, the SE estimation procedure has yet to…

Descriptors: Error of Measurement, Models, Evaluation Methods, Item Analysis

The Goodness of Fit Evaluation against Local Dependence in Polytomous IRT Models: What Global Fit Indices Can Tell Us?

Direct link

Jiangqiong Li – ProQuest LLC, 2024

When measuring latent constructs, for example, language ability, we use statistical models to specify appropriate relationships between the latent construct and observe responses to test items. These models rely on theoretical assumptions to ensure accurate parameter estimates for valid inferences based on the test results. This dissertation…

Descriptors: Goodness of Fit, Item Response Theory, Models, Measurement Techniques

Detecting Multidimensional DIF in Polytomous Items with IRT Methods and Estimation Approaches

Peer reviewed

Direct link

Güler Yavuz Temel – Journal of Educational Measurement, 2024

The purpose of this study was to investigate multidimensional DIF with a simple and nonsimple structure in the context of multidimensional Graded Response Model (MGRM). This study examined and compared the performance of the IRT-LR and Wald test using MML-EM and MHRM estimation approaches with different test factors and test structures in…

Descriptors: Computation, Multidimensional Scaling, Item Response Theory, Models

Latent Class Analysis with Measurement Invariance Testing: Simulation Study to Compare Overall Likelihood Ratio vs Residual Fit Statistics Based Model Selection

Peer reviewed

Direct link

Zsuzsa Bakk – Structural Equation Modeling: A Multidisciplinary Journal, 2024

A standard assumption of latent class (LC) analysis is conditional independence, that is the items of the LC are independent of the covariates given the LCs. Several approaches have been proposed for identifying violations of this assumption. The recently proposed likelihood ratio approach is compared to residual statistics (bivariate residuals…

Descriptors: Goodness of Fit, Error of Measurement, Comparative Analysis, Models

IRT Observed-Score Equating for Rater-Mediated Assessments Using a Hierarchical Rater Model

Peer reviewed

Direct link

Tong Wu; Stella Y. Kim; Carl Westine; Michelle Boyer – Journal of Educational Measurement, 2025

While significant attention has been given to test equating to ensure score comparability, limited research has explored equating methods for rater-mediated assessments, where human raters inherently introduce error. If not properly addressed, these errors can undermine score interchangeability and test validity. This study proposes an equating…

Descriptors: Item Response Theory, Evaluators, Error of Measurement, Test Validity

Rotation Local Solutions in Multidimensional Item Response Theory Models

Peer reviewed

Direct link

Hoang V. Nguyen; Niels G. Waller – Educational and Psychological Measurement, 2024

We conducted an extensive Monte Carlo study of factor-rotation local solutions (LS) in multidimensional, two-parameter logistic (M2PL) item response models. In this study, we simulated more than 19,200 data sets that were drawn from 96 model conditions and performed more than 7.6 million rotations to examine the influence of (a) slope parameter…

Descriptors: Monte Carlo Methods, Item Response Theory, Correlation, Error of Measurement

A Matrix Lie Group Formulation of Measurement Theory: Symmetries of Classical Measurement Theory

Peer reviewed

Direct link

William R. Nugent – Measurement: Interdisciplinary Research and Perspectives, 2024

Symmetry considerations are important in science, and Group Theory is a theory of symmetry. Classical Measurement Theory is the most used measurement theory in the social and behavioral sciences. In this article, the author uses Matrix Lie (Lee) group theory to formulate a measurement model. Symmetry is defined and illustrated using symmetries of…

Descriptors: Item Response Theory, Measurement Techniques, Models, Simulation

3PL and 4PL Multiprocess Models

Direct link

Ryan Derickson – ProQuest LLC, 2022

Item Response Theory (IRT) models are a popular analytic method for self report data. We show how traditional IRT models can be vulnerable to specific kinds of asymmetric measurement error (AME) in self-report data, because the models spread the error to all estimates -- even those of items that do not contribute error. We quantify the impact of…

Descriptors: Item Response Theory, Measurement Techniques, Error of Measurement, Models

Model Misspecification and Robustness of Observed-Score Test Equating Using Propensity Scores

Peer reviewed

Direct link

Wallin, Gabriel; Wiberg, Marie – Journal of Educational and Behavioral Statistics, 2023

This study explores the usefulness of covariates on equating test scores from nonequivalent test groups. The covariates are captured by an estimated propensity score, which is used as a proxy for latent ability to balance the test groups. The objective is to assess the sensitivity of the equated scores to various misspecifications in the…

Descriptors: Models, Error of Measurement, Robustness (Statistics), Equated Scores

Sample Size and Item Parameter Estimation Precision When Utilizing the Masters' Partial Credit Model

Download full text

Custer, Michael; Kim, Jongpil – Online Submission, 2023

This study utilizes an analysis of diminishing returns to examine the relationship between sample size and item parameter estimation precision when utilizing the Masters' Partial Credit Model for polytomous items. Item data from the standardization of the Batelle Developmental Inventory, 3rd Edition were used. Each item was scored with a…

Descriptors: Sample Size, Item Response Theory, Test Items, Computation

Comparison of Methods for Identifying Differential Step Functioning with Polytomous Item Response Data

Peer reviewed

Direct link

Finch, Holmes – Applied Measurement in Education, 2022

Much research has been devoted to identification of differential item functioning (DIF), which occurs when the item responses for individuals from two groups differ after they are conditioned on the latent trait being measured by the scale. There has been less work examining differential step functioning (DSF), which is present for polytomous…

Descriptors: Comparative Analysis, Item Response Theory, Item Analysis, Simulation

Investigating Confidence Intervals of Item Parameters When Some Item Parameters Take Priors in the 2PL and 3PL Models

Peer reviewed

Direct link

Paek, Insu; Lin, Zhongtian; Chalmers, Robert Philip – Educational and Psychological Measurement, 2023

To reduce the chance of Heywood cases or nonconvergence in estimating the 2PL or the 3PL model in the marginal maximum likelihood with the expectation-maximization (MML-EM) estimation method, priors for the item slope parameter in the 2PL model or for the pseudo-guessing parameter in the 3PL model can be used and the marginal maximum a posteriori…

Descriptors: Models, Item Response Theory, Test Items, Intervals

Effects of Using Double Ratings as Item Scores on IRT Proficiency Estimation

Peer reviewed

Direct link

Song, Yoon Ah; Lee, Won-Chan – Applied Measurement in Education, 2022

This article presents the performance of item response theory (IRT) models when double ratings are used as item scores over single ratings when rater effects are present. Study 1 examined the influence of the number of ratings on the accuracy of proficiency estimation in the generalized partial credit model (GPCM). Study 2 compared the accuracy of…

Descriptors: Item Response Theory, Item Analysis, Scores, Accuracy

Performance of Infit and Outfit Confidence Intervals Calculated via Parametric Bootstrapping

Peer reviewed

Direct link

Silva Diaz, John Alexander; Köhler, Carmen; Hartig, Johannes – Applied Measurement in Education, 2022

Testing item fit is central in item response theory (IRT) modeling, since a good fit is necessary to draw valid inferences from estimated model parameters. "Infit" and "outfit" fit statistics, widespread indices for detecting deviations from the Rasch model, are affected by data factors, such as sample size. Consequently, the…

Descriptors: Intervals, Item Response Theory, Item Analysis, Inferences

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7

Educational and Psychological…	23
ProQuest LLC	9
Journal of Educational…	8
Journal of Educational and…	7
Applied Measurement in…	5
Applied Psychological…	5
Grantee Submission	4
ETS Research Report Series	3
Online Submission	3
Educational Sciences: Theory…	2
Measurement:…	2
Multivariate Behavioral…	2
Psychometrika	2
Assessment	1
Assessment in Education:…	1
Educational Measurement:…	1
Educational Psychologist	1
Educational Testing Service	1
Health Education Research	1
International Journal of…	1
International Journal of…	1
Journal of Education and…	1
Journal of Psychoeducational…	1
Language Testing	1
Measurement and Evaluation in…	1
More ▼

Finch, Holmes	4
Cai, Li	3
DeMars, Christine E.	3
Monroe, Scott	3
Chun Wang	2
Custer, Michael	2
Dimitrov, Dimiter M.	2
Falk, Carl F.	2
Ferrando, Pere J.	2
Gongjun Xu	2
Kelecioglu, Hülya	2
Lee, Won-Chan	2
Li, Deping	2
Marcoulides, George A.	2
Maydeu-Olivares, Alberto	2
Oranje, Andreas	2
Paek, Insu	2
Raykov, Tenko	2
Rizavi, Saba	2
Tsaousis, Ioannis	2
Wilson, Mark	2
A. Corinne Huggins-Manley	1
Ackerman, Terry A.	1
Al Harbi, Khaleel	1
Al-harbi, Khaleel A.	1
More ▼