ERIC - Search Results

Publication Date

In 2025	0
Since 2024	4
Since 2021 (last 5 years)	11
Since 2016 (last 10 years)	20
Since 2006 (last 20 years)	29

Descriptor

Error of Measurement	34
Item Analysis	34
Simulation	34
Test Items	18
Item Response Theory	15
Sample Size	13
Comparative Analysis	9
Models	8
Computer Assisted Testing	7
Effect Size	6
Evaluation Methods	6
Foreign Countries	6
Goodness of Fit	6
Statistical Bias	6
Test Bias	6
Item Banks	5
Regression (Statistics)	5
Accuracy	4
Achievement Tests	4
Adaptive Testing	4
Bayesian Statistics	4
Difficulty Level	4
International Assessment	4
Latent Trait Theory	4
Maximum Likelihood Statistics	4
More ▼

Source

Educational and Psychological…	8
Journal of Educational…	4
ProQuest LLC	3
Applied Measurement in…	2
ETS Research Report Series	2
International Journal of…	2
Structural Equation Modeling:…	2
Autism: The International…	1
Grantee Submission	1
Large-scale Assessments in…	1
Multivariate Behavioral…	1
Practical Assessment,…	1
Psychometrika	1
Sociological Methods &…	1
More ▼

Publication Type

Journal Articles	26
Reports - Research	26
Reports - Evaluative	4
Dissertations/Theses -…	3
Speeches/Meeting Papers	2
Reports - Descriptive	1

Education Level

Elementary Secondary Education	2
Secondary Education	2

Audience

Researchers

Location

South Korea	1
Turkey	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	2
Trends in International…	2
Big Five Inventory	1

What Works Clearinghouse Rating

Showing 1 to 15 of 34 results Save | Export

Latent Class Analysis with Measurement Invariance Testing: Simulation Study to Compare Overall Likelihood Ratio vs Residual Fit Statistics Based Model Selection

Peer reviewed

Direct link

Zsuzsa Bakk – Structural Equation Modeling: A Multidisciplinary Journal, 2024

A standard assumption of latent class (LC) analysis is conditional independence, that is the items of the LC are independent of the covariates given the LCs. Several approaches have been proposed for identifying violations of this assumption. The recently proposed likelihood ratio approach is compared to residual statistics (bivariate residuals…

Descriptors: Goodness of Fit, Error of Measurement, Comparative Analysis, Models

Rotation Local Solutions in Multidimensional Item Response Theory Models

Peer reviewed

Direct link

Hoang V. Nguyen; Niels G. Waller – Educational and Psychological Measurement, 2024

We conducted an extensive Monte Carlo study of factor-rotation local solutions (LS) in multidimensional, two-parameter logistic (M2PL) item response models. In this study, we simulated more than 19,200 data sets that were drawn from 96 model conditions and performed more than 7.6 million rotations to examine the influence of (a) slope parameter…

Descriptors: Monte Carlo Methods, Item Response Theory, Correlation, Error of Measurement

Does Acquiescence Disagree with Measurement Invariance Testing?

Peer reviewed

Direct link

E. Damiano D'Urso; Jesper Tijmstra; Jeroen K. Vermunt; Kim De Roover – Structural Equation Modeling: A Multidisciplinary Journal, 2024

Measurement invariance (MI) is required for validly comparing latent constructs measured by multiple ordinal self-report items. Non-invariances may occur when disregarding (group differences in) an acquiescence response style (ARS; an agreeing tendency regardless of item content). If non-invariance results solely from neglecting ARS, one should…

Descriptors: Error of Measurement, Structural Equation Models, Construct Validity, Measurement Techniques

Comparison of Methods for Identifying Differential Step Functioning with Polytomous Item Response Data

Peer reviewed

Direct link

Finch, Holmes – Applied Measurement in Education, 2022

Much research has been devoted to identification of differential item functioning (DIF), which occurs when the item responses for individuals from two groups differ after they are conditioned on the latent trait being measured by the scale. There has been less work examining differential step functioning (DSF), which is present for polytomous…

Descriptors: Comparative Analysis, Item Response Theory, Item Analysis, Simulation

Robustness of Adaptive Measurement of Change to Item Parameter Estimation Error

Peer reviewed

Direct link

Cooperman, Allison W.; Weiss, David J.; Wang, Chun – Educational and Psychological Measurement, 2022

Adaptive measurement of change (AMC) is a psychometric method for measuring intra-individual change on one or more latent traits across testing occasions. Three hypothesis tests--a Z test, likelihood ratio test, and score ratio index--have demonstrated desirable statistical properties in this context, including low false positive rates and high…

Descriptors: Error of Measurement, Psychometrics, Hypothesis Testing, Simulation

The Study of the Effect of Item Parameter Drift on Ability Estimation Obtained from Adaptive Testing under Different Conditions

Peer reviewed
PDF on ERIC

Download full text

Sahin Kursad, Merve; Cokluk Bokeoglu, Omay; Cikrikci, Rahime Nukhet – International Journal of Assessment Tools in Education, 2022

Item parameter drift (IPD) is the systematic differentiation of parameter values of items over time due to various reasons. If it occurs in computer adaptive tests (CAT), it causes errors in the estimation of item and ability parameters. Identification of the underlying conditions of this situation in CAT is important for estimating item and…

Descriptors: Item Analysis, Computer Assisted Testing, Test Items, Error of Measurement

A Regression Discontinuity Design Framework for Controlling Selection Bias in Evaluations of Differential Item Functioning

Peer reviewed

Direct link

Koziol, Natalie A.; Goodrich, J. Marc; Yoon, HyeonJin – Educational and Psychological Measurement, 2022

Differential item functioning (DIF) is often used to examine validity evidence of alternate form test accommodations. Unfortunately, traditional approaches for evaluating DIF are prone to selection bias. This article proposes a novel DIF framework that capitalizes on regression discontinuity design analysis to control for selection bias. A…

Descriptors: Regression (Statistics), Item Analysis, Validity, Testing Accommodations

Evaluation of Structure Complexity Magnitude, Degree of Cross-Loading on Secondary Dimension and Model Specification on MIRT Parameter Estimation

Direct link

Hosseinzadeh, Mostafa – ProQuest LLC, 2021

In real-world situations, multidimensional data may appear on large-scale tests or attitudinal surveys. A simple structure, multidimensional model may be used to evaluate the items, ignoring the cross-loading of some items on the secondary dimension. The purpose of this study was to investigate the influence of structure complexity magnitude of…

Descriptors: Item Response Theory, Models, Simulation, Evaluation Methods

Robustness of Weighted Differential Item Functioning (DIF) Analysis: The Case of Mantel-Haenszel DIF Statistics. Research Report. ETS RR-21-12

Peer reviewed
PDF on ERIC

Download full text

Lu, Ru; Guo, Hongwen; Dorans, Neil J. – ETS Research Report Series, 2021

Two families of analysis methods can be used for differential item functioning (DIF) analysis. One family is DIF analysis based on observed scores, such as the Mantel-Haenszel (MH) and the standardized proportion-correct metric for DIF procedures; the other is analysis based on latent ability, in which the statistic is a measure of departure from…

Descriptors: Robustness (Statistics), Weighted Scores, Test Items, Item Analysis

Impact of Violations of Measurement Invariance in Longitudinal Mediation Modeling

Direct link

Xu, Jie – ProQuest LLC, 2019

Research has shown that cross-sectional mediation analysis cannot accurately reflect a true longitudinal mediated effect. To investigate longitudinal mediated effects, different longitudinal mediation models have been proposed and these models focus on different research questions related to longitudinal mediation. When fitting mediation models to…

Descriptors: Case Studies, Error of Measurement, Longitudinal Studies, Models

A Log-Linear Modeling Approach for Differential Item Functioning Detection in Polytomously Scored Items

Peer reviewed

Direct link

Yesiltas, Gonca; Paek, Insu – Educational and Psychological Measurement, 2020

A log-linear model (LLM) is a well-known statistical method to examine the relationship among categorical variables. This study investigated the performance of LLM in detecting differential item functioning (DIF) for polytomously scored items via simulations where various sample sizes, ability mean differences (impact), and DIF types were…

Descriptors: Simulation, Sample Size, Item Analysis, Scores

Variational Estimation for Multidimensional Generalized Partial Credit Model

Peer reviewed

Direct link

Chengyu Cui; Chun Wang; Gongjun Xu – Grantee Submission, 2024

Multidimensional item response theory (MIRT) models have generated increasing interest in the psychometrics literature. Efficient approaches for estimating MIRT models with dichotomous responses have been developed, but constructing an equally efficient and robust algorithm for polytomous models has received limited attention. To address this gap,…

Descriptors: Item Response Theory, Accuracy, Simulation, Psychometrics

Comparing the Robustness of Three Nonparametric DIF Procedures to Differential Rapid Guessing

Peer reviewed

Direct link

Abulela, Mohammed A. A.; Rios, Joseph A. – Applied Measurement in Education, 2022

When there are no personal consequences associated with test performance for examinees, rapid guessing (RG) is a concern and can differ between subgroups. To date, the impact of differential RG on item-level measurement invariance has received minimal attention. To that end, a simulation study was conducted to examine the robustness of the…

Descriptors: Comparative Analysis, Robustness (Statistics), Nonparametric Statistics, Item Analysis

A Bayesian Random Block Item Response Theory Model for Forced-Choice Formats

Peer reviewed

Direct link

Lee, HyeSun; Smith, Weldon Z. – Educational and Psychological Measurement, 2020

Based on the framework of testlet models, the current study suggests the Bayesian random block item response theory (BRB IRT) model to fit forced-choice formats where an item block is composed of three or more items. To account for local dependence among items within a block, the BRB IRT model incorporated a random block effect into the response…

Descriptors: Bayesian Statistics, Item Response Theory, Monte Carlo Methods, Test Format

Sensitivity of the RMSD for Detecting Item-Level Misfit in Low-Performing Countries

Peer reviewed

Direct link

Tijmstra, Jesper; Bolsinova, Maria; Liaw, Yuan-Ling; Rutkowski, Leslie; Rutkowski, David – Journal of Educational Measurement, 2020

Although the root-mean squared deviation (RMSD) is a popular statistical measure for evaluating country-specific item-level misfit (i.e., differential item functioning [DIF]) in international large-scale assessment, this paper shows that its sensitivity to detect misfit may depend strongly on the proficiency distribution of the considered…

Descriptors: Test Items, Goodness of Fit, Probability, Accuracy

Previous Page | Next Page »

Pages: 1 | 2 | 3

Rutkowski, Leslie	2
Abulela, Mohammed A. A.	1
Ayan, Cansu	1
Benítez, Isabel	1
Beretvas, S. Natasha	1
Bolsinova, Maria	1
Chengyu Cui	1
Cho, Sun-Joo	1
Chun Wang	1
Cikrikci, Nukhet	1
Cikrikci, Rahime Nukhet	1
Cokluk Bokeoglu, Omay	1
Cooperman, Allison W.	1
Curry, Allen R.	1
Dorans, Neil J.	1
E. Damiano D'Urso	1
Enders, Craig K.	1
Finch, Holmes	1
French, Brian F.	1
Gomez Grajales, Carlos Alberto	1
Gongjun Xu	1
Goodrich, J. Marc	1
Gul, Emrah	1
Guo, Hongwen	1
More ▼