ERIC - Search Results

Publication Date

In 2025	2
Since 2024	5
Since 2021 (last 5 years)	11
Since 2016 (last 10 years)	18
Since 2006 (last 20 years)	45

Descriptor

Error of Measurement	58
Evaluation Methods	58
Item Response Theory	58
Models	16
Simulation	15
Computation	12
Test Items	12
Item Analysis	10
Measurement Techniques	10
Psychometrics	10
Scores	10
Statistical Analysis	10
Test Bias	10
Comparative Analysis	9
Sample Size	9
Goodness of Fit	7
Statistical Bias	7
Correlation	6
Factor Analysis	6
Foreign Countries	6
Maximum Likelihood Statistics	6
Monte Carlo Methods	6
Data Analysis	5
Accuracy	4
Computer Software	4
More ▼

Publication Type

Journal Articles	42
Reports - Research	33
Reports - Evaluative	12
Dissertations/Theses -…	7
Reports - Descriptive	6
Speeches/Meeting Papers	3

Education Level

Secondary Education	3
Elementary Secondary Education	2
Adult Education	1
Grade 10	1
Grade 9	1
High Schools	1
Higher Education	1
Junior High Schools	1
Middle Schools	1

Audience

Researchers	3
Practitioners	1

Location

Taiwan	2
Germany	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	2
Program for International…	2
National Education…	1
Trends in International…	1
Wechsler Adult Intelligence…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 58 results Save | Export

Combining Mokken Scale Analysis with Rasch Measurement Theory to Explore Differences in Measurement Quality between Subgroups

Peer reviewed

Direct link

Stefanie A. Wind; Benjamin Lugu; Yurou Wang – International Journal of Testing, 2025

Mokken Scale Analysis (MSA) is a nonparametric approach that offers exploratory tools for understanding the nature of item responses while emphasizing invariance requirements. MSA is often discussed as it relates to Rasch measurement theory, which also emphasizes invariance, but uses parametric models. Researchers who have compared and combined…

Descriptors: Item Response Theory, Scaling, Surveys, Evaluation Methods

A Note on Standard Errors for Multidimensional Two-Parameter Logistic Models Using Gaussian Variational Estimation

Peer reviewed

Direct link

Jiaying Xiao; Chun Wang; Gongjun Xu – Grantee Submission, 2024

Accurate item parameters and standard errors (SEs) are crucial for many multidimensional item response theory (MIRT) applications. A recent study proposed the Gaussian Variational Expectation Maximization (GVEM) algorithm to improve computational efficiency and estimation accuracy (Cho et al., 2021). However, the SE estimation procedure has yet to…

Descriptors: Error of Measurement, Models, Evaluation Methods, Item Analysis

IRT Observed-Score Equating for Rater-Mediated Assessments Using a Hierarchical Rater Model

Peer reviewed

Direct link

Tong Wu; Stella Y. Kim; Carl Westine; Michelle Boyer – Journal of Educational Measurement, 2025

While significant attention has been given to test equating to ensure score comparability, limited research has explored equating methods for rater-mediated assessments, where human raters inherently introduce error. If not properly addressed, these errors can undermine score interchangeability and test validity. This study proposes an equating…

Descriptors: Item Response Theory, Evaluators, Error of Measurement, Test Validity

Evaluating the Performance of Estimators in SEM and IRT with Ordinal Variables

Direct link

Klauth, Bo – ProQuest LLC, 2023

In conducting confirmatory factor analysis with ordered response items, the literature suggests that when the number of responses is five and item skewness (IS) is approximately normal, researchers can employ maximum likelihood with robust standard errors (MLR). However, MLR can yield biased factor loadings (FL) and FL standard errors (FLSE) when…

Descriptors: Item Response Theory, Evaluation Methods, Factor Analysis, Error of Measurement

Is the Area under Curve Appropriate for Evaluating the Fit of Psychometric Models?

Peer reviewed

Direct link

Han, Yuting; Zhang, Jihong; Jiang, Zhehan; Shi, Dexin – Educational and Psychological Measurement, 2023

In the literature of modern psychometric modeling, mostly related to item response theory (IRT), the fit of model is evaluated through known indices, such as X[superscript 2], M2, and root mean square error of approximation (RMSEA) for absolute assessments as well as Akaike information criterion (AIC), consistent AIC (CAIC), and Bayesian…

Descriptors: Goodness of Fit, Psychometrics, Error of Measurement, Item Response Theory

Resolving and Re-Scoring Constructed Response Items in Mixed-Format Assessments: An Exploration of Three Approaches

Peer reviewed

Direct link

Stefanie A. Wind; Yangmeng Xu – Educational Assessment, 2024

We explored three approaches to resolving or re-scoring constructed-response items in mixed-format assessments: rater agreement, person fit, and targeted double scoring (TDS). We used a simulation study to consider how the three approaches impact the psychometric properties of student achievement estimates, with an emphasis on person fit. We found…

Descriptors: Interrater Reliability, Error of Measurement, Evaluation Methods, Examiners

Comparing Mimic and Mimic-Interaction to Alignment Methods for Investigating Measurement Invariance Concerning a Continuous Violator

Peer reviewed

Direct link

Yuanfang Liu; Mark H. C. Lai; Ben Kelcey – Structural Equation Modeling: A Multidisciplinary Journal, 2024

Measurement invariance holds when a latent construct is measured in the same way across different levels of background variables (continuous or categorical) while controlling for the true value of that construct. Using Monte Carlo simulation, this paper compares the multiple indicators, multiple causes (MIMIC) model and MIMIC-interaction to a…

Descriptors: Classification, Accuracy, Error of Measurement, Correlation

Modified Item-Fit Indices for Dichotomous IRT Models with Missing Data

Peer reviewed
PDF on ERIC

Download full text

Direct link

Xue Zhang; Chun Wang – Grantee Submission, 2022

Item-level fit analysis not only serves as a complementary check to global fit analysis, it is also essential in scale development because the fit results will guide item revision and/or deletion (Liu & Maydeu-Olivares, 2014). During data collection, missing response data may likely happen due to various reasons. Chi-square-based item fit…

Descriptors: Goodness of Fit, Item Response Theory, Scores, Test Length

Some Methods and Evaluation for Linking and Equating with Small Samples

Peer reviewed

Direct link

Peabody, Michael R. – Applied Measurement in Education, 2020

The purpose of the current article is to introduce the equating and evaluation methods used in this special issue. Although a comprehensive review of all existing models and methodologies would be impractical given the format, a brief introduction to some of the more popular models will be provided. A brief discussion of the conditions required…

Descriptors: Evaluation Methods, Equated Scores, Sample Size, Item Response Theory

Evaluation of Structure Complexity Magnitude, Degree of Cross-Loading on Secondary Dimension and Model Specification on MIRT Parameter Estimation

Direct link

Hosseinzadeh, Mostafa – ProQuest LLC, 2021

In real-world situations, multidimensional data may appear on large-scale tests or attitudinal surveys. A simple structure, multidimensional model may be used to evaluate the items, ignoring the cross-loading of some items on the secondary dimension. The purpose of this study was to investigate the influence of structure complexity magnitude of…

Descriptors: Item Response Theory, Models, Simulation, Evaluation Methods

Correction for Item Response Theory Latent Trait Measurement Error in Linear Mixed Effects Models

Peer reviewed
PDF on ERIC

Download full text

Wang, Chun; Xu, Gongjun; Zhang, Xue – Grantee Submission, 2019

When latent variables are used as outcomes in regression analysis, a common approach that is used to solve the ignored measurement error issue is to take a multilevel perspective on item response modeling (IRT). Although recent computational advancement allow efficient and accurate estimation of multilevel IRT models, we argue that a two-stage…

Descriptors: Error of Measurement, Item Response Theory, Regression (Statistics), Evaluation Methods

Optimal Linking Design for Response Model Parameters

Peer reviewed

Direct link

Barrett, Michelle D.; van der Linden, Wim J. – Journal of Educational Measurement, 2017

Linking functions adjust for differences between identifiability restrictions used in different instances of the estimation of item response model parameters. These adjustments are necessary when results from those instances are to be compared. As linking functions are derived from estimated item response model parameters, parameter estimation…

Descriptors: Item Response Theory, Error of Measurement, Programming, Evaluation Methods

Differential Item Functioning Effect Size from the Multigroup Confirmatory Factor Analysis for a Meta-Analysis: A Simulation Study

Peer reviewed

Direct link

Park, Sung Eun; Ahn, Soyeon; Zopluoglu, Cengiz – Educational and Psychological Measurement, 2021

This study presents a new approach to synthesizing differential item functioning (DIF) effect size: First, using correlation matrices from each study, we perform a multigroup confirmatory factor analysis (MGCFA) that examines measurement invariance of a test item between two subgroups (i.e., focal and reference groups). Then we synthesize, across…

Descriptors: Item Analysis, Effect Size, Difficulty Level, Monte Carlo Methods

Evaluating Local Independence in Rasch Models with WLSMV Global Fit Indices

Direct link

Hyunsuk Han – ProQuest LLC, 2018

In Huggins-Manley & Han (2017), it was shown that WLSMV global model fit indices used in structural equating modeling practice are sensitive to person parameter estimate RMSE and item difficulty parameter estimate RMSE that results from local dependence in 2-PL IRT models, particularly when conditioning on number of test items and sample size.…

Descriptors: Models, Statistical Analysis, Item Response Theory, Evaluation Methods

From OLS to Multilevel Multidimensional Mixture IRT: A Model Refinement Approach to Investigating Patterns of Relationships in PISA 2012 Data

Direct link

Gulsah Gurkan – ProQuest LLC, 2021

Secondary analyses of international large-scale assessments (ILSA) commonly characterize relationships between variables of interest using correlations. However, the accuracy of correlation estimates is impaired by artefacts such as measurement error and clustering. Despite advancements in methodology, conventional correlation estimates or…

Descriptors: Secondary School Students, Achievement Tests, International Assessment, Foreign Countries

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Educational and Psychological…	11
ProQuest LLC	7
Journal of Educational…	6
Applied Measurement in…	5
Applied Psychological…	5
Grantee Submission	4
ETS Research Report Series	2
International Journal of…	2
Online Submission	2
Structural Equation Modeling:…	2
Assessment	1
Educational Assessment	1
Educational Testing Service	1
Journal of Chemical Education	1
Journal of Experimental…	1
National Center for Research…	1
Psychological Assessment	1
Psychological Methods	1
Psychometrika	1
RAND Corporation	1
Structural Equation Modeling	1
More ▼

Wang, Wen-Chung	3
Cai, Li	2
Chun Wang	2
Stefanie A. Wind	2
Woods, Carol M.	2
van der Linden, Wim J.	2
Ahn, Soyeon	1
Ankenmann, Robert D.	1
Antal, Tamás	1
Ayers, Elizabeth	1
Barrett, Michelle D.	1
Ben Kelcey	1
Benjamin Lugu	1
Brandriet, Alexandra	1
Briggs, Derek C.	1
Carl Westine	1
Carstensen, Claus H.	1
Chen, Hsueh-Chu	1
Chen, Yu-Jen	1
Cheng, Chien-Fen	1
Cheung, K. C.	1
Croon, Marcel A.	1
Custer, Michael	1
Davey, Tim	1
DeMars, Christine E.	1
More ▼