ERIC - Search Results

Publication Date

In 2025	5
Since 2024	33
Since 2021 (last 5 years)	83
Since 2016 (last 10 years)	175
Since 2006 (last 20 years)	346

Descriptor

Error of Measurement	446
Item Response Theory	446
Test Items	152
Simulation	111
Models	96
Comparative Analysis	94
Computation	87
Scores	77
Goodness of Fit	62
Statistical Analysis	62
Sample Size	61
Evaluation Methods	58
Test Bias	55
Foreign Countries	53
Item Analysis	51
Psychometrics	49
Monte Carlo Methods	47
Test Reliability	46
Accuracy	45
Difficulty Level	45
Equated Scores	44
Correlation	42
Test Construction	40
Computer Assisted Testing	39
Factor Analysis	36
More ▼

Publication Type

Journal Articles	328
Reports - Research	264
Reports - Evaluative	95
Reports - Descriptive	51
Speeches/Meeting Papers	41
Dissertations/Theses -…	30
Numerical/Quantitative Data	13
Book/Product Reviews	2
Guides - Non-Classroom	2
Opinion Papers	2
Collected Works - General	1
Information Analyses	1
Reports - General	1
Tests/Questionnaires	1
More ▼

Education Level

Higher Education	27
Secondary Education	27
Elementary Education	25
Postsecondary Education	20
Elementary Secondary Education	19
Junior High Schools	16
Middle Schools	15
Grade 3	13
Early Childhood Education	12
Grade 7	12
Grade 4	11
Primary Education	11
Grade 8	10
Intermediate Grades	10
Grade 5	9
High Schools	8
Grade 6	7
Grade 1	3
Grade 2	3
Kindergarten	3
Adult Education	2
Grade 9	2
Grade 10	1
Preschool Education	1
More ▼

Audience

Researchers	3
Practitioners	1

Location

New York	5
Turkey	5
Australia	4
Taiwan	4
China	3
Germany	3
Indonesia	3
United Kingdom	3
United Kingdom (England)	3
Canada	2
Japan	2
Netherlands	2
New Mexico	2
Philippines	2
Saudi Arabia	2
South Korea	2
Austria	1
Bahrain	1
Belgium	1
California	1
Europe	1
Finland	1
Italy	1
Kuwait	1
Luxembourg	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001

What Works Clearinghouse Rating

Showing 1 to 15 of 446 results Save | Export

Combining Mokken Scale Analysis with Rasch Measurement Theory to Explore Differences in Measurement Quality between Subgroups

Peer reviewed

Direct link

Stefanie A. Wind; Benjamin Lugu; Yurou Wang – International Journal of Testing, 2025

Mokken Scale Analysis (MSA) is a nonparametric approach that offers exploratory tools for understanding the nature of item responses while emphasizing invariance requirements. MSA is often discussed as it relates to Rasch measurement theory, which also emphasizes invariance, but uses parametric models. Researchers who have compared and combined…

Descriptors: Item Response Theory, Scaling, Surveys, Evaluation Methods

The Impact of Measurement Noninvariance across Time and Group in Longitudinal Item Response Modeling

Peer reviewed

Direct link

In-Hee Choi – Asia Pacific Education Review, 2024

Longitudinal item response data often exhibit two types of measurement noninvariance: the noninvariance of item parameters between subject groups and that of item parameters across multiple time points. This study proposes a comprehensive approach to the simultaneous modeling of both types of measurement noninvariance in terms of longitudinal item…

Descriptors: Longitudinal Studies, Item Response Theory, Growth Models, Error of Measurement

Using Regularization to Identify Measurement Bias across Multiple Background Characteristics: A Penalized Expectation-Maximization Algorithm

Peer reviewed

Direct link

William C. M. Belzak; Daniel J. Bauer – Journal of Educational and Behavioral Statistics, 2024

Testing for differential item functioning (DIF) has undergone rapid statistical developments recently. Moderated nonlinear factor analysis (MNLFA) allows for simultaneous testing of DIF among multiple categorical and continuous covariates (e.g., sex, age, ethnicity, etc.), and regularization has shown promising results for identifying DIF among…

Descriptors: Test Bias, Algorithms, Factor Analysis, Error of Measurement

Educational Implications of Comparing Unidimensional and Multidimensional Item Response Theories

Peer reviewed
PDF on ERIC

Download full text

Seyma Erbay Mermer – Pegem Journal of Education and Instruction, 2024

This study aims to compare item and student parameters of dichotomously scored multidimensional constructs estimated based on unidimensional and multidimensional Item Response Theory (IRT) under different conditions of sample size, interdimensional correlation and number of dimensions. This research, conducted with simulations, is of a basic…

Descriptors: Item Response Theory, Correlation, Error of Measurement, Comparative Analysis

Detecting Differential Item Functioning with Multiple Causes: A Comparison of Three Methods

Peer reviewed

Direct link

Xiaowen Liu – International Journal of Testing, 2024

Differential item functioning (DIF) often arises from multiple sources. Within the context of multidimensional item response theory, this study examined DIF items with varying secondary dimensions using the three DIF methods: SIBTEST, Mantel-Haenszel, and logistic regression. The effect of the number of secondary dimensions on DIF detection rates…

Descriptors: Item Analysis, Test Items, Item Response Theory, Correlation

Detecting Differential Item Functioning among Multiple Groups Using IRT Residual DIF Framework

Peer reviewed

Direct link

Hwanggyu Lim; Danqi Zhu; Edison M. Choe; Kyung T. Han – Journal of Educational Measurement, 2024

This study presents a generalized version of the residual differential item functioning (RDIF) detection framework in item response theory, named GRDIF, to analyze differential item functioning (DIF) in multiple groups. The GRDIF framework retains the advantages of the original RDIF framework, such as computational efficiency and ease of…

Descriptors: Item Response Theory, Test Bias, Test Reliability, Test Construction

A Note on Standard Errors for Multidimensional Two-Parameter Logistic Models Using Gaussian Variational Estimation

Peer reviewed

Direct link

Jiaying Xiao; Chun Wang; Gongjun Xu – Grantee Submission, 2024

Accurate item parameters and standard errors (SEs) are crucial for many multidimensional item response theory (MIRT) applications. A recent study proposed the Gaussian Variational Expectation Maximization (GVEM) algorithm to improve computational efficiency and estimation accuracy (Cho et al., 2021). However, the SE estimation procedure has yet to…

Descriptors: Error of Measurement, Models, Evaluation Methods, Item Analysis

The Goodness of Fit Evaluation against Local Dependence in Polytomous IRT Models: What Global Fit Indices Can Tell Us?

Direct link

Jiangqiong Li – ProQuest LLC, 2024

When measuring latent constructs, for example, language ability, we use statistical models to specify appropriate relationships between the latent construct and observe responses to test items. These models rely on theoretical assumptions to ensure accurate parameter estimates for valid inferences based on the test results. This dissertation…

Descriptors: Goodness of Fit, Item Response Theory, Models, Measurement Techniques

Detecting Multidimensional DIF in Polytomous Items with IRT Methods and Estimation Approaches

Peer reviewed

Direct link

Güler Yavuz Temel – Journal of Educational Measurement, 2024

The purpose of this study was to investigate multidimensional DIF with a simple and nonsimple structure in the context of multidimensional Graded Response Model (MGRM). This study examined and compared the performance of the IRT-LR and Wald test using MML-EM and MHRM estimation approaches with different test factors and test structures in…

Descriptors: Computation, Multidimensional Scaling, Item Response Theory, Models

Latent Class Analysis with Measurement Invariance Testing: Simulation Study to Compare Overall Likelihood Ratio vs Residual Fit Statistics Based Model Selection

Peer reviewed

Direct link

Zsuzsa Bakk – Structural Equation Modeling: A Multidisciplinary Journal, 2024

A standard assumption of latent class (LC) analysis is conditional independence, that is the items of the LC are independent of the covariates given the LCs. Several approaches have been proposed for identifying violations of this assumption. The recently proposed likelihood ratio approach is compared to residual statistics (bivariate residuals…

Descriptors: Goodness of Fit, Error of Measurement, Comparative Analysis, Models

Item Parameter Recovery: Sensitivity to Prior Distribution

Peer reviewed

Direct link

Christine E. DeMars; Paulius Satkus – Educational and Psychological Measurement, 2024

Marginal maximum likelihood, a common estimation method for item response theory models, is not inherently a Bayesian procedure. However, due to estimation difficulties, Bayesian priors are often applied to the likelihood when estimating 3PL models, especially with small samples. Little focus has been placed on choosing the priors for marginal…

Descriptors: Item Response Theory, Statistical Distributions, Error of Measurement, Bayesian Statistics

Exploring the Influence of Response Styles on Continuous Scale Assessments: Insights from a Novel Modeling Approach

Peer reviewed

Direct link

Hung-Yu Huang – Educational and Psychological Measurement, 2025

The use of discrete categorical formats to assess psychological traits has a long-standing tradition that is deeply embedded in item response theory models. The increasing prevalence and endorsement of computer- or web-based testing has led to greater focus on continuous response formats, which offer numerous advantages in both respondent…

Descriptors: Response Style (Tests), Psychological Characteristics, Item Response Theory, Test Reliability

IRT Observed-Score Equating for Rater-Mediated Assessments Using a Hierarchical Rater Model

Peer reviewed

Direct link

Tong Wu; Stella Y. Kim; Carl Westine; Michelle Boyer – Journal of Educational Measurement, 2025

While significant attention has been given to test equating to ensure score comparability, limited research has explored equating methods for rater-mediated assessments, where human raters inherently introduce error. If not properly addressed, these errors can undermine score interchangeability and test validity. This study proposes an equating…

Descriptors: Item Response Theory, Evaluators, Error of Measurement, Test Validity

Rotation Local Solutions in Multidimensional Item Response Theory Models

Peer reviewed

Direct link

Hoang V. Nguyen; Niels G. Waller – Educational and Psychological Measurement, 2024

We conducted an extensive Monte Carlo study of factor-rotation local solutions (LS) in multidimensional, two-parameter logistic (M2PL) item response models. In this study, we simulated more than 19,200 data sets that were drawn from 96 model conditions and performed more than 7.6 million rotations to examine the influence of (a) slope parameter…

Descriptors: Monte Carlo Methods, Item Response Theory, Correlation, Error of Measurement

Linking Errors Introduced by Rapid Guessing Responses When Employing Multigroup Concurrent IRT Scaling

Direct link

Jiayi Deng – ProQuest LLC, 2024

Test score comparability in international large-scale assessments (LSA) is of utmost importance in measuring the effectiveness of education systems and understanding the impact of education on economic growth. To effectively compare test scores on an international scale, score linking is widely used to convert raw scores from different linguistic…

Descriptors: Item Response Theory, Scoring Rubrics, Scoring, Error of Measurement

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 30

Educational and Psychological…	66
Applied Psychological…	31
Journal of Educational…	31
ProQuest LLC	30
Applied Measurement in…	23
Journal of Educational and…	21
ETS Research Report Series	17
Grantee Submission	16
Psychometrika	12
International Journal of…	10
Online Submission	8
International Journal of…	7
Measurement:…	7
Structural Equation Modeling:…	6
Educational Measurement:…	5
New York State Education…	5
National Center for Education…	4
Psychological Assessment	4
Psychological Methods	4
Behavioral Research and…	3
Health Education Research	3
Journal of Experimental…	3
Multivariate Behavioral…	3
National Center for Research…	3
Practical Assessment,…	3
More ▼

Cai, Li	10
Kolen, Michael J.	7
Lee, Won-Chan	7
DeMars, Christine E.	6
Finch, Holmes	6
Sinharay, Sandip	6
Wang, Wen-Chung	6
van der Linden, Wim J.	6
Chun Wang	5
Dimitrov, Dimiter M.	5
Monroe, Scott	5
Paek, Insu	5
Raykov, Tenko	5
Zhang, Jinming	5
Fox, Jean-Paul	4
Guo, Hongwen	4
Lee, Yi-Hsuan	4
Li, Yuan H.	4
Marcoulides, George A.	4
Ogasawara, Haruhiko	4
Wang, Chun	4
Wang, Tianyou	4
Woods, Carol M.	4
Zwick, Rebecca	4
More ▼

National Assessment of…	14
Program for International…	9
Trends in International…	7
ACT Assessment	4
Iowa Tests of Basic Skills	3
SAT (College Admission Test)	3
Early Childhood Longitudinal…	2
Law School Admission Test	2
National Education…	2
Work Keys (ACT)	2
Armed Forces Qualification…	1
Armed Services Vocational…	1
Behavioral Risk Factor…	1
Big Five Inventory	1
Force Concept Inventory	1
Gates MacGinitie Reading Tests	1
General Aptitude Test Battery	1
Graduate Management Admission…	1
Iowa Tests of Educational…	1
Progress in International…	1
Student Teacher Relationship…	1
Wechsler Adult Intelligence…	1
Wechsler Preschool and…	1
Woodcock Johnson Psycho…	1
More ▼