ERIC - Search Results

Publication Date

In 2025	3
Since 2024	7
Since 2021 (last 5 years)	16
Since 2016 (last 10 years)	31
Since 2006 (last 20 years)	57

Descriptor

Item Response Theory	79
Robustness (Statistics)	79
Test Items	24
Simulation	19
Models	14
Computation	12
Estimation (Mathematics)	11
Item Analysis	11
Maximum Likelihood Statistics	11
Comparative Analysis	10
Correlation	10
Error of Measurement	10
Goodness of Fit	10
Statistical Analysis	9
Ability	8
Factor Analysis	8
Psychometrics	8
Sample Size	8
Test Construction	8
Bayesian Statistics	7
Computer Assisted Testing	7
Difficulty Level	7
Equated Scores	7
Evaluation Methods	7
Foreign Countries	7
More ▼

Publication Type

Journal Articles	55
Reports - Research	43
Reports - Evaluative	26
Speeches/Meeting Papers	9
Dissertations/Theses -…	8
Reports - Descriptive	2
Information Analyses	1
Numerical/Quantitative Data	1
Tests/Questionnaires	1

Education Level

Higher Education	12
Postsecondary Education	6
Grade 7	5
Secondary Education	5
Elementary Education	4
Grade 4	3
Grade 8	3
Early Childhood Education	2
Grade 3	2
Grade 6	2
Intermediate Grades	2
Junior High Schools	2
Middle Schools	2
Primary Education	2
Elementary Secondary Education	1
Grade 10	1
Grade 5	1
More ▼

Audience

Location

Australia	2
United States	2
Canada	1
China (Shanghai)	1
Delaware	1
Hong Kong	1
Mississippi	1
Singapore	1
Turkey	1
United Kingdom (England)	1
Washington	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	3
ACT Assessment	1
California Achievement Tests	1
Early Childhood Longitudinal…	1
Graduate Record Examinations	1
Raven Progressive Matrices	1
Social Skills Rating System	1
Trends in International…	1

What Works Clearinghouse Rating

Item Response Theory X

Showing 1 to 15 of 79 results Save | Export

Gaussian Variational Estimation of MIRT and Its Applications in Large-Scale Assessments

Direct link

Jiaying Xiao – ProQuest LLC, 2024

Multidimensional Item Response Theory (MIRT) has been widely used in educational and psychological assessments. It estimates multiple constructs simultaneously and models the correlations among latent constructs. While it provides more accurate results, the unidimensional IRT model is still dominant in real applications. One major reason is that…

Descriptors: Item Response Theory, Algorithms, Computation, Efficiency

Item Parameter Recovery: Sensitivity to Prior Distribution

Peer reviewed

Direct link

Christine E. DeMars; Paulius Satkus – Educational and Psychological Measurement, 2024

Marginal maximum likelihood, a common estimation method for item response theory models, is not inherently a Bayesian procedure. However, due to estimation difficulties, Bayesian priors are often applied to the likelihood when estimating 3PL models, especially with small samples. Little focus has been placed on choosing the priors for marginal…

Descriptors: Item Response Theory, Statistical Distributions, Error of Measurement, Bayesian Statistics

Exploring the Influence of Response Styles on Continuous Scale Assessments: Insights from a Novel Modeling Approach

Peer reviewed

Direct link

Hung-Yu Huang – Educational and Psychological Measurement, 2025

The use of discrete categorical formats to assess psychological traits has a long-standing tradition that is deeply embedded in item response theory models. The increasing prevalence and endorsement of computer- or web-based testing has led to greater focus on continuous response formats, which offer numerous advantages in both respondent…

Descriptors: Response Style (Tests), Psychological Characteristics, Item Response Theory, Test Reliability

IRT Observed-Score Equating for Rater-Mediated Assessments Using a Hierarchical Rater Model

Peer reviewed

Direct link

Tong Wu; Stella Y. Kim; Carl Westine; Michelle Boyer – Journal of Educational Measurement, 2025

While significant attention has been given to test equating to ensure score comparability, limited research has explored equating methods for rater-mediated assessments, where human raters inherently introduce error. If not properly addressed, these errors can undermine score interchangeability and test validity. This study proposes an equating…

Descriptors: Item Response Theory, Evaluators, Error of Measurement, Test Validity

Is Effort Moderated Scoring Robust to Multidimensional Rapid Guessing?

Peer reviewed

Direct link

Joseph A. Rios; Jiayi Deng – Educational and Psychological Measurement, 2025

To mitigate the potential damaging consequences of rapid guessing (RG), a form of noneffortful responding, researchers have proposed a number of scoring approaches. The present simulation study examines the robustness of the most popular of these approaches, the unidimensional effort-moderated (EM) scoring procedure, to multidimensional RG (i.e.,…

Descriptors: Scoring, Guessing (Tests), Reaction Time, Item Response Theory

A Residual-Based Differential Item Functioning Detection Framework in Item Response Theory

Peer reviewed

Direct link

Lim, Hwanggyu; Choe, Edison M.; Han, Kyung T. – Journal of Educational Measurement, 2022

Differential item functioning (DIF) of test items should be evaluated using practical methods that can produce accurate and useful results. Among a plethora of DIF detection techniques, we introduce the new "Residual DIF" (RDIF) framework, which stands out for its accessibility without sacrificing efficacy. This framework consists of…

Descriptors: Test Items, Item Response Theory, Identification, Robustness (Statistics)

Do Reported Treatment Effects Generalize to Other Measures of the Same Construct: A Specification Test

Peer reviewed

Direct link

Peter F. Halpin – Society for Research on Educational Effectiveness, 2024

Background: Meta-analyses of educational interventions have consistently documented the importance of methodological factors related to the choice of outcome measures. In particular, when interventions are evaluated using measures developed by researchers involved with the intervention or its evaluation, the effect sizes tend to be larger than…

Descriptors: College Students, College Faculty, STEM Education, Item Response Theory

A Robust Method for Detecting Item Misfit in Large-Scale Assessments

Peer reviewed

Direct link

von Davier, Matthias; Bezirhan, Ummugul – Educational and Psychological Measurement, 2023

Viable methods for the identification of item misfit or Differential Item Functioning (DIF) are central to scale construction and sound measurement. Many approaches rely on the derivation of a limiting distribution under the assumption that a certain model fits the data perfectly. Typical DIF assumptions such as the monotonicity and population…

Descriptors: Robustness (Statistics), Test Items, Item Analysis, Goodness of Fit

Model Misspecification and Robustness of Observed-Score Test Equating Using Propensity Scores

Peer reviewed

Direct link

Wallin, Gabriel; Wiberg, Marie – Journal of Educational and Behavioral Statistics, 2023

This study explores the usefulness of covariates on equating test scores from nonequivalent test groups. The covariates are captured by an estimated propensity score, which is used as a proxy for latent ability to balance the test groups. The objective is to assess the sensitivity of the equated scores to various misspecifications in the…

Descriptors: Models, Error of Measurement, Robustness (Statistics), Equated Scores

Examining the Robustness of the Graded Response and 2-Parameter Logistic Models to Violations of Construct Normality

Peer reviewed

Direct link

Manapat, Patrick D.; Edwards, Michael C. – Educational and Psychological Measurement, 2022

When fitting unidimensional item response theory (IRT) models, the population distribution of the latent trait ([theta]) is often assumed to be normally distributed. However, some psychological theories would suggest a nonnormal [theta]. For example, some clinical traits (e.g., alcoholism, depression) are believed to follow a positively skewed…

Descriptors: Robustness (Statistics), Computational Linguistics, Item Response Theory, Psychological Patterns

Relative Robustness of CDMs and (M)IRT in Measuring Growth in Latent Skills

Peer reviewed

Direct link

Huang, Qi; Bolt, Daniel M. – Educational and Psychological Measurement, 2023

Previous studies have demonstrated evidence of latent skill continuity even in tests intentionally designed for measurement of binary skills. In addition, the assumption of binary skills when continuity is present has been shown to potentially create a lack of invariance in item and latent ability parameters that may undermine applications. In…

Descriptors: Item Response Theory, Test Items, Skill Development, Robustness (Statistics)

Adaptive Weight Estimation of Latent Ability: Application to Computerized Adaptive Testing with Response Revision

Peer reviewed

Direct link

Wang, Shiyu; Xiao, Houping; Cohen, Allan – Journal of Educational and Behavioral Statistics, 2021

An adaptive weight estimation approach is proposed to provide robust latent ability estimation in computerized adaptive testing (CAT) with response revision. This approach assigns different weights to each distinct response to the same item when response revision is allowed in CAT. Two types of weight estimation procedures, nonfunctional and…

Descriptors: Computer Assisted Testing, Adaptive Testing, Computation, Robustness (Statistics)

Identifying Dynamic Shifts to Careless and Insufficient Effort Behavior in Questionnaire Responses; a Novel Approach and Experimental Validation

Peer reviewed

Direct link

Zachary J. Roman; Patrick Schmidt; Jason M. Miller; Holger Brandt – Structural Equation Modeling: A Multidisciplinary Journal, 2024

Careless and insufficient effort responding (C/IER) is a situation where participants respond to survey instruments without considering the item content. This phenomena adds noise to data leading to erroneous inference. There are multiple approaches to identifying and accounting for C/IER in survey settings, of these approaches the best performing…

Descriptors: Structural Equation Models, Bayesian Statistics, Response Style (Tests), Robustness (Statistics)

Robust Estimation of Ability and Mental Speed Employing the Hierarchical Model for Responses and Response Times

Peer reviewed

Direct link

Ranger, Jochen; Kuhn, Jörg-Tobias; Wolgast, Anett – Journal of Educational Measurement, 2021

Van der Linden's hierarchical model for responses and response times can be used in order to infer the ability and mental speed of test takers from their responses and response times in an educational test. A standard approach for this is maximum likelihood estimation. In real-world applications, the data of some test takers might be partly…

Descriptors: Models, Reaction Time, Item Response Theory, Tests

Analysing Standard Progressive Matrices (SPM-LS) with Bayesian Item Response Models

Peer reviewed
PDF on ERIC

Download full text

Bürkner, Paul-Christian – Journal of Intelligence, 2020

Raven's Standard Progressive Matrices (SPM) test and related matrix-based tests are widely applied measures of cognitive ability. Using Bayesian Item Response Theory (IRT) models, I reanalyzed data of an SPM short form proposed by Myszkowski and Storme (2018) and, at the same time, illustrate the application of these models. Results indicate that…

Descriptors: Intelligence Tests, Matrices, Bayesian Statistics, Item Response Theory

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

Educational and Psychological…	11
ProQuest LLC	8
Applied Psychological…	6
Journal of Educational…	5
ETS Research Report Series	4
Journal of Educational and…	4
Applied Measurement in…	3
Alberta Journal of…	2
Multivariate Behavioral…	2
Online Submission	2
Sociological Methods &…	2
Anatomical Sciences Education	1
Assessment for Effective…	1
CBE - Life Sciences Education	1
Center for Education Data &…	1
Educational Measurement:…	1
Educational Philosophy and…	1
Educational Testing Service	1
European Journal of…	1
International Educational…	1
Journal of Computer Assisted…	1
Journal of Intelligence	1
Measurement and Evaluation in…	1
Physical Review Special…	1
Psychometrika	1
More ▼

Kolen, Michael J.	2
Kuhn, Jörg-Tobias	2
Ranger, Jochen	2
Rogers, W. Todd	2
Royal, Kenneth D.	2
Abdel-fattah, Abdel-fattah A.	1
Alvarado, Jesús M.	1
Ambergen, A. W.	1
Anderson, Daniel	1
Ankenmann, Robert D.	1
Antino, Mirko	1
Asún, Rodrigo A.	1
Baldwin, Beatrice	1
Beddow, Peter A., III	1
Bentler, Peter M.	1
Berberoglu, Giray	1
Bezirhan, Ummugul	1
Birol, Gülnur	1
Bliese, Paul	1
Bolt, Daniel M.	1
Braun, Henry	1
Bürkner, Paul-Christian	1
Cai, Li	1
Cantley, Ian	1
More ▼