ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	7
Since 2006 (last 20 years)	17

Descriptor

Comparative Analysis	40
Models	24
Item Response Theory	18
Mathematical Models	15
Test Items	12
Simulation	8
Estimation (Mathematics)	7
Statistical Analysis	7
Test Bias	6
Equated Scores	5
Equations (Mathematics)	5
Item Analysis	5
Sample Size	5
Scores	5
Computer Simulation	4
Difficulty Level	4
Evaluation Methods	4
Factor Analysis	4
Latent Trait Theory	4
Reliability	4
Statistical Distributions	4
Test Reliability	4
Accuracy	3
Achievement Tests	3
Bayesian Statistics	3
More ▼

Source

Journal of Educational…

Publication Type

Journal Articles	38
Reports - Research	22
Reports - Evaluative	12
Reports - Descriptive	3
Guides - Non-Classroom	1
Information Analyses	1
Speeches/Meeting Papers	1

Education Level

High Schools	1
Higher Education	1
Postsecondary Education	1
Secondary Education	1

Audience

Researchers

Location

South Carolina

Laws, Policies, & Programs

Defunis v Odegaard

Assessments and Surveys

Comprehensive Tests of Basic…	1
General Educational…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 40 results Save | Export

Comparing and Combining IRTree Models and Anchoring Vignettes in Addressing Response Styles

Peer reviewed

Direct link

Mingfeng Xue; Ping Chen – Journal of Educational Measurement, 2025

Response styles pose great threats to psychological measurements. This research compares IRTree models and anchoring vignettes in addressing response styles and estimating the target traits. It also explores the potential of combining them at the item level and total-score level (ratios of extreme and middle responses to vignettes). Four models…

Descriptors: Item Response Theory, Models, Comparative Analysis, Vignettes

Bayesian Model Selection Methods for Multilevel IRT Models: A Comparison of Five DIC-Based Indices

Peer reviewed

Direct link

Zhang, Xue; Tao, Jian; Wang, Chun; Shi, Ning-Zhong – Journal of Educational Measurement, 2019

Model selection is important in any statistical analysis, and the primary goal is to find the preferred (or most parsimonious) model, based on certain criteria, from a set of candidate models given data. Several recent publications have employed the deviance information criterion (DIC) to do model selection among different forms of multilevel item…

Descriptors: Bayesian Statistics, Item Response Theory, Measurement, Models

An Alternative to the 3PL: Using Asymmetric Item Characteristic Curves to Address Guessing Effects

Peer reviewed

Direct link

Lee, Sora; Bolt, Daniel M. – Journal of Educational Measurement, 2018

Both the statistical and interpretational shortcomings of the three-parameter logistic (3PL) model in accommodating guessing effects on multiple-choice items are well documented. We consider the use of a residual heteroscedasticity (RH) model as an alternative, and compare its performance to the 3PL with real test data sets and through simulation…

Descriptors: Statistical Analysis, Models, Guessing (Tests), Multiple Choice Tests

Scale Alignment in Between-Item Multidimensional Rasch Models

Peer reviewed

Direct link

Feuerstahler, Leah; Wilson, Mark – Journal of Educational Measurement, 2019

Scores estimated from multidimensional item response theory (IRT) models are not necessarily comparable across dimensions. In this article, the concept of aligned dimensions is formalized in the context of Rasch models, and two methods are described--delta dimensional alignment (DDA) and logistic regression alignment (LRA)--to transform estimated…

Descriptors: Item Response Theory, Models, Scores, Comparative Analysis

A Comparison of Strategies for Smoothing Parameter Selection for Mixed-Format Tests under the Random Groups Design

Peer reviewed

Direct link

Liu, Chunyan; Kolen, Michael J. – Journal of Educational Measurement, 2018

Smoothing techniques are designed to improve the accuracy of equating functions. The main purpose of this study is to compare seven model selection strategies for choosing the smoothing parameter (C) for polynomial loglinear presmoothing and one procedure for model selection in cubic spline postsmoothing for mixed-format pseudo tests under the…

Descriptors: Comparative Analysis, Accuracy, Models, Sample Size

Lord's Wald Test for Detecting Dif in Multidimensional Irt Models: A Comparison of Two Estimation Approaches

Peer reviewed

Direct link

Lee, Soo; Suh, Youngsuk – Journal of Educational Measurement, 2018

Lord's Wald test for differential item functioning (DIF) has not been studied extensively in the context of the multidimensional item response theory (MIRT) framework. In this article, Lord's Wald test was implemented using two estimation approaches, marginal maximum likelihood estimation and Bayesian Markov chain Monte Carlo estimation, to detect…

Descriptors: Item Response Theory, Sample Size, Models, Error of Measurement

IRT-Estimated Reliability for Tests Containing Mixed Item Formats

Peer reviewed

Direct link

Shu, Lianghua; Schwarz, Richard D. – Journal of Educational Measurement, 2014

As a global measure of precision, item response theory (IRT) estimated reliability is derived for four coefficients (Cronbach's a, Feldt-Raju, stratified a, and marginal reliability). Models with different underlying assumptions concerning test-part similarity are discussed. A detailed computational example is presented for the targeted…

Descriptors: Item Response Theory, Reliability, Models, Computation

An Odds Ratio Approach for Detecting DDF under the Nested Logit Modeling Framework

Peer reviewed

Direct link

Terzi, Ragip; Suh, Youngsuk – Journal of Educational Measurement, 2015

An odds ratio approach (ORA) under the framework of a nested logit model was proposed for evaluating differential distractor functioning (DDF) in multiple-choice items and was compared with an existing ORA developed under the nominal response model. The performances of the two ORAs for detecting DDF were investigated through an extensive…

Descriptors: Test Bias, Multiple Choice Tests, Test Items, Comparative Analysis

Diagnostic Profiles: A Standard Setting Method for Use with a Cognitive Diagnostic Model

Peer reviewed

Direct link

Skaggs, Gary; Hein, Serge F.; Wilkins, Jesse L. M. – Journal of Educational Measurement, 2016

This article introduces the Diagnostic Profiles (DP) standard setting method for setting a performance standard on a test developed from a cognitive diagnostic model (CDM), the outcome of which is a profile of mastered and not-mastered skills or attributes rather than a single test score. In the DP method, the key judgment task for panelists is a…

Descriptors: Models, Standard Setting, Profiles, Diagnostic Tests

Differential Item Functioning Assessment in Cognitive Diagnostic Modeling: Application of the Wald Test to Investigate DIF in the DINA Model

Peer reviewed

Direct link

Hou, Likun; de la Torre, Jimmy; Nandakumar, Ratna – Journal of Educational Measurement, 2014

Analyzing examinees' responses using cognitive diagnostic models (CDMs) has the advantage of providing diagnostic information. To ensure the validity of the results from these models, differential item functioning (DIF) in CDMs needs to be investigated. In this article, the Wald test is proposed to examine DIF in the context of CDMs. This study…

Descriptors: Test Bias, Models, Simulation, Error Patterns

A Comparison of Item Calibration Procedures in the Presence of Test Speededness

Peer reviewed

Direct link

Suh, Youngsuk; Cho, Sun-Joo; Wollack, James A. – Journal of Educational Measurement, 2012

In the presence of test speededness, the parameter estimates of item response theory models can be poorly estimated due to conditional dependencies among items, particularly for end-of-test items (i.e., speeded items). This article conducted a systematic comparison of five-item calibration procedures--a two-parameter logistic (2PL) model, a…

Descriptors: Response Style (Tests), Timed Tests, Test Items, Item Response Theory

Classification Consistency and Accuracy for Complex Assessments Using Item Response Theory

Peer reviewed

Direct link

Lee, Won-Chan – Journal of Educational Measurement, 2010

In this article, procedures are described for estimating single-administration classification consistency and accuracy indices for complex assessments using item response theory (IRT). This IRT approach was applied to real test data comprising dichotomous and polytomous items. Several different IRT model combinations were considered. Comparisons…

Descriptors: Classification, Item Response Theory, Comparative Analysis, Models

A Comparison of Chained Linear and Poststratification Linear Equating under Different Testing Conditions

Peer reviewed

Direct link

Puhan, Gautam – Journal of Educational Measurement, 2010

In this study I compared results of chained linear, Tucker, and Levine-observed score equatings under conditions where the new and old forms samples were similar in ability and also when they were different in ability. The length of the anchor test was also varied to examine its effect on the three different equating methods. The three equating…

Descriptors: Testing, Equated Scores, Comparative Analysis, Causal Models

Estimation Methods for One-Parameter Testlet Models

Peer reviewed

Direct link

Jiao, Hong; Wang, Shudong; He, Wei – Journal of Educational Measurement, 2013

This study demonstrated the equivalence between the Rasch testlet model and the three-level one-parameter testlet model and explored the Markov Chain Monte Carlo (MCMC) method for model parameter estimation in WINBUGS. The estimation accuracy from the MCMC method was compared with those from the marginalized maximum likelihood estimation (MMLE)…

Descriptors: Computation, Item Response Theory, Models, Monte Carlo Methods

RIM: A Random Item Mixture Model to Detect Differential Item Functioning

Peer reviewed

Direct link

Frederickx, Sofie; Tuerlinckx, Francis; De Boeck, Paul; Magis, David – Journal of Educational Measurement, 2010

In this paper we present a new methodology for detecting differential item functioning (DIF). We introduce a DIF model, called the random item mixture (RIM), that is based on a Rasch model with random item difficulties (besides the common random person abilities). In addition, a mixture model is assumed for the item difficulties such that the…

Descriptors: Test Bias, Models, Test Items, Difficulty Level

Previous Page | Next Page »

Pages: 1 | 2 | 3

Kolen, Michael J.	4
Suh, Youngsuk	3
Kane, Michael T.	2
Marsh, Herbert W.	2
Nandakumar, Ratna	2
Al-Karni, Ali	1
Baker, Frank B.	1
Bejar, Isaac I.	1
Beretvas, S. Natasha	1
Bolt, Daniel M.	1
Breland, Hunter M.	1
Brennan, Robert L.	1
Cho, Sun-Joo	1
Cohen, Allan S.	1
De Boeck, Paul	1
DeMars, Christine E.	1
Feuerstahler, Leah	1
Frederickx, Sofie	1
Hanna, Gila	1
Hanson, Bradley A.	1
He, Wei	1
Hein, Serge F.	1
Hirsch, Thomas M.	1
Hocevar, Dennis	1
More ▼