ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	4

Descriptor

Monte Carlo Methods	14
Test Length	14
Item Response Theory	9
Maximum Likelihood Statistics	6
Mathematical Models	5
Sample Size	5
Computation	4
Computer Simulation	4
Estimation (Mathematics)	4
Ability	3
Bayesian Statistics	3
Comparative Analysis	3
Error of Measurement	3
Markov Processes	3
Statistical Distributions	3
Adaptive Testing	2
Item Analysis	2
Item Bias	2
Models	2
Psychometrics	2
Simulation	2
Statistical Analysis	2
Test Bias	2
Test Interpretation	2
Test Items	2
More ▼

Source

Applied Psychological…	9
Educational and Psychological…	1
Journal of Educational…	1

Publication Type

Reports - Evaluative	14
Journal Articles	11
Speeches/Meeting Papers	4

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 14 results Save | Export

Marginal Maximum A Posteriori Item Parameter Estimation for the Generalized Graded Unfolding Model

Peer reviewed

Direct link

Roberts, James S.; Thompson, Vanessa M. – Applied Psychological Measurement, 2011

A marginal maximum a posteriori (MMAP) procedure was implemented to estimate item parameters in the generalized graded unfolding model (GGUM). Estimates from the MMAP method were compared with those derived from marginal maximum likelihood (MML) and Markov chain Monte Carlo (MCMC) procedures in a recovery simulation that varied sample size,…

Descriptors: Statistical Analysis, Markov Processes, Computation, Monte Carlo Methods

Recovery of Graded Response Model Parameters: A Comparison of Marginal Maximum Likelihood and Markov Chain Monte Carlo Estimation

Peer reviewed

Direct link

Kieftenbeld, Vincent; Natesan, Prathiba – Applied Psychological Measurement, 2012

Markov chain Monte Carlo (MCMC) methods enable a fully Bayesian approach to parameter estimation of item response models. In this simulation study, the authors compared the recovery of graded response model parameters using marginal maximum likelihood (MML) and Gibbs sampling (MCMC) under various latent trait distributions, test lengths, and…

Descriptors: Test Length, Markov Processes, Item Response Theory, Monte Carlo Methods

Simultaneous Estimation of Overall and Domain Abilities: A Higher-Order IRT Model Approach

Peer reviewed

Direct link

de la Torre, Jimmy; Song, Hao – Applied Psychological Measurement, 2009

Assessments consisting of different domains (e.g., content areas, objectives) are typically multidimensional in nature but are commonly assumed to be unidimensional for estimation purposes. The different domains of these assessments are further treated as multi-unidimensional tests for the purpose of obtaining diagnostic information. However, when…

Descriptors: Ability, Tests, Item Response Theory, Data Analysis

Bias of Exploratory and Cross-Validated DETECT Index under Unidimensionality

Peer reviewed

Direct link

Monahan, Patrick O.; Stump, Timothy E.; Finch, Holmes; Hambleton, Ronald K. – Applied Psychological Measurement, 2007

DETECT is a nonparametric "full" dimensionality assessment procedure that clusters dichotomously scored items into dimensions and provides a DETECT index of magnitude of multidimensionality. Four factors (test length, sample size, item response theory [IRT] model, and DETECT index) were manipulated in a Monte Carlo study of bias, standard error,…

Descriptors: Test Length, Sample Size, Monte Carlo Methods, Geometric Concepts

Comparing BILOG and LOGIST Estimates for Normal, Truncated Normal, and Beta Ability Distributions.

Download full text

Abdel-fattah, Abdel-fattah A. – 1994

The accuracy of estimation procedures in item response theory was studied using Monte Carlo methods and varying sample size, number of subjects, and distribution of ability parameters for: (1) joint maximum likelihood as implemented in the computer program LOGIST; (2) marginal maximum likelihood; and (3) marginal Bayesian procedures as implemented…

Descriptors: Ability, Bayesian Statistics, Estimation (Mathematics), Maximum Likelihood Statistics

Monte Carlo Evaluation of Implied Orders as a Basis for Tailored Testing.

Peer reviewed

Cudeck, Robert; And Others – Applied Psychological Measurement, 1979

TAILOR, a computer program which implements an approach to tailored testing, was examined by Monte Carlo methods. The evaluation showed the procedure to be highly reliable and capable of reducing the required number of tests items by about one half. (Author/JKS)

Descriptors: Adaptive Testing, Computer Programs, Feasibility Studies, Item Analysis

The Effect of Test Length and IRT Model on the Distribution and Stability of Three Appropriateness Indexes.

Peer reviewed

Noonan, Brian W.; And Others – Applied Psychological Measurement, 1992

Studied the extent to which three appropriateness indexes, Z(sub 3), ECIZ4, and W, are well standardized in a Monte Carlo study. The ECIZ4 most closely approximated a normal distribution, and its skewness and kurtosis were more stable and less affected by test length and item response theory model than the others. (SLD)

Descriptors: Comparative Analysis, Item Response Theory, Mathematical Models, Maximum Likelihood Statistics

Recovery of Marginal Maximum Likelihood Estimates in the Two-Parameter Logistic Response Model: An Evaluation of MULTILOG.

Peer reviewed

Stone, Clement A. – Applied Psychological Measurement, 1992

Monte Carlo methods are used to evaluate marginal maximum likelihood estimation of item parameters and maximum likelihood estimates of theta in the two-parameter logistic model for varying test lengths, sample sizes, and assumed theta distributions. Results with 100 datasets demonstrate the methods' general precision and stability. Exceptions are…

Descriptors: Computer Software Evaluation, Estimation (Mathematics), Mathematical Models, Maximum Likelihood Statistics

The Influence of Test Characteristics on the Detection of Aberrant Response Patterns.

Peer reviewed

Reise, Steven P.; Due, Allan M. – Applied Psychological Measurement, 1991

Previous person-fit research is extended through explication of an unexplored model for generating aberrant response patterns. The proposed model is then implemented to investigate the influence of test properties on the aberrancy detection power of a person-fit statistic. Difficulties of aberrancy detection are discussed. (SLD)

Descriptors: Algorithms, Computer Simulation, Item Response Theory, Mathematical Models

A Monte Carlo Study of Marginal Maximum Likelihood Parameter Estimates for the Graded Model.

Download full text

Ankenmann, Robert D.; Stone, Clement A. – 1992

Effects of test length, sample size, and assumed ability distribution were investigated in a multiple replication Monte Carlo study under the 1-parameter (1P) and 2-parameter (2P) logistic graded model with five score levels. Accuracy and variability of item parameter and ability estimates were examined. Monte Carlo methods were used to evaluate…

Descriptors: Computer Simulation, Estimation (Mathematics), Item Bias, Mathematical Models

The MIMIC Model as a Method for Detecting DIF: Comparison With Mantel-Haenszel, SIBTEST, and the IRT Likelihood Ratio

Peer reviewed

Direct link

Finch, Holmes – Applied Psychological Measurement, 2005

This study compares the ability of the multiple indicators, multiple causes (MIMIC) confirmatory factor analysis model to correctly identify cases of differential item functioning (DIF) with more established methods. Although the MIMIC model might have application in identifying DIF for multiple grouping variables, there has been little…

Descriptors: Identification, Factor Analysis, Test Bias, Models

Item Parameter Recovery, Standard Error Estimates, and Fit Statistics of the Winsteps Program for the Family of Rasch Models

Peer reviewed

Direct link

Wang, Wen-Chung; Chen, Cheng-Te – Educational and Psychological Measurement, 2005

This study investigates item parameter recovery, standard error estimates, and fit statistics yielded by the WINSTEPS program under the Rasch model and the rating scale model through Monte Carlo simulations. The independent variables were item response model, test length, and sample size. WINSTEPS yielded practically unbiased estimates for the…

Descriptors: Statistics, Test Length, Rating Scales, Item Response Theory

Monte Carlo Simulation Comparison of Two-Stage Testing and Computerized Adaptive Testing.

Download full text

Kim, Haeok; Plake, Barbara S. – 1993

A two-stage testing strategy is one method of adapting the difficulty of a test to an individual's ability level in an effort to achieve more precise measurement. A routing test provides an initial estimate of ability level, and a second-stage measurement test then evaluates the examinee further. The measurement accuracy and efficiency of item…

Descriptors: Ability, Adaptive Testing, Comparative Testing, Computer Assisted Testing

Thin versus Thick Matching in the Mantel-Haenszel Procedure for Detecting DIF.

Peer reviewed

Donoghue, John R.; Allen, Nancy L. – Journal of Educational Statistics, 1993

Forming the matching variable for the Mantel-Haenszel differential item functioning (DIF) procedure through use of the total score as the matching variable (thin) and forming the matching variable by pooling total score levels (thick) were compared in a Monte Carlo study. Reasons thick matching is superior are discussed. (SLD)

Descriptors: Comparative Analysis, Computer Simulation, Equations (Mathematics), Graphs

Finch, Holmes	2
Stone, Clement A.	2
Abdel-fattah, Abdel-fattah A.	1
Allen, Nancy L.	1
Ankenmann, Robert D.	1
Chen, Cheng-Te	1
Cudeck, Robert	1
Donoghue, John R.	1
Due, Allan M.	1
Hambleton, Ronald K.	1
Kieftenbeld, Vincent	1
Kim, Haeok	1
Monahan, Patrick O.	1
Natesan, Prathiba	1
Noonan, Brian W.	1
Plake, Barbara S.	1
Reise, Steven P.	1
Roberts, James S.	1
Song, Hao	1
Stump, Timothy E.	1
Thompson, Vanessa M.	1
Wang, Wen-Chung	1
de la Torre, Jimmy	1
More ▼