Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 10 |
Descriptor
Simulation | 21 |
Item Response Theory | 17 |
Comparative Analysis | 6 |
Models | 6 |
Monte Carlo Methods | 6 |
Test Items | 6 |
Item Bias | 5 |
Bayesian Statistics | 4 |
Estimation (Mathematics) | 4 |
Evaluation Methods | 4 |
Sample Size | 4 |
More ▼ |
Source
Applied Psychological… | 10 |
Educational and Psychological… | 2 |
Journal of Educational and… | 2 |
International Journal of… | 1 |
International Journal of… | 1 |
Journal of Educational… | 1 |
Psychometrika | 1 |
Author
Cohen, Allan S. | 21 |
Kim, Seock-Ho | 9 |
Wollack, James A. | 4 |
Bolt, Daniel M. | 2 |
Cho, Sun-Joo | 2 |
Jang, Yoonsun | 2 |
Kang, Taehoon | 2 |
Choi, Youn-Jeng | 1 |
De Boeck, Paul | 1 |
DiStefano, Christine A. | 1 |
Eckerly, Carol A. | 1 |
More ▼ |
Publication Type
Journal Articles | 18 |
Reports - Research | 10 |
Reports - Evaluative | 9 |
Speeches/Meeting Papers | 5 |
Reports - Descriptive | 2 |
Education Level
Elementary Education | 1 |
Grade 3 | 1 |
Audience
Location
Florida | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Florida Comprehensive… | 1 |
What Works Clearinghouse Rating
Jang, Yoonsun; Cohen, Allan S. – Educational and Psychological Measurement, 2020
A nonconverged Markov chain can potentially lead to invalid inferences about model parameters. The purpose of this study was to assess the effect of a nonconverged Markov chain on the estimation of parameters for mixture item response theory models using a Markov chain Monte Carlo algorithm. A simulation study was conducted to investigate the…
Descriptors: Markov Processes, Item Response Theory, Accuracy, Inferences
Jang, Yoonsun; Kim, Seock-Ho; Cohen, Allan S. – Journal of Educational Measurement, 2018
This study investigates the effect of multidimensionality on extraction of latent classes in mixture Rasch models. In this study, two-dimensional data were generated under varying conditions. The two-dimensional data sets were analyzed with one- to five-class mixture Rasch models. Results of the simulation study indicate the mixture Rasch model…
Descriptors: Item Response Theory, Simulation, Correlation, Multidimensional Scaling
Lee, Sunbok; Choi, Youn-Jeng; Cohen, Allan S. – International Journal of Assessment Tools in Education, 2018
A simulation study is a useful tool in examining how validly item response theory (IRT) models can be applied in various settings. Typically, a large number of replications are required to obtain the desired precision. However, many standard software packages in IRT, such as MULTILOG and BILOG, are not well suited for a simulation study requiring…
Descriptors: Item Response Theory, Simulation, Replication (Evaluation), Automation
Wollack, James A.; Cohen, Allan S.; Eckerly, Carol A. – Educational and Psychological Measurement, 2015
Test tampering, especially on tests for educational accountability, is an unfortunate reality, necessitating that the state (or its testing vendor) perform data forensic analyses, such as erasure analyses, to look for signs of possible malfeasance. Few statistical approaches exist for detecting fraudulent erasures, and those that do largely do not…
Descriptors: Tests, Cheating, Item Response Theory, Accountability
Kang, Taehoon; Cohen, Allan S.; Sung, Hyun-Jung – Applied Psychological Measurement, 2009
This study examines the utility of four indices for use in model selection with nested and nonnested polytomous item response theory (IRT) models: a cross-validation index and three information-based indices. Four commonly used polytomous IRT models are considered: the graded response model, the generalized partial credit model, the partial credit…
Descriptors: Item Response Theory, Models, Selection, Simulation
Cho, Sun-Joo; Cohen, Allan S. – Journal of Educational and Behavioral Statistics, 2010
Mixture item response theory models have been suggested as a potentially useful methodology for identifying latent groups formed along secondary, possibly nuisance dimensions. In this article, we describe a multilevel mixture item response theory (IRT) model (MMixIRTM) that allows for the possibility that this nuisance dimensionality may function…
Descriptors: Simulation, Mathematics Tests, Item Response Theory, Student Behavior
Wells, Craig S.; Cohen, Allan S.; Patton, Jeffrey – International Journal of Testing, 2009
A primary concern with testing differential item functioning (DIF) using a traditional point-null hypothesis is that a statistically significant result does not imply that the magnitude of DIF is of practical interest. Similarly, for a given sample size, a non-significant result does not allow the researcher to conclude the item is free of DIF. To…
Descriptors: Test Bias, Test Items, Statistical Analysis, Hypothesis Testing
Li, Feiming; Cohen, Allan S.; Kim, Seock-Ho; Cho, Sun-Joo – Applied Psychological Measurement, 2009
This study examines model selection indices for use with dichotomous mixture item response theory (IRT) models. Five indices are considered: Akaike's information coefficient (AIC), Bayesian information coefficient (BIC), deviance information coefficient (DIC), pseudo-Bayes factor (PsBF), and posterior predictive model checks (PPMC). The five…
Descriptors: Item Response Theory, Models, Selection, Methods
Goegebeur, Yuri; De Boeck, Paul; Wollack, James A.; Cohen, Allan S. – Psychometrika, 2008
An item response theory model for dealing with test speededness is proposed. The model consists of two random processes, a problem solving process and a random guessing process, with the random guessing gradually taking over from the problem solving process. The involved change point and change rate are considered random parameters in order to…
Descriptors: Problem Solving, Item Response Theory, Models, Case Studies
Kang, Taehoon; Cohen, Allan S. – Applied Psychological Measurement, 2007
Fit of the model to the data is important if the benefits of item response theory (IRT) are to be obtained. In this study, the authors compared model selection results using the likelihood ratio test, two information-based criteria, and two Bayesian methods. An example illustrated the potential for inconsistency in model selection depending on…
Descriptors: Simulation, Item Response Theory, Comparative Analysis, Bayesian Statistics

Cohen, Allan S.; Kane, Michael T.; Kim, Seock-Ho – Applied Psychological Measurement, 2001
Discusses reasons why increasing the number of replications in Monte Carlo simulation studies is not necessary for satisfactory levels of precision and offers guidelines in the context of error tolerance analysis for determining how much precision is needed. (SLD)
Descriptors: Monte Carlo Methods, Simulation
Kim, Seock-Ho; Cohen, Allan S.; DiStefano, Christine A.; Kim, Sooyeon – 1998
Type I error rates of the likelihood ratio test for the detection of differential item functioning (DIF) in the partial credit model were investigated using simulated data. The partial credit model with four ordered performance levels was used to generate data sets of a 30-item test for samples of 300 and 1,000 simulated examinees. Three different…
Descriptors: Item Bias, Simulation, Test Items

Wollack, James A.; Bolt, Daniel M.; Cohen, Allan S.; Lee, Young-Sun – Applied Psychological Measurement, 2002
Compared the quality of item parameter estimates for marginal maximum likelihood (MML) and Markov Chain Monte Carlo (MCMC) with the nominal response model using simulation. The quality of item parameter recovery was nearly identical for MML and MCMC, and both methods tended to produce good estimates. (SLD)
Descriptors: Estimation (Mathematics), Markov Processes, Monte Carlo Methods, Simulation

Cohen, Allan S.; Kim, Seock-Ho – Applied Psychological Measurement, 1998
Studied results from five linking methods under the graded-response model using simulated data. Results show that differences in the linking coefficients are small. The five methods yielded similar results for longer common-item links with large sample sizes and when the distribution of item-location parameters matched the underlying trait…
Descriptors: Equated Scores, Estimation (Mathematics), Item Response Theory, Sample Size

Kim, Seock-Ho; Cohen, Allan S. – Applied Psychological Measurement, 1998
Compared three methods for developing a common metric under item response theory through simulation. For smaller numbers of common items, linking using the characteristic curve method yielded smaller root mean square differences for both item discrimination and difficulty parameters. For larger numbers of common items, the three methods were…
Descriptors: Comparative Analysis, Difficulty Level, Item Response Theory, Simulation
Previous Page | Next Page ยป
Pages: 1 | 2