NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 9 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Kalinowski, Steven T. – Educational and Psychological Measurement, 2019
Item response theory (IRT) is a statistical paradigm for developing educational tests and assessing students. IRT, however, currently lacks an established graphical method for examining model fit for the three-parameter logistic model, the most flexible and popular IRT model in educational testing. A method is presented here to do this. The graph,…
Descriptors: Item Response Theory, Educational Assessment, Goodness of Fit, Probability
Peer reviewed Peer reviewed
Direct linkDirect link
Makransky, Guido; Mayer, Richard; Nøremølle, Anne; Cordoba, Ainara Lopez; Wandall, Jakob; Bonde, Mads – Educational Technology Research and Development, 2020
There is great potential in making assessment and learning complementary. In this study, we investigated the feasibility of developing a desktop virtual reality (VR) laboratory simulation on the topic of genetics, with integrated assessment using multiple choice questions based on item response theory (IRT) and feedback based on the cognitive…
Descriptors: Student Evaluation, Feedback (Response), Computer Simulation, Computer Uses in Education
Peer reviewed Peer reviewed
Liou, Michelle – Applied Psychological Measurement, 1993
Accuracy of three exact person tests for assessing model-data fit in the Rasch model was investigated in a simulation study. Empirical Type I error rates and statistical power of the person tests were computed. The exact person test conditioned on total score is a promising tool for assessing consistency of response patterns with the Rasch model.…
Descriptors: Comparative Analysis, Computer Simulation, Equations (Mathematics), Goodness of Fit
Liu, Xiufeng – 1992
The difference between compensatory and non-compensatory item response theory (IRT) models in terms of the dimensionality of test data generated by them, and its effect on the model-data-fit were examined. The STRESS (proportion of variance not accounted for by the multidimensional scaling model) and RSQ (proportion of variance accounted for by…
Descriptors: Chi Square, Comparative Analysis, Computer Simulation, Foreign Countries
Peer reviewed Peer reviewed
Smith, Richard M. – Educational and Psychological Measurement, 1991
This study reports results of an investigation based on simulated data of the distributional properties of the item fit statistics that are commonly used in the Rasch model calibration programs as indices of the fit of responses to individual items to the measurement model. (SLD)
Descriptors: Computer Simulation, Equations (Mathematics), Goodness of Fit, Item Response Theory
Peer reviewed Peer reviewed
Muraki, Eiji – Applied Psychological Measurement, 1990
This study examined the application of the marginal maximum likelihood-EM algorithm to the parameter estimation problems of the normal ogive and logistic polytomous response models for Likert-type items. A rating scale model, based on F. Samejima's (1969) graded response model, was developed. (TJH)
Descriptors: Algorithms, Computer Simulation, Equations (Mathematics), Goodness of Fit
Peer reviewed Peer reviewed
De Ayala, R. J.; And Others – Applied Measurement in Education, 1992
A study involving 1,000 simulated examinees compared the partial credit and graded response models in computerized adaptive testing (CAT). The graded response model fit the data well and provided slightly more accurate ability estimates than those of the partial credit model. Benefits of polytomous model-based CATs are discussed. (SLD)
Descriptors: Adaptive Testing, Comparative Analysis, Computer Assisted Testing, Computer Simulation
Peer reviewed Peer reviewed
Luecht, Richard M.; Miller, Timothy R. – Applied Psychological Measurement, 1992
A two-stage process that considers the multidimensionality of tests under the framework of unidimensional item response theory is described and evaluated. Findings from a simulation indicate that gains in estimation robustness and score interpretation are possible with almost no sacrifice of goodness of fit when using this two-stage approach. (SLD)
Descriptors: Cluster Analysis, Computer Simulation, Equations (Mathematics), Estimation (Mathematics)
Linacre, John M. – 1990
Advantages and disadvantages of standard Rasch analysis computer programs are discussed. The unconditional maximum likelihood algorithm allows all observations to participate equally in determining the measures and calibrations to be obtained quickly from a data set. On the advantage side, standard Rasch programs can be used immediately, are…
Descriptors: Algorithms, Computer Assisted Testing, Computer Graphics, Computer Simulation