NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 11 results Save | Export
Peer reviewed Peer reviewed
Lee, Guemin; Dunbar, Stephen B.; Frisbie, David A. – Educational and Psychological Measurement, 2001
Conceptualized eight different types of measurement models for a test composed of testlets and studied the goodness of fit of those models to data using data from the Iowa Tests of Basic Skills and simulated data. The essentially tau-equivalent model and the congeneric model provided worse model fit than the other measurement models. (SLD)
Descriptors: Goodness of Fit, Measurement Techniques, Models, Scores
Peer reviewed Peer reviewed
Wang, Wen-chung – Journal of Outcome Measurement, 1998
A Rasch-type analysis is presented for multiple-choice items in which one parameter is assigned to each distractor. Results of a small simulation study show that the parameter recovery of the distractor model is very satisfactory. Analysis of a real dataset shows that some items fit the Rasch model rather than the distractor model. (SLD)
Descriptors: Distractors (Tests), Goodness of Fit, Item Response Theory, Multiple Choice Tests
Peer reviewed Peer reviewed
Rost, Jurgen; von Davier, Matthias – Applied Psychological Measurement, 1994
A new item-fit index is proposed that is both a descriptive measure of deviance of single items and an index for statistical inference. This index is based on assumptions of the dichotomous and polytomous Rasch models for items with ordered categories. A simulation study is described. (SLD)
Descriptors: Equations (Mathematics), Goodness of Fit, Item Response Theory, Simulation
Rudas, Tamas; Zwick, Rebecca – 1995
A method is proposed to assess the importance of differential item functioning (DIF) by estimating the largest possible fraction of the population in which DIF does not occur, or equivalently, the smallest possible portion of the population in which DIF may occur. The approach is based on latent class (C. C. Clogg, 1981) or mixture concepts, and…
Descriptors: Estimation (Mathematics), Goodness of Fit, Item Bias, Maximum Likelihood Statistics
McKinley, Robert – 1989
A confirmatory approach to assessing test structure using multidimensional item response theory (MIRT) was developed and evaluated. The approach involved adding to the exponent of the MIRT model an item structure matrix that allows the user to specify the ability dimensions measured by an item. Various combinations of item structures were fit to…
Descriptors: Ability, Chi Square, Goodness of Fit, Item Response Theory
Dinero, Thomas E.; Haertel, Edward – 1976
This paper will discuss the results of a series of computer simulations comparing the Rasch logistic model to a series of models departing to various degrees from its assumption of equal discrimination power for all items. The results have implications for test construction and test scoring, indicating how closely the conventional raw score…
Descriptors: Comparative Analysis, Computer Programs, Goodness of Fit, Individual Differences
Peer reviewed Peer reviewed
Reckase, Mark D. – Journal of Educational Statistics, 1979
Since all commonly used latent trait models assume a unidimensional test, the applicability of the procedure to obviously multidimensional tests is questionable. This paper presents the results of the application of latent trait, traditional, and factor analyses to a series of actual and hypothetical tests that vary in factoral complexity.…
Descriptors: Achievement Tests, Factor Analysis, Goodness of Fit, Higher Education
Stone, Gregory Ethan – 1994
The quality of fit between the data and the measurement model is fundamental to any discussion of results. Fit has been the subject of inquiry since as early as the 1920s. Most early explorations concentrated on assessing global fit or subset fits on fixed length, traditional paper and pencil tests given as a single unit. The detection of aberrant…
Descriptors: Adaptive Testing, Computer Assisted Testing, Educational Assessment, Educational History
Hambleton, Ronald K.; Cook, Linda L. – 1978
The purpose of the present research was to study, systematically, the "goodness-of-fit" of the one-, two-, and three-parameter logistic models. We studied, using computer-simulated test data, the effects of four variables: variation in item discrimination parameters, the average value of the pseudo-chance level parameters, test length,…
Descriptors: Career Development, Difficulty Level, Goodness of Fit, Item Analysis
Tang, Huixing – 1994
This paper describes an item response theory (IRT) based method of differential item functioning (DIF) detection that involves neither separate calibration nor ability grouping. IRT is used to generate residual scores, scores free of the effects of person or group ability and item difficulty. Analysis of variance is then used to test the group…
Descriptors: Ability Grouping, Analysis of Variance, Goodness of Fit, Identification
Curry, Allen R.; And Others – 1978
The efficacy of employing subsets of items from a calibrated item pool to estimate the Rasch model person parameters was investigated. Specifically, the degree of invariance of Rasch model ability-parameter estimates was examined across differing collections of simulated items. The ability-parameter estimates were obtained from a simulation of…
Descriptors: Career Development, Difficulty Level, Equated Scores, Error of Measurement