NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 11 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Han, Kyung T. – Applied Psychological Measurement, 2013
Most computerized adaptive testing (CAT) programs do not allow test takers to review and change their responses because it could seriously deteriorate the efficiency of measurement and make tests vulnerable to manipulative test-taking strategies. Several modified testing methods have been developed that provide restricted review options while…
Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Tendeiro, Jorge N.; Meijer, Rob R. – Applied Psychological Measurement, 2013
To classify an item score pattern as not fitting a nonparametric item response theory (NIRT) model, the probability of exceedance (PE) of an observed response vector x can be determined as the sum of the probabilities of all response vectors that are, at most, as likely as x, conditional on the test's total score. Vector x is to be considered…
Descriptors: Probability, Nonparametric Statistics, Goodness of Fit, Test Length
Peer reviewed Peer reviewed
Direct linkDirect link
Veldkamp, Bernard P.; Matteucci, Mariagiulia; de Jong, Martijn G. – Applied Psychological Measurement, 2013
Item response theory parameters have to be estimated, and because of the estimation process, they do have uncertainty in them. In most large-scale testing programs, the parameters are stored in item banks, and automated test assembly algorithms are applied to assemble operational test forms. These algorithms treat item parameters as fixed values,…
Descriptors: Test Construction, Test Items, Item Banks, Automation
Peer reviewed Peer reviewed
Sachs, John – Applied Psychological Measurement, 1994
Use of the method of reciprocal biweighted means (MBM) for dealing with the outlier problem in dual scaling compared favorably with other robust estimation procedures, such as the method of trimmed reciprocal averages. Four datasets from previous studies were used. Advantages of the MBM are discussed. (SLD)
Descriptors: Estimation (Mathematics), Research Methodology, Robustness (Statistics), Scaling
Peer reviewed Peer reviewed
Hattie, John; And Others – Applied Psychological Measurement, 1996
A simulation study was conducted to evaluate the dependability of the "T" index of unidimensionality developed by W. F. Stout and used in his DIMTEST procedure. DIMTEST was found to provide dependable indications of unidimensionality, to be reasonably robust, and to allow for practical demarcation between one and many dimensions. (SLD)
Descriptors: Factor Analysis, Item Response Theory, Robustness (Statistics), Simulation
Peer reviewed Peer reviewed
Kirisci, Levent; Hsu, Tse-chi; Yu, Lifa – Applied Psychological Measurement, 2001
Studied the effects of test dimensionality, theta distribution shape, and estimation program (BILOG, MULTILOG, or XCALIBRE) on the accuracy of item and person parameter estimates through simulation. Derived guidelines for estimating parameters of multidimensional test items using unidimensional item response theory models. (SLD)
Descriptors: Ability, Computer Software, Estimation (Mathematics), Item Response Theory
Peer reviewed Peer reviewed
Sinar, Evan F.; Zickar, Michael J. – Applied Psychological Measurement, 2002
Examined the influence of deviant scale items on item parameter estimates of focal scale items and person parameter estimates through a comparison of item response theory (IRT) and classical test theory (CTT) models. Used Monte Carlo methods to explore results from a pilot investigation of job attitude data. Discusses implications for researchers…
Descriptors: Attitudes, Estimation (Mathematics), Monte Carlo Methods, Robustness (Statistics)
Peer reviewed Peer reviewed
Direct linkDirect link
Bonett, Douglas G. – Applied Psychological Measurement, 2006
Comparing variability of test scores across alternate forms, test conditions, or subpopulations is a fundamental problem in psychometrics. A confidence interval for a ratio of standard deviations is proposed that performs as well as the classic method with normal distributions and performs dramatically better with nonnormal distributions. A simple…
Descriptors: Intervals, Mathematical Concepts, Comparative Analysis, Psychometrics
Peer reviewed Peer reviewed
Zwinderman, Aeilko; van den Wollenberg, Arnold L. – Applied Psychological Measurement, 1990
Simulation studies (N=4,000 simulees) examined the effect of misspecification of the latent ability distribution (theta) on the accuracy and efficiency of marginal maximum likelihood (MML) item parameter estimates and on MML statistics to test sufficiency and conditional independence. Results were compared to those of the conditional maximum…
Descriptors: Comparative Analysis, Computer Simulation, Estimation (Mathematics), Item Response Theory
Peer reviewed Peer reviewed
Eiting, Mindert H. – Applied Psychological Measurement, 1991
A method is proposed for sequential evaluation of reliability of psychometric instruments. Sample size is unfixed; a test statistic is computed after each person is sampled and a decision is made in each stage of the sampling process. Results from a series of Monte-Carlo experiments establish the method's efficiency. (SLD)
Descriptors: Computer Simulation, Equations (Mathematics), Estimation (Mathematics), Mathematical Models
Peer reviewed Peer reviewed
Luecht, Richard M.; Miller, Timothy R. – Applied Psychological Measurement, 1992
A two-stage process that considers the multidimensionality of tests under the framework of unidimensional item response theory is described and evaluated. Findings from a simulation indicate that gains in estimation robustness and score interpretation are possible with almost no sacrifice of goodness of fit when using this two-stage approach. (SLD)
Descriptors: Cluster Analysis, Computer Simulation, Equations (Mathematics), Estimation (Mathematics)