ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	4

Descriptor

Robustness (Statistics)	11
Estimation (Mathematics)	6
Item Response Theory	6
Test Items	4
Computer Simulation	3
Simulation	3
Test Construction	3
Adaptive Testing	2
Comparative Analysis	2
Computer Assisted Testing	2
Equations (Mathematics)	2
Goodness of Fit	2
Mathematical Models	2
Monte Carlo Methods	2
Probability	2
Psychometrics	2
Sample Size	2
Scores	2
Ability	1
Adolescents	1
Attitudes	1
Automation	1
Cluster Analysis	1
College Entrance Examinations	1
Computation	1
More ▼

Source

Applied Psychological…

Publication Type

Journal Articles	11
Reports - Evaluative	5
Reports - Research	5
Reports - Descriptive	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 11 results Save | Export

Item Pocket Method to Allow Response Review and Change in Computerized Adaptive Testing

Peer reviewed

Direct link

Han, Kyung T. – Applied Psychological Measurement, 2013

Most computerized adaptive testing (CAT) programs do not allow test takers to review and change their responses because it could seriously deteriorate the efficiency of measurement and make tests vulnerable to manipulative test-taking strategies. Several modified testing methods have been developed that provide restricted review options while…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Testing

The Probability of Exceedance as a Nonparametric Person-Fit Statistic for Tests of Moderate Length

Peer reviewed

Direct link

Tendeiro, Jorge N.; Meijer, Rob R. – Applied Psychological Measurement, 2013

To classify an item score pattern as not fitting a nonparametric item response theory (NIRT) model, the probability of exceedance (PE) of an observed response vector x can be determined as the sum of the probabilities of all response vectors that are, at most, as likely as x, conditional on the test's total score. Vector x is to be considered…

Descriptors: Probability, Nonparametric Statistics, Goodness of Fit, Test Length

Uncertainties in the Item Parameter Estimates and Robust Automated Test Assembly

Peer reviewed

Direct link

Veldkamp, Bernard P.; Matteucci, Mariagiulia; de Jong, Martijn G. – Applied Psychological Measurement, 2013

Item response theory parameters have to be estimated, and because of the estimation process, they do have uncertainty in them. In most large-scale testing programs, the parameters are stored in item banks, and automated test assembly algorithms are applied to assemble operational test forms. These algorithms treat item parameters as fixed values,…

Descriptors: Test Construction, Test Items, Item Banks, Automation

Robust Dual Scaling Weights with Tukey's Biweight.

Peer reviewed

Sachs, John – Applied Psychological Measurement, 1994

Use of the method of reciprocal biweighted means (MBM) for dealing with the outlier problem in dual scaling compared favorably with other robust estimation procedures, such as the method of trimmed reciprocal averages. Four datasets from previous studies were used. Advantages of the MBM are discussed. (SLD)

Descriptors: Estimation (Mathematics), Research Methodology, Robustness (Statistics), Scaling

An Assessment of Stout's Index of Essential Unidimensionality.

Peer reviewed

Hattie, John; And Others – Applied Psychological Measurement, 1996

A simulation study was conducted to evaluate the dependability of the "T" index of unidimensionality developed by W. F. Stout and used in his DIMTEST procedure. DIMTEST was found to provide dependable indications of unidimensionality, to be reasonably robust, and to allow for practical demarcation between one and many dimensions. (SLD)

Descriptors: Factor Analysis, Item Response Theory, Robustness (Statistics), Simulation

Robustness of Item Parameter Estimation Programs to Assumptions of Unidimensionality and Normality.

Peer reviewed

Kirisci, Levent; Hsu, Tse-chi; Yu, Lifa – Applied Psychological Measurement, 2001

Studied the effects of test dimensionality, theta distribution shape, and estimation program (BILOG, MULTILOG, or XCALIBRE) on the accuracy of item and person parameter estimates through simulation. Derived guidelines for estimating parameters of multidimensional test items using unidimensional item response theory models. (SLD)

Descriptors: Ability, Computer Software, Estimation (Mathematics), Item Response Theory

Evaluating the Robustness of Graded Response Model and Classical Test Theory Parameter Estimates to Deviant Items.

Peer reviewed

Sinar, Evan F.; Zickar, Michael J. – Applied Psychological Measurement, 2002

Examined the influence of deviant scale items on item parameter estimates of focal scale items and person parameter estimates through a comparison of item response theory (IRT) and classical test theory (CTT) models. Used Monte Carlo methods to explore results from a pilot investigation of job attitude data. Discusses implications for researchers…

Descriptors: Attitudes, Estimation (Mathematics), Monte Carlo Methods, Robustness (Statistics)

Robust Confidence Interval for a Ratio of Standard Deviations

Peer reviewed

Direct link

Bonett, Douglas G. – Applied Psychological Measurement, 2006

Comparing variability of test scores across alternate forms, test conditions, or subpopulations is a fundamental problem in psychometrics. A confidence interval for a ratio of standard deviations is proposed that performs as well as the classic method with normal distributions and performs dramatically better with nonnormal distributions. A simple…

Descriptors: Intervals, Mathematical Concepts, Comparative Analysis, Psychometrics

Robustness of Marginal Maximum Likelihood Estimation in the Rasch Model.

Peer reviewed

Zwinderman, Aeilko; van den Wollenberg, Arnold L. – Applied Psychological Measurement, 1990

Simulation studies (N=4,000 simulees) examined the effect of misspecification of the latent ability distribution (theta) on the accuracy and efficiency of marginal maximum likelihood (MML) item parameter estimates and on MML statistics to test sufficiency and conditional independence. Results were compared to those of the conditional maximum…

Descriptors: Comparative Analysis, Computer Simulation, Estimation (Mathematics), Item Response Theory

Sequential Reliability Tests.

Peer reviewed

Eiting, Mindert H. – Applied Psychological Measurement, 1991

A method is proposed for sequential evaluation of reliability of psychometric instruments. Sample size is unfixed; a test statistic is computed after each person is sampled and a decision is made in each stage of the sampling process. Results from a series of Monte-Carlo experiments establish the method's efficiency. (SLD)

Descriptors: Computer Simulation, Equations (Mathematics), Estimation (Mathematics), Mathematical Models

Unidimensional Calibrations and Interpretations of Composite Traits for Multidimensional Tests.

Peer reviewed

Luecht, Richard M.; Miller, Timothy R. – Applied Psychological Measurement, 1992

A two-stage process that considers the multidimensionality of tests under the framework of unidimensional item response theory is described and evaluated. Findings from a simulation indicate that gains in estimation robustness and score interpretation are possible with almost no sacrifice of goodness of fit when using this two-stage approach. (SLD)

Descriptors: Cluster Analysis, Computer Simulation, Equations (Mathematics), Estimation (Mathematics)

Bonett, Douglas G.	1
Eiting, Mindert H.	1
Han, Kyung T.	1
Hattie, John	1
Hsu, Tse-chi	1
Kirisci, Levent	1
Luecht, Richard M.	1
Matteucci, Mariagiulia	1
Meijer, Rob R.	1
Miller, Timothy R.	1
Sachs, John	1
Sinar, Evan F.	1
Tendeiro, Jorge N.	1
Veldkamp, Bernard P.	1
Yu, Lifa	1
Zickar, Michael J.	1
Zwinderman, Aeilko	1
de Jong, Martijn G.	1
van den Wollenberg, Arnold L.	1
More ▼