NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 9 results Save | Export
Peer reviewed Peer reviewed
van der Linden, Wim J. – Applied Psychological Measurement, 1979
The restrictions on item difficulties that must be met when binomial models are applied to domain-referenced testing are examined. Both a deterministic and a stochastic conception of item responses are discussed with respect to difficulty and Guttman-type items. (Author/BH)
Descriptors: Difficulty Level, Item Sampling, Latent Trait Theory, Mathematical Models
Peer reviewed Peer reviewed
Cicchetti, Domenic V.; Fleiss, Joseph L. – Applied Psychological Measurement, 1977
The weighted kappa coefficient is a measure of interrater agreement when the relative seriousness of each possible disagreement can be quantified. This monte carlo study demonstrates the utility of the kappa coefficient for ordinal data. Sample size is also briefly discussed. (Author/JKS)
Descriptors: Mathematical Models, Rating Scales, Reliability, Sampling
Peer reviewed Peer reviewed
Alsawalmeh, Yousef M.; Feldt, Leonard S. – Applied Psychological Measurement, 1992
An approximate statistical test is derived for the hypothesis that the intraclass reliability coefficients associated with two measurement procedures are equal. Control of Type 1 error is investigated by comparing empirical sampling distributions of the test statistic with its derived theoretical distribution. A numerical illustration is…
Descriptors: Equations (Mathematics), Hypothesis Testing, Mathematical Models, Measurement Techniques
Peer reviewed Peer reviewed
Raju, Nambury S. – Applied Psychological Measurement, 1990
The asymptotic sampling distributions (means and variances) are presented for the signed and unsigned estimates for the Rasch model, two-parameter model, and the three-parameter model with fixed lower asymptotes. Applications for item-bias research are discussed. (SLD)
Descriptors: Equations (Mathematics), Estimation (Mathematics), Item Bias, Item Response Theory
Peer reviewed Peer reviewed
Fowler, Robert L. – Applied Psychological Measurement, 1992
A Monte Carlo simulation explored how to optimize power in the extreme groups strategy when sampling from nonnormal distributions. Results show that the optimum percent for the extreme group selection was approximately the same for all population shapes, except the extremely platykurtic (uniform) distribution. (SLD)
Descriptors: Construct Validity, Equations (Mathematics), Mathematical Models, Monte Carlo Methods
Peer reviewed Peer reviewed
Berger, Martjin P. F. – Applied Psychological Measurement, 1991
A generalized variance criterion is proposed to measure efficiency in item-response-theory (IRT) models. Heuristic arguments are given to formulate the efficiency of a design in terms of an asymptotic generalized variance criterion. Efficiencies of designs for one-, two-, and three-parameter models are compared. (SLD)
Descriptors: Comparative Analysis, Efficiency, Equations (Mathematics), Error of Measurement
Peer reviewed Peer reviewed
Levin, Joel R.; Subkoviak, Michael J. – Applied Psychological Measurement, 1977
Textbook calculations of statistical power or sample size follow from formulas that assume that the variables under consideration are measured without error. However, in the real world of behavioral research, errors of measurement cannot be neglected. The determination of sample size is discussed, and an example illustrates blocking strategy.…
Descriptors: Analysis of Covariance, Analysis of Variance, Error of Measurement, Hypothesis Testing
Peer reviewed Peer reviewed
Eiting, Mindert H. – Applied Psychological Measurement, 1991
A method is proposed for sequential evaluation of reliability of psychometric instruments. Sample size is unfixed; a test statistic is computed after each person is sampled and a decision is made in each stage of the sampling process. Results from a series of Monte-Carlo experiments establish the method's efficiency. (SLD)
Descriptors: Computer Simulation, Equations (Mathematics), Estimation (Mathematics), Mathematical Models
Peer reviewed Peer reviewed
Linn, Robert L.; Slinde, Jeffrey A. – Applied Psychological Measurement, 1979
This study investigated the adequacy of the Rasch model in equating existing standardized tests with groups of examinees not widely separated in ability. With the exception of one test pair and one grade level, the Rasch model using the anchor test procedure provided a reasonably satisfactory means of equating. (Author/CTM)
Descriptors: Equated Scores, Goodness of Fit, Intermediate Grades, Item Analysis