ERIC - Search Results

Source

Applied Psychological…

Author

Alsawalmeh, Yousef M.	1
Berger, Martjin P. F.	1
Cicchetti, Domenic V.	1
Eiting, Mindert H.	1
Feldt, Leonard S.	1
Fleiss, Joseph L.	1
Fowler, Robert L.	1
Levin, Joel R.	1
Linn, Robert L.	1
Raju, Nambury S.	1
Slinde, Jeffrey A.	1
Subkoviak, Michael J.	1
van der Linden, Wim J.	1
More ▼

Publication Type

Journal Articles	7
Reports - Evaluative	5
Reports - General	1
Reports - Research	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 9 results Save | Export

Binomial Test Models and Item Difficulty.

Peer reviewed

van der Linden, Wim J. – Applied Psychological Measurement, 1979

The restrictions on item difficulties that must be met when binomial models are applied to domain-referenced testing are examined. Both a deterministic and a stochastic conception of item responses are discussed with respect to difficulty and Guttman-type items. (Author/BH)

Descriptors: Difficulty Level, Item Sampling, Latent Trait Theory, Mathematical Models

Comparison of the Null Distributions of Weighted Kappa and the C Ordinal Statistic

Peer reviewed

Cicchetti, Domenic V.; Fleiss, Joseph L. – Applied Psychological Measurement, 1977

The weighted kappa coefficient is a measure of interrater agreement when the relative seriousness of each possible disagreement can be quantified. This monte carlo study demonstrates the utility of the kappa coefficient for ordinal data. Sample size is also briefly discussed. (Author/JKS)

Descriptors: Mathematical Models, Rating Scales, Reliability, Sampling

Test of the Hypothesis that the Intraclass Reliability Coefficient Is the Same for Two Measurement Procedures.

Peer reviewed

Alsawalmeh, Yousef M.; Feldt, Leonard S. – Applied Psychological Measurement, 1992

An approximate statistical test is derived for the hypothesis that the intraclass reliability coefficients associated with two measurement procedures are equal. Control of Type 1 error is investigated by comparing empirical sampling distributions of the test statistic with its derived theoretical distribution. A numerical illustration is…

Descriptors: Equations (Mathematics), Hypothesis Testing, Mathematical Models, Measurement Techniques

Determining the Significance of Estimated Signed and Unsigned Areas between Two Item Response Functions.

Peer reviewed

Raju, Nambury S. – Applied Psychological Measurement, 1990

The asymptotic sampling distributions (means and variances) are presented for the signed and unsigned estimates for the Rasch model, two-parameter model, and the three-parameter model with fixed lower asymptotes. Applications for item-bias research are discussed. (SLD)

Descriptors: Equations (Mathematics), Estimation (Mathematics), Item Bias, Item Response Theory

Using Extreme Groups Strategy When Measures Are Not Normally Distributed.

Peer reviewed

Fowler, Robert L. – Applied Psychological Measurement, 1992

A Monte Carlo simulation explored how to optimize power in the extreme groups strategy when sampling from nonnormal distributions. Results show that the optimum percent for the extreme group selection was approximately the same for all population shapes, except the extremely platykurtic (uniform) distribution. (SLD)

Descriptors: Construct Validity, Equations (Mathematics), Mathematical Models, Monte Carlo Methods

On the Efficiency of IRT Models When Applied to Different Sampling Designs.

Peer reviewed

Berger, Martjin P. F. – Applied Psychological Measurement, 1991

A generalized variance criterion is proposed to measure efficiency in item-response-theory (IRT) models. Heuristic arguments are given to formulate the efficiency of a design in terms of an asymptotic generalized variance criterion. Efficiencies of designs for one-, two-, and three-parameter models are compared. (SLD)

Descriptors: Comparative Analysis, Efficiency, Equations (Mathematics), Error of Measurement

Planning an Experiment in the Company of Measurement Error

Peer reviewed

Levin, Joel R.; Subkoviak, Michael J. – Applied Psychological Measurement, 1977

Textbook calculations of statistical power or sample size follow from formulas that assume that the variables under consideration are measured without error. However, in the real world of behavioral research, errors of measurement cannot be neglected. The determination of sample size is discussed, and an example illustrates blocking strategy.…

Descriptors: Analysis of Covariance, Analysis of Variance, Error of Measurement, Hypothesis Testing

Sequential Reliability Tests.

Peer reviewed

Eiting, Mindert H. – Applied Psychological Measurement, 1991

A method is proposed for sequential evaluation of reliability of psychometric instruments. Sample size is unfixed; a test statistic is computed after each person is sampled and a decision is made in each stage of the sampling process. Results from a series of Monte-Carlo experiments establish the method's efficiency. (SLD)

Descriptors: Computer Simulation, Equations (Mathematics), Estimation (Mathematics), Mathematical Models

The Rasch Model, Objective Measurement, Equating, and Robustness.

Peer reviewed

Linn, Robert L.; Slinde, Jeffrey A. – Applied Psychological Measurement, 1979

This study investigated the adequacy of the Rasch model in equating existing standardized tests with groups of examinees not widely separated in ability. With the exception of one test pair and one grade level, the Rasch model using the anchor test procedure provided a reasonably satisfactory means of equating. (Author/CTM)

Descriptors: Equated Scores, Goodness of Fit, Intermediate Grades, Item Analysis

Mathematical Models	9
Sampling	9
Equations (Mathematics)	5
Estimation (Mathematics)	3
Test Reliability	3
Error of Measurement	2
Hypothesis Testing	2
Item Response Theory	2
Latent Trait Theory	2
Monte Carlo Methods	2
Reliability	2
Research Design	2
Statistical Distributions	2
Test Interpretation	2
Analysis of Covariance	1
Analysis of Variance	1
Comparative Analysis	1
Computer Simulation	1
Construct Validity	1
Difficulty Level	1
Efficiency	1
Equated Scores	1
Goodness of Fit	1
Heuristics	1
Intermediate Grades	1
More ▼