Descriptor
Test Reliability | 9 |
Sampling | 7 |
Test Validity | 5 |
Error of Measurement | 3 |
Mathematical Models | 3 |
Measurement | 3 |
Testing Problems | 3 |
Achievement Tests | 2 |
Correlation | 2 |
Item Analysis | 2 |
Item Sampling | 2 |
More ▼ |
Source
Journal of Educational… | 9 |
Author
Askegaard, Lewis D. | 1 |
Callender, John C. | 1 |
Frisbee, David A. | 1 |
Garg, Rashmi | 1 |
Jackson, Rex | 1 |
Kolen, Michael J. | 1 |
Levin, Joel R. | 1 |
Osburn, H. G. | 1 |
Reilly, Richard R. | 1 |
Subkoviak, Michael J. | 1 |
Umila, Benwardo V. | 1 |
More ▼ |
Publication Type
Journal Articles | 4 |
Reports - Research | 4 |
Education Level
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
General Educational… | 1 |
What Works Clearinghouse Rating

Subkoviak, Michael J.; Levin, Joel R. – Journal of Educational Measurement, 1977
Measurement error in dependent variables reduces the power of statistical tests to detect mean differences of specified magnitude. Procedures for determining power and sample size that consider the reliability of the dependent variable are discussed and illustrated. Methods for estimating reliability coefficients used in these procedures are…
Descriptors: Error of Measurement, Hypothesis Testing, Power (Statistics), Sampling

Callender, John C.; Osburn, H. G. – Journal of Educational Measurement, 1979
Some procedures for estimating internal consistency reliability may be superior mathematically to the more commonly used methods such as Coefficient Alpha. One problem is computational difficulty; the other is the possibility of overestimation due to capitalization on chance. (Author/CTM)
Descriptors: Higher Education, Mathematical Formulas, Research Problems, Sampling

Garg, Rashmi; And Others – Journal of Educational Measurement, 1986
For the purpose of obtaining data to use in test development, multiple matrix sampling plans were compared to examinee sampling plans. Data were simulated for examinees, sampled from a population with a normal distribution of ability, responding to items selected from an item universe. (Author/LMO)
Descriptors: Difficulty Level, Monte Carlo Methods, Sampling, Statistical Studies

Whitely, Susan E. – Journal of Educational Measurement, 1977
A debate concerning specific issues and the general usefulness of the Rasch latent trait test model is continued. Methods of estimation, necessary sample size, and the applicability of the model are discussed. (JKS)
Descriptors: Error of Measurement, Item Analysis, Mathematical Models, Measurement

Frisbee, David A. – Journal of Educational Measurement, 1973
The purpose of this study was to gather empirical evidence to compare the reliabilities and concurrent validities of multiple choice and true-false tests that were written to measure understandings and relationships in the same content areas. (Author)
Descriptors: Achievement Tests, Correlation, High School Students, Measurement

Wright, Benjamin D. – Journal of Educational Measurement, 1977
Statements made in a previous article of this journal concerning the Rasch latent trait test model are questioned. Methods of estimation, necessary sample sizes, several formuli, and the general usefulness of the Rasch model are discussed. (JKS)
Descriptors: Computers, Error of Measurement, Item Analysis, Mathematical Models

Reilly, Richard R.; Jackson, Rex – Journal of Educational Measurement, 1973
The present study suggests that although the reliability of an academic aptitude test given under formula-score condition can be increased substantially through empirical option weighting, much of the increase is due to the capitalization of the keying procedure on omitting tendencies which are reliable but not valid. (Author)
Descriptors: Aptitude Tests, Correlation, Factor Analysis, Item Sampling

Askegaard, Lewis D.; Umila, Benwardo V. – Journal of Educational Measurement, 1982
Multiple matrix sampling of items and examinees was applied to an 18-item rank order instrument administered to a randomly assigned group and compared to the ordering and ranking of all items by control subjects. High correlations between ranks suggest the methodology may viably reduce respondent effort on long rank ordering tasks. (Author/CM)
Descriptors: Evaluation Methods, Item Sampling, Junior High Schools, Student Reaction

Kolen, Michael J.; Whitney, Douglas R. – Journal of Educational Measurement, 1982
The adequacy of equipercentile, linear, one-parameter (Rasch), and three-parameter logistic item-response theory procedures for equating 12 forms of five tests of general educational development were compared. Results indicated the equating method adequacy depends on a variety of factors such as test characteristics, equating design, and sample…
Descriptors: Achievement Tests, Comparative Analysis, Equated Scores, Equivalency Tests