ERIC - Search Results

Descriptor

Test Reliability	9
Sampling	7
Test Validity	5
Error of Measurement	3
Mathematical Models	3
Measurement	3
Testing Problems	3
Achievement Tests	2
Correlation	2
Item Analysis	2
Item Sampling	2
Multiple Choice Tests	2
Tables (Data)	2
Test Construction	2
Test Interpretation	2
Test Items	2
Aptitude Tests	1
Comparative Analysis	1
Computers	1
Difficulty Level	1
Equated Scores	1
Equivalency Tests	1
Evaluation Methods	1
Factor Analysis	1
High School Students	1
More ▼

Source

Journal of Educational…

Author

Askegaard, Lewis D.	1
Callender, John C.	1
Frisbee, David A.	1
Garg, Rashmi	1
Jackson, Rex	1
Kolen, Michael J.	1
Levin, Joel R.	1
Osburn, H. G.	1
Reilly, Richard R.	1
Subkoviak, Michael J.	1
Umila, Benwardo V.	1
Whitely, Susan E.	1
Whitney, Douglas R.	1
Wright, Benjamin D.	1
More ▼

Publication Type

Journal Articles	4
Reports - Research	4

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

General Educational…

What Works Clearinghouse Rating

Showing all 9 results Save | Export

Fallibility of Measurement and the Power of a Statistical Test

Peer reviewed

Subkoviak, Michael J.; Levin, Joel R. – Journal of Educational Measurement, 1977

Measurement error in dependent variables reduces the power of statistical tests to detect mean differences of specified magnitude. Procedures for determining power and sample size that consider the reliability of the dependent variable are discussed and illustrated. Methods for estimating reliability coefficients used in these procedures are…

Descriptors: Error of Measurement, Hypothesis Testing, Power (Statistics), Sampling

An Empirical Comparison of Coefficient Alpha, Guttman's Lambda-2, and MSPLIT Maximized Split-Half Reliability Estimates.

Peer reviewed

Callender, John C.; Osburn, H. G. – Journal of Educational Measurement, 1979

Some procedures for estimating internal consistency reliability may be superior mathematically to the more commonly used methods such as Coefficient Alpha. One problem is computational difficulty; the other is the possibility of overestimation due to capitalization on chance. (Author/CTM)

Descriptors: Higher Education, Mathematical Formulas, Research Problems, Sampling

A Comparison of Examinee Sampling and Multiple Matrix Sampling in Test Development.

Peer reviewed

Garg, Rashmi; And Others – Journal of Educational Measurement, 1986

For the purpose of obtaining data to use in test development, multiple matrix sampling plans were compared to examinee sampling plans. Data were simulated for examinees, sampled from a population with a normal distribution of ability, responding to items selected from an item universe. (Author/LMO)

Descriptors: Difficulty Level, Monte Carlo Methods, Sampling, Statistical Studies

Models, Meanings and Misunderstandings: Some Issues in Applying Rasch's Theory

Peer reviewed

Whitely, Susan E. – Journal of Educational Measurement, 1977

A debate concerning specific issues and the general usefulness of the Rasch latent trait test model is continued. Methods of estimation, necessary sample size, and the applicability of the model are discussed. (JKS)

Descriptors: Error of Measurement, Item Analysis, Mathematical Models, Measurement

Multiple Choice Versus True-False: A Comparison of Reliabilities and Concurrent Validities

Peer reviewed

Frisbee, David A. – Journal of Educational Measurement, 1973

The purpose of this study was to gather empirical evidence to compare the reliabilities and concurrent validities of multiple choice and true-false tests that were written to measure understandings and relationships in the same content areas. (Author)

Descriptors: Achievement Tests, Correlation, High School Students, Measurement

Misunderstanding the Rasch Model

Peer reviewed

Wright, Benjamin D. – Journal of Educational Measurement, 1977

Statements made in a previous article of this journal concerning the Rasch latent trait test model are questioned. Methods of estimation, necessary sample sizes, several formuli, and the general usefulness of the Rasch model are discussed. (JKS)

Descriptors: Computers, Error of Measurement, Item Analysis, Mathematical Models

Effects of Empirical Option Weighting on Reliability and Validity of an Academic Aptitude Test

Peer reviewed

Reilly, Richard R.; Jackson, Rex – Journal of Educational Measurement, 1973

The present study suggests that although the reliability of an academic aptitude test given under formula-score condition can be increased substantially through empirical option weighting, much of the increase is due to the capitalization of the keying procedure on omitting tendencies which are reliable but not valid. (Author)

Descriptors: Aptitude Tests, Correlation, Factor Analysis, Item Sampling

An Empirical Investigation of the Applicability of Multiple Matrix Sampling to the Method of Rank Order.

Peer reviewed

Askegaard, Lewis D.; Umila, Benwardo V. – Journal of Educational Measurement, 1982

Multiple matrix sampling of items and examinees was applied to an 18-item rank order instrument administered to a randomly assigned group and compared to the ordering and ranking of all items by control subjects. High correlations between ranks suggest the methodology may viably reduce respondent effort on long rank ordering tasks. (Author/CM)

Descriptors: Evaluation Methods, Item Sampling, Junior High Schools, Student Reaction

Comparison of Four Procedures for Equating the Tests of General Educational Development.

Peer reviewed

Kolen, Michael J.; Whitney, Douglas R. – Journal of Educational Measurement, 1982

The adequacy of equipercentile, linear, one-parameter (Rasch), and three-parameter logistic item-response theory procedures for equating 12 forms of five tests of general educational development were compared. Results indicated the equating method adequacy depends on a variety of factors such as test characteristics, equating design, and sample…

Descriptors: Achievement Tests, Comparative Analysis, Equated Scores, Equivalency Tests