Descriptor
Test Theory | 8 |
Test Reliability | 5 |
Error of Measurement | 4 |
Correlation | 3 |
Hypothesis Testing | 2 |
Mathematical Formulas | 2 |
Mathematical Models | 2 |
Pretests Posttests | 2 |
Scores | 2 |
Statistical Studies | 2 |
Testing Problems | 2 |
More ▼ |
Source
Journal of Experimental… | 8 |
Publication Type
Journal Articles | 8 |
Reports - Research | 6 |
Opinion Papers | 2 |
Reports - Evaluative | 1 |
Education Level
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating

MacMillan, Peter D. – Journal of Experimental Education, 2000
Compared classical test theory (CTT), generalizability theory (GT), and multifaceted Rasch model (MFRM) approaches to detecting and correcting for rater variability using responses of 4,930 high school students graded by 3 raters on 9 scales. The MFRM approach identified far more raters as different than did the CTT analysis. GT and Rasch…
Descriptors: Generalizability Theory, High School Students, High Schools, Interrater Reliability

Zimmerman, Donald W. – Journal of Experimental Education, 1986
A computer program randomly sampled ordered pairs of scores from known populations that departed from bivariate normal form and calculated correlation coefficients from sample values. Hypotheses were tested (1) that population correlations are zero using the t statistic; and (2) that population correlations have non-zero values using the r to z…
Descriptors: Correlation, Hypothesis Testing, Sampling, Statistical Distributions

Williams, Richard H.; Zimmerman, Donald W. – Journal of Experimental Education, 1982
A mathematical link between test reliability and test validity is derived, taking into account the correlation between error scores on a test and error scores on a criterion measure. When this correlation is positive, the "paradoxical" nonmonotonic relation between test reliability and test validity occurs universally. (Author/BW)
Descriptors: Correlation, Error of Measurement, Mathematical Models, Test Reliability

Williams, Richard H.; Zimmerman, Donald W. – Journal of Experimental Education, 1984
This paper provides a list of 10 salient features of the standard error of measurement, contrasting it to the reliability coefficient. It is concluded that the standard error of measurement should be regarded as a primary characteristic of a mental test. (Author/DWH)
Descriptors: Educational Testing, Error of Measurement, Evaluation Methods, Psychological Testing

Williams, Richard H.; Zimmerman, Donald W. – Journal of Experimental Education, 1980
It is suggested that error of measurement cannot be routinely incorporated into the "error term" in statistical tests, and that the reliability of test scores does not have the simple relationship to statistical inference that one might expect. (Author/GK)
Descriptors: Error of Measurement, Hypothesis Testing, Mathematical Formulas, Test Reliability

Zimmerman, Donald W.; And Others – Journal of Experimental Education, 1981
Reliability coefficients of linear combinations of observed scores have anomalous properties which have led to difficulties in the investigation of difference scores and gain scores in test theory. Discrepancies between classical results and correct results obtained from more general formulas, which allow for correlated errors, are examined…
Descriptors: Error of Measurement, Mathematical Formulas, Mathematical Models, Scores

Gupta, J. K.; And Others – Journal of Experimental Education, 1988
How the validity of gain scores varies with the standard deviations of pretest and posttest scores and the correlation between the two are analyzed. Earlier findings that under realistic testing conditions difference scores can have excellent predictive value are supported. Conditions under which gain scores have optimum validity are specified.…
Descriptors: Educational Change, Equations (Mathematics), Measures (Individuals), Predictive Validity

Williams, Richard H.; Zimmerman, Donald W. – Journal of Experimental Education, 1982
The reliability of simple difference scores is greater than, less than, or equal to that of residualized difference scores, depending on whether the correlation between pretest and posttest scores is greater than, less than, or equal to the ratio of the standard deviations of pretest and posttest scores. (Author)
Descriptors: Achievement Gains, Comparative Analysis, Correlation, Pretests Posttests