ERIC - Search Results

Source

Journal of Experimental…

Author

Zimmerman, Donald W.	6
Williams, Richard H.	4
Gupta, J. K.	1
MacMillan, Peter D.	1

Publication Type

Journal Articles	8
Reports - Research	6
Opinion Papers	2
Reports - Evaluative	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 8 results Save | Export

Classical, Generalizability, and Multifaceted Rasch Detection of Interrater Variability in Large, Sparse Data Sets.

Peer reviewed

MacMillan, Peter D. – Journal of Experimental Education, 2000

Compared classical test theory (CTT), generalizability theory (GT), and multifaceted Rasch model (MFRM) approaches to detecting and correcting for rater variability using responses of 4,930 high school students graded by 3 raters on 9 scales. The MFRM approach identified far more raters as different than did the CTT analysis. GT and Rasch…

Descriptors: Generalizability Theory, High School Students, High Schools, Interrater Reliability

Tests of Significance of Correlation Coefficients in the Absence of Bivariate Normal Populations.

Peer reviewed

Zimmerman, Donald W. – Journal of Experimental Education, 1986

A computer program randomly sampled ordered pairs of scores from known populations that departed from bivariate normal form and calculated correlation coefficients from sample values. Hypotheses were tested (1) that population correlations are zero using the t statistic; and (2) that population correlations have non-zero values using the r to z…

Descriptors: Correlation, Hypothesis Testing, Sampling, Statistical Distributions

Reconsideration of the "Attenuation Paradox"--and Some New Paradoxes in Test Validity.

Peer reviewed

Williams, Richard H.; Zimmerman, Donald W. – Journal of Experimental Education, 1982

A mathematical link between test reliability and test validity is derived, taking into account the correlation between error scores on a test and error scores on a criterion measure. When this correlation is positive, the "paradoxical" nonmonotonic relation between test reliability and test validity occurs universally. (Author/BW)

Descriptors: Correlation, Error of Measurement, Mathematical Models, Test Reliability

On the Virtues and Vices of the Standard Error of Measurement.

Peer reviewed

Williams, Richard H.; Zimmerman, Donald W. – Journal of Experimental Education, 1984

This paper provides a list of 10 salient features of the standard error of measurement, contrasting it to the reliability coefficient. It is concluded that the standard error of measurement should be regarded as a primary characteristic of a mental test. (Author/DWH)

Descriptors: Educational Testing, Error of Measurement, Evaluation Methods, Psychological Testing

Error of Measurement and Statistical Inference: Some Anomalies.

Peer reviewed

Williams, Richard H.; Zimmerman, Donald W. – Journal of Experimental Education, 1980

It is suggested that error of measurement cannot be routinely incorporated into the "error term" in statistical tests, and that the reliability of test scores does not have the simple relationship to statistical inference that one might expect. (Author/GK)

Descriptors: Error of Measurement, Hypothesis Testing, Mathematical Formulas, Test Reliability

The Reliability of Sums and Differences of Test Scores: Some New Results and Anomalies.

Peer reviewed

Zimmerman, Donald W.; And Others – Journal of Experimental Education, 1981

Reliability coefficients of linear combinations of observed scores have anomalous properties which have led to difficulties in the investigation of difference scores and gain scores in test theory. Discrepancies between classical results and correct results obtained from more general formulas, which allow for correlated errors, are examined…

Descriptors: Error of Measurement, Mathematical Formulas, Mathematical Models, Scores

On the Optimum Predictive Potential of Change Measure.

Peer reviewed

Gupta, J. K.; And Others – Journal of Experimental Education, 1988

How the validity of gain scores varies with the standard deviations of pretest and posttest scores and the correlation between the two are analyzed. Earlier findings that under realistic testing conditions difference scores can have excellent predictive value are supported. Conditions under which gain scores have optimum validity are specified.…

Descriptors: Educational Change, Equations (Mathematics), Measures (Individuals), Predictive Validity

The Comparative Reliability of Simple and Residualized Difference Scores.

Peer reviewed

Williams, Richard H.; Zimmerman, Donald W. – Journal of Experimental Education, 1982

The reliability of simple difference scores is greater than, less than, or equal to that of residualized difference scores, depending on whether the correlation between pretest and posttest scores is greater than, less than, or equal to the ratio of the standard deviations of pretest and posttest scores. (Author)

Descriptors: Achievement Gains, Comparative Analysis, Correlation, Pretests Posttests

Test Theory	8
Test Reliability	5
Error of Measurement	4
Correlation	3
Hypothesis Testing	2
Mathematical Formulas	2
Mathematical Models	2
Pretests Posttests	2
Scores	2
Statistical Studies	2
Testing Problems	2
Achievement Gains	1
Comparative Analysis	1
Educational Change	1
Educational Testing	1
Equations (Mathematics)	1
Evaluation Methods	1
Generalizability Theory	1
High School Students	1
High Schools	1
Interrater Reliability	1
Measures (Individuals)	1
Predictive Validity	1
Psychological Testing	1
Sampling	1
More ▼