ERIC - Search Results

Descriptor

Mathematical Models	8
Test Theory	8
Test Validity	8
Test Reliability	5
Error of Measurement	4
Latent Trait Theory	4
Item Analysis	3
Test Items	3
Achievement Tests	2
Difficulty Level	2
Elementary Education	2
Higher Education	2
Item Banks	2
Measurement Techniques	2
Sampling	2
Test Construction	2
Academic Achievement	1
Adaptive Testing	1
Algorithms	1
Attribution Theory	1
Behavioral Sciences	1
Career Development	1
Clinical Experience	1
Comparative Analysis	1
Computer Assisted Testing	1
More ▼

Source

Educational and Psychological…	1
Journal of Experimental…	1

Author

Zimmerman, Donald W.	2
Cason, Gerald J.	1
Forster, Fred	1
Haladyna, Tom	1
Holmes, Susan E.	1
Kane, Michael T.	1
Kiely, Gerard L.	1
Roid, Gale	1
Wainer, Howard	1
Williams, Richard H.	1

Publication Type

Reports - Research	8
Speeches/Meeting Papers	4
Journal Articles	2

Education Level

Audience

Researchers

Location

Laws, Policies, & Programs

Assessments and Surveys

California Achievement Tests

What Works Clearinghouse Rating

Showing all 8 results Save | Export

The Mathematical Definition of Test Validity.

Peer reviewed

Zimmerman, Donald W. – Educational and Psychological Measurement, 1983

A definition of test validity as the ratio of a covariance term to a variance term, analogous to the classical definition of test reliability, is proposed. When error scores on distinct tests are uncorrelated, the proposed definition coincides with the usual one, but it remains meaningful when error scores are correlated. (Author/BW)

Descriptors: Definitions, Mathematical Formulas, Mathematical Models, Test Theory

Reconsideration of the "Attenuation Paradox"--and Some New Paradoxes in Test Validity.

Peer reviewed

Williams, Richard H.; Zimmerman, Donald W. – Journal of Experimental Education, 1982

A mathematical link between test reliability and test validity is derived, taking into account the correlation between error scores on a test and error scores on a criterion measure. When this correlation is positive, the "paradoxical" nonmonotonic relation between test reliability and test validity occurs universally. (Author/BW)

Descriptors: Correlation, Error of Measurement, Mathematical Models, Test Reliability

The Effects of Test Content Match and Number of Items on the Accuracy of Trait Estimates from Tests Equated with the Three-Parameter Logistic Model.

Holmes, Susan E. – 1982

The purpose of the present study was to examine the accuracy of indirect trait estimates, i.e., estimates of some primary trait obtained from a second measure which have been equated to the first. The California Achievement Test in Reading was the primary measure and the Prescriptive Reading Inventory was the indirect measure. Four kinds of…

Descriptors: Content Analysis, Elementary Education, Equated Scores, Item Analysis

Interpreting Variance Components as Evidence for Reliability and Validity.

Download full text

Kane, Michael T. – 1980

The reliability and validity of measurement is analyzed by a sampling model based on generalizability theory. A model for the relationship between a measurement procedure and an attribute is developed from an analysis of how measurements are used and interpreted in science. The model provides a basis for analyzing the concept of an error of…

Descriptors: Attribution Theory, Behavioral Sciences, Error of Measurement, Mathematical Models

Controlling Rater Stringency Error in Clinical Performance Rating: Further Validation of a Performance Rating Theory.

Cason, Gerald J.; And Others – 1983

Prior research in a single clinical training setting has shown Cason and Cason's (1981) simplified model of their performance rating theory can improve rating reliability and validity through statistical control of rater stringency error. Here, the model was applied to clinical performance ratings of 14 cohorts (about 250 students and 200 raters)…

Descriptors: Clinical Experience, Error of Measurement, Evaluation Methods, Higher Education

Riding the Rasch Tiger. Part 1: Laying the Item Bank Foundation (Paul Volker Would Approve).

Forster, Fred – 1987

Studies carried out over a 12-year period addressed fundamental questions on the use of Rasch-based item banks. Large field tests administered in grades 3-8 of reading, mathematics, and science items, as well as standardized test results were used to explore the possible effects of many factors on item calibrations. In general, the results…

Descriptors: Achievement Tests, Difficulty Level, Elementary Education, Item Analysis

CATs, Testlets, and Test Construction: A Rationale for Putting Test Developers Back into CAT.

Wainer, Howard; Kiely, Gerard L. – 1986

Recent experience with the Computerized Adaptive Test (CAT) has raised a number of concerns about its practical applications. The concerns are principally involved with the concept of having the computer construct the test from a precalibrated item pool, and substituting statistical characteristics for the test developer's skills. Problems with…

Descriptors: Adaptive Testing, Algorithms, Computer Assisted Testing, Construct Validity

A Theoretical and Empirical Comparison of Three Approaches to Achievement Testing.

Haladyna, Tom; Roid, Gale – 1976

Three approaches to the construction of achievement tests are compared: construct, operational, and empirical. The construct approach is based upon classical test theory and measures an abstract representation of the instructional objectives. The operational approach specifies instructional intent through instructional objectives, facet design,…

Descriptors: Academic Achievement, Achievement Tests, Career Development, Comparative Analysis