ERIC - Search Results

Descriptor

Error of Measurement	14
Mathematical Models	14
Test Validity	14
Test Reliability	10
Sampling	7
Item Analysis	4
Test Theory	4
Academic Achievement	3
Achievement Tests	3
Latent Trait Theory	3
Measurement	3
Measurement Techniques	3
Statistical Analysis	3
Test Interpretation	3
Estimation (Mathematics)	2
Higher Education	2
Scores	2
Student Evaluation	2
Test Items	2
Testing Problems	2
Adults	1
Annotated Bibliographies	1
Attribution Theory	1
Behavioral Sciences	1
Bibliographies	1
More ▼

Source

Journal of Educational…	5
Educational and Psychological…	1
Journal of Experimental…	1
Psychometrika	1

Author

Brennan, Robert L.	1
Cason, Gerald J.	1
Chang, Yu-Wen	1
Davison, Mark L.	1
Douglass, James B.	1
Embretson, Susan	1
Haladyna, Tom	1
Harris, Chester W.	1
Kane, Michael T.	1
Kristof, Walter	1
Prediger, Dale J.	1
Roid, Gale	1
Shavelson, Richard J.	1
Whitely, Susan E.	1
Williams, Richard H.	1
Woodruff, David	1
Wright, Benjamin D.	1
Zimmerman, Donald W.	1
More ▼

Publication Type

Reports - Research	7
Journal Articles	5
Reports - Evaluative	3
Speeches/Meeting Papers	3
Reference Materials -…	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Woodcock Johnson Psycho…

What Works Clearinghouse Rating

Showing all 14 results Save | Export

Coefficient Kappa: Some Uses, Misuses, and Alternatives.

Peer reviewed

Brennan, Robert L.; Prediger, Dale J. – Educational and Psychological Measurement, 1981

This paper considers some appropriate and inappropriate uses of coefficient kappa and alternative kappa-like statistics. Discussion is restricted to the descriptive characteristics of these statistics for measuring agreement with categorical data in studies of reliability and validity. (Author)

Descriptors: Classification, Error of Measurement, Mathematical Models, Test Reliability

Reconsideration of the "Attenuation Paradox"--and Some New Paradoxes in Test Validity.

Peer reviewed

Williams, Richard H.; Zimmerman, Donald W. – Journal of Experimental Education, 1982

A mathematical link between test reliability and test validity is derived, taking into account the correlation between error scores on a test and error scores on a criterion measure. When this correlation is positive, the "paradoxical" nonmonotonic relation between test reliability and test validity occurs universally. (Author/BW)

Descriptors: Correlation, Error of Measurement, Mathematical Models, Test Reliability

Models, Meanings and Misunderstandings: Some Issues in Applying Rasch's Theory

Peer reviewed

Whitely, Susan E. – Journal of Educational Measurement, 1977

A debate concerning specific issues and the general usefulness of the Rasch latent trait test model is continued. Methods of estimation, necessary sample size, and the applicability of the model are discussed. (JKS)

Descriptors: Error of Measurement, Item Analysis, Mathematical Models, Measurement

Conditional Standard Error of Measurement in Prediction.

Peer reviewed

Woodruff, David – Journal of Educational Measurement, 1990

A method of estimating conditional standard error of measurement at specific score/ability levels is described that avoids theoretical problems identified for previous methods. The method focuses on variance of observed scores conditional on a fixed value of an observed parallel measurement, decomposing these variances into true and error parts.…

Descriptors: Error of Measurement, Estimation (Mathematics), Mathematical Models, Predictive Measurement

Misunderstanding the Rasch Model

Peer reviewed

Wright, Benjamin D. – Journal of Educational Measurement, 1977

Statements made in a previous article of this journal concerning the Rasch latent trait test model are questioned. Methods of estimation, necessary sample sizes, several formuli, and the general usefulness of the Rasch model are discussed. (JKS)

Descriptors: Computers, Error of Measurement, Item Analysis, Mathematical Models

On the Theory of a Set of Tests Which Differ Only in Length

Peer reviewed

Kristof, Walter – Psychometrika, 1971

Descriptors: Cognitive Measurement, Error of Measurement, Mathematical Models, Psychological Testing

Interpreting Variance Components as Evidence for Reliability and Validity.

Download full text

Kane, Michael T. – 1980

The reliability and validity of measurement is analyzed by a sampling model based on generalizability theory. A model for the relationship between a measurement procedure and an attribute is developed from an analysis of how measurements are used and interpreted in science. The model provides a basis for analyzing the concept of an error of…

Descriptors: Attribution Theory, Behavioral Sciences, Error of Measurement, Mathematical Models

A Process for Testing a Mathematical Model for the Solution of a Practical Problem: Application to Test Equating. LES Paper on Learning and Teaching. Paper #79.

Douglass, James B. – 1979

A general process for testing the feasibility of applying alternative mathematical or statistical models to the solution of a practical problem is presented and flowcharted. The system is used to develop a plan to compare models for test equating. The five alternative models to be considered for equating are: (1) anchor test equating using…

Descriptors: Equated Scores, Error of Measurement, Latent Trait Theory, Mathematical Models

Multiple Processing Strategies and the Construct Validity of Verbal Reasoning Tests.

Peer reviewed

Embretson, Susan; And Others – Journal of Educational Measurement, 1986

This study examined the influence of processing strategies, and the metacomponents that determine when to apply them, on the construct validity of a verbal reasoning test. A rule-oriented strategy, an association strategy, and a partial rule strategy were examined. All three strategies contributed to individual differences in verbal reasoning.…

Descriptors: Cognitive Processes, Elementary Secondary Education, Error of Measurement, Latent Trait Theory

Sampling Variability of Performance Assessments.

Peer reviewed

Shavelson, Richard J.; And Others – Journal of Educational Measurement, 1993

Evidence is presented on the generalizability and convergent validity of performance assessments using data from six studies of student achievement that sampled a wide range of measurement facets and methods. Results at individual and school levels indicate that task-sampling variability is the major source of measurement error. (SLD)

Descriptors: Academic Achievement, Educational Assessment, Error of Measurement, Generalizability Theory

Achievement Test Items--Methods of Study. CSE Monograph Series in Evaluation, 6.

Harris, Chester W.; And Others – 1977

The implications of a mathematical model of test scores are explored where the data are limited to a random sample of items without replacement from an indefinitely large population or item domain in which items are scored either zero or one. The purpose is to obtain an unbiased estimate of a student's proportion of items correct in the item…

Descriptors: Academic Achievement, Achievement Tests, Annotated Bibliographies, Bibliographies

Controlling Rater Stringency Error in Clinical Performance Rating: Further Validation of a Performance Rating Theory.

Cason, Gerald J.; And Others – 1983

Prior research in a single clinical training setting has shown Cason and Cason's (1981) simplified model of their performance rating theory can improve rating reliability and validity through statistical control of rater stringency error. Here, the model was applied to clinical performance ratings of 14 cohorts (about 250 students and 200 raters)…

Descriptors: Clinical Experience, Error of Measurement, Evaluation Methods, Higher Education

Measurement Accuracy: An Application of Multidimensional Item Response Theory to the Woodcock-Johnson Psycho-Educational Battery-Revised Achievement Scales.

Download full text

Davison, Mark L.; Chang, Yu-Wen – 1992

A two-dimensional, compensatory item response model and a unidimensional model were fitted to the reading and mathematics items in the Woodcock-Johnson Psycho-Educational Battery-Revised for a sample of 1,000 adults aged 20-39 years. Multidimensional information theory predicts that if the unidimensional abilities can be represented as vectors in…

Descriptors: Achievement Tests, Adults, Equations (Mathematics), Error of Measurement

A Theoretical and Empirical Comparison of Three Approaches to Achievement Testing.

Haladyna, Tom; Roid, Gale – 1976

Three approaches to the construction of achievement tests are compared: construct, operational, and empirical. The construct approach is based upon classical test theory and measures an abstract representation of the instructional objectives. The operational approach specifies instructional intent through instructional objectives, facet design,…

Descriptors: Academic Achievement, Achievement Tests, Career Development, Comparative Analysis