Descriptor
Author
Borrello, Gloria M. | 1 |
Cope, Ronald T. | 1 |
Hwang, Dae-Yeop | 1 |
Kolen, Michael J. | 1 |
Lockwood, Robert E. | 1 |
Molenaar, Ivo W. | 1 |
Sarvela, Paul D. | 1 |
Sijtsma, Klaas | 1 |
Thompson, Bruce | 1 |
Wheeler, Patricia H. | 1 |
Yen, Wendy M. | 1 |
More ▼ |
Publication Type
Reports - Research | 7 |
Speeches/Meeting Papers | 5 |
Journal Articles | 2 |
Reports - Evaluative | 1 |
Education Level
Audience
Researchers | 4 |
Location
Laws, Policies, & Programs
Assessments and Surveys
ACT Assessment | 1 |
Alabama High School… | 1 |
What Works Clearinghouse Rating
Hwang, Dae-Yeop – 2002
This study compared classical test theory (CTT) and item response theory (IRT). The behavior of the item and person statistics derived from these two measurement frameworks was examined analytically and empirically using a data set obtained from BILOG (R. Mislay and D. Block, 1997). The example was a 15-item test with a sample size of 600…
Descriptors: Comparative Analysis, Measurement Techniques, Scores, Statistical Distributions

Yen, Wendy M. – Journal of Educational Measurement, 1986
Two methods of constucting equal-interval scales for educational achievement are discussed: Thurstone's absolute scaling method and Item Response Theory. Alternative criteria for choosing a scale are contrasted. It is argued that clearer criteria are needed for judging the appropriateness and usefulness of alternative scaling procedures.…
Descriptors: Achievement Tests, Latent Trait Theory, Mathematical Models, Scaling
Wheeler, Patricia H. – 1993
A person's obtained score on a test provides an estimate of the individual's "true" score on that test. The obtained score is considered to have two parts, the true component and the error component. Classical test theory assumes that obtained scores for an individual over multiple administrations of the same test will lie symmetrically…
Descriptors: Cutting Scores, Error of Measurement, Scores, Statistical Distributions

Sijtsma, Klaas; Molenaar, Ivo W. – Psychometrika, 1987
Three methods for estimating reliability are studied within the context of nonparametric item response theory. Two were proposed originally by Mokken and a third is developed in this paper. Using a Monte Carlo strategy, these three estimation methods are compared with four "classical" lower bounds to reliability. (Author/JAZ)
Descriptors: Estimation (Mathematics), Latent Trait Theory, Measurement Techniques, Monte Carlo Methods
Cope, Ronald T.; Kolen, Michael J. – 1987
This study compared five density estimation techniques applied to samples from a population of 272,244 examinees' ACT English Usage and Mathematics Usage raw scores. Unsmoothed frequencies, kernel method, negative hypergeometric, four-parameter beta compound binomial, and Cureton-Tukey methods were applied to 500 replications of random samples of…
Descriptors: College Entrance Examinations, Estimation (Mathematics), Higher Education, Mathematical Models
Thompson, Bruce; Borrello, Gloria M. – 1987
Attitude measures frequently produce distributions of item scores that attenuate interitem correlations and thus also distort findings regarding the factor structure underlying the items. An actual data set involving 260 adult subjects' responses to 55 items on the Love Relationships Scale is employed to illustrate empirical methods for…
Descriptors: Adults, Analysis of Covariance, Attitude Measures, Correlation
Lockwood, Robert E.; And Others – 1986
Standards, passing scores, or cut scores have been seen as an element of criterion-referenced tests since their introduction. This paper discusses at least two issues surrounding the establishment of cut scores which appear to need clarification: (1) the theoretical definition of a cut score; and (2) decisions which must be made in selecting a…
Descriptors: Criterion Referenced Tests, Cutting Scores, Error of Measurement, High Schools
Sarvela, Paul D. – 1986
Four discrimination indices were compared, using score distributions which were normal, bimodal, and negatively skewed. The score distributions were systematically varied to represent the common circumstances of a military training situation using criterion-referenced mastery tests. Three 20-item tests were administered to 110 simulated subjects.…
Descriptors: Comparative Analysis, Criterion Referenced Tests, Item Analysis, Mastery Tests