NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 8 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Almehrizi, Rashid S. – Applied Psychological Measurement, 2013
The majority of large-scale assessments develop various score scales that are either linear or nonlinear transformations of raw scores for better interpretations and uses of assessment results. The current formula for coefficient alpha (a; the commonly used reliability coefficient) only provides internal consistency reliability estimates of raw…
Descriptors: Raw Scores, Scaling, Reliability, Computation
Peer reviewed Peer reviewed
Direct linkDirect link
Yi, Qing; Harris, Deborah J.; Gao, Xiaohong – Applied Psychological Measurement, 2008
This study investigated the group invariance of equating results using a science achievement test. Examinees were divided into different subgroups based on the average composite score for test centers, whether they had taken a physics course, and self-reported science grade point average. The reason for dividing examinees into subgroups using such…
Descriptors: Grade Point Average, Science Achievement, Academic Achievement, Physics
Peer reviewed Peer reviewed
Brennan, Robert L. – Applied Psychological Measurement, 1998
Provides a comprehensive and integrated treatment of both conditional absolute standard errors of measurement (SEM) and conditional relative SEMs from the perspective of generalizability theory. Illustrates the approach with examples from commercial standardized tests. Examples support the conclusion that both types of conditional SEMs tend to be…
Descriptors: Error of Measurement, Generalizability Theory, Raw Scores, Standardized Tests
Peer reviewed Peer reviewed
Vander Linden, Wim J.; Mellenbergh, Gideon J. – Applied Psychological Measurement, 1978
A general coefficient for tests, delta, is derived from a decision theoretic point of view. The situations are considered in which a true score is estimated by a function of the observed score, observed scores are split into more than two categories, and observed scores are split into only two categories. (Author/CTM)
Descriptors: Criterion Referenced Tests, Decision Making, Mathematical Models, Raw Scores
Peer reviewed Peer reviewed
van der Linden, Wim J.; Mellenbergh, Gideon J. – Applied Psychological Measurement, 1977
Using a linear loss function, a procedure is described for computing a cutting score that minimizes the risk for a given decision rule. The procedure is demonstrated with a criterion-referenced achievement test of elementary statistics administered to 167 students. (Author/CTM)
Descriptors: Cutting Scores, Higher Education, Latent Trait Theory, Mastery Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Tong, Ye; Kolen, Michael – Applied Psychological Measurement, 2005
The performance of three equating methods--the presmoothed equipercentile method, the item response theory (IRT) true score method, and the IRT observed score method--were examined based on three equating criteria: the same distributions property, the first-order equity property, and the second-order equity property. The magnitude of the…
Descriptors: True Scores, Criteria, Raw Scores, Item Response Theory
Peer reviewed Peer reviewed
Tinsley, Howard E. A.; Dawis, Rene V. – Applied Psychological Measurement, 1977
This research investigated the use of the Rasch simple logistic model in obtaining test-free ability estimates. Raw-score ability estimates were influenced by the difficulty of the items used in measurement but Rasch ability estimates were relatively independent of the item difficulty. (Author/CTM)
Descriptors: Equated Scores, Higher Education, Individual Testing, Item Analysis
Peer reviewed Peer reviewed
Downey, Ronald G. – Applied Psychological Measurement, 1979
This research attempted to interrelate several methods of producing option weights (i.e., Guttman internal and external weights and judges' weights) and examined their effects on reliability and on concurrent, predictive, and face validity. It was concluded that option weighting offered limited, if any, improvement over unit weighting. (Author/CTM)
Descriptors: Achievement Tests, Answer Keys, Comparative Testing, High Schools