Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 2 |
Descriptor
Error of Measurement | 8 |
Reliability | 8 |
Scaling | 8 |
Item Response Theory | 4 |
Estimation (Mathematics) | 3 |
True Scores | 3 |
Ability | 2 |
Equated Scores | 2 |
Goodness of Fit | 2 |
Raw Scores | 2 |
Adaptive Testing | 1 |
More ▼ |
Source
Journal of Educational… | 3 |
Applied Measurement in… | 1 |
Assessment | 1 |
ETS Research Report Series | 1 |
Multivariate Behavioral… | 1 |
Author
Kolen, Michael J. | 2 |
Camilli, Gregory | 1 |
Feldt, Leonard S. | 1 |
Hanson, Bradley A. | 1 |
Kim, Sooyeon | 1 |
Moses, Tim | 1 |
Qualls, Audrey L. | 1 |
Raykov, Tenko | 1 |
Thomas, Michael L. | 1 |
Wang, Tianyou | 1 |
Zeng, Lingjia | 1 |
More ▼ |
Publication Type
Journal Articles | 7 |
Reports - Evaluative | 5 |
Book/Product Reviews | 1 |
Reports - Descriptive | 1 |
Reports - Research | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
ACT Assessment | 1 |
Work Keys (ACT) | 1 |
What Works Clearinghouse Rating
Thomas, Michael L. – Assessment, 2011
Item response theory (IRT) and related latent variable models represent modern psychometric theory, the successor to classical test theory in psychological assessment. Although IRT has become prevalent in the measurement of ability and achievement, its contributions to clinical domains have been less extensive. Applications of IRT to clinical…
Descriptors: Item Response Theory, Psychological Evaluation, Reliability, Error of Measurement

Raykov, Tenko – Multivariate Behavioral Research, 1997
The population discrepancy between Cronbach's Coefficient Alpha (L. Cronbach, 1951) and scale reliability with fixed congeneric measure, uncorrelated errors, and sampling of subjects was studied. The difference is expressed in terms of the individual component violations of the assumption of equal tau-equivalence that is necessary and sufficient…
Descriptors: Error of Measurement, Reliability, Sampling, Scaling
Moses, Tim; Kim, Sooyeon – ETS Research Report Series, 2007
This study evaluated the impact of unequal reliability on test equating methods in the nonequivalent groups with anchor test (NEAT) design. Classical true score-based models were compared in terms of their assumptions about how reliability impacts test scores. These models were related to treatment of population ability differences by different…
Descriptors: Reliability, Equated Scores, Test Items, Statistical Analysis

Feldt, Leonard S.; Qualls, Audrey L. – Applied Measurement in Education, 1998
Two relatively simple methods for estimating the condition standard error of measurement (SEM) for nonlinearly derived score scales are proposed. Applications indicate that these two procedures produce fairly consistent estimates that tend to peak near the high end of the scale and reach a minimum in the middle of the raw score scale. (SLD)
Descriptors: Error of Measurement, Estimation (Mathematics), Raw Scores, Reliability

Camilli, Gregory – Journal of Educational Measurement, 1999
Yen and Burket suggested that shrinkage in vertical equating cannot be understood apart from multidimensionality. Reviews research on reliability, multidimensionality, and scale shrinkage, and explores issues of practical importance to educators. (SLD)
Descriptors: Equated Scores, Error of Measurement, Item Response Theory, Reliability

Kolen, Michael J.; Zeng, Lingjia; Hanson, Bradley A. – Journal of Educational Measurement, 1996
Presents an Item Response Theory (IRT) method for estimating standard errors of measurement of scale scores for the situation in which scale scores are nonlinear transformations of number-correct scores. Also describes procedures for estimating the average conditional standard error of measurement for scale scores and the reliability of scale…
Descriptors: Error of Measurement, Estimation (Mathematics), Item Response Theory, Reliability
Wang, Tianyou; And Others – 1996
M. J. Kolen, B. A. Hanson, and R. L. Brennan (1992) presented a procedure for assessing the conditional standard error of measurement (CSEM) of scale scores using a strong true-score model. They also investigated the ways of using nonlinear transformation from number-correct raw score to scale score to equalize the conditional standard error along…
Descriptors: Ability, Classification, Error of Measurement, Goodness of Fit

Kolen, Michael J.; And Others – Journal of Educational Measurement, 1992
A procedure is described for estimating the reliability and conditional standard errors of measurement of scale scores incorporating the discrete transformation of raw scores to scale scores. The method is illustrated using a strong true score model, and practical applications are described. (SLD)
Descriptors: College Entrance Examinations, Equations (Mathematics), Error of Measurement, Estimation (Mathematics)