Publication Date
| In 2026 | 0 |
| Since 2025 | 2 |
| Since 2022 (last 5 years) | 12 |
| Since 2017 (last 10 years) | 26 |
| Since 2007 (last 20 years) | 90 |
Descriptor
| True Scores | 416 |
| Error of Measurement | 121 |
| Test Reliability | 110 |
| Statistical Analysis | 107 |
| Mathematical Models | 97 |
| Item Response Theory | 87 |
| Correlation | 76 |
| Equated Scores | 76 |
| Reliability | 64 |
| Test Theory | 52 |
| Test Items | 51 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Audience
| Researchers | 12 |
| Practitioners | 2 |
| Administrators | 1 |
| Teachers | 1 |
Location
| Australia | 1 |
| Canada | 1 |
| China | 1 |
| Colorado | 1 |
| Illinois | 1 |
| Israel | 1 |
| New York | 1 |
| Oregon | 1 |
| Taiwan | 1 |
| Texas | 1 |
| United Kingdom (England) | 1 |
| More ▼ | |
Laws, Policies, & Programs
| Elementary and Secondary… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Peer reviewedKristof, Walter – Psychometrika, 1973
Paper is concerned with the hypothesis that two variables have a perfect disattenuated correlation, hence measure the same trait except for errors of measurement. (Author/RK)
Descriptors: Analysis of Variance, Correlation, Error of Measurement, Mathematical Models
Peer reviewedWerts, Charles E.; Linn, Robert L. – Educational and Psychological Measurement, 1972
The general problem of using group status to estimate true scores given multiple measures is considered in this paper. (Authors)
Descriptors: Error of Measurement, Group Status, Mathematical Applications, Multiple Regression Analysis
Peer reviewedCollet, Leverne S. – Journal of Educational Measurement, 1971
The purpose of this paper was to provide an empirical test of the hypothesis that elimination scores are more reliable and valid than classical corrected-for-guessing scores or weighted-choice scores. The evidence presented supports the hypothesized superiority of elimination scoring. (Author)
Descriptors: Evaluation, Guessing (Tests), Multiple Choice Tests, Scoring Formulas
Peer reviewedWinne, Philip H.; Belfry, M. Joan – Journal of Educational Measurement, 1982
This review of issues about correcting for attenuation concludes that the basic difficulty lies in being able to identify and equate sources of variance in estimates of validity and reliability. Recommendations are proposed for cautious use of correction for attenuation. (Author/CM)
Descriptors: Correlation, Error of Measurement, Research Methodology, Statistical Analysis
Peer reviewedWilcox, Rand R. – Educational and Psychological Measurement, 1981
The paper considers the problem of selecting the t best of k normal populations and simultaneously determining whether the selected populations have a mean larger than a known standard. Illustrations are given for selecting the t best of k examinees when the binomial error model applies. (Author)
Descriptors: Competitive Selection, Criterion Referenced Tests, Decision Making, Mathematical Models
Peer reviewedSchulman, Robert S. – Psychometrika, 1979
An alternative to the uniform probability distribution model for ordinal data is considered. Implications for statistics and for test theory are discussed. (JKS)
Descriptors: Career Development, Correlation, Mathematical Models, Nonparametric Statistics
Peer reviewedKleven, Thor Arnfinn – Scandinavian Journal of Educational Research, 1979
Supposing different values of the standard measurement error, the relation of scale coarseness to the total amount of error is studied on the basis of probability distribution of error. The analyses are performed within two models of error and with two criteria of amount of error. (Editor/SJL)
Descriptors: Cutting Scores, Error of Measurement, Goodness of Fit, Grading
Peer reviewedKolen, Michael J. – Journal of Educational Measurement, 1991
Estimation/smoothing methods that are flexible enough to fit a wide variety of test score distributions are reviewed: kernel method, strong true-score model-based method, and method that uses polynomial log-linear models. Applications of these methods include describing/comparing test score distributions, estimating norms, and estimating…
Descriptors: Comparative Analysis, Equated Scores, Equations (Mathematics), Estimation (Mathematics)
Peer reviewedBird, Kevin D. – Educational and Psychological Measurement, 1991
A method is outlined for analysis of the shape of an individual profile of scores on a standardized test battery. The method uses a simultaneous test procedure allowing for an overall test of profile flatness, with follow-up tests on all contrasts of interest. (SLD)
Descriptors: Equations (Mathematics), Hypothesis Testing, Mathematical Models, Profiles
Peer reviewedDe Champlain, Andre F. – Journal of Educational Measurement, 1996
Assessed the dimensionality of two forms of a large-scale standardized test separately for three ethnic groups of examinees. Investigated whether differences in their latent trait composites have impacts on unidimensional item response theory true-score equating functions. On both forms, a two-dimensional model accounted for item responses of…
Descriptors: Black Students, Diversity (Student), Equated Scores, Ethnicity
Lee, Won-Chan; Brennan, Robert L.; Kolen, Michael J. – Journal of Educational and Behavioral Statistics, 2006
Assuming errors of measurement are distributed binomially, this article reviews various procedures for constructing an interval for an individual's true number-correct score; presents two general interval estimation procedures for an individual's true scale score (i.e., normal approximation and endpoints conversion methods); compares various…
Descriptors: Probability, Intervals, Guidelines, Computer Simulation
Yang, Wen-Ling – 1997
Using an anchor-item design of test equating, the effects of three equating methods (Tucker linear and two three-parameter item-response-theory-based (3PL-IRT) methods), and the content representativeness of anchor items on the accuracy of equating were examined; and an innovative way of evaluating equating accuracy appropriate for the particular…
Descriptors: Equated Scores, Item Response Theory, Raw Scores, Test Construction
Lord, Frederic M.; Wingersky, Marilyn S. – 1983
Two methods of 'equating' tests using item response theory (IRT) are compared, one using true scores, the other using the estimated distribution of observed scores. On the data studied, they yield almost indistinguishable results. This is a reassuring result for users of IRT equating methods. (Author)
Descriptors: Comparative Analysis, Equated Scores, Estimation (Mathematics), Latent Trait Theory
Reuman, David A.; And Others – 1982
According to classical test theory, the presence of random measurement error in a psychological test has important implications for validation studies. The more comprehensive application of classical test theory in construct validation is distinguished from that in criterion-oriented validation. Critics of thematic apperceptive measurement of the…
Descriptors: Academic Achievement, Achievement Need, Adults, Error of Measurement
Peer reviewedRichards, James M.; And Others – Educational and Psychological Measurement, 1975
Descriptors: Academic Ability, Academic Achievement, Behavior Change, Computer Programs

Direct link
