Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 1 |
Descriptor
Statistical Analysis | 35 |
Test Reliability | 35 |
True Scores | 35 |
Mathematical Models | 13 |
Error of Measurement | 11 |
Correlation | 10 |
Measurement Techniques | 9 |
Test Validity | 8 |
Analysis of Variance | 7 |
Criterion Referenced Tests | 7 |
Tests | 6 |
More ▼ |
Source
Author
Publication Type
Reports - Research | 16 |
Journal Articles | 5 |
Speeches/Meeting Papers | 2 |
Reports - Evaluative | 1 |
Education Level
Elementary Education | 1 |
Elementary Secondary Education | 1 |
Grade 8 | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
Secondary Education | 1 |
Audience
Researchers | 1 |
Location
Laws, Policies, & Programs
Assessments and Surveys
Trends in International… | 1 |
What Works Clearinghouse Rating
Öztürk-Gübes, Nese; Kelecioglu, Hülya – Educational Sciences: Theory and Practice, 2016
The purpose of this study was to examine the impact of dimensionality, common-item set format, and different scale linking methods on preserving equity property with mixed-format test equating. Item response theory (IRT) true-score equating (TSE) and IRT observed-score equating (OSE) methods were used under common-item nonequivalent groups design.…
Descriptors: Test Format, Item Response Theory, True Scores, Equated Scores

Kristof, Walter – Psychometrika, 1974
Descriptors: Models, Statistical Analysis, Test Reliability, Testing

Ramsay, J. O. – Educational and Psychological Measurement, 1971
The consequences of the assumption that the expected score is equal to the true score are shown and alternatives discussed. (MS)
Descriptors: Psychological Testing, Statistical Analysis, Test Reliability, Testing

Joe, George W.; Woodward, J. Arthur – Psychometrika, 1976
This article is concerned with estimation of components of maximum generalizability in multifacet experimental designs involving multiple dependent measures. An example of a two-facet partially nested design is provided. (Author/RC)
Descriptors: Analysis of Variance, Correlation, Matrices, Reliability

Charter, Richard A.; Feldt, Leonard S. – Measurement and Evaluation in Counseling and Development, 2002
Presented is a detailed description of two true score confidence interval approaches, their use, interpretation, and a philosophical conflict that arises in many applied instances. (Contains 27 references.) (Author)
Descriptors: Error of Measurement, Psychometrics, Research Methodology, Statistical Analysis

Lu, K. H. – Educational and Psychological Measurement, 1971
Descriptors: Difficulty Level, Statistical Analysis, Statistical Significance, Test Items

Cureton, Edward E. – Educational and Psychological Measurement, 1971
A derivation of a formula for the stability coefficient is presented and discussed in terms of test reliability over time. (PR)
Descriptors: Error of Measurement, Raw Scores, Statistical Analysis, Test Reliability

Conger, Anthony J. – Multivariate Behavioral Research, 1974
Two indices of profile reliability are shown to be equivalent in terms of the individual independent canonical composites; however, because of different weighting procedures, they yield different overall indices of profile reliability. A common formula is provided from which both indices can be derived. (Author)
Descriptors: Analysis of Variance, Correlation, Matrices, Measurement Techniques
Stocking, Martha; And Others – 1973
For two tests measuring the same trait, the program, BIV20, equates the scores using the two True score distributions estimated by the univariate method 20 program (see Wingersky, Lees, Lennon, and Lord, 1969) and, with these equated true scores and their distributions, estimates the bivariate distribution scores and the relative efficiency of the…
Descriptors: Computer Programs, Equated Scores, Statistical Analysis, Test Reliability
Livingston, Samuel A. – 1970
The procedure of estimating true scores by means of a transformation of the obtained score based on the reliability coefficient is compared with the use of the obtained score without transformation. Using the mean squared error as a criterion, the transformed score is a better estimate for most examinees but poorer for those whose true scores lie…
Descriptors: Analysis of Variance, Measurement, Raw Scores, Scores

Zimmerman, Donald W. – Educational and Psychological Measurement, 1976
Using the concepts of conditional probability, conditional expectation, and conditional independence, the main results of the classical test theory model can be derived in a very few steps with minimal assumptions. The present effort explores the possibility that present classical test theories can be further condensed. (Author/RC)
Descriptors: Career Development, Correlation, Mathematical Models, Measurement

Marks, Edmond; Martin, Charles G. – American Educational Research Journal, 1973
Purpose of this study was to examine the effects of the true change-true initial score correlation on one aspect of the true simple change estimate, namely its error variance. (Authors/CB)
Descriptors: Analysis of Variance, Mathematical Applications, Measurement Techniques, Scoring Formulas
Gavin, Anne T.; Martin, Charles G. – 1976
A procedure for estimating the degree to which a subtest uniquely contributes to total test performance is presented and discussed. Uniqueness analysis may be appropriately applied to any composite measurement instrument such as a multipart test or a multitest battery to assess the unique contribution of each component to the total test. The…
Descriptors: Aptitude Tests, Correlation, Occupational Tests, Scores

Th.van der Kamp, Leo J.; Mellenbergh, Gideon J. – Educational and Psychological Measurement, 1976
Joreskog's model of cogeneric tests is used to analyze agreement between raters. Raters are treated as measuring instruments. The model of cogeneric tests, of which classical parallelism and tau-equivalence are shown to be special cases, is applied to teachers' ratings of students' responses on open-end questions. (Author/RC)
Descriptors: Goodness of Fit, Mathematical Models, Rating Scales, Statistical Analysis

Winne, Philip H.; Belfry, M. Joan – Journal of Educational Measurement, 1982
This review of issues about correcting for attenuation concludes that the basic difficulty lies in being able to identify and equate sources of variance in estimates of validity and reliability. Recommendations are proposed for cautious use of correction for attenuation. (Author/CM)
Descriptors: Correlation, Error of Measurement, Research Methodology, Statistical Analysis