ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	1

Descriptor

Statistical Analysis	35
Test Reliability	35
True Scores	35
Mathematical Models	13
Error of Measurement	11
Correlation	10
Measurement Techniques	9
Test Validity	8
Analysis of Variance	7
Criterion Referenced Tests	7
Tests	6
Raw Scores	5
Scores	5
Test Interpretation	5
Test Theory	5
Testing	5
Comparative Analysis	4
Decision Making	4
Statistical Significance	4
Test Construction	4
Academic Achievement	3
Career Development	3
Computer Programs	3
Equated Scores	3
Evaluation Methods	3
More ▼

Source

Educational and Psychological…	8
Psychometrika	4
American Educational Research…	1
Applied Psychological…	1
Educational Sciences: Theory…	1
Journal of Educational…	1
Measurement and Evaluation in…	1
Multivariate Behavioral…	1

Publication Type

Reports - Research	16
Journal Articles	5
Speeches/Meeting Papers	2
Reports - Evaluative	1

Education Level

Elementary Education	1
Elementary Secondary Education	1
Grade 8	1
Junior High Schools	1
Middle Schools	1
Secondary Education	1

Audience

Researchers

Location

Laws, Policies, & Programs

Assessments and Surveys

Trends in International…

What Works Clearinghouse Rating

Showing 1 to 15 of 35 results Save | Export

The Impact of Test Dimensionality, Common-Item Set Format, and Scale Linking Methods on Mixed-Format Test Equating

Peer reviewed
PDF on ERIC

Download full text

Öztürk-Gübes, Nese; Kelecioglu, Hülya – Educational Sciences: Theory and Practice, 2016

The purpose of this study was to examine the impact of dimensionality, common-item set format, and different scale linking methods on preserving equity property with mixed-format test equating. Item response theory (IRT) true-score equating (TSE) and IRT observed-score equating (OSE) methods were used under common-item nonequivalent groups design.…

Descriptors: Test Format, Item Response Theory, True Scores, Equated Scores

Estimation of Reliability and True Score Variance From a Split of a Test Into Three Arbitrary Parts

Peer reviewed

Kristof, Walter – Psychometrika, 1974

Descriptors: Models, Statistical Analysis, Test Reliability, Testing

True Score Theory: A Paradox

Peer reviewed

Ramsay, J. O. – Educational and Psychological Measurement, 1971

The consequences of the assumption that the expected score is equal to the true score are shown and alternatives discussed. (MS)

Descriptors: Psychological Testing, Statistical Analysis, Test Reliability, Testing

Some Developments in Multivariate Generalizability

Peer reviewed

Joe, George W.; Woodward, J. Arthur – Psychometrika, 1976

This article is concerned with estimation of components of maximum generalizability in multifacet experimental designs involving multiple dependent measures. An example of a two-facet partially nested design is provided. (Author/RC)

Descriptors: Analysis of Variance, Correlation, Matrices, Reliability

The Importance of Reliability as It Relates to True Score Confidence Intervals.

Peer reviewed

Charter, Richard A.; Feldt, Leonard S. – Measurement and Evaluation in Counseling and Development, 2002

Presented is a detailed description of two true score confidence interval approaches, their use, interpretation, and a philosophical conflict that arises in many applied instances. (Contains 27 references.) (Author)

Descriptors: Error of Measurement, Psychometrics, Research Methodology, Statistical Analysis

Statistical Control of "Impurity" in the Estimation of Test Reliability

Peer reviewed

Lu, K. H. – Educational and Psychological Measurement, 1971

Descriptors: Difficulty Level, Statistical Analysis, Statistical Significance, Test Items

The Stability Coefficient

Peer reviewed

Cureton, Edward E. – Educational and Psychological Measurement, 1971

A derivation of a formula for the stability coefficient is presented and discussed in terms of test reliability over time. (PR)

Descriptors: Error of Measurement, Raw Scores, Statistical Analysis, Test Reliability

Estimating Profile Reliability and Maximally Reliable Composites

Peer reviewed

Conger, Anthony J. – Multivariate Behavioral Research, 1974

Two indices of profile reliability are shown to be equivalent in terms of the individual independent canonical composites; however, because of different weighting procedures, they yield different overall indices of profile reliability. A common formula is provided from which both indices can be derived. (Author)

Descriptors: Analysis of Variance, Correlation, Matrices, Measurement Techniques

A Program for Estimating the Relative Efficiency of Tests at Various Ability Levels, for Equating True Scores, and for Predicting Bivariate Distributions of Observed Scores.

Download full text

Stocking, Martha; And Others – 1973

For two tests measuring the same trait, the program, BIV20, equates the scores using the two True score distributions estimated by the univariate method 20 program (see Wingersky, Lees, Lennon, and Lord, 1969) and, with these equated true scores and their distributions, estimates the bivariate distribution scores and the relative efficiency of the…

Descriptors: Computer Programs, Equated Scores, Statistical Analysis, Test Reliability

Some Observations on the Estimation of True Scores.

Livingston, Samuel A. – 1970

The procedure of estimating true scores by means of a transformation of the obtained score based on the reliability coefficient is compared with the use of the obtained score without transformation. Using the mean squared error as a criterion, the transformed score is a better estimate for most examinees but poorer for those whose true scores lie…

Descriptors: Analysis of Variance, Measurement, Raw Scores, Scores

Test Theory with Minimal Assumptions

Peer reviewed

Zimmerman, Donald W. – Educational and Psychological Measurement, 1976

Using the concepts of conditional probability, conditional expectation, and conditional independence, the main results of the classical test theory model can be derived in a very few steps with minimal assumptions. The present effort explores the possibility that present classical test theories can be further condensed. (Author/RC)

Descriptors: Career Development, Correlation, Mathematical Models, Measurement

Further Comments Relating to the Measurement of Change

Peer reviewed

Marks, Edmond; Martin, Charles G. – American Educational Research Journal, 1973

Purpose of this study was to examine the effects of the true change-true initial score correlation on one aspect of the true simple change estimate, namely its error variance. (Authors/CB)

Descriptors: Analysis of Variance, Mathematical Applications, Measurement Techniques, Scoring Formulas

A Procedure for Estimating the Unique Contribution of Each Component of a Composite Test: Uniqueness Analysis of Test 500. Technical Memorandum 76-8.

Download full text

Gavin, Anne T.; Martin, Charles G. – 1976

A procedure for estimating the degree to which a subtest uniquely contributes to total test performance is presented and discussed. Uniqueness analysis may be appropriately applied to any composite measurement instrument such as a multipart test or a multitest battery to assess the unique contribution of each component to the total test. The…

Descriptors: Aptitude Tests, Correlation, Occupational Tests, Scores

Agreement between Raters

Peer reviewed

Th.van der Kamp, Leo J.; Mellenbergh, Gideon J. – Educational and Psychological Measurement, 1976

Joreskog's model of cogeneric tests is used to analyze agreement between raters. Raters are treated as measuring instruments. The model of cogeneric tests, of which classical parallelism and tau-equivalence are shown to be special cases, is applied to teachers' ratings of students' responses on open-end questions. (Author/RC)

Descriptors: Goodness of Fit, Mathematical Models, Rating Scales, Statistical Analysis

Interpretive Problems When Correcting for Attenuation.

Peer reviewed

Winne, Philip H.; Belfry, M. Joan – Journal of Educational Measurement, 1982

This review of issues about correcting for attenuation concludes that the basic difficulty lies in being able to identify and equate sources of variance in estimates of validity and reliability. Recommendations are proposed for cautious use of correction for attenuation. (Author/CM)

Descriptors: Correlation, Error of Measurement, Research Methodology, Statistical Analysis

Previous Page | Next Page »

Pages: 1 | 2 | 3

Livingston, Samuel A.	3
Martin, Charles G.	2
Mellenbergh, Gideon J.	2
Belfry, M. Joan	1
Brennan, Robert L.	1
Bresler, Samuel	1
Cahan, Sorel	1
Charter, Richard A.	1
Conger, Anthony J.	1
Cureton, Edward E.	1
Donlon, Thomas F.	1
Feldt, Leonard S.	1
Gavin, Anne T.	1
Gleser, Leon Jay	1
Haladyna, Thomas	1
Harris, Chester W.	1
Horn, John L.	1
Joe, George W.	1
Joreskog, K. G.	1
Kelecioglu, Hülya	1
Kristof, Walter	1
Linn, Robert L.	1
Lu, K. H.	1
Marks, Edmond	1
More ▼