ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	2

Source

Applied Psychological…

Author

Biswas, Ajoy Kumar	1
Collins, Linda M.	1
Culpepper, Steven Andrew	1
Drasgow, Fritz	1
Embretson, Susan E.	1
Humphreys, Lloyd G.	1
Williams, Richard H.	1
Zimmerman, Donald W.	1

Publication Type

Journal Articles	7
Reports - Evaluative	6
Book/Product Reviews	3
Reports - Descriptive	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Armed Services Vocational…

What Works Clearinghouse Rating

Showing all 7 results Save | Export

The Reliability and Precision of Total Scores and IRT Estimates as a Function of Polytomous IRT Parameters and Latent Trait Distribution

Peer reviewed

Direct link

Culpepper, Steven Andrew – Applied Psychological Measurement, 2013

A classic topic in the fields of psychometrics and measurement has been the impact of the number of scale categories on test score reliability. This study builds on previous research by further articulating the relationship between item response theory (IRT) and classical test theory (CTT). Equations are presented for comparing the reliability and…

Descriptors: Item Response Theory, Reliability, Scores, Error of Measurement

Reliability of Total Test Scores When Considered as Ordinal Measurements

Peer reviewed

Direct link

Biswas, Ajoy Kumar – Applied Psychological Measurement, 2006

This article studies the ordinal reliability of (total) test scores. This study is based on a classical-type linear model of observed score (X), true score (T), and random error (E). Based on the idea of Kendall's tau-a coefficient, a measure of ordinal reliability for small-examinee populations is developed. This measure is extended to large…

Descriptors: True Scores, Test Theory, Test Reliability, Scores

Is Reliability Obsolete? A Commentary on "Are Simple Gain Scores Obsolete?"

Peer reviewed

Collins, Linda M. – Applied Psychological Measurement, 1996

The clarification provided by Williams and Zimmerman on the reliability of gain scores is translated into recognizable patterns of change that tend to produce reliable or unreliable gain scores. The relevance of the traditional idea of reliability to the measurement of change is also discussed. (SLD)

Descriptors: Achievement Gains, Change, Measurement Techniques, Reliability

Item Response Theory Models and Spurious Interaction Effects in Factorial ANOVA Designs.

Peer reviewed

Embretson, Susan E. – Applied Psychological Measurement, 1996

Conditions under which interaction effects estimated from classical total scores, rather than item response theory trait scores, can be misleading are discussed with reference to analysis of variance (ANOVA). When no interaction effects exist on the true latent variable, spurious interaction effects can be observed from the total score scale. (SLD)

Descriptors: Analysis of Variance, Interaction, Item Response Theory, Models

Linear Dependence on Gain Scores in Their Components Imposes Constraints on Their Use and Interpretation: Comment on "Are Simple Gain Scores Obsolete?"

Peer reviewed

Humphreys, Lloyd G. – Applied Psychological Measurement, 1996

The reliability of a gain is determined by the reliabilities of the components, the correlation between them, and their standard deviations. Reliability is not inherently low, but the components of gains in many investigations make low reliability likely and require caution in the use of gain scores. (SLD)

Descriptors: Achievement Gains, Change, Correlation, Error of Measurement

Commentary on the Commentaries of Collins and Humphreys.

Peer reviewed

Williams, Richard H.; Zimmerman, Donald W. – Applied Psychological Measurement, 1996

The critiques by L. Collins and L. Humphreys in this issue illustrate problems with the use of gain scores. Collins' examples show that familiar formulas for the reliability of differences do not reflect the precision of measures of change. Additional examples demonstrate flaws in the conventional approach to reliability. (SLD)

Descriptors: Achievement Gains, Change, Correlation, Error of Measurement

Modeling Incorrect Responses to Multiple-Choice Items with Multilinear Formula Score Theory.

Peer reviewed

Drasgow, Fritz; And Others – Applied Psychological Measurement, 1989

Multilinear formula scoring (MFS) is reviewed, with emphasis on estimating option characteristic curves (OCSs). MFS was used to estimate OCSs for the arithmetic reasoning subtest of the Armed Services Vocational Aptitude Battery for 2,978 examinees. A second analysis obtained OCSs for simulated data. The use of MFS is discussed. (SLD)

Descriptors: Estimation (Mathematics), Mathematical Models, Multiple Choice Tests, Scores

Scores	7
Test Theory	7
Measurement Techniques	4
Reliability	4
Achievement Gains	3
Change	3
Correlation	3
Error of Measurement	3
Item Response Theory	2
Analysis of Variance	1
Comparative Analysis	1
Computation	1
Equations (Mathematics)	1
Estimation (Mathematics)	1
Interaction	1
Mathematical Models	1
Measures (Individuals)	1
Models	1
Multiple Choice Tests	1
Scoring Formulas	1
Statistical Inference	1
Test Interpretation	1
Test Reliability	1
True Scores	1
More ▼