ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	6

Descriptor

Test Reliability	110
True Scores	110
Error of Measurement	37
Statistical Analysis	35
Mathematical Models	29
Correlation	22
Criterion Referenced Tests	20
Test Validity	20
Test Theory	19
Measurement Techniques	18
Scores	18
Test Interpretation	18
Test Construction	13
Measurement	12
Analysis of Variance	11
Scoring Formulas	11
Item Analysis	10
Norm Referenced Tests	10
Raw Scores	10
Testing	10
Test Results	9
Testing Problems	9
Career Development	8
Comparative Analysis	8
Models	8
More ▼

Publication Type

Reports - Research	49
Journal Articles	34
Reports - Evaluative	8
Speeches/Meeting Papers	7
Guides - Non-Classroom	3
Numerical/Quantitative Data	2
Information Analyses	1
Opinion Papers	1
Reports - Descriptive	1
Reports - General	1

Education Level

Elementary Education	1
Elementary Secondary Education	1
Grade 8	1
Junior High Schools	1
Middle Schools	1
Secondary Education	1

Audience

Researchers

Location

United Kingdom (Great Britain)

Laws, Policies, & Programs

Elementary and Secondary…

Assessments and Surveys

Differential Aptitude Test	1
General Aptitude Test Battery	1
Iowa Tests of Basic Skills	1
Teacher Performance…	1
Test of English as a Foreign…	1
Test of Standard Written…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 110 results Save | Export

A Note on the Use of Categorical Subscores

Peer reviewed

Direct link

Kylie Gorney; Sandip Sinharay – Journal of Educational Measurement, 2025

Although there exists an extensive amount of research on subscores and their properties, limited research has been conducted on categorical subscores and their interpretations. In this paper, we focus on the claim of Feinberg and von Davier that categorical subscores are useful for remediation and instructional purposes. We investigate this claim…

Descriptors: Tests, Scores, Test Interpretation, Alternative Assessment

Accuracy and Sensitivity of Coefficient Alpha and Its Alternatives with Unidimensional and Contaminated Scales

Peer reviewed

Direct link

Xiao, Leifeng; Hau, Kit-Tai – Applied Measurement in Education, 2023

We compared coefficient alpha with five alternatives (omega total, omega RT, omega h, GLB, and coefficient H) in two simulation studies. Results showed for unidimensional scales, (a) all indices except omega h performed similarly well for most conditions; (b) alpha is still good; (c) GLB and coefficient H overestimated reliability with small…

Descriptors: Test Theory, Test Reliability, Factor Analysis, Test Length

Effects of Differential Item Functioning on Examinees' Test Performance and Reliability of Test

Peer reviewed

Direct link

Lee, Yi-Hsuan; Zhang, Jinming – International Journal of Testing, 2017

Simulations were conducted to examine the effect of differential item functioning (DIF) on measurement consequences such as total scores, item response theory (IRT) ability estimates, and test reliability in terms of the ratio of true-score variance to observed-score variance and the standard error of estimation for the IRT ability parameter. The…

Descriptors: Test Bias, Test Reliability, Performance, Scores

The Impact of Test Dimensionality, Common-Item Set Format, and Scale Linking Methods on Mixed-Format Test Equating

Peer reviewed
PDF on ERIC

Download full text

Öztürk-Gübes, Nese; Kelecioglu, Hülya – Educational Sciences: Theory and Practice, 2016

The purpose of this study was to examine the impact of dimensionality, common-item set format, and different scale linking methods on preserving equity property with mixed-format test equating. Item response theory (IRT) true-score equating (TSE) and IRT observed-score equating (OSE) methods were used under common-item nonequivalent groups design.…

Descriptors: Test Format, Item Response Theory, True Scores, Equated Scores

NCME Instructional Module: Understanding Reliability.

Peer reviewed

Traub, Ross E.; Rowley, Glenn L. – Educational Measurement: Issues and Practice, 1991

The idea of test consistency is illustrated, with reference to two sets of test scores. A mathematical model is used to explain the relative consistency and relative inconsistency of measurements, and a means of indexing reliability is derived using the model. Practical aspects of estimating reliability are considered. (TJH)

Descriptors: Mathematical Models, Test Reliability, True Scores

A Stochastic Model for Test-Retest Correlations.

Peer reviewed

Morrison, Donald G. – Psychometrika, 1981

A simple stochastic model is formulated in order to determine the optimal time between the first test and the second test when the test-retest method of assessing reliability is used. A forgetting process and a change in true score process are postulated. Some numerical examples and suggestions are presented. (Author/JKS)

Descriptors: Correlation, Test Reliability, Test Theory, True Scores

Reliability of Total Test Scores When Considered as Ordinal Measurements

Peer reviewed

Direct link

Biswas, Ajoy Kumar – Applied Psychological Measurement, 2006

This article studies the ordinal reliability of (total) test scores. This study is based on a classical-type linear model of observed score (X), true score (T), and random error (E). Based on the idea of Kendall's tau-a coefficient, a measure of ordinal reliability for small-examinee populations is developed. This measure is extended to large…

Descriptors: True Scores, Test Theory, Test Reliability, Scores

Estimation of Reliability and True Score Variance From a Split of a Test Into Three Arbitrary Parts

Peer reviewed

Kristof, Walter – Psychometrika, 1974

Descriptors: Models, Statistical Analysis, Test Reliability, Testing

A Relationship Between Harris Factors and Guttman's Sixth Lower Bound To Reliability

Peer reviewed

Nicewander, W. Alan – Psychometrika, 1975

Shows that the Harris factors of R have psychometric properties similar to those discussed by Kaiser and Caffrey (1965) and Bentler (1968). Specifically it is shown that the Harris factors of R maximize a lower-bound to the reliability of a composite measure derived by Guttman (1945). (Author/RC)

Descriptors: Correlation, Factor Analysis, Matrices, Prediction

Some Results Relating to Test Equating Under Relaxed Test Form Equivalence

Peer reviewed

Marks, Edmond; Lindsay, Carl A. – Journal of Educational Measurement, 1972

Examines the effects of four parameters on the accuracy of test equating under a relaxed definition of test form equivalence. The four parameters studied were sample size, test form length, test form reliability, and the correlation between true scores of the test forms to be equated. (CK)

Descriptors: Scores, Test Interpretation, Test Reliability, Test Results

True Score Theory: A Paradox

Peer reviewed

Ramsay, J. O. – Educational and Psychological Measurement, 1971

The consequences of the assumption that the expected score is equal to the true score are shown and alternatives discussed. (MS)

Descriptors: Psychological Testing, Statistical Analysis, Test Reliability, Testing

A Note on Gaylord's "Estimating Test Reliability from the Item-Test Correlations"

Peer reviewed

Bowers, John – Educational and Psychological Measurement, 1971

Descriptors: Error of Measurement, Mathematical Models, Test Reliability, True Scores

Maximally Reliable Composites for Unidimensional Measures.

Peer reviewed

Conger, Anthony J. – Educational and Psychological Measurement, 1980

Reliability maximizing weights are related to theoretically specified true score scaling weights to show a constant relationship that is invariant under separate linear tranformations on each variable in the system. Test theoretic relations should be derived for the most general model available and not for unnecessarily constrained models.…

Descriptors: Mathematical Formulas, Scaling, Test Reliability, Test Theory

A Test Theory Model for Ordinal Measurements

Peer reviewed

Schulman, Robert S.; Haden, Richard L. – Psychometrika, 1975

A model is proposed for the description of ordinal test scores based on the definition of true score as expected rank; its deviations are compared with results from classical test theory. An unbiased estimator of population true score from sample data is calculated. Score variance and population reliability are examined. (Author/BJG)

Descriptors: Career Development, Mathematical Models, Test Reliability, Test Theory

Spearman's Test Score Model: A Restatement

Peer reviewed

Ng, K. T. – Educational and Psychological Measurement, 1974

This paper is aimed at demonstrating that Charles Spearman postulated neither a platonic true-error distinction nor a requirement for constant true scores under repeated measurement. (Author/RC)

Descriptors: Career Development, Correlation, Models, Test Reliability

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8

Educational and Psychological…	20
Psychometrika	15
Journal of Educational…	13
Applied Psychological…	4
Applied Measurement in…	2
Educational Measurement:…	2
Journal of Educational…	2
Multivariate Behavioral…	2
American Educational Research…	1
British Educational Research…	1
Child Abuse and Neglect: The…	1
Child Development	1
ETS Research Report Series	1
Educational Sciences: Theory…	1
Illinois School Research	1
International Journal of…	1
Journal of Experimental…	1
Journal of School Psychology	1
Language, Speech, and Hearing…	1
Mathematical Spectrum	1
Measurement and Evaluation in…	1
Research Quarterly for…	1
More ▼

Livingston, Samuel A.	9
Cureton, Edward E.	3
Linn, Robert L.	3
Schulman, Robert S.	3
Werts, C. E.	3
Werts, Charles E.	3
Wilcox, Rand R.	3
Zimmerman, Donald W.	3
Algina, James	2
Brennan, Robert L.	2
Conger, Anthony J.	2
Feldt, Leonard S.	2
Harris, Chester W.	2
Jackson, Paul H.	2
Kane, Michael T.	2
Lord, Frederic M.	2
Marks, Edmond	2
Martin, Charles G.	2
Mellenbergh, Gideon J.	2
Noe, Michael J.	2
Agunwamba, Christian C.	1
Allison, Paul A.	1
Attali, Yigal	1
Bates, Herman M., III	1
More ▼