ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	11

Descriptor

Error of Measurement	18
Reliability	18
Test Theory	18
Scores	9
Computation	5
Correlation	5
Equations (Mathematics)	4
Generalizability Theory	3
Measurement	3
Measurement Techniques	3
Regression (Statistics)	3
Statistical Analysis	3
Achievement Gains	2
Change	2
Error Correction	2
High Stakes Tests	2
Intelligence Tests	2
Item Response Theory	2
Mathematical Models	2
Models	2
Predictor Variables	2
Structural Equation Models	2
Test Items	2
Test Validity	2
Academic Achievement	1
More ▼

Source

Applied Psychological…	6
Educational and Psychological…	3
ETS Research Report Series	1
Educational Testing Service	1
High Ability Studies	1
International Journal of…	1
Journal of Early Adolescence	1
Journal of Special Education	1

Publication Type

Journal Articles	14
Reports - Evaluative	7
Reports - Research	6
Reports - Descriptive	4
Speeches/Meeting Papers	4
Book/Product Reviews	2
Opinion Papers	1

Education Level

Higher Education	2
Postsecondary Education	1

Audience

Location

Germany

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

Eysenck Personality Inventory	1
SAT (College Admission Test)	1
Wechsler Preschool and…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 18 results Save | Export

Modifying Spearman's Attenuation Equation to Yield Partial Corrections for Measurement Error--With Application to Sample Size Calculations

Peer reviewed

Direct link

Nicewander, W. Alan – Educational and Psychological Measurement, 2018

Spearman's correction for attenuation (measurement error) corrects a correlation coefficient for measurement errors in either-or-both of two variables, and follows from the assumptions of classical test theory. Spearman's equation removes all measurement error from a correlation coefficient which translates into "increasing the reliability of…

Descriptors: Error of Measurement, Correlation, Sample Size, Computation

Measurement Error Correction Formula for Cluster-Level Group Differences in Cluster Randomized and Observational Studies

Peer reviewed

Direct link

Cho, Sun-Joo; Preacher, Kristopher J. – Educational and Psychological Measurement, 2016

Multilevel modeling (MLM) is frequently used to detect cluster-level group differences in cluster randomized trial and observational studies. Group differences on the outcomes (posttest scores) are detected by controlling for the covariate (pretest scores) as a proxy variable for unobserved factors that predict future attributes. The pretest and…

Descriptors: Error of Measurement, Error Correction, Multivariate Analysis, Hierarchical Linear Modeling

The Reliability and Precision of Total Scores and IRT Estimates as a Function of Polytomous IRT Parameters and Latent Trait Distribution

Peer reviewed

Direct link

Culpepper, Steven Andrew – Applied Psychological Measurement, 2013

A classic topic in the fields of psychometrics and measurement has been the impact of the number of scale categories on test score reliability. This study builds on previous research by further articulating the relationship between item response theory (IRT) and classical test theory (CTT). Equations are presented for comparing the reliability and…

Descriptors: Item Response Theory, Reliability, Scores, Error of Measurement

Generalizability Theory as a Unifying Framework of Measurement Reliability in Adolescent Research

Peer reviewed

Direct link

Fan, Xitao; Sun, Shaojing – Journal of Early Adolescence, 2014

In adolescence research, the treatment of measurement reliability is often fragmented, and it is not always clear how different reliability coefficients are related. We show that generalizability theory (G-theory) is a comprehensive framework of measurement reliability, encompassing all other reliability methods (e.g., Pearson "r,"…

Descriptors: Generalizability Theory, Measurement, Reliability, Correlation

Taking the Error Term of the Factor Model into Account: The Factor Score Predictor Interval

Peer reviewed

Direct link

Beauducel, Andre – Applied Psychological Measurement, 2013

The problem of factor score indeterminacy implies that the factor and the error scores cannot be completely disentangled in the factor model. It is therefore proposed to compute Harman's factor score predictor that contains an additive combination of factor and error variance. This additive combination is discussed in the framework of classical…

Descriptors: Factor Analysis, Predictor Variables, Reliability, Error of Measurement

Defensible Progress Monitoring Data for Medium- and High-Stakes Decisions

Peer reviewed

Direct link

Parker, Richard I.; Vannest, Kimberly J.; Davis, John L.; Clemens, Nathan H. – Journal of Special Education, 2012

Within a response to intervention model, educators increasingly use progress monitoring (PM) to support medium- to high-stakes decisions for individual students. For PM to serve these more demanding decisions requires more careful consideration of measurement error. That error should be calculated within a fixed linear regression model rather than…

Descriptors: Measurement, Computation, Response to Intervention, Regression (Statistics)

Errors of Measurement, Theory, and Public Policy. William H. Angoff Memorial Lecture Series

Download full text

Kane, Michael – Educational Testing Service, 2010

The 12th annual William H. Angoff Memorial Lecture was presented by Dr. Michael T. Kane, ETS's (Educational Testing Service) Samuel J. Messick Chair in Test Validity and the former Director of Research at the National Conference of Bar Examiners. Dr. Kane argues that it is important for policymakers to recognize the impact of errors of measurement…

Descriptors: Error of Measurement, Scores, Public Policy, Test Theory

Correcting Fallacies in Validity, Reliability, and Classification

Peer reviewed

Direct link

Sijtsma, Klaas – International Journal of Testing, 2009

This article reviews three topics from test theory that continue to raise discussion and controversy and capture test theorists' and constructors' interest. The first topic concerns the discussion of the methodology of investigating and establishing construct validity; the second topic concerns reliability and its misuse, alternative definitions…

Descriptors: Construct Validity, Reliability, Classification, Test Theory

The Paradoxical Attenuation Effect in Tests Based on Classical Test Theory: Mathematical Background and Practical Implications for the Measurement of High Abilities

Peer reviewed

Direct link

Ziegler, Albert; Ziegler, Albert – High Ability Studies, 2009

The aim of this paper is to demonstrate the dramatic consequences the application of cut-off points can have in the practice of identifying gifted individuals. The paradoxical attenuation effect describes the frequent situation in which measurements of the gifts and talents individuals possess are lower than their true values. However, in…

Descriptors: Gifted, Academic Achievement, Test Theory, Measurement

Standardized Conditional "SEM": A Case for Conditional Reliability

Peer reviewed

Direct link

Raju, Nambury S.; Price, Larry R.; Oshima, T. C.; Nering, Michael L. – Applied Psychological Measurement, 2007

An examinee-level (or conditional) reliability is proposed for use in both classical test theory (CTT) and item response theory (IRT). The well-known group-level reliability is shown to be the average of conditional reliabilities of examinees in a group or a population. This relationship is similar to the known relationship between the square of…

Descriptors: Item Response Theory, Error of Measurement, Reliability, Test Theory

Estimation of Reliability Coefficients Using the Test Information Function and Its Modifications.

Peer reviewed

Samejima, Fumiko – Applied Psychological Measurement, 1994

The reliability coefficient is predicted from the test information function (TIF) or two modified TIF formulas and a specific trait distribution. Examples illustrate the variability of the reliability coefficient across different trait distributions, and results are compared with empirical reliability coefficients. (SLD)

Descriptors: Adaptive Testing, Error of Measurement, Estimation (Mathematics), Reliability

Linear Dependence on Gain Scores in Their Components Imposes Constraints on Their Use and Interpretation: Comment on "Are Simple Gain Scores Obsolete?"

Peer reviewed

Humphreys, Lloyd G. – Applied Psychological Measurement, 1996

The reliability of a gain is determined by the reliabilities of the components, the correlation between them, and their standard deviations. Reliability is not inherently low, but the components of gains in many investigations make low reliability likely and require caution in the use of gain scores. (SLD)

Descriptors: Achievement Gains, Change, Correlation, Error of Measurement

Commentary on the Commentaries of Collins and Humphreys.

Peer reviewed

Williams, Richard H.; Zimmerman, Donald W. – Applied Psychological Measurement, 1996

The critiques by L. Collins and L. Humphreys in this issue illustrate problems with the use of gain scores. Collins' examples show that familiar formulas for the reliability of differences do not reflect the precision of measures of change. Additional examples demonstrate flaws in the conventional approach to reliability. (SLD)

Descriptors: Achievement Gains, Change, Correlation, Error of Measurement

Accuracy of Two Procedures for Estimating Reliability of Mastery Tests. Research Memorandum 79-1.

PDF pending restoration

Hunyh, Hunyh; Saunders, Joseph C. – 1979

Comparisons were made among various methods of estimating the reliability of pass-fail decisions based on mastery tests. The reliability indices that are considered are p, the proportion of agreements between two estimates, and kappa, the proportion of agreements corrected for chance. Estimates of these two indices were made on the basis of…

Descriptors: Cutting Scores, Error of Measurement, Mastery Tests, Reliability

When Can Subscores Have Value? Research Report. ETS RR-05-08

Peer reviewed
PDF on ERIC

Download full text

Haberman, Shelby J. – ETS Research Report Series, 2005

In educational tests, subscores are often generated from a portion of the items in a larger test. Guidelines based on mean-squared error are proposed to indicate whether subscores are worth reporting. Alternatives considered are direct reports of subscores, estimates of subscores based on total score, combined estimates based on subscores and…

Descriptors: Scores, Test Items, Error of Measurement, Computation

Previous Page | Next Page »

Pages: 1 | 2

Beauducel, Andre	1
Cho, Sun-Joo	1
Clemens, Nathan H.	1
Culpepper, Steven Andrew	1
Davis, John L.	1
Dickinson, Terry L.	1
Espelage, Dorothy L.	1
Fan, Xitao	1
Graham, James M.	1
Haberman, Shelby J.	1
Humphreys, Lloyd G.	1
Hunyh, Hunyh	1
Kamps, Jodi	1
Kane, Michael	1
Nering, Michael L.	1
Nicewander, W. Alan	1
Oshima, T. C.	1
Parker, Richard I.	1
Preacher, Kristopher J.	1
Price, Larry R.	1
Quittner, Alexandra L.	1
Raju, Nambury S.	1
Samejima, Fumiko	1
Saunders, Joseph C.	1
Sijtsma, Klaas	1
More ▼