ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	10

Descriptor

Error of Measurement	26
True Scores	26
Reliability	9
Estimation (Mathematics)	8
Item Response Theory	7
Scores	6
Computation	5
Mathematical Models	5
Raw Scores	5
Simulation	5
Test Theory	5
Correlation	4
Probability	4
Testing Problems	4
Achievement Tests	3
Bayesian Statistics	3
Classification	3
Comparative Analysis	3
Equations (Mathematics)	3
Evaluation Methods	3
Foreign Countries	3
Measurement Techniques	3
Models	3
Standardized Tests	3
Statistical Distributions	3
More ▼

Source

Journal of Educational…	6
Educational and Psychological…	3
Applied Psychological…	1
Canadian Journal of Program…	1
Educational Measurement:…	1
Educational Research	1
Journal of Consulting and…	1
Journal of Educational and…	1
Journal of Experimental…	1
Multivariate Behavioral…	1
National Center for Research…	1
Psychometrika	1
More ▼

Publication Type

Reports - Evaluative	26
Journal Articles	18
Speeches/Meeting Papers	3
Opinion Papers	1
Reports - Descriptive	1
Reports - Research	1

Education Level

Junior High Schools

Audience

Researchers

Location

Australia	1
Taiwan	1
United Kingdom (England)	1

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	1
Work Keys (ACT)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 26 results Save | Export

On the Relationship between Classical Test Theory and Item Response Theory: From One to the Other and Back

Peer reviewed

Direct link

Raykov, Tenko; Marcoulides, George A. – Educational and Psychological Measurement, 2016

The frequently neglected and often misunderstood relationship between classical test theory and item response theory is discussed for the unidimensional case with binary measures and no guessing. It is pointed out that popular item response models can be directly obtained from classical test theory-based models by accounting for the discrete…

Descriptors: Test Theory, Item Response Theory, Models, Correlation

Relationships of Measurement Error and Prediction Error in Observed-Score Regression

Peer reviewed

Direct link

Moses, Tim – Journal of Educational Measurement, 2012

The focus of this paper is assessing the impact of measurement errors on the prediction error of an observed-score regression. Measures are presented and described for decomposing the linear regression's prediction error variance into parts attributable to the true score variance and the error variances of the dependent variable and the predictor…

Descriptors: Error of Measurement, Prediction, Regression (Statistics), True Scores

A Response to an Article Published in "Educational Research"'s Special Issue on Assessment (June 2009). What Can Be Inferred about Classification Accuracy from Classification Consistency?

Peer reviewed

Direct link

Bramley, Tom – Educational Research, 2010

Background: A recent article published in "Educational Research" on the reliability of results in National Curriculum testing in England (Newton, "The reliability of results from national curriculum testing in England," "Educational Research" 51, no. 2: 181-212, 2009) suggested that: (1) classification accuracy can be…

Descriptors: National Curriculum, Educational Research, Testing, Measurement

An Alternative IRT Observed Score Equating Method. CRESST Report 751

Download full text

Kang, Taehoon; Chen, Troy T. – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2009

In this report, an alternative item response theory (IRT) observed score equating method was newly developed. The proposed equating method was illustrated with two real data sets and the equating results were compared to those of traditional IRT true score and IRT observed score equating methods. Using three loss indices, the new method appeared…

Descriptors: Equated Scores, Item Response Theory, True Scores, Methods

Posttraumatic Stress Disorder and Intimate Relationship Problems: A Meta-Analysis

Peer reviewed

Direct link

Taft, Casey T.; Watkins, Laura E.; Stafford, Jane; Street, Amy E.; Monson, Candice M. – Journal of Consulting and Clinical Psychology, 2011

Objective: The authors conducted a meta-analysis of empirical studies investigating associations between indices of posttraumatic stress disorder (PTSD) and intimate relationship problems to empirically synthesize this literature. Method: A literature search using PsycINFO, Medline, Published International Literature on Traumatic Stress (PILOTS),…

Descriptors: Aggression, Posttraumatic Stress Disorder, Doctoral Dissertations, Error of Measurement

A Modification to Angoff and Bookmarking Cut Scores to Account for the Imperfect Reliability of Test Scores

Peer reviewed

Direct link

MacCann, Robert G. – Educational and Psychological Measurement, 2008

It is shown that the Angoff and bookmarking cut scores are examples of true score equating that in the real world must be applied to observed scores. In the context of defining minimal competency, the percentage "failed" by such methods is a function of the length of the measuring instrument. It is argued that this length is largely…

Descriptors: True Scores, Cutting Scores, Minimum Competencies, Scores

Standard Errors of Estimated Latent Variable Scores with Estimated Structural Parameters

Peer reviewed

Direct link

Hoshino, Takahiro; Shigemasu, Kazuo – Applied Psychological Measurement, 2008

The authors propose a concise formula to evaluate the standard error of the estimated latent variable score when the true values of the structural parameters are not known and must be estimated. The formula can be applied to factor scores in factor analysis or ability parameters in item response theory, without bootstrap or Markov chain Monte…

Descriptors: Monte Carlo Methods, Markov Processes, Factor Analysis, Computation

Impact of Measurement Error on Statistical Power: Review of an Old Paradox.

Peer reviewed

Williams, Richard H.; And Others – Journal of Experimental Education, 1995

The paradox that a Student t-test based on pretest-posttest differences can attain its greatest power when the difference score reliability is zero was explained by demonstrating that power is not a mathematical function of reliability unless either true score variance or error score variance is constant. (SLD)

Descriptors: Error of Measurement, Power (Statistics), Pretests Posttests, Reliability

Shrinkage Estimation of Linear Combinations of True Scores.

Peer reviewed

Longford, Nicholas T. – Psychometrika, 1997

It is demonstrated that, in the presence of population information, a linear combination of true scores can be estimated more efficiently than by the same linear combination of the observed scores. Three criteria for optimality are discussed, but they yield the same solution, described as a multivariate shrinkage estimator. (Author/SLD)

Descriptors: Error of Measurement, Estimation (Mathematics), Multivariate Analysis, Population Distribution

On the Reliability of Categorically Scored Examinations

Peer reviewed

Direct link

Kupermintz, Haggai – Journal of Educational Measurement, 2004

A decision-theoretic approach to the question of reliability in categorically scored examinations is explored. The concepts of true scores and errors are discussed as they deviate from conventional psychometric definitions and measurement error in categorical scores is cast in terms of misclassifications. A reliability measure based on…

Descriptors: Test Reliability, Error of Measurement, Psychometrics, Test Theory

A Confirmatory Analysis of Item Reliability Trends (CAIRT): Differentiating True Score and Error Variance in the Analysis of Item Context Effects

Peer reviewed

Direct link

Hartig, Johannes; Holzel, Britta; Moosbrugger, Helfried – Multivariate Behavioral Research, 2007

Numerous studies have shown increasing item reliabilities as an effect of the item position in personality scales. Traditionally, these context effects are analyzed based on item-total correlations. This approach neglects that trends in item reliabilities can be caused either by an increase in true score variance or by a decrease in error…

Descriptors: True Scores, Error of Measurement, Structural Equation Models, Simulation

Interval Estimation for True Raw and Scale Scores under the Binomial Error Model

Peer reviewed

Direct link

Lee, Won-Chan; Brennan, Robert L.; Kolen, Michael J. – Journal of Educational and Behavioral Statistics, 2006

Assuming errors of measurement are distributed binomially, this article reviews various procedures for constructing an interval for an individual's true number-correct score; presents two general interval estimation procedures for an individual's true scale score (i.e., normal approximation and endpoints conversion methods); compares various…

Descriptors: Probability, Intervals, Guidelines, Computer Simulation

A Consideration for Variable Length Adaptive Tests.

Download full text

Wingersky, Marilyn S. – 1989

In a variable-length adaptive test with a stopping rule that relied on the asymptotic standard error of measurement of the examinee's estimated true score, M. S. Stocking (1987) discovered that it was sufficient to know the examinee's true score and the number of items administered to predict with some accuracy whether an examinee's true score was…

Descriptors: Adaptive Testing, Bayesian Statistics, Error of Measurement, Estimation (Mathematics)

On the Difference between Reliability of Measurement and Precision of Survey Instruments.

Peer reviewed

Evans, Brian – Canadian Journal of Program Evaluation/La Revue canadienne d'evaluation de programme, 1995

The distinction between two models of reliability is clarified. Reliability may be conceived of and estimated from a true score model or from the perspective of sampling precision. Basic models are developed and illustrated for each approach using data from the author's work on measuring organizational climate. (SLD)

Descriptors: Data Analysis, Error of Measurement, Evaluators, Models

The Use of Confidence Intervals When Interpreting Test Scores. EREAPA Publication Series No. 93-4.

Download full text

Wheeler, Patricia H. – 1993

A person's obtained score on a test provides an estimate of the individual's "true" score on that test. The obtained score is considered to have two parts, the true component and the error component. Classical test theory assumes that obtained scores for an individual over multiple administrations of the same test will lie symmetrically…

Descriptors: Cutting Scores, Error of Measurement, Scores, Statistical Distributions

Previous Page | Next Page »

Pages: 1 | 2

Kolen, Michael J.	2
Longford, Nicholas T.	2
Woodruff, David	2
Bramley, Tom	1
Brennan, Robert L.	1
Chang, Shun-Wen	1
Chen, Troy T.	1
Cizek, Gregory J.	1
Evans, Brian	1
Goldberg, Gail Lynn	1
Hartig, Johannes	1
Harvill, Leo M.	1
Holzel, Britta	1
Hoshino, Takahiro	1
Kang, Taehoon	1
Kupermintz, Haggai	1
Lee, Won-Chan	1
Li, Yuan H.	1
Lissitz, Robert W.	1
MacCann, Robert G.	1
Marcoulides, George A.	1
Monson, Candice M.	1
Moosbrugger, Helfried	1
Moses, Tim	1
More ▼