Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 8 |
Descriptor
Error of Measurement | 30 |
Reliability | 30 |
True Scores | 30 |
Statistical Analysis | 12 |
Correlation | 10 |
Analysis of Variance | 6 |
Mathematical Models | 6 |
Measurement Techniques | 6 |
Sampling | 5 |
Scores | 5 |
Measurement | 4 |
More ▼ |
Source
Author
Livingston, Samuel A. | 3 |
Edwards, Keith J. | 2 |
Moses, Tim | 2 |
Alonso, Ariel | 1 |
Bramley, Tom | 1 |
Brennan, Robert L. | 1 |
Cizek, Gregory J. | 1 |
Cunning, Leslie | 1 |
Dickinson, Terry L. | 1 |
Doppelt, Jerome E. | 1 |
Evans, Brian | 1 |
More ▼ |
Publication Type
Journal Articles | 17 |
Reports - Research | 12 |
Reports - Evaluative | 9 |
Speeches/Meeting Papers | 3 |
Guides - Non-Classroom | 1 |
Numerical/Quantitative Data | 1 |
Opinion Papers | 1 |
Reports - Descriptive | 1 |
Education Level
Higher Education | 1 |
Audience
Researchers | 1 |
Location
Australia | 1 |
United Kingdom (England) | 1 |
Laws, Policies, & Programs
Assessments and Surveys
ACT Assessment | 1 |
National Longitudinal Study… | 1 |
Work Keys (ACT) | 1 |
What Works Clearinghouse Rating
Moses, Tim – Journal of Educational Measurement, 2012
The focus of this paper is assessing the impact of measurement errors on the prediction error of an observed-score regression. Measures are presented and described for decomposing the linear regression's prediction error variance into parts attributable to the true score variance and the error variances of the dependent variable and the predictor…
Descriptors: Error of Measurement, Prediction, Regression (Statistics), True Scores
Leue, Anja; Lange, Sebastian – Assessment, 2011
The assessment of positive affect (PA) and negative affect (NA) by means of the Positive Affect and Negative Affect Schedule has received a remarkable popularity in the social sciences. Using a meta-analytic tool--namely, reliability generalization (RG)--population reliability scores of both scales have been investigated on the basis of a random…
Descriptors: Social Sciences, True Scores, Generalization, Affective Behavior
Laenen, Annouschka; Alonso, Ariel; Molenberghs, Geert; Vangeneugden, Tony; Mallinckrodt, Craig H. – Applied Psychological Measurement, 2010
Longitudinal studies are permeating clinical trials in psychiatry. Therefore, it is of utmost importance to study the psychometric properties of rating scales, frequently used in these trials, within a longitudinal framework. However, intrasubject serial correlation and memory effects are problematic issues often encountered in longitudinal data.…
Descriptors: Psychiatry, Rating Scales, Memory, Psychometrics
Bramley, Tom – Educational Research, 2010
Background: A recent article published in "Educational Research" on the reliability of results in National Curriculum testing in England (Newton, "The reliability of results from national curriculum testing in England," "Educational Research" 51, no. 2: 181-212, 2009) suggested that: (1) classification accuracy can be…
Descriptors: National Curriculum, Educational Research, Testing, Measurement
MacCann, Robert G. – Educational and Psychological Measurement, 2008
It is shown that the Angoff and bookmarking cut scores are examples of true score equating that in the real world must be applied to observed scores. In the context of defining minimal competency, the percentage "failed" by such methods is a function of the length of the measuring instrument. It is argued that this length is largely…
Descriptors: True Scores, Cutting Scores, Minimum Competencies, Scores
Harasym, Peter H.; Woloschuk, Wayne; Cunning, Leslie – Advances in Health Sciences Education, 2008
Physician-patient communication is a clinical skill that can be learned and has a positive impact on patient satisfaction and health outcomes. A concerted effort at all medical schools is now directed at teaching and evaluating this core skill. Student communication skills are often assessed by an Objective Structure Clinical Examination (OSCE).…
Descriptors: Medical Schools, Family Practice (Medicine), Examiners, Error of Measurement

Green, Samuel B.; Hershberger, Scott L. – Structural Equation Modeling, 2000
Proposes true score models that can account for correlated errors and their effect on coefficient alpha. These models allow random measurement errors on earlier items to affect directly or indirectly the scores on later items. Conditions under which coefficient alpha may yield spuriously high estimates or reliability are discussed. (SLD)
Descriptors: Correlation, Error of Measurement, Reliability, True Scores
Moses, Tim; Kim, Sooyeon – ETS Research Report Series, 2007
This study evaluated the impact of unequal reliability on test equating methods in the nonequivalent groups with anchor test (NEAT) design. Classical true score-based models were compared in terms of their assumptions about how reliability impacts test scores. These models were related to treatment of population ability differences by different…
Descriptors: Reliability, Equated Scores, Test Items, Statistical Analysis

Williams, Richard H.; And Others – Journal of Experimental Education, 1995
The paradox that a Student t-test based on pretest-posttest differences can attain its greatest power when the difference score reliability is zero was explained by demonstrating that power is not a mathematical function of reliability unless either true score variance or error score variance is constant. (SLD)
Descriptors: Error of Measurement, Power (Statistics), Pretests Posttests, Reliability

Feldt, Leonard S.; Qualls, Audrey L. – Applied Measurement in Education, 1999
Examined the stability of the standard error of measurement and the relationship between the reliability coefficient and the variance of both true scores and error scores for 170 school districts in a state. As expected, reliability coefficients varied as a function of group variability, but the variation in split-half coefficients from school to…
Descriptors: Elementary Secondary Education, Error of Measurement, Reliability, School Districts
Hartig, Johannes; Holzel, Britta; Moosbrugger, Helfried – Multivariate Behavioral Research, 2007
Numerous studies have shown increasing item reliabilities as an effect of the item position in personality scales. Traditionally, these context effects are analyzed based on item-total correlations. This approach neglects that trends in item reliabilities can be caused either by an increase in true score variance or by a decrease in error…
Descriptors: True Scores, Error of Measurement, Structural Equation Models, Simulation
Doppelt, Jerome E. – Test Service Bulletin, 1956
The standard error of measurement as a means for estimating the margin of error that should be allowed for in test scores is discussed. The true score measures the performance that is characteristic of the person tested; the variations, plus and minus, around the true score describe a characteristic of the test. When the standard deviation is used…
Descriptors: Bulletins, Error of Measurement, Measurement Techniques, Reliability

Evans, Brian – Canadian Journal of Program Evaluation/La Revue canadienne d'evaluation de programme, 1995
The distinction between two models of reliability is clarified. Reliability may be conceived of and estimated from a true score model or from the perspective of sampling precision. Basic models are developed and illustrated for each approach using data from the author's work on measuring organizational climate. (SLD)
Descriptors: Data Analysis, Error of Measurement, Evaluators, Models

Kleven, Thor Arnfinn – Scandinavian Journal of Educational Research, 1979
Supposing different values of the standard measurement error, the relation of scale coarseness to the total amount of error is studied on the basis of probability distribution of error. The analyses are performed within two models of error and with two criteria of amount of error. (Editor/SJL)
Descriptors: Cutting Scores, Error of Measurement, Goodness of Fit, Grading
Livingston, Samuel A. – 1976
A distinction is made between reliability of measurement and reliability of classification; the "criterion-referenced reliability coefficient" describes the former. Application of this coefficient to the probability distribution of possible scores for a single student yields a meaningful way to describe the reliability of a single score. (Author)
Descriptors: Classification, Criterion Referenced Tests, Error of Measurement, Measurement
Previous Page | Next Page ยป
Pages: 1 | 2