ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	8

Descriptor

Error of Measurement	30
Reliability	30
True Scores	30
Statistical Analysis	12
Correlation	10
Analysis of Variance	6
Mathematical Models	6
Measurement Techniques	6
Sampling	5
Scores	5
Measurement	4
Models	4
Predictor Variables	4
Simulation	4
Classification	3
Computation	3
Cutting Scores	3
Data Analysis	3
Educational Research	3
Evaluation	3
Goodness of Fit	3
Item Analysis	3
Item Response Theory	3
Probability	3
Raw Scores	3
More ▼

Source

Journal of Educational…	5
Educational and Psychological…	2
Advances in Health Sciences…	1
Applied Measurement in…	1
Applied Psychological…	1
Assessment	1
Canadian Journal of Program…	1
ETS Research Report Series	1
Educational Research	1
Journal of Experimental…	1
Multivariate Behavioral…	1
Scandinavian Journal of…	1
Structural Equation Modeling	1
Test Service Bulletin	1
More ▼

Publication Type

Journal Articles	17
Reports - Research	12
Reports - Evaluative	9
Speeches/Meeting Papers	3
Guides - Non-Classroom	1
Numerical/Quantitative Data	1
Opinion Papers	1
Reports - Descriptive	1

Education Level

Higher Education

Audience

Researchers

Location

Australia	1
United Kingdom (England)	1

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	1
National Longitudinal Study…	1
Work Keys (ACT)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 30 results Save | Export

Relationships of Measurement Error and Prediction Error in Observed-Score Regression

Peer reviewed

Direct link

Moses, Tim – Journal of Educational Measurement, 2012

The focus of this paper is assessing the impact of measurement errors on the prediction error of an observed-score regression. Measures are presented and described for decomposing the linear regression's prediction error variance into parts attributable to the true score variance and the error variances of the dependent variable and the predictor…

Descriptors: Error of Measurement, Prediction, Regression (Statistics), True Scores

Reliability Generalization: An Examination of the Positive Affect and Negative Affect Schedule

Peer reviewed

Direct link

Leue, Anja; Lange, Sebastian – Assessment, 2011

The assessment of positive affect (PA) and negative affect (NA) by means of the Positive Affect and Negative Affect Schedule has received a remarkable popularity in the social sciences. Using a meta-analytic tool--namely, reliability generalization (RG)--population reliability scores of both scales have been investigated on the basis of a random…

Descriptors: Social Sciences, True Scores, Generalization, Affective Behavior

Coping with Memory Effect and Serial Correlation when Estimating Reliability in a Longitudinal Framework

Peer reviewed

Direct link

Laenen, Annouschka; Alonso, Ariel; Molenberghs, Geert; Vangeneugden, Tony; Mallinckrodt, Craig H. – Applied Psychological Measurement, 2010

Longitudinal studies are permeating clinical trials in psychiatry. Therefore, it is of utmost importance to study the psychometric properties of rating scales, frequently used in these trials, within a longitudinal framework. However, intrasubject serial correlation and memory effects are problematic issues often encountered in longitudinal data.…

Descriptors: Psychiatry, Rating Scales, Memory, Psychometrics

A Response to an Article Published in "Educational Research"'s Special Issue on Assessment (June 2009). What Can Be Inferred about Classification Accuracy from Classification Consistency?

Peer reviewed

Direct link

Bramley, Tom – Educational Research, 2010

Background: A recent article published in "Educational Research" on the reliability of results in National Curriculum testing in England (Newton, "The reliability of results from national curriculum testing in England," "Educational Research" 51, no. 2: 181-212, 2009) suggested that: (1) classification accuracy can be…

Descriptors: National Curriculum, Educational Research, Testing, Measurement

A Modification to Angoff and Bookmarking Cut Scores to Account for the Imperfect Reliability of Test Scores

Peer reviewed

Direct link

MacCann, Robert G. – Educational and Psychological Measurement, 2008

It is shown that the Angoff and bookmarking cut scores are examples of true score equating that in the real world must be applied to observed scores. In the context of defining minimal competency, the percentage "failed" by such methods is a function of the length of the measuring instrument. It is argued that this length is largely…

Descriptors: True Scores, Cutting Scores, Minimum Competencies, Scores

Undesired Variance Due to Examiner Stringency/Leniency Effect in Communication Skill Scores Assessed in OSCEs

Peer reviewed

Direct link

Harasym, Peter H.; Woloschuk, Wayne; Cunning, Leslie – Advances in Health Sciences Education, 2008

Physician-patient communication is a clinical skill that can be learned and has a positive impact on patient satisfaction and health outcomes. A concerted effort at all medical schools is now directed at teaching and evaluating this core skill. Student communication skills are often assessed by an Objective Structure Clinical Examination (OSCE).…

Descriptors: Medical Schools, Family Practice (Medicine), Examiners, Error of Measurement

Peer reviewed

Green, Samuel B.; Hershberger, Scott L. – Structural Equation Modeling, 2000

Proposes true score models that can account for correlated errors and their effect on coefficient alpha. These models allow random measurement errors on earlier items to affect directly or indirectly the scores on later items. Conditions under which coefficient alpha may yield spuriously high estimates or reliability are discussed. (SLD)

Descriptors: Correlation, Error of Measurement, Reliability, True Scores

Reliability and the Nonequivalent Groups with Anchor Test Design. Research Report. ETS RR-07-16

Peer reviewed
PDF on ERIC

Download full text

Moses, Tim; Kim, Sooyeon – ETS Research Report Series, 2007

This study evaluated the impact of unequal reliability on test equating methods in the nonequivalent groups with anchor test (NEAT) design. Classical true score-based models were compared in terms of their assumptions about how reliability impacts test scores. These models were related to treatment of population ability differences by different…

Descriptors: Reliability, Equated Scores, Test Items, Statistical Analysis

Impact of Measurement Error on Statistical Power: Review of an Old Paradox.

Peer reviewed

Williams, Richard H.; And Others – Journal of Experimental Education, 1995

The paradox that a Student t-test based on pretest-posttest differences can attain its greatest power when the difference score reliability is zero was explained by demonstrating that power is not a mathematical function of reliability unless either true score variance or error score variance is constant. (SLD)

Descriptors: Error of Measurement, Power (Statistics), Pretests Posttests, Reliability

Variability in Reliability Coefficients and the Standard Error of Measurement from School District to District.

Peer reviewed

Feldt, Leonard S.; Qualls, Audrey L. – Applied Measurement in Education, 1999

Examined the stability of the standard error of measurement and the relationship between the reliability coefficient and the variance of both true scores and error scores for 170 school districts in a state. As expected, reliability coefficients varied as a function of group variability, but the variation in split-half coefficients from school to…

Descriptors: Elementary Secondary Education, Error of Measurement, Reliability, School Districts

A Confirmatory Analysis of Item Reliability Trends (CAIRT): Differentiating True Score and Error Variance in the Analysis of Item Context Effects

Peer reviewed

Direct link

Hartig, Johannes; Holzel, Britta; Moosbrugger, Helfried – Multivariate Behavioral Research, 2007

Numerous studies have shown increasing item reliabilities as an effect of the item position in personality scales. Traditionally, these context effects are analyzed based on item-total correlations. This approach neglects that trends in item reliabilities can be caused either by an increase in true score variance or by a decrease in error…

Descriptors: True Scores, Error of Measurement, Structural Equation Models, Simulation

How Accurate Is a Test Score?

Download full text

Doppelt, Jerome E. – Test Service Bulletin, 1956

The standard error of measurement as a means for estimating the margin of error that should be allowed for in test scores is discussed. The true score measures the performance that is characteristic of the person tested; the variations, plus and minus, around the true score describe a characteristic of the test. When the standard deviation is used…

Descriptors: Bulletins, Error of Measurement, Measurement Techniques, Reliability

On the Difference between Reliability of Measurement and Precision of Survey Instruments.

Peer reviewed

Evans, Brian – Canadian Journal of Program Evaluation/La Revue canadienne d'evaluation de programme, 1995

The distinction between two models of reliability is clarified. Reliability may be conceived of and estimated from a true score model or from the perspective of sampling precision. Basic models are developed and illustrated for each approach using data from the author's work on measuring organizational climate. (SLD)

Descriptors: Data Analysis, Error of Measurement, Evaluators, Models

The Relation of the Scale Coarseness to the Dependability of Marks.

Peer reviewed

Kleven, Thor Arnfinn – Scandinavian Journal of Educational Research, 1979

Supposing different values of the standard measurement error, the relation of scale coarseness to the total amount of error is studied on the basis of probability distribution of error. The analyses are performed within two models of error and with two criteria of amount of error. (Editor/SJL)

Descriptors: Cutting Scores, Error of Measurement, Goodness of Fit, Grading

The Criterion-Referenced Reliability of a Single Score. Report 76-01.

Livingston, Samuel A. – 1976

A distinction is made between reliability of measurement and reliability of classification; the "criterion-referenced reliability coefficient" describes the former. Application of this coefficient to the probability distribution of possible scores for a single student yields a meaningful way to describe the reliability of a single score. (Author)

Descriptors: Classification, Criterion Referenced Tests, Error of Measurement, Measurement

Previous Page | Next Page »

Pages: 1 | 2

Livingston, Samuel A.	3
Edwards, Keith J.	2
Moses, Tim	2
Alonso, Ariel	1
Bramley, Tom	1
Brennan, Robert L.	1
Cizek, Gregory J.	1
Cunning, Leslie	1
Dickinson, Terry L.	1
Doppelt, Jerome E.	1
Evans, Brian	1
Feldt, Leonard S.	1
Green, Samuel B.	1
Gustafsson, Jan-Eric	1
Harasym, Peter H.	1
Harris, Chester W.	1
Hartig, Johannes	1
Hershberger, Scott L.	1
Holzel, Britta	1
Kim, Sooyeon	1
Kleven, Thor Arnfinn	1
Kolen, Michael J.	1
Laenen, Annouschka	1
Lange, Sebastian	1
More ▼