ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	1

Descriptor

Error of Measurement	12
Psychometrics	12
Scores	5
Latent Trait Theory	4
Test Items	4
Testing Problems	4
Evaluation Methods	3
Measurement Techniques	3
Reliability	3
Comparative Analysis	2
Difficulty Level	2
Educational Research	2
Goodness of Fit	2
Higher Education	2
Item Analysis	2
Sample Size	2
Scoring	2
Statistical Studies	2
Test Construction	2
Test Validity	2
Academic Ability	1
Administrator Evaluation	1
Administrators	1
Bayesian Statistics	1
Behavior Patterns	1
More ▼

Source

Educational and Psychological…	1
Grantee Submission	1

Author

Henson, Robin K.	2
Smith, Richard M.	2
Brookshire, William	1
Chang, Te-Sheng	1
Cook, Linda L.	1
Espelage, Dorothy L.	1
Kamps, Jodi	1
Kogan, Lori R.	1
Lance, Charles E.	1
Moomaw, Michael E.	1
Petersen, Nancy S.	1
Quittner, Alexandra L.	1
Thompson, Bruce	1
Tsui, Anne S.	1
Vacha-Haase, Tammi	1
W. Jake Thompson	1
Wangerin, Paul T.	1
Wise, Lauress L.	1
More ▼

Publication Type

Speeches/Meeting Papers	12
Reports - Research	7
Reports - Evaluative	2
Guides - Non-Classroom	1
Journal Articles	1
Opinion Papers	1
Reports - Descriptive	1

Education Level

Audience

Researchers

Location

Laws, Policies, & Programs

Assessments and Surveys

Medical College Admission Test	1
New Jersey College Basic…	1
Teacher Efficacy Scale	1

What Works Clearinghouse Rating

Showing all 12 results Save | Export

Evaluating Methods for Assessing Model Fit in Diagnostic Classification Models

Peer reviewed

W. Jake Thompson – Grantee Submission, 2024

Diagnostic classification models (DCMs) are psychometric models that can be used to estimate the presence or absence of psychological traits, or proficiency on fine-grained skills. Critical to the use of any psychometric model in practice, including DCMs, is an evaluation of model fit. Traditionally, DCMs have been estimated with maximum…

Descriptors: Bayesian Statistics, Classification, Psychometrics, Goodness of Fit

A Reliability Generalization Study of the Teacher Efficacy Scale and Related Instruments.

Peer reviewed

Henson, Robin K.; Kogan, Lori R.; Vacha-Haase, Tammi – Educational and Psychological Measurement, 2001

Studied sources of measurement error variance in the Teacher Efficacy Scale (TES) (Gibson and Dembo, 1984). Used reliability generalization to characterize the typical score reliability for the TES and potential sources of measurement error variance across 43 studies. Also examined related instruments for measurement integrity. (SLD)

Descriptors: Error of Measurement, Generalization, Meta Analysis, Psychometrics

Characterizing Measurement Error in Test Scores across Studies: A Tutorial on Conducting "Reliability Generalization" Analyses.

Download full text

Henson, Robin K.; Thompson, Bruce – 2001

Given the potential value of reliability generalization (RG) studies in the development of cumulative psychometric knowledge, the purpose of this paper is to provide a tutorial on how to conduct such studies and to serve as a guide for researchers wishing to use this methodology. After some brief comments on classical test theory, the paper…

Descriptors: Coding, Error of Measurement, Psychometrics, Reliability

The Error of Accuracy for Two Regression Techniques: Does Psychometric Parallelism Matter?

Download full text

Chang, Te-Sheng; Brookshire, William – 1997

The question of least-squares weights versus equal weights has been a subject of great interest to researchers for over 60 years. Several researchers have compared the efficiency of equal weights and that of least-squares weights under different conditions. Recently, S. V. Paunonen and R. C. Gardner stressed that the necessary and sufficient…

Descriptors: Correlation, Error of Measurement, Least Squares Statistics, Predictor Variables

Qualities of Judgmental Ratings by Four Rater Sources.

Download full text

Tsui, Anne S. – 1983

Quality of performance data yielded by subjective judgment is of major concern to researchers in performance appraisal. However, some confusion exists in the analysis of quality on ratings obtained from different rating scale formats and from different raters. To clarify this confusion, a study was conducted to assess the quality of judgmental…

Descriptors: Administrator Evaluation, Administrators, Error of Measurement, Evaluation Methods

An Application of Generalizability Theory to the Validation of a Behaviorally Anchored Role-Play Measure.

Espelage, Dorothy L.; Quittner, Alexandra L.; Kamps, Jodi – 1998

Generalizability theory (g-theory) was used, as an alternative to classical test theory, to evaluate measurement error in a behaviorally anchored role-play measure, highlighting the usefulness of this theory in instrument development. G-theory partitions an observed score into the universe score and error scores associated with separate sources of…

Descriptors: Behavior Patterns, Eating Disorders, Error of Measurement, Females

Latent Trait Models for Partially Speeded Tests.

Wise, Lauress L. – 1986

A primary goal of this study was to determine the extent to which item difficulty was related to item position and, if a significant relationship was found, to suggest adjustments to predicted item difficulty that reflect differences in item position. Item response data from the Medical College Admission Test (MCAT) were analyzed. A data set was…

Descriptors: College Entrance Examinations, Difficulty Level, Educational Research, Error of Measurement

Assessing the Psychometric Quality of Performance Rating Scales: Comparisons among Evaluative Criteria.

Download full text

Lance, Charles E.; Moomaw, Michael E. – 1983

Direct assessments of the accuracy with which raters can use a rating instrument are presented. This study demonstrated how surplus behavioral incidents scaled during the development of Behaviorally Anchored Rating Scales (BARS) can be used effectively in the evaluation of the newly developed scales. Construction of scenarios of hypothetical…

Descriptors: Behavior Rating Scales, Comparative Analysis, Error of Measurement, Evaluation Criteria

A Comparison of Rasch Person Analysis and Robust Estimators.

Smith, Richard M. – 1983

Measurement disturbances, such as guessing, startup, and plodding, often result in an examinee's ability being either over- or under-estimated by the maximum likelihood estimation employed in latent trait psychometric models. Several authors have suggested methods to lessen the impact of unexpected responses on the ability estimation process. This…

Descriptors: Difficulty Level, Error of Measurement, Estimation (Mathematics), Goodness of Fit

Test Fairness Is a Personal Issue!

Smith, Richard M. – 1983

Previous studies of test item bias have investigated how different groups of examinees perform differently on a given set of items. These studies imply that examinees should be treated in a certain way because they are of a particular sex or race rather than as individuals in their own right, but it is unrealistic and unfair to assume such an…

Descriptors: Academic Ability, Error of Measurement, Error Patterns, Higher Education

Download full text

Cook, Linda L.; Petersen, Nancy S. – 1986

This paper examines how various equating methods are affected by: (1) sampling error; (2) sample characteristics; and (3) characteristics of anchor test items. It reviews empirical studies that investigated the invariance of equating transformations, and it discusses empirical and simulation studies that focus on how the properties of anchor tests…

Descriptors: Educational Research, Equated Scores, Error of Measurement, Evaluation Methods

Lies; Damned Lies; Statistics; and Law School Grades. Grade Conferences from Hell: Measurement Error in Law School Grading.

Download full text

Wangerin, Paul T. – 1994

This paper addresses problems confronting law school teachers in grading law school exams and assigning letter grades. Using prototypical dialogue and scenarios, the paper examines mathematical and statistical issues that contribute to grading errors. Discussed in relation to real world data and the bar exam are: differential weighting, combining…

Descriptors: Civil Rights, Court Litigation, Educational Malpractice, Error of Measurement