Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 1 |
Descriptor
Error of Measurement | 12 |
Psychometrics | 12 |
Scores | 5 |
Latent Trait Theory | 4 |
Test Items | 4 |
Testing Problems | 4 |
Evaluation Methods | 3 |
Measurement Techniques | 3 |
Reliability | 3 |
Comparative Analysis | 2 |
Difficulty Level | 2 |
More ▼ |
Author
Henson, Robin K. | 2 |
Smith, Richard M. | 2 |
Brookshire, William | 1 |
Chang, Te-Sheng | 1 |
Cook, Linda L. | 1 |
Espelage, Dorothy L. | 1 |
Kamps, Jodi | 1 |
Kogan, Lori R. | 1 |
Lance, Charles E. | 1 |
Moomaw, Michael E. | 1 |
Petersen, Nancy S. | 1 |
More ▼ |
Publication Type
Speeches/Meeting Papers | 12 |
Reports - Research | 7 |
Reports - Evaluative | 2 |
Guides - Non-Classroom | 1 |
Journal Articles | 1 |
Opinion Papers | 1 |
Reports - Descriptive | 1 |
Education Level
Audience
Researchers | 3 |
Location
Laws, Policies, & Programs
Assessments and Surveys
Medical College Admission Test | 1 |
New Jersey College Basic… | 1 |
Teacher Efficacy Scale | 1 |
What Works Clearinghouse Rating

W. Jake Thompson – Grantee Submission, 2024
Diagnostic classification models (DCMs) are psychometric models that can be used to estimate the presence or absence of psychological traits, or proficiency on fine-grained skills. Critical to the use of any psychometric model in practice, including DCMs, is an evaluation of model fit. Traditionally, DCMs have been estimated with maximum…
Descriptors: Bayesian Statistics, Classification, Psychometrics, Goodness of Fit

Henson, Robin K.; Kogan, Lori R.; Vacha-Haase, Tammi – Educational and Psychological Measurement, 2001
Studied sources of measurement error variance in the Teacher Efficacy Scale (TES) (Gibson and Dembo, 1984). Used reliability generalization to characterize the typical score reliability for the TES and potential sources of measurement error variance across 43 studies. Also examined related instruments for measurement integrity. (SLD)
Descriptors: Error of Measurement, Generalization, Meta Analysis, Psychometrics
Henson, Robin K.; Thompson, Bruce – 2001
Given the potential value of reliability generalization (RG) studies in the development of cumulative psychometric knowledge, the purpose of this paper is to provide a tutorial on how to conduct such studies and to serve as a guide for researchers wishing to use this methodology. After some brief comments on classical test theory, the paper…
Descriptors: Coding, Error of Measurement, Psychometrics, Reliability
Chang, Te-Sheng; Brookshire, William – 1997
The question of least-squares weights versus equal weights has been a subject of great interest to researchers for over 60 years. Several researchers have compared the efficiency of equal weights and that of least-squares weights under different conditions. Recently, S. V. Paunonen and R. C. Gardner stressed that the necessary and sufficient…
Descriptors: Correlation, Error of Measurement, Least Squares Statistics, Predictor Variables
Tsui, Anne S. – 1983
Quality of performance data yielded by subjective judgment is of major concern to researchers in performance appraisal. However, some confusion exists in the analysis of quality on ratings obtained from different rating scale formats and from different raters. To clarify this confusion, a study was conducted to assess the quality of judgmental…
Descriptors: Administrator Evaluation, Administrators, Error of Measurement, Evaluation Methods
Espelage, Dorothy L.; Quittner, Alexandra L.; Kamps, Jodi – 1998
Generalizability theory (g-theory) was used, as an alternative to classical test theory, to evaluate measurement error in a behaviorally anchored role-play measure, highlighting the usefulness of this theory in instrument development. G-theory partitions an observed score into the universe score and error scores associated with separate sources of…
Descriptors: Behavior Patterns, Eating Disorders, Error of Measurement, Females
Wise, Lauress L. – 1986
A primary goal of this study was to determine the extent to which item difficulty was related to item position and, if a significant relationship was found, to suggest adjustments to predicted item difficulty that reflect differences in item position. Item response data from the Medical College Admission Test (MCAT) were analyzed. A data set was…
Descriptors: College Entrance Examinations, Difficulty Level, Educational Research, Error of Measurement
Lance, Charles E.; Moomaw, Michael E. – 1983
Direct assessments of the accuracy with which raters can use a rating instrument are presented. This study demonstrated how surplus behavioral incidents scaled during the development of Behaviorally Anchored Rating Scales (BARS) can be used effectively in the evaluation of the newly developed scales. Construction of scenarios of hypothetical…
Descriptors: Behavior Rating Scales, Comparative Analysis, Error of Measurement, Evaluation Criteria
Smith, Richard M. – 1983
Measurement disturbances, such as guessing, startup, and plodding, often result in an examinee's ability being either over- or under-estimated by the maximum likelihood estimation employed in latent trait psychometric models. Several authors have suggested methods to lessen the impact of unexpected responses on the ability estimation process. This…
Descriptors: Difficulty Level, Error of Measurement, Estimation (Mathematics), Goodness of Fit
Smith, Richard M. – 1983
Previous studies of test item bias have investigated how different groups of examinees perform differently on a given set of items. These studies imply that examinees should be treated in a certain way because they are of a particular sex or race rather than as individuals in their own right, but it is unrealistic and unfair to assume such an…
Descriptors: Academic Ability, Error of Measurement, Error Patterns, Higher Education
Cook, Linda L.; Petersen, Nancy S. – 1986
This paper examines how various equating methods are affected by: (1) sampling error; (2) sample characteristics; and (3) characteristics of anchor test items. It reviews empirical studies that investigated the invariance of equating transformations, and it discusses empirical and simulation studies that focus on how the properties of anchor tests…
Descriptors: Educational Research, Equated Scores, Error of Measurement, Evaluation Methods
Wangerin, Paul T. – 1994
This paper addresses problems confronting law school teachers in grading law school exams and assigning letter grades. Using prototypical dialogue and scenarios, the paper examines mathematical and statistical issues that contribute to grading errors. Discussed in relation to real world data and the bar exam are: differential weighting, combining…
Descriptors: Civil Rights, Court Litigation, Educational Malpractice, Error of Measurement