Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 1 |
Descriptor
Generalizability Theory | 7 |
Interrater Reliability | 7 |
Research Design | 7 |
Error of Measurement | 3 |
Higher Education | 3 |
Measurement Techniques | 2 |
Test Theory | 2 |
Analysis of Variance | 1 |
Behavior Rating Scales | 1 |
Cardiovascular System | 1 |
Classification | 1 |
More ▼ |
Author
Baldus, Robert | 1 |
Buhr, Dianne C. | 1 |
Camara, Wayne J. | 1 |
Dovell, Patricia | 1 |
Goodwin, Laura D. | 1 |
Goodwin, William L. | 1 |
Gugiu, Mihaiela R. | 1 |
Gugiu, Paul C. | 1 |
Lautenschlager, Gary | 1 |
Li, Mao-Neng Fred | 1 |
Naizer, Gilbert | 1 |
More ▼ |
Publication Type
Reports - Research | 4 |
Speeches/Meeting Papers | 4 |
Journal Articles | 3 |
Reports - Evaluative | 2 |
Guides - Non-Classroom | 1 |
Opinion Papers | 1 |
Education Level
Higher Education | 1 |
Audience
Researchers | 3 |
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Gugiu, Mihaiela R.; Gugiu, Paul C.; Baldus, Robert – Journal of MultiDisciplinary Evaluation, 2012
Background: Educational researchers have long espoused the virtues of writing with regard to student cognitive skills. However, research on the reliability of the grades assigned to written papers reveals a high degree of contradiction, with some researchers concluding that the grades assigned are very reliable whereas others suggesting that they…
Descriptors: Grades (Scholastic), Grading, Scoring Rubrics, Research Design

Li, Mao-Neng Fred; Lautenschlager, Gary – Educational and Psychological Measurement, 1997
lllustrates a link between the multiple-rater kappa of J. Fleiss (1971) or other analogues and the generalizability (G) coefficient for a single facet design, and discusses the use and interpretation of G theory in the study of interrater agreement when data are measured on a nominal scale. (SLD)
Descriptors: Classification, Generalizability Theory, Interrater Reliability, Research Design
Camara, Wayne J. – 1986
Previous efforts to investigate the equivalence of rating sources for job analysis ratings have reported conflicting results. In the present research, correlational and generalizability analyses were conducted to examine the equivalency of rating sources for over 70 state civil service job classifications. Incumbent and supervisor ratings (N=697)…
Descriptors: Evaluators, Generalizability Theory, Interrater Reliability, Job Analysis

Goodwin, Laura D.; Goodwin, William L. – Journal of Early Intervention, 1991
Four approaches to estimating interrater reliability in early childhood special education research are illustrated and compared: correlation, comparison of means, percentage of agreement, and generalizability theory techniques. Generalizability theory techniques are proposed as a method for estimating the amount of variance attributable to…
Descriptors: Analysis of Variance, Disabilities, Early Childhood Education, Educational Research
Naizer, Gilbert – 1992
A measurement approach called generalizability theory (G-theory) is an important alternative to the more familiar classical measurement theory that yields less useful coefficients such as alpha or the KR-20 coefficient. G-theory is a theory about the dependability of behavioral measurements that allows the simultaneous estimation of multiple…
Descriptors: Error of Measurement, Estimation (Mathematics), Generalizability Theory, Higher Education
Webber, Larry; And Others – 1986
Generalizability theory, which subsumes classical measurement theory as a special case, provides a general model for estimating the reliability of observational rating data by estimating the variance components of the measurement design. Research data from the "Heart Smart" health intervention program were analyzed as a heuristic tool.…
Descriptors: Behavior Rating Scales, Cardiovascular System, Error of Measurement, Generalizability Theory
Dovell, Patricia; Buhr, Dianne C. – 1986
This study examined the difficulty level of essay topics used in the large-scale assessment of writing in relation to five different scoring models, and sought to determine what effects the scoring models would have on passing rates. In model one, examinee's score is the direct result of a score assigned by the reader or the sum of scores assigned…
Descriptors: College Students, Difficulty Level, Essay Tests, Essays