Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 4 |
Descriptor
Generalizability Theory | 9 |
Interrater Reliability | 9 |
Test Theory | 9 |
Test Reliability | 6 |
Foreign Countries | 3 |
Error of Measurement | 2 |
Estimation (Mathematics) | 2 |
Evaluation Methods | 2 |
Higher Education | 2 |
Research Design | 2 |
Research Methodology | 2 |
More ▼ |
Source
Asian Journal of Education… | 1 |
Assessment & Evaluation in… | 1 |
Educational Sciences: Theory… | 1 |
International Journal of… | 1 |
Journal of Experimental… | 1 |
School Psychology Review | 1 |
Author
Aksu, Gökhan | 1 |
Aktas, Mehtap | 1 |
Arnold, Margery E. | 1 |
Asiret, Semih | 1 |
Buhr, Dianne C. | 1 |
Dovell, Patricia | 1 |
Eser, Mehmet Taha | 1 |
Gelbal, Selahattin | 1 |
Gresham, Frank M. | 1 |
Guler, Nese | 1 |
MacMillan, Peter D. | 1 |
More ▼ |
Publication Type
Journal Articles | 6 |
Reports - Research | 5 |
Reports - Evaluative | 3 |
Speeches/Meeting Papers | 3 |
Information Analyses | 1 |
Opinion Papers | 1 |
Education Level
Higher Education | 2 |
Grade 8 | 1 |
Grade 9 | 1 |
Audience
Researchers | 1 |
Location
Turkey (Ankara) | 2 |
Finland (Helsinki) | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Comparison of the Results of the Generalizability Theory with the Inter-Rater Agreement Coefficients
Eser, Mehmet Taha; Aksu, Gökhan – International Journal of Curriculum and Instruction, 2022
The agreement between raters is examined within the scope of the concept of "inter-rater reliability". Although there are clear definitions of the concepts of agreement between raters and reliability between raters, there is no clear information about the conditions under which agreement and reliability level methods are appropriate to…
Descriptors: Generalizability Theory, Interrater Reliability, Evaluation Methods, Test Theory
Uzun, N. Bilge; Aktas, Mehtap; Asiret, Semih; Yormaz, Seha – Asian Journal of Education and Training, 2018
The goal of this study is to determine the reliability of the performance points of dentistry students regarding communication skills and to examine the scoring reliability by generalizability theory in balanced random and fixed facet (mixed design) data, considering also the interactions of student, rater and duty. The study group of the research…
Descriptors: Foreign Countries, Generalizability Theory, Scores, Test Reliability
Rantanen, Pekka – Assessment & Evaluation in Higher Education, 2013
A multilevel analysis approach was used to analyse students' evaluation of teaching (SET). The low value of inter-rater reliability stresses that any solid conclusions on teaching cannot be made on the basis of single feedbacks. To assess a teacher's general teaching effectiveness, one needs to evaluate four randomly chosen course implementations.…
Descriptors: Test Reliability, Feedback (Response), Generalizability Theory, Student Evaluation of Teacher Performance
Guler, Nese; Gelbal, Selahattin – Educational Sciences: Theory and Practice, 2010
In this study, the Classical test theory and generalizability theory were used for determination to reliability of scores obtained from measurement tool of mathematics success. 24 open-ended mathematics question of the TIMSS-1999 was applied to 203 students in 2007-spring semester. Internal consistency of scores was found as 0.92. For…
Descriptors: Generalizability Theory, Test Theory, Test Reliability, Interrater Reliability

MacMillan, Peter D. – Journal of Experimental Education, 2000
Compared classical test theory (CTT), generalizability theory (GT), and multifaceted Rasch model (MFRM) approaches to detecting and correcting for rater variability using responses of 4,930 high school students graded by 3 raters on 9 scales. The MFRM approach identified far more raters as different than did the CTT analysis. GT and Rasch…
Descriptors: Generalizability Theory, High School Students, High Schools, Interrater Reliability
Arnold, Margery E. – 1996
It is incorrect to say "the test is reliable" because reliability is a function not only of the test itself, but of many factors. The present paper explains how different factors affect classical reliability estimates such as test-retest, interrater, internal consistency, and equivalent forms coefficients. Furthermore, the limits of classical test…
Descriptors: Estimation (Mathematics), Generalizability Theory, Heuristics, Interrater Reliability

Gresham, Frank M. – School Psychology Review, 1984
The evidence for the psychometric adequacy of behavioral interviews in terms of traditional psychometric theory and generalizability theory are reviewed. The review resulted in the conclusion that behavioral interviews have some evidence for interrater reliability, content validity, and criterion-related validity. Additional research in several…
Descriptors: Behavior Patterns, Behavior Problems, Functional Behavioral Assessment, Generalizability Theory
Naizer, Gilbert – 1992
A measurement approach called generalizability theory (G-theory) is an important alternative to the more familiar classical measurement theory that yields less useful coefficients such as alpha or the KR-20 coefficient. G-theory is a theory about the dependability of behavioral measurements that allows the simultaneous estimation of multiple…
Descriptors: Error of Measurement, Estimation (Mathematics), Generalizability Theory, Higher Education
Dovell, Patricia; Buhr, Dianne C. – 1986
This study examined the difficulty level of essay topics used in the large-scale assessment of writing in relation to five different scoring models, and sought to determine what effects the scoring models would have on passing rates. In model one, examinee's score is the direct result of a score assigned by the reader or the sum of scores assigned…
Descriptors: College Students, Difficulty Level, Essay Tests, Essays