ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	4

Descriptor

Generalizability Theory	9
Interrater Reliability	9
Test Theory	9
Test Reliability	6
Foreign Countries	3
Error of Measurement	2
Estimation (Mathematics)	2
Evaluation Methods	2
Higher Education	2
Research Design	2
Research Methodology	2
Scores	2
Scoring	2
Test Interpretation	2
Behavior Patterns	1
Behavior Problems	1
College Students	1
Communication Skills	1
Data Analysis	1
Dentistry	1
Difficulty Level	1
Elective Courses	1
Essay Tests	1
Essays	1
Feedback (Response)	1
More ▼

Source

Asian Journal of Education…	1
Assessment & Evaluation in…	1
Educational Sciences: Theory…	1
International Journal of…	1
Journal of Experimental…	1
School Psychology Review	1

Author

Aksu, Gökhan	1
Aktas, Mehtap	1
Arnold, Margery E.	1
Asiret, Semih	1
Buhr, Dianne C.	1
Dovell, Patricia	1
Eser, Mehmet Taha	1
Gelbal, Selahattin	1
Gresham, Frank M.	1
Guler, Nese	1
MacMillan, Peter D.	1
Naizer, Gilbert	1
Rantanen, Pekka	1
Uzun, N. Bilge	1
Yormaz, Seha	1
More ▼

Publication Type

Journal Articles	6
Reports - Research	5
Reports - Evaluative	3
Speeches/Meeting Papers	3
Information Analyses	1
Opinion Papers	1

Education Level

Higher Education	2
Grade 8	1
Grade 9	1

Audience

Researchers

Location

Turkey (Ankara)	2
Finland (Helsinki)	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 9 results Save | Export

Comparison of the Results of the Generalizability Theory with the Inter-Rater Agreement Coefficients

Peer reviewed
PDF on ERIC

Download full text

Eser, Mehmet Taha; Aksu, Gökhan – International Journal of Curriculum and Instruction, 2022

The agreement between raters is examined within the scope of the concept of "inter-rater reliability". Although there are clear definitions of the concepts of agreement between raters and reliability between raters, there is no clear information about the conditions under which agreement and reliability level methods are appropriate to…

Descriptors: Generalizability Theory, Interrater Reliability, Evaluation Methods, Test Theory

Using Generalizability Theory to Assess the Score Reliability of Communication Skills of Dentistry Students

Peer reviewed
PDF on ERIC

Download full text

Uzun, N. Bilge; Aktas, Mehtap; Asiret, Semih; Yormaz, Seha – Asian Journal of Education and Training, 2018

The goal of this study is to determine the reliability of the performance points of dentistry students regarding communication skills and to examine the scoring reliability by generalizability theory in balanced random and fixed facet (mixed design) data, considering also the interactions of student, rater and duty. The study group of the research…

Descriptors: Foreign Countries, Generalizability Theory, Scores, Test Reliability

The Number of Feedbacks Needed for Reliable Evaluation. A Multilevel Analysis of the Reliability, Stability and Generalisability of Students' Evaluation of Teaching

Peer reviewed

Direct link

Rantanen, Pekka – Assessment & Evaluation in Higher Education, 2013

A multilevel analysis approach was used to analyse students' evaluation of teaching (SET). The low value of inter-rater reliability stresses that any solid conclusions on teaching cannot be made on the basis of single feedbacks. To assess a teacher's general teaching effectiveness, one needs to evaluate four randomly chosen course implementations.…

Descriptors: Test Reliability, Feedback (Response), Generalizability Theory, Student Evaluation of Teacher Performance

Studying Reliability of Open Ended Mathematics Items According to the Classical Test Theory and Generalizability Theory

Peer reviewed
PDF on ERIC

Download full text

Guler, Nese; Gelbal, Selahattin – Educational Sciences: Theory and Practice, 2010

In this study, the Classical test theory and generalizability theory were used for determination to reliability of scores obtained from measurement tool of mathematics success. 24 open-ended mathematics question of the TIMSS-1999 was applied to 203 students in 2007-spring semester. Internal consistency of scores was found as 0.92. For…

Descriptors: Generalizability Theory, Test Theory, Test Reliability, Interrater Reliability

Classical, Generalizability, and Multifaceted Rasch Detection of Interrater Variability in Large, Sparse Data Sets.

Peer reviewed

MacMillan, Peter D. – Journal of Experimental Education, 2000

Compared classical test theory (CTT), generalizability theory (GT), and multifaceted Rasch model (MFRM) approaches to detecting and correcting for rater variability using responses of 4,930 high school students graded by 3 raters on 9 scales. The MFRM approach identified far more raters as different than did the CTT analysis. GT and Rasch…

Descriptors: Generalizability Theory, High School Students, High Schools, Interrater Reliability

Influences on and Limitations of Classical Test Theory Reliability Estimates.

Download full text

Arnold, Margery E. – 1996

It is incorrect to say "the test is reliable" because reliability is a function not only of the test itself, but of many factors. The present paper explains how different factors affect classical reliability estimates such as test-retest, interrater, internal consistency, and equivalent forms coefficients. Furthermore, the limits of classical test…

Descriptors: Estimation (Mathematics), Generalizability Theory, Heuristics, Interrater Reliability

Behavioral Interviews in School Psychology: Issues in Psychometric Adequacy and Research.

Peer reviewed

Gresham, Frank M. – School Psychology Review, 1984

The evidence for the psychometric adequacy of behavioral interviews in terms of traditional psychometric theory and generalizability theory are reviewed. The review resulted in the conclusion that behavioral interviews have some evidence for interrater reliability, content validity, and criterion-related validity. Additional research in several…

Descriptors: Behavior Patterns, Behavior Problems, Functional Behavioral Assessment, Generalizability Theory

Basic Concepts in Generalizability Theory: A More Powerful Approach to Evaluating Reliability.

Download full text

Naizer, Gilbert – 1992

A measurement approach called generalizability theory (G-theory) is an important alternative to the more familiar classical measurement theory that yields less useful coefficients such as alpha or the KR-20 coefficient. G-theory is a theory about the dependability of behavioral measurements that allows the simultaneous estimation of multiple…

Descriptors: Error of Measurement, Estimation (Mathematics), Generalizability Theory, Higher Education

Essay Topic Difficulty in Relation to Scoring Models.

Dovell, Patricia; Buhr, Dianne C. – 1986

This study examined the difficulty level of essay topics used in the large-scale assessment of writing in relation to five different scoring models, and sought to determine what effects the scoring models would have on passing rates. In model one, examinee's score is the direct result of a score assigned by the reader or the sum of scores assigned…

Descriptors: College Students, Difficulty Level, Essay Tests, Essays