ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	7
Since 2016 (last 10 years)	34
Since 2006 (last 20 years)	63

Descriptor

Generalizability Theory	120
Interrater Reliability	120
Test Reliability	39
Scoring	27
Scores	26
Error of Measurement	24
Performance Based Assessment	20
Foreign Countries	18
Evaluation Methods	15
Scoring Rubrics	15
Test Validity	15
Observation	14
Statistical Analysis	14
Classroom Observation…	13
Higher Education	12
Item Response Theory	12
Teacher Evaluation	12
Writing Evaluation	12
Evaluators	11
Elementary School Students	10
Reliability	9
Student Evaluation	9
Test Theory	9
Validity	9
Comparative Analysis	8
More ▼

Publication Type

Journal Articles	86
Reports - Research	78
Reports - Evaluative	32
Speeches/Meeting Papers	25
Reports - Descriptive	6
Tests/Questionnaires	6
Dissertations/Theses -…	2
Information Analyses	2
Numerical/Quantitative Data	2
Opinion Papers	2
Guides - Non-Classroom	1
More ▼

Education Level

Higher Education	21
Elementary Education	13
Postsecondary Education	12
Elementary Secondary Education	5
Grade 8	4
Adult Education	2
Early Childhood Education	2
Grade 1	2
Grade 4	2
Junior High Schools	2
Kindergarten	2
Middle Schools	2
Primary Education	2
Secondary Education	2
Grade 5	1
Grade 9	1
Preschool Education	1
More ▼

Audience

Researchers

Location

Turkey	3
California	2
Cyprus	2
Turkey (Ankara)	2
Alabama	1
Asia	1
Canada	1
Canada (Montreal)	1
China (Beijing)	1
Finland (Helsinki)	1
Idaho	1
Japan	1
Missouri	1
Oklahoma	1
Pennsylvania	1
South Korea	1
United Kingdom	1
West Germany	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Teacher Performance…	2
Medical College Admission Test	1
Students Evaluation of…	1
Texas Assessment of Academic…	1
Trends in International…	1
United States Medical…	1
Work Keys (ACT)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 120 results Save | Export

Comparison of the Results of the Generalizability Theory with the Inter-Rater Agreement Coefficients

Peer reviewed
PDF on ERIC

Download full text

Eser, Mehmet Taha; Aksu, Gökhan – International Journal of Curriculum and Instruction, 2022

The agreement between raters is examined within the scope of the concept of "inter-rater reliability". Although there are clear definitions of the concepts of agreement between raters and reliability between raters, there is no clear information about the conditions under which agreement and reliability level methods are appropriate to…

Descriptors: Generalizability Theory, Interrater Reliability, Evaluation Methods, Test Theory

Using Many-Facet Rasch Measurement and Generalizability Theory to Explore Rater Effects for Direct Behavior Rating--Multi-Item Scales

Peer reviewed

Direct link

Anthony, Christopher J.; Styck, Kara M.; Volpe, Robert J.; Robert, Christopher R. – School Psychology, 2023

Although originally conceived of as a marriage of direct behavioral observation and indirect behavior rating scales, recent research has indicated that Direct Behavior Ratings (DBRs) are affected by rater idiosyncrasies (rater effects) similar to other indirect forms of behavioral assessment. Most of this research has been conducted using…

Descriptors: Item Response Theory, Generalizability Theory, Interrater Reliability, Behavior Rating Scales

Evaluating an Explicit Instruction Teacher Observation Protocol through a Validity Argument Approach

Peer reviewed

Direct link

Johnson, Evelyn S.; Zheng, Yuzhu; Crawford, Angela R.; Moylan, Laura A. – Journal of Experimental Education, 2022

In this study, we examined the scoring and generalizability assumptions of an explicit instruction (EI) special education teacher observation protocol using many-faceted Rasch measurement (MFRM). Video observations of classroom instruction from 48 special education teachers across four states were collected. External raters (n = 20) were trained…

Descriptors: Direct Instruction, Teacher Education, Classroom Observation Techniques, Validity

Evaluating Human Scoring Using Generalizability Theory

Peer reviewed

Direct link

Bimpeh, Yaw; Pointer, William; Smith, Ben Alexander; Harrison, Liz – Applied Measurement in Education, 2020

Many high-stakes examinations in the United Kingdom (UK) use both constructed-response items and selected-response items. We need to evaluate the inter-rater reliability for constructed-response items that are scored by humans. While there are a variety of methods for evaluating rater consistency across ratings in the psychometric literature, we…

Descriptors: Scoring, Generalizability Theory, Interrater Reliability, Foreign Countries

When Seeing Is Believing: Generalizability and Decision Studies for Observational Data in Evaluation and Research on Teaching

Peer reviewed

Direct link

Weston, Timothy J.; Hayward, Charles N.; Laursen, Sandra L. – American Journal of Evaluation, 2021

Observations are widely used in research and evaluation to characterize teaching and learning activities. Because conducting observations is typically resource intensive, it is important that inferences from observation data are made confidently. While attention focuses on interrater reliability, the reliability of a single-class measure over the…

Descriptors: Generalizability Theory, Observation, Inferences, Social Science Research

Not Just Generalizability: A Case for Multifaceted Latent Trait Models in Teacher Observation Systems

Peer reviewed

Direct link

Wind, Stefanie A.; Jones, Eli – Educational Researcher, 2019

Teacher evaluation systems often include classroom observations in which raters use rating scales to evaluate teachers' effectiveness. Recently, researchers have promoted the use of multifaceted approaches to investigating reliability using Generalizability theory, instead of rater reliability statistics. Generalizability theory allows analysts to…

Descriptors: Teacher Evaluation, Observation, Generalizability Theory, Item Response Theory

The Use of Open-Ended Questions in Large-Scale Tests for Selection: Generalizability and Dependability

Peer reviewed
PDF on ERIC

Download full text

Atilgan, Hakan; Demir, Elif Kübra; Ogretmen, Tuncay; Basokcu, Tahsin Oguz – International Journal of Progressive Education, 2020

It has become a critical question what the reliability level would be when open-ended questions are used in large-scale selection tests. One of the aims of the present study is to determine what the reliability would be in the event that the answers given by test-takers are scored by experts when open-ended short answer questions are used in…

Descriptors: Foreign Countries, Secondary School Students, Test Items, Test Reliability

Structural Validity, Internal Consistency, and Rater Reliability of the Modified Barium Swallow Impairment Profile: Breaking Ground on a 52,726-Patient, Clinical Data Set

Peer reviewed

Direct link

Clain, Alex E.; Alkhuwaiter, Munirah; Davidson, Kate; Martin-Harris, Bonnie – Journal of Speech, Language, and Hearing Research, 2022

Purpose: The purpose of this study was to extend the assessment of the psychometric properties of the Modified Barium Swallow Impairment Profile (MBSImP). Here, we re-examined structural validity and internal consistency using a large clinical-registry data set and formally examined rater reliability in a smaller data set. Method: This study…

Descriptors: Diagnostic Tests, Disability Identification, Physical Disabilities, Eating Disorders

Reliability of Essay Ratings: A Study on Generalizability Theory

Peer reviewed
PDF on ERIC

Download full text

Atilgan, Hakan – Eurasian Journal of Educational Research, 2019

Purpose: This study intended to examine the generalizability and reliability of essay ratings within the scope of the generalizability (G) theory. Specifically, the effect of raters on the generalizability and reliability of students' essay ratings was examined. Furthermore, variations of the generalizability and reliability coefficients with…

Descriptors: Foreign Countries, Essay Tests, Test Reliability, Interrater Reliability

The Generalizability of Running Record Accuracy and Self-Correction Scores

Peer reviewed

Direct link

D'Agostino, Jerome V.; Rodgers, Emily; Winkler, Christa; Johnson, Tracy; Berenbon, Rebecca – Reading Psychology, 2021

Running Records provide a standardized method for recording and assessing students' oral reading behaviors and are excellent formative assessment tools to guide instructional decision-making. This study expands on prior Running Record reliability work by evaluating the extent to which external raters and teachers consistently assessed students'…

Descriptors: Accuracy, Oral Reading, Generalizability Theory, Error Correction

Evaluating an Explicit Instruction Teacher Observation Protocol through a Validity Argument Approach

Peer reviewed
PDF on ERIC

Download full text

Direct link

Johnson, Evelyn S.; Zheng, Yuzhu; Crawford, Angela R.; Moylan, Laura A. – Grantee Submission, 2020

In this study, we examined the scoring and generalizability assumptions of an Explicit Instruction (EI) special education teacher observation protocol using many-faceted Rasch measurement (MFRM). Video observations of classroom instruction from 48 special education teachers across four states were collected. External raters (n = 20) were trained…

Descriptors: Direct Instruction, Teacher Evaluation, Classroom Observation Techniques, Validity

Examining the Reliability of Scores from a Performance Assessment of Practice-Based Competencies

Peer reviewed

Direct link

Roduta Roberts, Mary; Alves, Cecilia Brito; Werther, Karin; Bahry, Louise M. – Journal of Psychoeducational Assessment, 2019

The purpose of this study was to examine the reliability and sources of score variation from a performance assessment of practice competencies within an occupational therapy program. Data from 99 students who participated in a practical exam were examined. A generalizability analysis of analytic, total, and overall holistic scores was completed…

Descriptors: Performance Based Assessment, Test Reliability, Scores, Occupational Therapy

The Mathematical Quality of Instruction (MQI) in Kindergarten: An Evaluation of the Stability of the MQI Using Generalizability Theory

Peer reviewed
PDF on ERIC

Download full text

Direct link

Mantzicopoulos, Panayota; French, Brian F.; Patrick, Helen – Grantee Submission, 2018

Research Findings: We evaluated the score stability of the Mathematical Quality of Instruction (MQI), an observational measure of mathematics instruction. Three raters each scored, independently, 100 video-recorded lessons taught by 20 kindergarten teachers in the spring. Using generalizability theory analyses, we decomposed the MQI's score…

Descriptors: Kindergarten, Mathematics Instruction, Educational Quality, Classroom Observation Techniques

Reliability of the Analytic Rubric and Checklist for the Assessment of Story Writing Skills: G and Decision Study in Generalizability Theory

Peer reviewed
PDF on ERIC

Download full text

Uzun, N. Bilge; Alici, Devrim; Aktas, Mehtap – European Journal of Educational Research, 2019

The purpose of study is to examine the reliability of analytical rubrics and checklists developed for the assessment of story writing skills by means of generalizability theory. The study group consisted of 52 students attending the 5th grade at primary school and 20 raters in Mersin University. The G study was carried out with the fully crossed…

Descriptors: Foreign Countries, Scoring Rubrics, Check Lists, Writing Tests

The Mathematical Quality of Instruction (MQI) in Kindergarten: An Evaluation of the Stability of the MQI Using Generalizability Theory

Peer reviewed

Direct link

Mantzicopoulos, Panayota; French, Brian F.; Patrick, Helen – Early Education and Development, 2018

Descriptors: Kindergarten, Mathematics Instruction, Educational Quality, Classroom Observation Techniques

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8

Educational and Psychological…	6
Language Testing	4
Advances in Health Sciences…	3
Grantee Submission	3
Journal of Educational…	3
Applied Measurement in…	2
Assessment for Effective…	2
Educational Assessment	2
Educational Researcher	2
Educational Sciences: Theory…	2
International Journal of…	2
Journal of Experimental…	2
Language Assessment Quarterly	2
Multivariate Behavioral…	2
ProQuest LLC	2
Reading Psychology	2
School Psychology Review	2
Adapted Physical Activity…	1
Alberta Journal of…	1
American Journal of Evaluation	1
Applied Psychological…	1
Asian Journal of Education…	1
Assessing Writing	1
Assessment & Evaluation in…	1
Behavioral Disorders	1
More ▼

Johnson, Evelyn S.	4
Crawford, Angela R.	3
Moylan, Laura A.	3
Zheng, Yuzhu	3
Abedi, Jamal	2
Aktas, Mehtap	2
Atilgan, Hakan	2
Baker, Eva L.	2
Capie, William	2
Charalambous, Charalambos Y.	2
French, Brian F.	2
Goodwin, Laura D.	2
Li, Mao-Neng Fred	2
Linn, Robert L.	2
Mantzicopoulos, Panayota	2
Patrick, Helen	2
Shavelson, Richard J.	2
Uzun, N. Bilge	2
Ahmet Guven	1
Aksu, Gökhan	1
Aldrich, Jennifer	1
Alici, Devrim	1
Alkahtani, Saif F.	1
Alkhuwaiter, Munirah	1
More ▼