ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	5

Descriptor

Generalizability Theory	9
Interrater Reliability	9
Validity	9
Reliability	3
Certification	2
Classroom Observation…	2
Direct Instruction	2
Error of Measurement	2
Item Response Theory	2
Scores	2
Scoring	2
Special Education Teachers	2
Academic Achievement	1
Behavior Patterns	1
Behavior Problems	1
Chinese	1
Concept Mapping	1
Content Analysis	1
Correlation	1
Cutting Scores	1
English	1
Essays	1
Evaluation	1
Evaluation Methods	1
Evaluation Problems	1
More ▼

Source

Language Assessment Quarterly	2
Applied Measurement in…	1
Grantee Submission	1
Journal of Experimental…	1
Learning and Instruction	1
School Psychology Review	1

Author

Crawford, Angela R.	2
Johnson, Evelyn S.	2
Moylan, Laura A.	2
Zheng, Yuzhu	2
Camara, Wayne J.	1
Cope, Ronald T.	1
Fisher, Steven P.	1
Gordon, Belita	1
Gresham, Frank M.	1
Han, Chao	1
Helmke, Andreas	1
Johnson, Robert L.	1
Lenske, Gerlinde	1
Penny, James	1
Praetorius, Anna-Katharina	1
Shavelson, Richard J.	1
Shumate, Steven R.	1
Yin, Yue	1
More ▼

Publication Type

Reports - Research	8
Journal Articles	6
Tests/Questionnaires	3
Speeches/Meeting Papers	2
Information Analyses	1
Opinion Papers	1

Education Level

Higher Education	2
Postsecondary Education	2
Elementary Secondary Education	1
Grade 8	1

Audience

Researchers

Location

California	1
China (Beijing)	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 9 results Save | Export

Evaluating an Explicit Instruction Teacher Observation Protocol through a Validity Argument Approach

Peer reviewed

Direct link

Johnson, Evelyn S.; Zheng, Yuzhu; Crawford, Angela R.; Moylan, Laura A. – Journal of Experimental Education, 2022

In this study, we examined the scoring and generalizability assumptions of an explicit instruction (EI) special education teacher observation protocol using many-faceted Rasch measurement (MFRM). Video observations of classroom instruction from 48 special education teachers across four states were collected. External raters (n = 20) were trained…

Descriptors: Direct Instruction, Teacher Education, Classroom Observation Techniques, Validity

Evaluating an Explicit Instruction Teacher Observation Protocol through a Validity Argument Approach

Peer reviewed
PDF on ERIC

Download full text

Direct link

Johnson, Evelyn S.; Zheng, Yuzhu; Crawford, Angela R.; Moylan, Laura A. – Grantee Submission, 2020

In this study, we examined the scoring and generalizability assumptions of an Explicit Instruction (EI) special education teacher observation protocol using many-faceted Rasch measurement (MFRM). Video observations of classroom instruction from 48 special education teachers across four states were collected. External raters (n = 20) were trained…

Descriptors: Direct Instruction, Teacher Evaluation, Classroom Observation Techniques, Validity

Investigating Score Dependability in English/Chinese Interpreter Certification Performance Testing: A Generalizability Theory Approach

Peer reviewed

Direct link

Han, Chao – Language Assessment Quarterly, 2016

As a property of test scores, reliability/dependability constitutes an important psychometric consideration, and it underpins the validity of measurement results. A review of interpreter certification performance tests (ICPTs) reveals that (a) although reliability/dependability checking has been recognized as an important concern, its theoretical…

Descriptors: Foreign Countries, Scores, English, Chinese

Observer Ratings of Instructional Quality: Do They Fulfill What They Promise?

Peer reviewed

Direct link

Praetorius, Anna-Katharina; Lenske, Gerlinde; Helmke, Andreas – Learning and Instruction, 2012

Despite considerable interest in the topic of instructional quality in research as well as practice, little is known about the quality of its assessment. Using generalizability analysis as well as content analysis, the present study investigates how reliably and validly instructional quality is measured by observer ratings. Twelve trained raters…

Descriptors: Student Teachers, Interrater Reliability, Content Analysis, Observation

Application of Generalizability Theory to Concept Map Assessment Research

Peer reviewed

Direct link

Yin, Yue; Shavelson, Richard J. – Applied Measurement in Education, 2008

In the first part of this article, the use of Generalizability (G) theory in examining the dependability of concept map assessment scores and designing a concept map assessment for a particular practical application is discussed. In the second part, the application of G theory is demonstrated by comparing the technical qualities of two frequently…

Descriptors: Generalizability Theory, Concept Mapping, Validity, Reliability

The Equivalence of Rater Sources on Job Analysis Ratings.

Download full text

Camara, Wayne J. – 1986

Previous efforts to investigate the equivalence of rating sources for job analysis ratings have reported conflicting results. In the present research, correlational and generalizability analyses were conducted to examine the equivalency of rating sources for over 70 state civil service job classifications. Incumbent and supervisor ratings (N=697)…

Descriptors: Evaluators, Generalizability Theory, Interrater Reliability, Job Analysis

Resolving Score Differences in the Rating of Writing Samples: Does Discussion Improve the Accuracy of Scores?

Peer reviewed

Direct link

Johnson, Robert L.; Penny, James; Gordon, Belita; Shumate, Steven R.; Fisher, Steven P. – Language Assessment Quarterly, 2005

Many studies have indicated that at least 2 raters should score writing assessments to improve interrater reliability. However, even for assessments that characteristically demonstrate high levels of rater agreement, 2 raters of the same essay can occasionally report different, or discrepant, scores. If a single score, typically referred to as an…

Descriptors: Interrater Reliability, Scores, Evaluation, Reliability

Behavioral Interviews in School Psychology: Issues in Psychometric Adequacy and Research.

Peer reviewed

Gresham, Frank M. – School Psychology Review, 1984

The evidence for the psychometric adequacy of behavioral interviews in terms of traditional psychometric theory and generalizability theory are reviewed. The review resulted in the conclusion that behavioral interviews have some evidence for interrater reliability, content validity, and criterion-related validity. Additional research in several…

Descriptors: Behavior Patterns, Behavior Problems, Functional Behavioral Assessment, Generalizability Theory

A Generalizability Study of the Angoff Method Applied to Setting Cutoff Scores of Professional Certification Tests.

Cope, Ronald T. – 1987

This study used generalizability theory and other statistical concepts to assess the application of the Angoff method to setting cutoff scores on two professional certification tests. A panel of ten judges gave pre- and post-feedback Angoff probability ratings of items of two forms of a professional certification test, and another panel of nine…

Descriptors: Certification, Correlation, Cutting Scores, Error of Measurement