ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	3

Descriptor

Comparative Analysis	6
Generalizability Theory	6
Scoring	6
Reliability	3
Scoring Rubrics	3
Error of Measurement	2
Social Studies	2
Automation	1
Childrens Writing	1
Computer Assisted Testing	1
Decision Making	1
Grade 1	1
Higher Education	1
Inservice Teacher Education	1
Interrater Reliability	1
Islam	1
Item Response Theory	1
Junior High School Students	1
Junior High Schools	1
Kindergarten	1
Measurement	1
Medical Students	1
Multivariate Analysis	1
Physicians	1
Primary Education	1
More ▼

Source

Applied Measurement in…	2
ProQuest LLC	2
Asia Pacific Education Review	1
Reading Psychology	1

Author

Alkahtani, Saif F.	1
Chon, Kyong Hee	1
Clauser, Brian E.	1
Clyman, Stephen G.	1
Daniel, Cathy	1
Dellinger, Amy	1
Denny, R. Kenton	1
Lengh, Carolyn J.	1
Marzano, Robert J.	1
Noh, Eun Hee	1
Powers, Taylor	1
Stuhlmann, Janice	1
Sung, Kyung Hee	1
Swanson, David B.	1
More ▼

Publication Type

Journal Articles	4
Reports - Research	4
Dissertations/Theses -…	2

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 6 results Save | Export

Multivariate Generalizability Analysis of Automated Scoring for Short Answer Items of Social Studies in Large-Scale Assessment

Peer reviewed

Direct link

Sung, Kyung Hee; Noh, Eun Hee; Chon, Kyong Hee – Asia Pacific Education Review, 2017

With increased use of constructed response items in large scale assessments, the cost of scoring has been a major consideration (Noh et al. in KICE Report RRE 2012-6, 2012; Wainer and Thissen in "Applied Measurement in Education" 6:103-118, 1993). In response to the scoring cost issues, various forms of automated system for scoring…

Descriptors: Automation, Scoring, Social Studies, Test Items

Oral Performace Scoring Using Generalizability Theory and Many-Facet Rasch Measurement: A Comparison Study

Direct link

Alkahtani, Saif F. – ProQuest LLC, 2012

The principal aim of the present study was to better guide the Quranic recitation appraisal practice by presenting an application of Generalizability theory and Many-facet Rasch Measurement Model for assessing the dependability and fit of two suggested rubrics. Recitations of 93 students were rated holistically and analytically by 3 independent…

Descriptors: Generalizability Theory, Item Response Theory, Verbal Tests, Islam

Generalizability Theory: Measuring the Dependability of Selected Methods for Scoring Classroom Assessments

Direct link

Lengh, Carolyn J. – ProQuest LLC, 2010

This study compares the dependability of four classroom assessment scoring methods. Generalizability theory (G) and alternative decision (D) are used to measure the results of students' classroom assessment scores and compare the results of the four scoring methods on variability of rater by person variance and the level of G and D coefficients…

Descriptors: Generalizability Theory, Scoring, Social Studies, Tests

A Comparison of the Generalizability of Scores Produced by Expert Raters and Automated Scoring Systems.

Peer reviewed

Clauser, Brian E.; Swanson, David B.; Clyman, Stephen G. – Applied Measurement in Education, 1999

Performed generalizability analyses of expert ratings and computer-produced scores for a computer-delivered performance assessment of physicians' patient management skills. The two automated scoring systems produced scores for the 200 medical students that were approximately as generalizable as those produced by the four expert raters. (SLD)

Descriptors: Comparative Analysis, Computer Assisted Testing, Generalizability Theory, Higher Education

A Comparison of Selected Methods of Scoring Classroom Assessments.

Peer reviewed

Marzano, Robert J. – Applied Measurement in Education, 2002

Two studies, each involving 10 eighth graders,compared the findings from generalizability (G) studies and alternative decision (D) studies for 4 approaches to scoring classroom assessments. In terms of less rater x person variability and higher G and D coefficients, the methods ranked in this order: topic-specific rubric, constrained point,…

Descriptors: Comparative Analysis, Decision Making, Generalizability Theory, Junior High School Students

A Generalizability Study of the Effects of Training on Teachers' Abilities to Rate Children's Writing Using a Rubric.

Peer reviewed

Stuhlmann, Janice; Daniel, Cathy; Dellinger, Amy; Denny, R. Kenton; Powers, Taylor – Reading Psychology, 1999

Investigates whether training raters to interpret the scoring dimensions on a rubric would increase reliability. Compares two groups of kindergarten and first-grade teachers: one group with training, one without. Finds that training increases raters' abilities to reliably interpret scoring items. (SC)

Descriptors: Childrens Writing, Comparative Analysis, Generalizability Theory, Grade 1