ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	0
Since 2007 (last 20 years)	2

Descriptor

Evaluation Methods	3
Generalizability Theory	3
Reliability	3
Second Language Learning	3
English (Second Language)	2
Scores	2
Writing Skills	2
College Students	1
Error of Measurement	1
Essays	1
Evaluators	1
Holistic Approach	1
Language Tests	1
Listening Skills	1
Oral Language	1
Reading Comprehension	1
Reading Skills	1
Scoring	1
Story Telling	1
Student Evaluation	1
Writing (Composition)	1
Writing Evaluation	1
Writing Tests	1
More ▼

Source

International Journal of…	1
Language Testing in Asia	1
Reading Research and…	1

Author

Glissmeyer, Connie B.	1
Kantor, Robert	1
Lee, Yong-Won	1
Luo, Juan	1
Morrison, Timothy G.	1
Sudweeks, Richard R.	1
Tanner, Mark W.	1
Wilcox, Bradley R.	1
Xiao, Yunnan	1
Zhang, Bo	1

Publication Type

Journal Articles	3
Reports - Research	3

Education Level

Higher Education

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 3 results Save | Export

Rater Reliability and Score Discrepancy under Holistic and Analytic Scoring of Second Language Writing

Peer reviewed

Direct link

Zhang, Bo; Xiao, Yunnan; Luo, Juan – Language Testing in Asia, 2015

Previous studies comparing holistic scoring to analytic scoring of second language writing have given mixed results. Some of them suffer from methodological drawbacks, such as limited writing sample size, limited number of raters, and lack of direct comparison of the two methods. Based on 300 writing samples graded by 14 raters, this research…

Descriptors: Evaluators, Reliability, Scores, Holistic Approach

Evaluating Prototype Tasks and Alternative Rating Schemes for a New ESL Writing Test through G-Theory

Peer reviewed

Direct link

Lee, Yong-Won; Kantor, Robert – International Journal of Testing, 2007

Possible integrated and independent tasks were pilot tested for the writing section of a new generation of the TOEFL[R] (Test of English as a Foreign Language[TM]). This study examines the impact of various rating designs and of the number of tasks and raters on the reliability of writing scores based on integrated and independent tasks from the…

Descriptors: Generalizability Theory, Writing Tests, English (Second Language), Second Language Learning

Establishing Reliable Procedures for Rating ELL Students' Reading Comprehension Using Oral Retellings

Peer reviewed

Direct link

Sudweeks, Richard R.; Glissmeyer, Connie B.; Morrison, Timothy G.; Wilcox, Bradley R.; Tanner, Mark W. – Reading Research and Instruction, 2004

Oral retellings are strongly recommended as a way to measure reading comprehension for second language learners (Bernhardt, 1985, 1990, 1991). However, the reliability of such ratings is a matter of concern for a variety of reasons (Aiken, 1996; Cooper, 1981; Saal, Downey, & Lahey, 1980). The purpose of this study was to establish reliable rating…

Descriptors: Error of Measurement, Generalizability Theory, Reading Comprehension, Second Language Learning