ERIC - Search Results

Publication Date

In 2026	0
Since 2025	0
Since 2022 (last 5 years)	0
Since 2017 (last 10 years)	1
Since 2007 (last 20 years)	5

Descriptor

Generalizability Theory	10
Reliability	10
Second Language Learning	10
English (Second Language)	9
Scores	8
Writing Tests	5
Evaluation Methods	3
Language Tests	3
Scoring	3
Test Construction	3
Validity	3
Error of Measurement	2
Foreign Countries	2
Psychometrics	2
Speech	2
Student Evaluation	2
Test Items	2
Writing Skills	2
Ability	1
Academic Achievement	1
Adults	1
Bilingual Education	1
Bilingualism	1
College Students	1
Comparative Analysis	1
More ▼

Source

ETS Research Report Series	2
Assessing Writing	1
Educational Researcher	1
International Journal of…	1
Language Testing in Asia	1
Reading Research and…	1

Author

Lee, Yong-Won	5
Kantor, Robert	4
Mollaun, Pam	2
Carey, Jill	1
Glissmeyer, Connie B.	1
Golub-Smith, Marna	1
Huang, Jinyan	1
Luo, Juan	1
Morrison, Timothy G.	1
Payton, Carmen	1
Schmidgall, Jonathan E.	1
Solano-Flores, Guillermo	1
Sudweeks, Richard R.	1
Tanner, Mark W.	1
Wilcox, Bradley R.	1
Xiao, Yunnan	1
Zhang, Bo	1
More ▼

Publication Type

Reports - Research	9
Journal Articles	7
Speeches/Meeting Papers	3
Numerical/Quantitative Data	2
Reports - Evaluative	1
Tests/Questionnaires	1

Education Level

Elementary Secondary Education	1
Higher Education	1

Audience

Location

Canada	2
Australia	1
Hong Kong	1
Mexico	1
Taiwan	1

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	3
Test of English for…	1

What Works Clearinghouse Rating

Showing all 10 results Save | Export

The Consistency of "TOEIC"® Speaking Scores across Ratings and Tasks. Research Report. ETS RR-17-46

Peer reviewed
PDF on ERIC

Download full text

Schmidgall, Jonathan E. – ETS Research Report Series, 2017

This report briefly reviews the design and scoring procedure for the "TOEIC"® Speaking test and summarizes existing evidence about the consistency of TOEIC Speaking test scores. It then describes several analyses conducted using generalizability theory to provide additional information about the consistency of scores across different…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Speech Tests

Rater Reliability and Score Discrepancy under Holistic and Analytic Scoring of Second Language Writing

Peer reviewed

Direct link

Zhang, Bo; Xiao, Yunnan; Luo, Juan – Language Testing in Asia, 2015

Previous studies comparing holistic scoring to analytic scoring of second language writing have given mixed results. Some of them suffer from methodological drawbacks, such as limited writing sample size, limited number of raters, and lack of direct comparison of the two methods. Based on 300 writing samples graded by 14 raters, this research…

Descriptors: Evaluators, Reliability, Scores, Holistic Approach

Who Is Given Tests in What Language by Whom, When, and Where? The Need for Probabilistic Views of Language in the Testing of English Language Learners

Peer reviewed

Direct link

Solano-Flores, Guillermo – Educational Researcher, 2008

The testing of English language learners (ELLs) is, to a large extent, a random process because of poor implementation and factors that are uncertain or beyond control. Yet current testing practices and policies appear to be based on deterministic views of language and linguistic groups and erroneous assumptions about the capacity of assessment…

Descriptors: Generalizability Theory, Testing, Second Language Learning, Error of Measurement

The Score Reliability of the Test of Spoken English (TSE) from the Generalizability Theory Perspective: Validating the Current Procedure.

Lee, Yong-Won; Golub-Smith, Marna; Payton, Carmen; Carey, Jill – 2001

This study investigated the validity of the current reliability estimation procedure for the Test of Spoken English (TSE), a tape-mediated semi-performance test of 12 speaking tasks, from the perspective of generalizability theory and examined the feasibility of shortening the test without compromising the psychometric quality of the test. Data…

Descriptors: Adults, English (Second Language), Estimation (Mathematics), Generalizability Theory

How Accurate Are ESL Students' Holistic Writing Scores on Large-Scale Assessments?--A Generalizability Theory Approach

Peer reviewed

Direct link

Huang, Jinyan – Assessing Writing, 2008

Using generalizability theory, this study examined both the rating variability and reliability of ESL students' writing in the provincial English examinations in Canada. Three years' data were used in order to complete the analyses and examine the stability of the results. The major research question that guided this study was: Are there any…

Descriptors: Generalizability Theory, Foreign Countries, English (Second Language), Writing Tests

Evaluating Prototype Tasks and Alternative Rating Schemes for a New ESL Writing Test through G-Theory

Peer reviewed

Direct link

Lee, Yong-Won; Kantor, Robert – International Journal of Testing, 2007

Possible integrated and independent tasks were pilot tested for the writing section of a new generation of the TOEFL[R] (Test of English as a Foreign Language[TM]). This study examines the impact of various rating designs and of the number of tasks and raters on the reliability of writing scores based on integrated and independent tasks from the…

Descriptors: Generalizability Theory, Writing Tests, English (Second Language), Second Language Learning

Score Reliability as an Essential Prerequisite for Validating New Writing and Speaking Tasks for TOEFL.

Lee, Yong-Won; Kantor, Robert; Mollaun, Pam – 2002

This paper reports the results of generalizability theory (G) analyses done for new writing and speaking tasks for the Test of English as a Foreign Language (TOEFL). For writing, a special focus was placed on evaluating the impact on the reliability of the number of raters (or ratings) per essay (one or two) and the number of tasks (one, two, or…

Descriptors: English (Second Language), Generalizability Theory, Reliability, Scores

Dependability of New ESL Writing Test Scores: Evaluating Prototype Tasks and Alternative Rating Schemes. TOEFL® Monograph Series. MS-31. ETS RR-05-14

Peer reviewed
PDF on ERIC

Download full text

Lee, Yong-Won; Kantor, Robert – ETS Research Report Series, 2005

Possible integrated and independent tasks were pilot tested for the writing section of a new generation of TOEFL® (Test of English as a Foreign Language™) examination. This study examines the impact of various rating designs as well as the impact of the number of tasks and raters on the reliability of writing scores based on integrated and…

Descriptors: Language Tests, English (Second Language), Second Language Learning, Writing Tests

Score Dependability of the Writing and Speaking Sections of New TOEFL.

Lee, Yong-Won; Kantor, Robert; Mollaun, Pam – 2002

This study examines the score dependability of writing and speaking assessments from the Test of English as a Foreign Language (TOEFL) from the perspectives of univariate and multivariate generalizability theory (G-theory) and presents the findings of three separate G-theory studies. For writing, the focus was on evaluating the impact on…

Descriptors: Ability, English (Second Language), Generalizability Theory, Item Bias

Establishing Reliable Procedures for Rating ELL Students' Reading Comprehension Using Oral Retellings

Peer reviewed

Direct link

Sudweeks, Richard R.; Glissmeyer, Connie B.; Morrison, Timothy G.; Wilcox, Bradley R.; Tanner, Mark W. – Reading Research and Instruction, 2004

Oral retellings are strongly recommended as a way to measure reading comprehension for second language learners (Bernhardt, 1985, 1990, 1991). However, the reliability of such ratings is a matter of concern for a variety of reasons (Aiken, 1996; Cooper, 1981; Saal, Downey, & Lahey, 1980). The purpose of this study was to establish reliable rating…

Descriptors: Error of Measurement, Generalizability Theory, Reading Comprehension, Second Language Learning