Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 7 |
Descriptor
Generalizability Theory | 8 |
English (Second Language) | 4 |
Language Tests | 4 |
Scores | 4 |
Evaluators | 3 |
Interrater Reliability | 3 |
Language Proficiency | 3 |
Test Items | 3 |
Accuracy | 2 |
Data Analysis | 2 |
English Language Learners | 2 |
More ▼ |
Source
Language Testing | 8 |
Author
Lin, Chih-Kai | 2 |
Attali, Yigal | 1 |
Ewert, Doreen | 1 |
Kozaki ,Y. | 1 |
Oostdam, Ron | 1 |
Shin, Ji-young | 1 |
Shin, Sun-Young | 1 |
Zhang, Jinming | 1 |
Zhang, Su | 1 |
van Gelderen, Amos | 1 |
van Steensel, Roel | 1 |
More ▼ |
Publication Type
Journal Articles | 8 |
Reports - Research | 8 |
Education Level
Higher Education | 2 |
Postsecondary Education | 2 |
Elementary Secondary Education | 1 |
Grade 7 | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
Secondary Education | 1 |
Audience
Location
Netherlands | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Test of English as a Foreign… | 1 |
Test of English for… | 1 |
What Works Clearinghouse Rating
Shin, Ji-young – Language Testing, 2022
With the present study I investigated the sources of score variance and dependability in a local oral English proficiency test for potential international teaching assistants (ITAs) across four first language (L1) groups, and suggested alternative test designs. Using generalizability theory, I examined the relative importance of L1s (i.e., Indian,…
Descriptors: Foreign Students, Language Tests, Language Proficiency, Oral Language
Lin, Chih-Kai – Language Testing, 2017
Sparse-rated data are common in operational performance-based language tests, as an inevitable result of assigning examinee responses to a fraction of available raters. The current study investigates the precision of two generalizability-theory methods (i.e., the rating method and the subdividing method) specifically designed to accommodate the…
Descriptors: Data Analysis, Language Tests, Generalizability Theory, Accuracy
Attali, Yigal – Language Testing, 2016
A short training program for evaluating responses to an essay writing task consisted of scoring 20 training essays with immediate feedback about the correct score. The same scoring session also served as a certification test for trainees. Participants with little or no previous rating experience completed this session and 14 trainees who passed an…
Descriptors: Writing Evaluation, Writing Tests, Standardized Tests, Evaluators
Lin, Chih-Kai; Zhang, Jinming – Language Testing, 2014
Research on the relationship between English language proficiency standards and academic content standards serves to provide information about the extent to which English language learners (ELLs) are expected to encounter academic language use that facilitates their content learning, such as in mathematics and science. Standards-to-standards…
Descriptors: Language Proficiency, Academic Standards, Generalizability Theory, English Language Learners
Shin, Sun-Young; Ewert, Doreen – Language Testing, 2015
Reading-to-write (RTW) tasks are becoming increasingly popular and have already been used in several high-stakes English proficiency exams, either replacing or complementing a prompt-based essay test. However, it is still not clear that what accounts for successful or unsuccessful performance on an integrated reading-writing task is owing to the…
Descriptors: English (Second Language), Language Tests, Language Proficiency, Test Items
van Steensel, Roel; Oostdam, Ron; van Gelderen, Amos – Language Testing, 2013
On the basis of a validation study of a new test for assessing low-achieving adolescents' reading comprehension skills--the SALT-reading--we analyzed two issues relevant to the field of reading test development. Using the test results of 200 seventh graders, we examined the possibility of identifying reading comprehension subskills and the effects…
Descriptors: Adolescents, Low Achievement, Reading Comprehension, Reading Tests
Zhang, Su – Language Testing, 2006
This study applied generalizability theory to investigate the contributions of persons, items, sections, and language backgrounds to the score dependability of the Test of English for International Communication (TOEIC). I replicated and extended Brown's (1999) study of the Test of English as a Foreign Language (TOEFL), using data from two…
Descriptors: Communication (Thought Transfer), Generalizability Theory, English (Second Language), Scores
Kozaki ,Y. – Language Testing, 2004
This article presents a standard-setting procedure for performance assessment in a foreign language, through which some of the major problems facing performance assessment in criterion-referenced testing can be addressed. The procedure, which was geared to revealing and accommodating inter-judge variability, employed the synergy of multiple…
Descriptors: Data Analysis, Testing, Performance Tests, Generalizability Theory