Publication Date
| In 2026 | 0 |
| Since 2025 | 12 |
| Since 2022 (last 5 years) | 83 |
| Since 2017 (last 10 years) | 173 |
| Since 2007 (last 20 years) | 360 |
Descriptor
Source
| Language Testing | 539 |
Author
| Davies, Alan | 8 |
| Bachman, Lyle F. | 7 |
| Elder, Catherine | 7 |
| Cheng, Liying | 6 |
| Xi, Xiaoming | 6 |
| Yan, Xun | 6 |
| Alderson, J. Charles | 5 |
| Aryadoust, Vahid | 5 |
| Cho, Yeonsuk | 5 |
| Ginther, April | 5 |
| Knoch, Ute | 5 |
| More ▼ | |
Publication Type
Education Level
Audience
Location
| Japan | 33 |
| China | 30 |
| Australia | 23 |
| United Kingdom | 15 |
| Canada | 14 |
| South Korea | 13 |
| Europe | 7 |
| Germany | 6 |
| Hong Kong | 6 |
| Netherlands | 6 |
| New Zealand | 5 |
| More ▼ | |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 4 |
| Elementary and Secondary… | 1 |
| Lau v Nichols | 1 |
| Race to the Top | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Llosa, Lorena; Malone, Margaret E. – Language Testing, 2019
Investigating the comparability of students' performance on TOEFL writing tasks and actual academic writing tasks is essential to provide backing for the extrapolation inference in the TOEFL validity argument (Chapelle, Enright, & Jamieson, 2008). This study compared 103 international non-native-English-speaking undergraduate students'…
Descriptors: Computer Assisted Testing, Language Tests, English (Second Language), Second Language Learning
Li, Hongli; Hunter, C. Vincent; Lei, Pui-Wa – Language Testing, 2016
Cognitive diagnostic models (CDMs) have great promise for providing diagnostic information to aid learning and instruction, and a large number of CDMs have been proposed. However, the assumptions and performances of different CDMs and their applications in regard to reading comprehension tests are not fully understood. In the present study, we…
Descriptors: Reading Comprehension, Reading Tests, Models, Comparative Analysis
Trace, Jonathan; Janssen, Gerriet; Meier, Valerie – Language Testing, 2017
Previous research in second language writing has shown that when scoring performance assessments even trained raters can exhibit significant differences in severity. When raters disagree, using discussion to try to reach a consensus is one popular form of score resolution, particularly in contexts with limited resources, as it does not require…
Descriptors: Performance Based Assessment, Second Language Learning, Scoring, Evaluators
McCray, Gareth; Brunfaut, Tineke – Language Testing, 2018
This study investigates test-takers' processing while completing banked gap-fill tasks, designed to test reading proficiency, in order to test theoretically based expectations about the variation in cognitive processes of test-takers across levels of performance. Twenty-eight test-takers' eye traces on 24 banked gap-fill items (on six tasks) were…
Descriptors: Language Tests, Test Items, Item Analysis, Eye Movements
Kang, Okim; Rubin, Don; Kermad, Alyssa – Language Testing, 2019
As a result of the fact that judgments of non-native speech are closely tied to social biases, oral proficiency ratings are susceptible to error because of rater background and social attitudes. In the present study we seek first to estimate the variance attributable to rater background and attitudinal variables on novice raters' assessments of L2…
Descriptors: Evaluators, Second Language Learning, Language Tests, English (Second Language)
Beaudrie, Sara; Amezcua, Angelica; Loza, Sergio – Language Testing, 2019
Critical language awareness (CLA) is increasingly identified as a central component of the Spanish heritage language (SHL) classroom (Leeman, 2005; Martínez, 2003; among others). As a minority language, SHL is subject to sociopolitical, cultural, and economic forces that devalue its status. It is devalued in the eyes of the public, as a legitimate…
Descriptors: Metalinguistics, Heritage Education, Spanish, Second Language Learning
Macqueen, Susy; Knoch, Ute; Wigglesworth, Gillian; Nordlinger, Rachel; Singer, Ruth; McNamara, Tim; Brickle, Rhianna – Language Testing, 2019
All educational testing is intended to have consequences, which are assumed to be beneficial, but tests may also have unintended, negative consequences (Messick, 1989). The issue is particularly important in the case of large-scale standardized tests, such as Australia's "National Assessment Program--Literacy and Numeracy" (NAPLAN), the…
Descriptors: Numeracy, Standardized Tests, National Curriculum, Testing Programs
Poehner, Matthew E.; Zhang, Jie; Lu, Xiaofei – Language Testing, 2015
Dynamic assessment (DA) derives from the sociocultural theory of mind as elaborated by Russian psychologist L. S. Vygotsky. By offering mediation when individuals experience difficulties and carefully tracing their responsiveness, Vygotsky (1998) proposed that diagnoses may uncover abilities that have fully formed as well as those still in the…
Descriptors: Computer Assisted Testing, Second Language Learning, Reading Tests, Listening Comprehension Tests
Davis, Larry – Language Testing, 2016
Two factors were investigated that are thought to contribute to consistency in rater scoring judgments: rater training and experience in scoring. Also considered were the relative effects of scoring rubrics and exemplars on rater performance. Experienced teachers of English (N = 20) scored recorded responses from the TOEFL iBT speaking test prior…
Descriptors: Evaluators, Oral Language, Scores, Language Tests
Lee, Shinhye; Winke, Paula – Language Testing, 2018
We investigated how young language learners process their responses on and perceive a computer-mediated, timed speaking test. Twenty 8-, 9-, and 10-year-old non-native English-speaking children (NNSs) and eight same-aged, native English-speaking children (NSs) completed seven computerized sample TOEFL® Primary™ speaking test tasks. We investigated…
Descriptors: Elementary School Students, Second Language Learning, Responses, Computer Assisted Testing
Kuiken, Folkert; Vedder, Ineke – Language Testing, 2017
The importance of functional adequacy as an essential component of L2 proficiency has been observed by several authors (Pallotti, 2009; De Jong, Steinel, Florijn, Schoonen, & Hulstijn, 2012a, b). The rationale underlying the present study is that the assessment of writing proficiency in L2 is not fully possible without taking into account the…
Descriptors: Second Language Learning, Rating Scales, Computational Linguistics, Persuasive Discourse
Yoo, Hanwook; Manna, Venessa F. – Language Testing, 2017
This study assessed the factor structure of the Test of English for International Communication (TOEIC®) Listening and Reading test, and its invariance across subgroups of test-takers. The subgroups were defined by (a) gender, (b) age, (c) employment status, (d) time spent studying English, and (e) having lived in a country where English is the…
Descriptors: English (Second Language), Language Proficiency, Language Tests, Second Language Learning
Beigman Klebanov, Beata; Ramineni, Chaitanya; Kaufer, David; Yeoh, Paul; Ishizaki, Suguru – Language Testing, 2019
Essay writing is a common type of constructed-response task used frequently in standardized writing assessments. However, the impromptu timed nature of the essay writing tests has drawn increasing criticism for the lack of authenticity for real-world writing in classroom and workplace settings. The goal of this paper is to contribute evidence to a…
Descriptors: Test Validity, Writing Tests, Writing Skills, Persuasive Discourse
Khabbazbashi, Nahal – Language Testing, 2017
This study explores the extent to which topic and background knowledge of topic affect spoken performance in a high-stakes speaking test. It is argued that evidence of a substantial influence may introduce construct-irrelevant variance and undermine test fairness. Data were collected from 81 non-native speakers of English who performed on 10…
Descriptors: Speech Tests, High Stakes Tests, English (Second Language), Language Proficiency
Cumming, Alister – Language Testing, 2015
The studies documented in the four articles in this special issue uniquely exemplify principles of design-based research as follows: by taking innovative approaches to significant problems in the contexts of real educational practices; by addressing fundamental pedagogical and policy issues related to language, learning, and teaching; and, in the…
Descriptors: Educational Research, Research Methodology, Instructional Design, Educational Assessment

Peer reviewed
Direct link
