Publication Date
| In 2026 | 0 |
| Since 2025 | 12 |
| Since 2022 (last 5 years) | 100 |
| Since 2017 (last 10 years) | 195 |
| Since 2007 (last 20 years) | 399 |
Descriptor
Source
| Language Testing | 606 |
Author
| Davies, Alan | 11 |
| Bachman, Lyle F. | 10 |
| Alderson, J. Charles | 8 |
| Elder, Catherine | 8 |
| Knoch, Ute | 8 |
| McNamara, Tim | 8 |
| Yan, Xun | 7 |
| Brunfaut, Tineke | 6 |
| Chapelle, Carol A. | 6 |
| Cho, Yeonsuk | 6 |
| Ginther, April | 6 |
| More ▼ | |
Publication Type
Education Level
Audience
| Researchers | 1 |
| Teachers | 1 |
Location
| Japan | 31 |
| China | 28 |
| Australia | 25 |
| United Kingdom | 15 |
| Canada | 14 |
| South Korea | 12 |
| Hong Kong | 9 |
| Netherlands | 9 |
| Germany | 8 |
| Europe | 7 |
| Taiwan | 6 |
| More ▼ | |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 2 |
| Elementary and Secondary… | 1 |
| Lau v Nichols | 1 |
| Race to the Top | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Warnby, Marcus; Malmström, Hans; Hansen, Kajsa Yang – Language Testing, 2023
The academic section of the Vocabulary Levels Test (VLT-Ac) and the Academic Vocabulary Test (AVT) both assess meaning-recognition knowledge of written receptive academic vocabulary, deemed central for engagement in academic activities. Depending on the purpose and context of the testing, either of the tests can be appropriate, but for research…
Descriptors: Foreign Countries, Scores, Written Language, Receptive Language
Allen, David; Nakamura, Keita – Language Testing, 2023
Although there is abundant evidence for the use of first-language (L1) knowledge by bilinguals when using a second language (L2), investigation into the impact of L1 knowledge in large-scale L2 language assessments and discussion of how such impact may be controlled has received little attention in the language assessment literature. This study…
Descriptors: Language Tests, Second Language Learning, Contrastive Linguistics, English (Second Language)
Liu, Tingting; Aryadoust, Vahid; Foo, Stacy – Language Testing, 2022
This study evaluated the validity of the Michigan English Test (MET) Listening Section by investigating its underlying factor structure and the replicability of its factor structure across multiple test forms. Data from 3255 test takers across four forms of the MET Listening Section were used. To investigate the factor structure, each form was…
Descriptors: Factor Structure, Language Tests, Second Language Learning, Second Language Instruction
Min, Shangchao; Zhang, Juan; Li, Yue; He, Lianzhen – Language Testing, 2022
Local language tests are an arena where national language standards can be operationalized to create a hub for integrating assessment results and language support. Few studies, however, have examined the operationalization of national standards in local language assessment contexts. In this study, we proposed a model to present the integration of…
Descriptors: Language Tests, Listening Comprehension Tests, Second Language Learning, English (Second Language)
Knoch, Ute; Huisman, Annemiek; Elder, Cathie; Kong, Xiaoxiao; McKenna, Angela – Language Testing, 2020
A key concern of washback research in language testing is with the value of test preparation for facilitating learning and improving test performance. Although test takers may draw on a wide range of preparation activities, the majority of research studies examining test preparation have taken place in classroom settings, leaving self-access…
Descriptors: Test Preparation, Repetition, Language Tests, English for Academic Purposes
Maria Treadaway; John Read – Language Testing, 2024
Standard-setting is an essential component of test development, supporting the meaningfulness and appropriate interpretation of test scores. However, in the high-stakes testing environment of aviation, standard-setting studies are underexplored. To address this gap, we document two stages in the standard-setting procedures for the Overseas Flight…
Descriptors: Standard Setting, Diagnostic Tests, High Stakes Tests, English for Special Purposes
Li, Minzi; Zhang, Xian – Language Testing, 2021
This meta-analysis explores the correlation between self-assessment (SA) and language performance. Sixty-seven studies with 97 independent samples involving more than 68,500 participants were included in our analysis. It was found that the overall correlation between SA and language performance was 0.466 (p < 0.01). Moderator analysis was…
Descriptors: Meta Analysis, Self Evaluation (Individuals), Likert Scales, Research Reports
Lin, You-Min; Y. Chen, Michelle – Language Testing, 2020
This study examined the writing score and writing feature changes of 562 repeat test takers who took the Canadian English Language Proficiency Index Program--General (CELPIP--General) test at least three times, with a short (30-40 day) interval between the first and second attempts and a longer (90-180 day) interval between the first and third…
Descriptors: Language Tests, Standardized Tests, Language Proficiency, Writing Tests
Yan, Xun; Staples, Shelley – Language Testing, 2020
The argument-based approach to validity (Kane, 2013) focuses on two steps: (1) making claims about the proposed interpretation and use of test scores as a coherent, interpretive argument; and (2) evaluating those claims based on theoretical and empirical evidence related to test performances and scores. This paper discusses the role of…
Descriptors: Writing Tests, Language Tests, Language Proficiency, Test Validity
Liao, Ray J. T. – Language Testing, 2023
Among the variety of selected response formats used in L2 reading assessment, multiple-choice (MC) is the most commonly adopted, primarily due to its efficiency and objectiveness. Given the impact of assessment results on teaching and learning, it is necessary to investigate the degree to which the MC format reliably measures learners' L2 reading…
Descriptors: Reading Tests, Language Tests, Second Language Learning, Second Language Instruction
Gokturk, Nazlinur; Chukharev-Hudilainen, Evgeny – Language Testing, 2023
With recent technological advances, researchers have begun to explore the potential use of spoken dialog systems (SDSs) for L2 oral communication assessment. While several studies support the feasibility of building these systems for various types of oral tasks, research on the construct validity of SDS-delivered tasks is still limited. Thus, this…
Descriptors: Oral Language, Dialogs (Language), Second Language Learning, Second Language Instruction
Peng, Yue; Yan, Wei; Cheng, Liying – Language Testing, 2021
This test review focuses on the current version (2009) of [Chinese characters omitted] (Hanyu Shuiping Kaoshi), literally translated as the Chinese Language Proficiency Test and abbreviated as HSK. Tailored to non-native speakers of the Chinese language, this test consists of six proficiency levels (Levels 1 and 2 as beginners, Levels 3 and 4 as…
Descriptors: Language Proficiency, Language Tests, Chinese, Decision Making
Youn, Soo Jung – Language Testing, 2020
This qualitative study reports an investigation of the nature of interactional competence at various levels of achievement in the context of role-play speaking assessment. The focal point of this study is on how examinees jointly accomplish the interactional work involved in proposal sequences in role-play interaction. Based on a conversation…
Descriptors: Role Playing, Interaction, Test Validity, Communicative Competence (Languages)
Christensen, Laurene L.; Shyyan, Vitaliy V.; MacMillan, Fabiana – Language Testing, 2023
In order to make assessments as widely accessible as possible, including to young learners from diverse backgrounds with a wide range of individual needs and characteristics, some developers of standardized tests have resorted to offering accessibility tools (e.g., magnifying/zoom) and accommodations (e.g., extended response time) to test takers.…
Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Language Proficiency
Aryadoust, Vahid; Ng, Li Ying; Sayama, Hiroki – Language Testing, 2021
Over the past decades, the application of Rasch measurement in language assessment has gradually increased. In the present study, we coded 215 papers using Rasch measurement published in 21 applied linguistics journals for multiple features. We found that seven Rasch models and 23 software packages were adopted in these papers, with many-facet…
Descriptors: Language Tests, Testing, Test Items, Network Analysis

Peer reviewed
Direct link
