Publication Date
| In 2026 | 0 |
| Since 2025 | 12 |
| Since 2022 (last 5 years) | 100 |
| Since 2017 (last 10 years) | 195 |
| Since 2007 (last 20 years) | 399 |
Descriptor
Source
| Language Testing | 606 |
Author
| Davies, Alan | 11 |
| Bachman, Lyle F. | 10 |
| Alderson, J. Charles | 8 |
| Elder, Catherine | 8 |
| Knoch, Ute | 8 |
| McNamara, Tim | 8 |
| Yan, Xun | 7 |
| Brunfaut, Tineke | 6 |
| Chapelle, Carol A. | 6 |
| Cho, Yeonsuk | 6 |
| Ginther, April | 6 |
| More ▼ | |
Publication Type
Education Level
Audience
| Researchers | 1 |
| Teachers | 1 |
Location
| Japan | 31 |
| China | 28 |
| Australia | 25 |
| United Kingdom | 15 |
| Canada | 14 |
| South Korea | 12 |
| Hong Kong | 9 |
| Netherlands | 9 |
| Germany | 8 |
| Europe | 7 |
| Taiwan | 6 |
| More ▼ | |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 2 |
| Elementary and Secondary… | 1 |
| Lau v Nichols | 1 |
| Race to the Top | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Yufan Zhao; Vahid Aryadoust – Language Testing, 2025
This study examined the semantic features of the simulated mini-lectures in the listening sections of the International English Language Testing System (IELTS) and the Test of English as a Foreign Language (TOEFL) based on automatized semantic analysis to explore the content validity of the two tests. Two study corpora were utilized, the IELTS…
Descriptors: Semantics, Computational Linguistics, Academic Language, Second Language Learning
Baghaei, Purya; Christensen, Karl Bang – Language Testing, 2023
C-tests are gap-filling tests mainly used as rough and economical measures of second-language proficiency for placement and research purposes. A C-test usually consists of several short independent passages where the second half of every other word is deleted. Owing to their interdependent structure, C-test items violate the local independence…
Descriptors: Item Response Theory, Language Tests, Language Proficiency, Second Language Learning
Xiaoting Shi; Xiaomei Ma; Wenbo Du; Xuliang Gao – Language Testing, 2024
Cognitive diagnostic assessment (CDA) intends to identify learners' strengths and weaknesses in latent cognitive attributes to provide personalized remedial instructions. Previous CDA studies on English as a Foreign Language (EFL)/English as a Second Language (ESL) writing have adopted dichotomous cognitive diagnostic models (CDMs) to analyze data…
Descriptors: Writing Evaluation, Writing Tests, Diagnostic Tests, English (Second Language)
Takanori Sato – Language Testing, 2024
Assessing the content of learners' compositions is a common practice in second language (L2) writing assessment. However, the construct definition of content in L2 writing assessment potentially underrepresents the target competence in content and language integrated learning (CLIL), which aims to foster not only L2 proficiency but also critical…
Descriptors: Language Tests, Content and Language Integrated Learning, Writing Evaluation, Writing Tests
Junlan Pan; Emma Marsden – Language Testing, 2024
"Tests of Aptitude for Language Learning" (TALL) is an openly accessible internet-based battery to measure the multifaceted construct of foreign language aptitude, using language domain-specific instruments and L1-sensitive instructions and stimuli. This brief report introduces the components of this theory-informed battery and…
Descriptors: Language Tests, Aptitude Tests, Second Language Learning, Test Construction
Knoch, Ute; Deygers, Bart; Khamboonruang, Apichat – Language Testing, 2021
Rating scale development in the field of language assessment is often considered in dichotomous ways: It is assumed to be guided either by expert intuition or by drawing on performance data. Even though quite a few authors have argued that rating scale development is rarely so easily classifiable, this dyadic view has dominated language testing…
Descriptors: Rating Scales, Test Construction, Language Tests, Test Use
Daniel R. Isbell; Dustin Crowther; Hitoshi Nishizawa – Language Testing, 2024
The extrapolation of test scores to a target domain - that is, association between test performances and relevant real-world outcomes - is critical to valid score interpretation and use. This study examined the relationship between Duolingo English Test (DET) speaking scores and university stakeholders' evaluation of DET speaking performances. A…
Descriptors: Language Proficiency, Language Tests, Higher Education, Stakeholders
Yu-Tzu Chang; Ann Tai Choe; Daniel Holden; Daniel R. Isbell – Language Testing, 2024
In this Brief Report, we describe an evaluation of and revisions to a rubric adapted from the Jacobs et al.'s (1981) ESL COMPOSITION PROFILE, with four rubric categories and 20-point rating scales, in the context of an intensive English program writing placement test. Analysis of 4 years of rating data (2016-2021, including 434 essays) using…
Descriptors: Language Tests, Rating Scales, Second Language Learning, English (Second Language)
Pearson, William S. – Language Testing, 2023
Many candidates undertaking high-stakes English language proficiency tests for academic enrolment do not achieve the results they need for reasons including linguistic unreadiness, test unpreparedness, illness, an unfavourable configuration of tasks, or administrative and marking errors. Owing to the importance of meeting goals or out of a belief…
Descriptors: High Stakes Tests, English (Second Language), Language Proficiency, Language Tests
Nishizawa, Hitoshi – Language Testing, 2023
In this study, I investigate the construct validity and fairness pertaining to the use of a variety of Englishes in listening test input. I obtained data from a post-entry English language placement test administered at a public university in the United States. In addition to expectedly familiar American English, the test features Hawai'i,…
Descriptors: Construct Validity, Listening Comprehension Tests, Language Tests, English (Second Language)
Farshad Effatpanah; Purya Baghaei; Mona Tabatabaee-Yazdi; Esmat Babaii – Language Testing, 2025
This study aimed to propose a new method for scoring C-Tests as measures of general language proficiency. In this approach, the unit of analysis is sentences rather than gaps or passages. That is, the gaps correctly reformulated in each sentence were aggregated as sentence score, and then each sentence was entered into the analysis as a polytomous…
Descriptors: Item Response Theory, Language Tests, Test Items, Test Construction
Jeffrey Stewart; Henrik Gyllstad; Christopher Nicklin; Stuart McLean – Language Testing, 2024
The purpose of this paper is to (a) establish whether meaning recall and meaning recognition item formats test psychometrically distinct constructs of vocabulary knowledge which measure separate skills, and, if so, (b) determine whether each construct possesses unique properties predictive of L2 reading proficiency. Factor analyses and…
Descriptors: Vocabulary Development, Psychometrics, Language Tests, Recall (Psychology)
Beverly Baker; Angel Arias; Louis-David Bibeau; Yiwei Qin; Margret Norenberg; Jennifer St-John – Language Testing, 2024
Placement tests are used to support a particular need in a local context--to determine the best starting place for a student entering a specific programme of language study. This brief report will focus on the development of an innovative placement test with self-directed elements for our local needs at a university in Canada for students studying…
Descriptors: Student Placement, Placement Tests, Personal Autonomy, Test Construction
Burton, J. Dylan – Language Testing, 2023
In its 40th year, "Language Testing" journal has served as the flagship journal for scholars, researchers, and practitioners in the field of language testing and assessment. This viewpoint piece, written from the perspective of an emerging scholar, discusses two possible future trends based on evidence going back to the very first issue…
Descriptors: Language Tests, Testing, Futures (of Society), Periodicals
Dongil Shin – Language Testing, 2024
This paper addresses the intersection of testing and policy, situating test-driven impact and validation within the context of policy-led educational reform in Korea. I will briefly review the existing validation models. Then, arguing for an expansion of the conventional conceptualization of consequential validity research, I use Fairclough's…
Descriptors: Educational Policy, Discourse Analysis, Test Validity, Educational Change

Peer reviewed
Direct link
