Publication Date
In 2025 | 3 |
Since 2024 | 17 |
Since 2021 (last 5 years) | 35 |
Since 2016 (last 10 years) | 66 |
Since 2006 (last 20 years) | 103 |
Descriptor
Source
Language Testing | 156 |
Author
Chapelle, Carol A. | 4 |
Yan, Xun | 4 |
Aryadoust, Vahid | 3 |
Bachman, Lyle F. | 3 |
Davies, Alan | 3 |
Fulcher, Glenn | 3 |
Shohamy, Elana | 3 |
Alderson, J. Charles | 2 |
August, Diane | 2 |
Beglar, David | 2 |
Brown, James Dean | 2 |
More ▼ |
Publication Type
Journal Articles | 156 |
Reports - Research | 93 |
Reports - Evaluative | 31 |
Opinion Papers | 17 |
Reports - Descriptive | 14 |
Information Analyses | 8 |
Tests/Questionnaires | 5 |
Speeches/Meeting Papers | 2 |
Education Level
Audience
Researchers | 1 |
Teachers | 1 |
Location
Japan | 8 |
China | 6 |
United Kingdom | 5 |
Australia | 4 |
Brazil | 3 |
South Korea | 3 |
United Kingdom (England) | 3 |
Canada | 2 |
Germany | 2 |
Israel | 2 |
New Zealand | 2 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Ying Xu; Xiaodong Li; Jin Chen – Language Testing, 2025
This article provides a detailed review of the Computer-based English Listening Speaking Test (CELST) used in Guangdong, China, as part of the National Matriculation English Test (NMET) to assess students' English proficiency. The CELST measures listening and speaking skills as outlined in the "English Curriculum for Senior Middle…
Descriptors: Computer Assisted Testing, English (Second Language), Language Tests, Listening Comprehension Tests
Xueliang Chen; Vahid Aryadoust; Wenxin Zhang – Language Testing, 2025
The growing diversity among test takers in second or foreign language (L2) assessments makes the importance of fairness front and center. This systematic review aimed to examine how fairness in L2 assessments was evaluated through differential item functioning (DIF) analysis. A total of 83 articles from 27 journals were included in a systematic…
Descriptors: Second Language Learning, Language Tests, Test Items, Item Analysis
Maria Treadaway; John Read – Language Testing, 2024
Standard-setting is an essential component of test development, supporting the meaningfulness and appropriate interpretation of test scores. However, in the high-stakes testing environment of aviation, standard-setting studies are underexplored. To address this gap, we document two stages in the standard-setting procedures for the Overseas Flight…
Descriptors: Standard Setting, Diagnostic Tests, High Stakes Tests, English for Special Purposes
Junlan Pan; Emma Marsden – Language Testing, 2024
"Tests of Aptitude for Language Learning" (TALL) is an openly accessible internet-based battery to measure the multifaceted construct of foreign language aptitude, using language domain-specific instruments and L1-sensitive instructions and stimuli. This brief report introduces the components of this theory-informed battery and…
Descriptors: Language Tests, Aptitude Tests, Second Language Learning, Test Construction
Emma Bruce; Karen Dunn; Tony Clark – Language Testing, 2025
Several high-stakes English proficiency tests including but not limited to IELTS, PTE Academic, and TOEFL iBT recommend a 2-year time limit on validity for score usage. Although this timeframe provides a useful rule-of-thumb for the recency of testing, it can have far-reaching consequences. In response to stakeholder queries around IELTS validity…
Descriptors: High Stakes Tests, Language Tests, Test Validity, Scores
Tia M. Fechter; Heeyeon Yoon – Language Testing, 2024
This study evaluated the efficacy of two proposed methods in an operational standard-setting study conducted for a high-stakes language proficiency test of the U.S. government. The goal was to seek low-cost modifications to the existing Yes/No Angoff method to increase the validity and reliability of the recommended cut scores using a convergent…
Descriptors: Standard Setting, Language Proficiency, Language Tests, Evaluation Methods
Development of the American Sign Language Fingerspelling and Numbers Comprehension Test (ASL FaN-CT)
Corrine Occhino; Ryan Lidster; Leah C. Geer; Jason Listman; Peter C. Hauser – Language Testing, 2024
We describe the development and initial validation of the "ASL Fingerspelling and Number Comprehension Test" (ASL FaN-CT), a test of recognition proficiency for fingerspelled words in American Sign Language (ASL). Despite the relative frequency of fingerspelling in ASL discourse, learners commonly struggle to produce and perceive…
Descriptors: Language Tests, Test Construction, Finger Spelling, Test Validity
Read, John – Language Testing, 2023
Published work on vocabulary assessment has grown substantially in the last 10 years, but it is still somewhat outside the mainstream of the field. There has been a recent call for those developing vocabulary tests to apply professional standards to their work, especially in validating their instruments for specified purposes before releasing them…
Descriptors: Language Tests, Vocabulary Development, Second Language Learning, Test Format
Yunwen Su; Sun-Young Shin – Language Testing, 2024
Rating scales that language testers design should be tailored to the specific test purpose and score use as well as reflect the target construct. Researchers have long argued for the value of data-driven scales for classroom performance assessment, because they are specific to pedagogical tasks and objectives, have rich descriptors to offer useful…
Descriptors: Rating Scales, Language Tests, Test Construction, Performance Based Assessment
Amber Dudley; Emma Marsden; Giulia Bovolenta – Language Testing, 2024
Vocabulary knowledge strongly predicts second language reading, listening, writing, and speaking. Yet, few tests have been developed to assess vocabulary knowledge in French. The primary aim of this pilot study was to design and initially validate the Context-Aligned Two Thousand Test (CA-TTT), following open research practices. The CA-TTT is a…
Descriptors: French, Vocabulary Development, Secondary School Students, Language Tests
Ramsey L. Cardwell; Steven W. Nydick; J.R. Lockwood; Alina A. von Davier – Language Testing, 2024
Applicants must often demonstrate adequate English proficiency when applying to postsecondary institutions by taking an English language proficiency test, such as the TOEFL iBT, IELTS Academic, or Duolingo English Test (DET). Concordance tables aim to provide equivalent scores across multiple assessments, helping admissions officers to make fair…
Descriptors: Second Language Learning, English (Second Language), Language Tests, Language Proficiency
Jennifer Randall; Mya Poe; David Slomp; Maria Elena Oliveri – Language Testing, 2024
Educational assessments, from kindergarden to 12th grade (K-12) to licensure, have a long, well-documented history of oppression and marginalization. In this paper, we (the authors) ask the field of educational assessment/measurement to actively disrupt the White supremacist and racist logics that fuel this marginalization and re-orient itself…
Descriptors: Language Tests, Test Validity, Justice, Kindergarten
Ute Knoch; Jason Fan – Language Testing, 2024
While several test concordance tables have been published, the research underpinning such tables has rarely been examined in detail. This study aimed to survey the publically available studies or documentation underpinning the test concordance tables of the providers of four major international language tests, all accepted by the Australian…
Descriptors: Language Tests, English, Test Validity, Item Analysis
Daniel R. Isbell; Dustin Crowther; Hitoshi Nishizawa – Language Testing, 2024
The extrapolation of test scores to a target domain - that is, association between test performances and relevant real-world outcomes - is critical to valid score interpretation and use. This study examined the relationship between Duolingo English Test (DET) speaking scores and university stakeholders' evaluation of DET speaking performances. A…
Descriptors: Language Proficiency, Language Tests, Higher Education, Stakeholders
Jeffrey Stewart; Henrik Gyllstad; Christopher Nicklin; Stuart McLean – Language Testing, 2024
The purpose of this paper is to (a) establish whether meaning recall and meaning recognition item formats test psychometrically distinct constructs of vocabulary knowledge which measure separate skills, and, if so, (b) determine whether each construct possesses unique properties predictive of L2 reading proficiency. Factor analyses and…
Descriptors: Vocabulary Development, Psychometrics, Language Tests, Recall (Psychology)