Publication Date
In 2025 | 4 |
Since 2024 | 19 |
Descriptor
Language Tests | 18 |
Test Validity | 17 |
Second Language Learning | 10 |
Foreign Countries | 9 |
English (Second Language) | 7 |
Language Proficiency | 7 |
Test Construction | 6 |
Scores | 5 |
High Stakes Tests | 4 |
Test Items | 4 |
Test Reliability | 4 |
More ▼ |
Source
Language Testing | 19 |
Author
Emma Marsden | 2 |
Vahid Aryadoust | 2 |
Alina A. von Davier | 1 |
Amber Dudley | 1 |
Averil Coxhead | 1 |
Christopher Nicklin | 1 |
Corrine Occhino | 1 |
Daniel R. Isbell | 1 |
David Slomp | 1 |
Dongil Shin | 1 |
Dustin Crowther | 1 |
More ▼ |
Publication Type
Journal Articles | 19 |
Reports - Research | 15 |
Reports - Descriptive | 2 |
Information Analyses | 1 |
Reports - Evaluative | 1 |
Tests/Questionnaires | 1 |
Education Level
Audience
Location
Australia | 2 |
China | 2 |
New Zealand | 2 |
United Kingdom | 2 |
Canada | 1 |
Japan | 1 |
South Korea | 1 |
United Kingdom (England) | 1 |
United States | 1 |
Vietnam | 1 |
Laws, Policies, & Programs
Assessments and Surveys
International English… | 4 |
Test of English as a Foreign… | 2 |
ACT Assessment | 1 |
Test of English for… | 1 |
What Works Clearinghouse Rating
Tia M. Fechter; Heeyeon Yoon – Language Testing, 2024
This study evaluated the efficacy of two proposed methods in an operational standard-setting study conducted for a high-stakes language proficiency test of the U.S. government. The goal was to seek low-cost modifications to the existing Yes/No Angoff method to increase the validity and reliability of the recommended cut scores using a convergent…
Descriptors: Standard Setting, Language Proficiency, Language Tests, Evaluation Methods
Jennifer Randall; Mya Poe; David Slomp; Maria Elena Oliveri – Language Testing, 2024
Educational assessments, from kindergarden to 12th grade (K-12) to licensure, have a long, well-documented history of oppression and marginalization. In this paper, we (the authors) ask the field of educational assessment/measurement to actively disrupt the White supremacist and racist logics that fuel this marginalization and re-orient itself…
Descriptors: Language Tests, Test Validity, Justice, Kindergarten
Ute Knoch; Jason Fan – Language Testing, 2024
While several test concordance tables have been published, the research underpinning such tables has rarely been examined in detail. This study aimed to survey the publically available studies or documentation underpinning the test concordance tables of the providers of four major international language tests, all accepted by the Australian…
Descriptors: Language Tests, English, Test Validity, Item Analysis
Emma Bruce; Karen Dunn; Tony Clark – Language Testing, 2025
Several high-stakes English proficiency tests including but not limited to IELTS, PTE Academic, and TOEFL iBT recommend a 2-year time limit on validity for score usage. Although this timeframe provides a useful rule-of-thumb for the recency of testing, it can have far-reaching consequences. In response to stakeholder queries around IELTS validity…
Descriptors: High Stakes Tests, Language Tests, Test Validity, Scores
Development of the American Sign Language Fingerspelling and Numbers Comprehension Test (ASL FaN-CT)
Corrine Occhino; Ryan Lidster; Leah C. Geer; Jason Listman; Peter C. Hauser – Language Testing, 2024
We describe the development and initial validation of the "ASL Fingerspelling and Number Comprehension Test" (ASL FaN-CT), a test of recognition proficiency for fingerspelled words in American Sign Language (ASL). Despite the relative frequency of fingerspelling in ASL discourse, learners commonly struggle to produce and perceive…
Descriptors: Language Tests, Test Construction, Finger Spelling, Test Validity
Xueliang Chen; Vahid Aryadoust; Wenxin Zhang – Language Testing, 2025
The growing diversity among test takers in second or foreign language (L2) assessments makes the importance of fairness front and center. This systematic review aimed to examine how fairness in L2 assessments was evaluated through differential item functioning (DIF) analysis. A total of 83 articles from 27 journals were included in a systematic…
Descriptors: Second Language Learning, Language Tests, Test Items, Item Analysis
Yunwen Su; Sun-Young Shin – Language Testing, 2024
Rating scales that language testers design should be tailored to the specific test purpose and score use as well as reflect the target construct. Researchers have long argued for the value of data-driven scales for classroom performance assessment, because they are specific to pedagogical tasks and objectives, have rich descriptors to offer useful…
Descriptors: Rating Scales, Language Tests, Test Construction, Performance Based Assessment
Ramsey L. Cardwell; Steven W. Nydick; J.R. Lockwood; Alina A. von Davier – Language Testing, 2024
Applicants must often demonstrate adequate English proficiency when applying to postsecondary institutions by taking an English language proficiency test, such as the TOEFL iBT, IELTS Academic, or Duolingo English Test (DET). Concordance tables aim to provide equivalent scores across multiple assessments, helping admissions officers to make fair…
Descriptors: Second Language Learning, English (Second Language), Language Tests, Language Proficiency
Louise Palmour – Language Testing, 2024
This article explores the nature of the construct underlying classroom-based English for academic purpose (EAP) oral presentation assessments, which are used, in part, to determine admission to programmes of study at UK universities. Through analysis of qualitative data (from questionnaires, interviews, rating discussions, and fieldnotes), the…
Descriptors: English for Academic Purposes, Public Speaking, College Students, Foreign Countries
Thi My Hang Nguyen; Peter Gu; Averil Coxhead – Language Testing, 2024
Despite extensive research on assessing collocational knowledge, valid measures of academic collocations remain elusive. With the present study, we employ an argument-based approach to validate two Academic Collocation Tests (ACTs) that assess the ability to recognize and produce academic collocations (i.e., two-word units such as "key…
Descriptors: Foreign Countries, College Students, College Entrance Examinations, English (Second Language)
Ying Xu; Xiaodong Li; Jin Chen – Language Testing, 2025
This article provides a detailed review of the Computer-based English Listening Speaking Test (CELST) used in Guangdong, China, as part of the National Matriculation English Test (NMET) to assess students' English proficiency. The CELST measures listening and speaking skills as outlined in the "English Curriculum for Senior Middle…
Descriptors: Computer Assisted Testing, English (Second Language), Language Tests, Listening Comprehension Tests
Yufan Zhao; Vahid Aryadoust – Language Testing, 2025
This study examined the semantic features of the simulated mini-lectures in the listening sections of the International English Language Testing System (IELTS) and the Test of English as a Foreign Language (TOEFL) based on automatized semantic analysis to explore the content validity of the two tests. Two study corpora were utilized, the IELTS…
Descriptors: Semantics, Computational Linguistics, Academic Language, Second Language Learning
Junlan Pan; Emma Marsden – Language Testing, 2024
"Tests of Aptitude for Language Learning" (TALL) is an openly accessible internet-based battery to measure the multifaceted construct of foreign language aptitude, using language domain-specific instruments and L1-sensitive instructions and stimuli. This brief report introduces the components of this theory-informed battery and…
Descriptors: Language Tests, Aptitude Tests, Second Language Learning, Test Construction
Daniel R. Isbell; Dustin Crowther; Hitoshi Nishizawa – Language Testing, 2024
The extrapolation of test scores to a target domain - that is, association between test performances and relevant real-world outcomes - is critical to valid score interpretation and use. This study examined the relationship between Duolingo English Test (DET) speaking scores and university stakeholders' evaluation of DET speaking performances. A…
Descriptors: Language Proficiency, Language Tests, Higher Education, Stakeholders
Jeffrey Stewart; Henrik Gyllstad; Christopher Nicklin; Stuart McLean – Language Testing, 2024
The purpose of this paper is to (a) establish whether meaning recall and meaning recognition item formats test psychometrically distinct constructs of vocabulary knowledge which measure separate skills, and, if so, (b) determine whether each construct possesses unique properties predictive of L2 reading proficiency. Factor analyses and…
Descriptors: Vocabulary Development, Psychometrics, Language Tests, Recall (Psychology)
Previous Page | Next Page ยป
Pages: 1 | 2