Publication Date
In 2025 | 5 |
Since 2024 | 12 |
Since 2021 (last 5 years) | 39 |
Since 2016 (last 10 years) | 107 |
Since 2006 (last 20 years) | 147 |
Descriptor
Source
Author
Powers, Donald E. | 5 |
Staples, Shelley | 3 |
Barkaoui, Khaled | 2 |
Biber, Douglas | 2 |
Davies, Alan | 2 |
He, Lianzhen | 2 |
Isbell, Daniel R. | 2 |
Kane, Michael | 2 |
Kim, Hae-Jin | 2 |
Koizumi, Rie | 2 |
Kremmel, Benjamin | 2 |
More ▼ |
Publication Type
Education Level
Audience
Researchers | 2 |
Policymakers | 1 |
Practitioners | 1 |
Teachers | 1 |
Location
China | 16 |
Japan | 8 |
South Korea | 7 |
Canada | 6 |
Iran | 6 |
Europe | 5 |
United Kingdom | 5 |
Spain | 4 |
Israel | 3 |
Taiwan | 3 |
Turkey | 3 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 2 |
Assessments and Surveys
What Works Clearinghouse Rating
Stefan O'Grady – TESOL Journal, 2025
Task-based language assessment represents a major component of task-based language teaching syllabi. Current perspectives emphasise the importance of tasks in the assessment process, suggesting that adherence to influential models of language production during task design yields predictable test outcomes. The current study contends that the…
Descriptors: Task Analysis, Language Tests, Evaluators, Rating Scales
Emma Bruce; Karen Dunn; Tony Clark – Language Testing, 2025
Several high-stakes English proficiency tests including but not limited to IELTS, PTE Academic, and TOEFL iBT recommend a 2-year time limit on validity for score usage. Although this timeframe provides a useful rule-of-thumb for the recency of testing, it can have far-reaching consequences. In response to stakeholder queries around IELTS validity…
Descriptors: High Stakes Tests, Language Tests, Test Validity, Scores
Paula Elosua – Language Assessment Quarterly, 2024
In sociolinguistic contexts where standardized languages coexist with regional dialects, the study of differential item functioning is a valuable tool for examining certain linguistic uses or varieties as threats to score validity. From an ecological perspective, this paper describes three stages in the study of differential item functioning…
Descriptors: Reading Tests, Reading Comprehension, Scores, Test Validity
Hakyung Sung; Sooyeon Cho; Kristopher Kyle – Language Assessment Quarterly, 2024
Lexical diversity (LD) is an important indicator of second language lexical development. Much research has investigated LD indices, with a focus on learners of English. However, further research is needed in languages that are typologically distinct from English, such as Korean. In this study, we evaluated the reliability and validity of LD…
Descriptors: Second Language Learning, Korean, Persuasive Discourse, Language Tests
Apichat Khamboonruang – Language Testing in Asia, 2025
Chulalongkorn University Language Institute (CULI) test was developed as a local standardised test of English for professional and international communication. To ensure that the CULI test fulfils its intended purposes, this study employed Kane's argument-based validation and Rasch measurement approaches to construct the validity argument for the…
Descriptors: Universities, Second Language Learning, Second Language Instruction, Language Tests
Ramsey L. Cardwell; Steven W. Nydick; J.R. Lockwood; Alina A. von Davier – Language Testing, 2024
Applicants must often demonstrate adequate English proficiency when applying to postsecondary institutions by taking an English language proficiency test, such as the TOEFL iBT, IELTS Academic, or Duolingo English Test (DET). Concordance tables aim to provide equivalent scores across multiple assessments, helping admissions officers to make fair…
Descriptors: Second Language Learning, English (Second Language), Language Tests, Language Proficiency
Schmitt, Norbert; Nation, Paul; Kremmel, Benjamin – Language Teaching, 2020
Recently, a large number of vocabulary tests have been made available to language teachers, testers, and researchers. Unfortunately, most of them have been launched with inadequate validation evidence. The field of language testing has become increasingly more rigorous in the area of test validation, but developers of vocabulary tests have…
Descriptors: Test Construction, Test Validity, Language Tests, Test Use
Choi, Yun Deok – Language Testing in Asia, 2022
A much-debated question in the L2 assessment field is if computer familiarity should be considered a potential source of construct-irrelevant variance in computer-based writing (CBW) tests. This study aims to make a partial validity argument for an online source-based writing test (OSWT) designed for English placement testing (EPT), focusing on…
Descriptors: Test Validity, Scores, Computer Assisted Testing, English (Second Language)
Prentza, Alexandra; Tafiadis, Dionysios; Chondrogianni, Vasiliki; Tsimpli, Ianthi-Maria – Journal of Psycholinguistic Research, 2022
This study provides a preliminary validation of a Greek Sentence Repetition Task (SRT) with a sample of 110 monolingual and bilingual typically developing (TLD) children and examines the test's ability to distinguish between Greek monolingual children and age-matched Albanian-Greek bilinguals using a Receiver Operating Characteristics (ROC)…
Descriptors: Greek, Sentences, Repetition, Monolingualism
Ying Xu; Xiaodong Li; Jin Chen – Language Testing, 2025
This article provides a detailed review of the Computer-based English Listening Speaking Test (CELST) used in Guangdong, China, as part of the National Matriculation English Test (NMET) to assess students' English proficiency. The CELST measures listening and speaking skills as outlined in the "English Curriculum for Senior Middle…
Descriptors: Computer Assisted Testing, English (Second Language), Language Tests, Listening Comprehension Tests
Pauline Frizelle; Ana Oliveira-Buckley; Tricia Biancone; Jorge Oliveira; Paul Fletcher; Dorothy V. M. Bishop; Cristina McKean – International Journal of Language & Communication Disorders, 2025
Introduction: The present study investigated English-speaking 5-9 year olds' (n = 600, normative sample) comprehension of relative, adverbial and complement clauses using the Test of Complex Syntax-Electronic (TECS-E), an online interactive assessment. with strong test-retest reliability, concurrent validity and internal consistency. Method: Using…
Descriptors: Syntax, Child Language, Young Children, Language Tests
Daniel R. Isbell; Dustin Crowther; Hitoshi Nishizawa – Language Testing, 2024
The extrapolation of test scores to a target domain - that is, association between test performances and relevant real-world outcomes - is critical to valid score interpretation and use. This study examined the relationship between Duolingo English Test (DET) speaking scores and university stakeholders' evaluation of DET speaking performances. A…
Descriptors: Language Proficiency, Language Tests, Higher Education, Stakeholders
Jeffrey Stewart; Henrik Gyllstad; Christopher Nicklin; Stuart McLean – Language Testing, 2024
The purpose of this paper is to (a) establish whether meaning recall and meaning recognition item formats test psychometrically distinct constructs of vocabulary knowledge which measure separate skills, and, if so, (b) determine whether each construct possesses unique properties predictive of L2 reading proficiency. Factor analyses and…
Descriptors: Vocabulary Development, Psychometrics, Language Tests, Recall (Psychology)
Jieun Kim; Daniel Richard Isbell – Language Assessment Quarterly, 2024
The ACTFL Assessment of Performance Toward Proficiency in Languages (AAPPL, https://www.actfl.n.d.org/assessments/k-12-assessments/aappl) assesses proficiency in 11 languages for students in grades 3 to 12 and is often used to award the Seal of Biliteracy. While arguments for the valid interpretation and uses of the AAPPL have previously been…
Descriptors: Language Tests, Second Language Learning, Second Language Instruction, Language Proficiency
Hoeve, Karen B. – Language Testing in Asia, 2022
High stakes test-based accountability systems primarily rely on aggregates and derivatives of scores from tests that were originally developed to measure individual student proficiency in subject areas such as math, reading/language arts, and now English language proficiency. Current validity models do not explicitly address this use of aggregate…
Descriptors: High Stakes Tests, Language Tests, Accountability, Educational Assessment