Publication Date
| In 2026 | 0 |
| Since 2025 | 12 |
| Since 2022 (last 5 years) | 83 |
| Since 2017 (last 10 years) | 173 |
| Since 2007 (last 20 years) | 360 |
Descriptor
Source
| Language Testing | 539 |
Author
| Davies, Alan | 8 |
| Bachman, Lyle F. | 7 |
| Elder, Catherine | 7 |
| Cheng, Liying | 6 |
| Xi, Xiaoming | 6 |
| Yan, Xun | 6 |
| Alderson, J. Charles | 5 |
| Aryadoust, Vahid | 5 |
| Cho, Yeonsuk | 5 |
| Ginther, April | 5 |
| Knoch, Ute | 5 |
| More ▼ | |
Publication Type
Education Level
Audience
Location
| Japan | 33 |
| China | 30 |
| Australia | 23 |
| United Kingdom | 15 |
| Canada | 14 |
| South Korea | 13 |
| Europe | 7 |
| Germany | 6 |
| Hong Kong | 6 |
| Netherlands | 6 |
| New Zealand | 5 |
| More ▼ | |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 4 |
| Elementary and Secondary… | 1 |
| Lau v Nichols | 1 |
| Race to the Top | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
In'nami, Yo; Koizumi, Rie – Language Testing, 2016
We addressed Deville and Chalhoub-Deville's (2006), Schoonen's (2012), and Xi and Mollaun's (2006) call for research into the contextual features that are considered related to person-by-task interactions in the framework of generalizability theory in two ways. First, we quantitatively synthesized the generalizability studies to determine the…
Descriptors: Evaluators, Second Language Learning, Writing Skills, Oral Language
Römer, Ute – Language Testing, 2017
This paper aims to connect recent corpus research on phraseology with current language testing practice. It discusses how corpora and corpus-analytic techniques can illuminate central aspects of speech and help in conceptualizing the notion of lexicogrammar in second language speaking assessment. The description of speech and some of its core…
Descriptors: Language Tests, Grammar, English (Second Language), Second Language Learning
Cheng, Junyu; Matthews, Joshua – Language Testing, 2018
This study explores the constructs that underpin three different measures of vocabulary knowledge and investigates the degree to which these three measures correlate with, and are able to predict, measures of second language (L2) listening and reading. Word frequency structured vocabulary tests tapping "receptive/orthographic (RecOrth)…
Descriptors: Listening Comprehension, Reading Comprehension, Reading Tests, Correlation
Yi, Yeon-Sook – Language Testing, 2017
The present study examines the relative importance of attributes within and across items by applying four cognitive diagnostic assessment models. The current study utilizes the function of the models that can indicate inter-attribute relationships that reflect the response behaviors of examinees to analyze scored test-taker responses to four forms…
Descriptors: Second Language Learning, Reading Comprehension, Listening Comprehension, Language Tests
Hoekje, Barbara – Language Testing, 2016
This commentary argues that the OET research raises inescapable contradictions in trying to separate "language" from "communication" within a weak performance test and advocates for reconceptualizing the legitimate domain of "language" more widely, reclaiming the full potential of the communicative competence…
Descriptors: Language Tests, Languages for Special Purposes, Second Language Learning, Communicative Competence (Languages)
Elicited Imitation as a Measure of Second Language Proficiency: A Narrative Review and Meta-Analysis
Yan, Xun; Maeda, Yukiko; Lv, Jing; Ginther, April – Language Testing, 2016
Elicited imitation (EI) has been widely used to examine second language (L2) proficiency and development and was an especially popular method in the 1970s and early 1980s. However, as the field embraced more communicative approaches to both instruction and assessment, the use of EI diminished, and the construct-related validity of EI scores as a…
Descriptors: Second Language Learning, Language Proficiency, Meta Analysis, Effect Size
Kyle, Kristopher; Crossley, Scott – Language Testing, 2017
Over the past 45 years, the construct of syntactic sophistication has been assessed in L2 writing using what Bulté and Housen (2012) refer to as absolute complexity (Lu, 2011; Ortega, 2003; Wolfe-Quintero, Inagaki, & Kim, 1998). However, it has been argued that making inferences about learners based on absolute complexity indices (e.g., mean…
Descriptors: Syntax, Verbs, Second Language Learning, Word Frequency
Suvorov, Ruslan – Language Testing, 2015
Investigating how visuals affect test takers' performance on video-based L2 listening tests has been the focus of many recent studies. While most existing research has been based on test scores and self-reported verbal data, few studies have examined test takers' viewing behavior (Ockey, 2007; Wagner, 2007, 2010a). To address this gap, in the…
Descriptors: Eye Movements, Second Language Learning, Listening Comprehension, Video Technology
LaFlair, Geoffrey T.; Staples, Shelley – Language Testing, 2017
Investigations of the validity of a number of high-stakes language assessments are conducted using an argument-based approach, which requires evidence for inferences that are critical to score interpretation (Chapelle, Enright, & Jamieson, 2008b; Kane, 2013). The current study investigates the extrapolation inference for a high-stakes test of…
Descriptors: Computational Linguistics, Language Tests, Test Validity, Inferences
Isaacs, Talia; Trofimovich, Pavel; Foote, Jennifer Ann – Language Testing, 2018
There is growing research on the linguistic features that most contribute to making second language (L2) speech easy or difficult to understand. Comprehensibility, which is usually captured through listener judgments, is increasingly viewed as integral to the L2 speaking construct. However, there are shortcomings in how this construct is…
Descriptors: Language Tests, English (Second Language), Second Language Learning, Language of Instruction
Ginther, April; Yan, Xun – Language Testing, 2018
This study examines the predictive validity of the TOEFL iBT with respect to academic achievement as measured by the first-year grade point average (GPA) of Chinese students at Purdue University, a large, public, Research I institution in Indiana, USA. Correlations between GPA, TOEFL iBT total and subsection scores were examined on 1990 mainland…
Descriptors: Correlation, Computer Assisted Testing, Profiles, English (Second Language)
Harding, Luke; Alderson, J. Charles; Brunfaut, Tineke – Language Testing, 2015
Alderson, Brunfaut and Harding (2014) recently investigated how diagnosis is practised across a range of professions in order to develop a tentative framework for a theory of diagnosis in second or foreign language (SFL) assessment. In articulating this framework, a set of five broad principles were proposed, encompassing the entire enterprise of…
Descriptors: Diagnostic Tests, Language Tests, Reading Tests, Second Language Learning
Trace, Jonathan; Brown, James Dean; Janssen, Gerriet; Kozhevnikova, Liudmila – Language Testing, 2017
Cloze tests have been the subject of numerous studies regarding their function and use in both first language and second language contexts (e.g., Jonz & Oller, 1994; Watanabe & Koyama, 2008). From a validity standpoint, one area of investigation has been the extent to which cloze tests measure reading ability beyond the sentence level.…
Descriptors: Cloze Procedure, Language Tests, Test Items, Item Analysis
Piper, Benjamin; Zuilkowski, Stephanie Simmons – Language Testing, 2016
Despite rapid growth in literacy-related programmes and evaluation in sub-Saharan Africa, little critical attention has been paid to the relevance of assumptions that underlie existing assessment methods. This study focuses on the issue of timing in the assessment of oral reading fluency, a critical component of successful reading (Chard, Vaughn,…
Descriptors: Role, Language Tests, Timed Tests, Foreign Countries
Kyle, Kristopher; Crossley, Scott A.; McNamara, Danielle S. – Language Testing, 2016
This study explores the construct validity of speaking tasks included in the TOEFL iBT (e.g., integrated and independent speaking tasks). Specifically, advanced natural language processing (NLP) tools, MANOVA difference statistics, and discriminant function analyses (DFA) are used to assess the degree to which and in what ways responses to these…
Descriptors: Construct Validity, Natural Language Processing, Speech Skills, Speech Acts

Peer reviewed
Direct link
