Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 4 |
Since 2016 (last 10 years) | 8 |
Since 2006 (last 20 years) | 14 |
Descriptor
Source
Language Testing | 15 |
Author
Kyle, Kristopher | 2 |
Choe, Ann Tai | 1 |
Crossley, Scott | 1 |
Crossley, Scott A. | 1 |
Davidson, Fred | 1 |
Dimova, Slobodanka | 1 |
Eguchi, Masaki | 1 |
Fulcher, Glenn | 1 |
Ginther, April | 1 |
Gyllstad, Henrik | 1 |
Hitoshi Nishizawa | 1 |
More ▼ |
Publication Type
Journal Articles | 15 |
Reports - Research | 10 |
Reports - Evaluative | 4 |
Opinion Papers | 2 |
Education Level
Higher Education | 3 |
Postsecondary Education | 3 |
Elementary Secondary Education | 1 |
Grade 9 | 1 |
High Schools | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
Secondary Education | 1 |
Audience
Location
California | 1 |
Japan | 1 |
South Korea | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Test of English as a Foreign… | 5 |
International English… | 1 |
Michigan Test of English… | 1 |
Test of English for… | 1 |
What Works Clearinghouse Rating
Hitoshi Nishizawa – Language Testing, 2024
Corpus-based studies have offered the domain definition inference for test developers. Yet, corpus-based studies on temporal fluency measures (e.g., speech rate) have been limited, especially in the context of academic lecture settings. This made it difficult for test developers to sample representative fluency features to create authentic…
Descriptors: High Stakes Tests, Language Tests, Second Language Learning, Computer Assisted Testing
Kyle, Kristopher; Eguchi, Masaki; Choe, Ann Tai; LaFlair, Geoff – Language Testing, 2022
In the realm of language proficiency assessments, the domain description inference and the extrapolation inference are key components of a validity argument. Biber et al.'s description of the lexicogrammatical features of the spoken and written registers in the T2K-SWAL corpus has served as support for the TOEFL iBT test's domain description and…
Descriptors: Language Variation, Written Language, Speech Communication, Inferences
Kim, Minkyung; Nam, Yunjung; Crossley, Scott A. – Language Testing, 2022
This study investigated the effects of working memory capacity (WMC), first language (L1) syllogistic inferencing ability, and second-language (L2) linguistic knowledge on L2 listening comprehension for passages of different lengths. Participants were 193 Korean ninth-grade learners of English. A path analysis was used to examine multivariate…
Descriptors: Native Language, Short Term Memory, Listening Comprehension, Second Language Learning
Gyllstad, Henrik; McLean, Stuart; Stewart, Jeffrey – Language Testing, 2021
The last three decades have seen an increase of tests aimed at measuring an individual's vocabulary level or size. The target words used in these tests are typically sampled from word frequency lists, which are in turn based on language corpora. Conventionally, test developers sample items from frequency bands of 1000 words; different tests employ…
Descriptors: Vocabulary Development, Sample Size, Language Tests, Test Items
Xi, Xiaoming – Language Testing, 2017
In recent years, continuing advances in technology have increased the capacity to automate the extraction of a range of linguistic features of texts and thus have provided the impetus for the substantial growth of corpus linguistics. While corpus linguistic tools and methods have been used extensively in second language learning research, they…
Descriptors: Computational Linguistics, Second Language Learning, Language Tests, Evaluation Methods
Löwenadler, John – Language Testing, 2019
This study aims to investigate patterns of variation in the interplay of L2 language ability and general reading comprehension skills in L2 reading, by comparing item-level effects of test-takers' results on L1 and L2 reading comprehension tests. The material comes from more than 500,000 people tested on L1 (Swedish) and L2 (English) in the…
Descriptors: Swedish, English (Second Language), Second Language Learning, Second Language Instruction
Kyle, Kristopher; Crossley, Scott – Language Testing, 2017
Over the past 45 years, the construct of syntactic sophistication has been assessed in L2 writing using what Bulté and Housen (2012) refer to as absolute complexity (Lu, 2011; Ortega, 2003; Wolfe-Quintero, Inagaki, & Kim, 1998). However, it has been argued that making inferences about learners based on absolute complexity indices (e.g., mean…
Descriptors: Syntax, Verbs, Second Language Learning, Word Frequency
LaFlair, Geoffrey T.; Staples, Shelley – Language Testing, 2017
Investigations of the validity of a number of high-stakes language assessments are conducted using an argument-based approach, which requires evidence for inferences that are critical to score interpretation (Chapelle, Enright, & Jamieson, 2008b; Kane, 2013). The current study investigates the extrapolation inference for a high-stakes test of…
Descriptors: Computational Linguistics, Language Tests, Test Validity, Inferences
Fulcher, Glenn; Davidson, Fred; Kemp, Jenny – Language Testing, 2011
Rating scale design and development for testing speaking is generally conducted using one of two approaches: the measurement-driven approach or the performance data-driven approach. The measurement-driven approach prioritizes the ordering of descriptors onto a single scale. Meaning is derived from the scaling methodology and the agreement of…
Descriptors: Speech Communication, Rating Scales, Inferences, English (Second Language)
Kunnan, Antony John – Language Testing, 2010
This paper presents the author's response to Xiaoming Xi's article titled "How do we go about investigating test fairness?" In this response, the author focuses on test fairness and Toulmin's model of argument structure, Xi's proposal, and the challenges the proposal brings. Xi proposes an approach to investigating test fairness to guide…
Descriptors: Persuasive Discourse, Inferences, Test Bias, Models
Ginther, April; Dimova, Slobodanka; Yang, Rui – Language Testing, 2010
Information provided by examination of the skills that underlie holistic scores can be used not only as supporting evidence for the validity of inferences associated with performance tests but also as a way to improve the scoring rubrics, descriptors, and benchmarks associated with scoring scales. As fluency is considered a critical, perhaps…
Descriptors: Performance Tests, Scoring Rubrics, Measures (Individuals), Scoring
Walters, F. Scott – Language Testing, 2007
Speech act theory-based, second language pragmatics testing (SLPT) poses problems for validation due to a lack of correspondence with empirical conversational data. Since conversation analysis (CA) provides a richer and more accurate account of language behavior, it may be preferred as a basis for SLPT development. However, applying CA methodology…
Descriptors: Inferences, Testing, Speech Acts, Language Tests
Song, Min-Young – Language Testing, 2008
This paper concerns the divisibility of comprehension subskills measured in L2 listening and reading tests. Motivated by the administration of the new Web-based English as a Second Language Placement Exam (WB-ESLPE) at UCLA, this study addresses the following research questions: first, to what extent do the WB-ESLPE listening and reading items…
Descriptors: Structural Equation Models, Second Language Learning, Reading Tests, Inferences
Llosa, Lorena – Language Testing, 2007
The use of standards-based classroom assessments to test English learners' language proficiency is increasingly prevalent in the United States and many other countries. In a large urban school district in California, for example, a classroom assessment is used to make high-stakes decisions about English learners' progress from one level to the…
Descriptors: Urban Schools, Multitrait Multimethod Techniques, Standardized Tests, Construct Validity

Swain, Merrill – Language Testing, 2001
Examines one aspect of the many interfaces between second language (L2) learning and L2 testing. The aspect is the oral interaction--the dialogue--that occurs within small groups. Discusses from within a sociocultural theory of mind, that in a group, performance is jointly constructed and distributed across the participants. (Author/VWL)
Descriptors: Dialogs (Language), Inferences, Interaction, Language Tests