Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 4 |
Since 2016 (last 10 years) | 11 |
Since 2006 (last 20 years) | 20 |
Descriptor
Inferences | 20 |
Language Tests | 18 |
English (Second Language) | 14 |
Second Language Learning | 14 |
Scores | 9 |
Language Proficiency | 8 |
Computational Linguistics | 6 |
Comparative Analysis | 5 |
Correlation | 5 |
Test Validity | 5 |
Validity | 5 |
More ▼ |
Source
Language Testing | 20 |
Author
Kyle, Kristopher | 2 |
Staples, Shelley | 2 |
Bonifacci, Paola | 1 |
Bouwer, Renske | 1 |
Béguin, Anton | 1 |
Chapelle, Carol A. | 1 |
Choe, Ann Tai | 1 |
Ciancaleoni, Matteo | 1 |
Crossley, Scott | 1 |
Crossley, Scott A. | 1 |
Davidson, Fred | 1 |
More ▼ |
Publication Type
Journal Articles | 20 |
Reports - Research | 12 |
Reports - Evaluative | 6 |
Opinion Papers | 2 |
Information Analyses | 1 |
Reports - Descriptive | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Higher Education | 3 |
Postsecondary Education | 3 |
Elementary Education | 2 |
Elementary Secondary Education | 1 |
Grade 9 | 1 |
High Schools | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
Secondary Education | 1 |
Audience
Location
California | 1 |
Italy | 1 |
Japan | 1 |
Netherlands | 1 |
South Korea | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Test of English as a Foreign… | 5 |
International English… | 1 |
Michigan Test of English… | 1 |
Test of English for… | 1 |
What Works Clearinghouse Rating
Hitoshi Nishizawa – Language Testing, 2024
Corpus-based studies have offered the domain definition inference for test developers. Yet, corpus-based studies on temporal fluency measures (e.g., speech rate) have been limited, especially in the context of academic lecture settings. This made it difficult for test developers to sample representative fluency features to create authentic…
Descriptors: High Stakes Tests, Language Tests, Second Language Learning, Computer Assisted Testing
Yan, Xun; Staples, Shelley – Language Testing, 2020
The argument-based approach to validity (Kane, 2013) focuses on two steps: (1) making claims about the proposed interpretation and use of test scores as a coherent, interpretive argument; and (2) evaluating those claims based on theoretical and empirical evidence related to test performances and scores. This paper discusses the role of…
Descriptors: Writing Tests, Language Tests, Language Proficiency, Test Validity
Knoch, Ute; Chapelle, Carol A. – Language Testing, 2018
Argument-based validation requires test developers and researchers to specify what is entailed in test interpretation and use. Doing so has been shown to yield advantages (Chapelle, Enright, & Jamieson, 2010), but it also requires an analysis of how the concerns of language testers can be conceptualized in the terms used to construct a…
Descriptors: Test Validity, Language Tests, Evaluation Research, Rating Scales
Kyle, Kristopher; Eguchi, Masaki; Choe, Ann Tai; LaFlair, Geoff – Language Testing, 2022
In the realm of language proficiency assessments, the domain description inference and the extrapolation inference are key components of a validity argument. Biber et al.'s description of the lexicogrammatical features of the spoken and written registers in the T2K-SWAL corpus has served as support for the TOEFL iBT test's domain description and…
Descriptors: Language Variation, Written Language, Speech Communication, Inferences
Kim, Minkyung; Nam, Yunjung; Crossley, Scott A. – Language Testing, 2022
This study investigated the effects of working memory capacity (WMC), first language (L1) syllogistic inferencing ability, and second-language (L2) linguistic knowledge on L2 listening comprehension for passages of different lengths. Participants were 193 Korean ninth-grade learners of English. A path analysis was used to examine multivariate…
Descriptors: Native Language, Short Term Memory, Listening Comprehension, Second Language Learning
Gyllstad, Henrik; McLean, Stuart; Stewart, Jeffrey – Language Testing, 2021
The last three decades have seen an increase of tests aimed at measuring an individual's vocabulary level or size. The target words used in these tests are typically sampled from word frequency lists, which are in turn based on language corpora. Conventionally, test developers sample items from frequency bands of 1000 words; different tests employ…
Descriptors: Vocabulary Development, Sample Size, Language Tests, Test Items
Xi, Xiaoming – Language Testing, 2017
In recent years, continuing advances in technology have increased the capacity to automate the extraction of a range of linguistic features of texts and thus have provided the impetus for the substantial growth of corpus linguistics. While corpus linguistic tools and methods have been used extensively in second language learning research, they…
Descriptors: Computational Linguistics, Second Language Learning, Language Tests, Evaluation Methods
Löwenadler, John – Language Testing, 2019
This study aims to investigate patterns of variation in the interplay of L2 language ability and general reading comprehension skills in L2 reading, by comparing item-level effects of test-takers' results on L1 and L2 reading comprehension tests. The material comes from more than 500,000 people tested on L1 (Swedish) and L2 (English) in the…
Descriptors: Swedish, English (Second Language), Second Language Learning, Second Language Instruction
Tobia, Valentina; Ciancaleoni, Matteo; Bonifacci, Paola – Language Testing, 2017
In this study, two alternative theoretical models were compared, in order to analyze which of them best explains primary school children's text comprehension skills. The first one was based on the distinction between two types of answers requested by the comprehension test: local or global. The second model involved texts' input modality: written…
Descriptors: Elementary School Students, Language Tests, Foreign Countries, Comparative Analysis
Oller, John W., Jr. – Language Testing, 2012
Kane's argument-based framework is summarized and examined. He implicitly appeals to the backgrounded concepts of fairness and justice. From there it is a short distance to grounding the whole system in the mundane notion of truth. In fact, valid argument systems must depend on representations that are "true" by virtue of agreement with purported…
Descriptors: Scores, Validity, Test Interpretation, Cutting Scores
Kyle, Kristopher; Crossley, Scott – Language Testing, 2017
Over the past 45 years, the construct of syntactic sophistication has been assessed in L2 writing using what Bulté and Housen (2012) refer to as absolute complexity (Lu, 2011; Ortega, 2003; Wolfe-Quintero, Inagaki, & Kim, 1998). However, it has been argued that making inferences about learners based on absolute complexity indices (e.g., mean…
Descriptors: Syntax, Verbs, Second Language Learning, Word Frequency
LaFlair, Geoffrey T.; Staples, Shelley – Language Testing, 2017
Investigations of the validity of a number of high-stakes language assessments are conducted using an argument-based approach, which requires evidence for inferences that are critical to score interpretation (Chapelle, Enright, & Jamieson, 2008b; Kane, 2013). The current study investigates the extrapolation inference for a high-stakes test of…
Descriptors: Computational Linguistics, Language Tests, Test Validity, Inferences
Bouwer, Renske; Béguin, Anton; Sanders, Ted; van den Bergh, Huub – Language Testing, 2015
In the present study, aspects of the measurement of writing are disentangled in order to investigate the validity of inferences made on the basis of writing performance and to describe implications for the assessment of writing. To include genre as a facet in the measurement, we obtained writing scores of 12 texts in four different genres for each…
Descriptors: Writing Tests, Generalization, Scores, Writing Instruction
Kane, Michael – Language Testing, 2012
The argument-based approach to validation involves two steps; specification of the proposed interpretations and uses of the test scores as an interpretive argument, and the evaluation of the plausibility of the proposed interpretive argument. More ambitious interpretations and uses tend to involve an extended network of inferences and assumptions…
Descriptors: Testing, Language Tests, Inferences, Test Validity
Fulcher, Glenn; Davidson, Fred; Kemp, Jenny – Language Testing, 2011
Rating scale design and development for testing speaking is generally conducted using one of two approaches: the measurement-driven approach or the performance data-driven approach. The measurement-driven approach prioritizes the ordering of descriptors onto a single scale. Meaning is derived from the scaling methodology and the agreement of…
Descriptors: Speech Communication, Rating Scales, Inferences, English (Second Language)
Previous Page | Next Page »
Pages: 1 | 2