Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 9 |
Since 2006 (last 20 years) | 14 |
Descriptor
Source
Language Testing | 17 |
Author
Alderson, J. Charles | 1 |
Blood, Ian A. | 1 |
Brown, James Dean | 1 |
Brunfaut, Tineke | 1 |
Campfield, Dorota E. | 1 |
Cho, Yeonsuk | 1 |
David, Gergely | 1 |
Dunlea, Jamie | 1 |
Duyen Thi Bich Nguyen | 1 |
Eberharter, Kathrin | 1 |
Filipi, Anna | 1 |
More ▼ |
Publication Type
Journal Articles | 17 |
Reports - Research | 15 |
Tests/Questionnaires | 4 |
Reports - Evaluative | 2 |
Audience
Location
Japan | 3 |
Europe | 2 |
Australia | 1 |
Hungary | 1 |
Iran (Tehran) | 1 |
Poland | 1 |
Russia | 1 |
Slovenia | 1 |
Turkey | 1 |
United Kingdom | 1 |
Vietnam | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
Test of English as a Foreign… | 4 |
International English… | 1 |
What Works Clearinghouse Rating
Holzknecht, Franz; McCray, Gareth; Eberharter, Kathrin; Kremmel, Benjamin; Zehentner, Matthias; Spiby, Richard; Dunlea, Jamie – Language Testing, 2021
Studies from various disciplines have reported that spatial location of options in relation to processing order impacts the ultimate choice of the option. A large number of studies have found a primacy effect, that is, the tendency to prefer the first option. In this paper we report on evidence that position of the key in four-option…
Descriptors: Language Tests, Test Items, Multiple Choice Tests, Listening Comprehension Tests
Hung Tan Ha; Duyen Thi Bich Nguyen; Tim Stoeckel – Language Testing, 2024
Word frequency has a long history of being considered the most important predictor of word difficulty and has served as a guideline for several aspects of second language vocabulary teaching, learning, and assessment. However, recent empirical research has challenged the supremacy of frequency as a predictor of word difficulty. Accordingly,…
Descriptors: Word Frequency, Vocabulary Skills, Second Language Learning, Second Language Instruction
Powers, Donald; Schedl, Mary; Papageorgiou, Spiros – Language Testing, 2017
The aim of this study was to develop, for the benefit of both test takers and test score users, enhanced "TOEFL ITP"® test score reports that go beyond the simple numerical scores that are currently reported. To do so, we applied traditional scale anchoring (proficiency scaling) to item difficulty data in order to develop performance…
Descriptors: English (Second Language), Second Language Learning, Language Proficiency, Scores
Rukthong, Anchana; Brunfaut, Tineke – Language Testing, 2020
Integrated test tasks, such as listening-to-speak or reading-to-write, are increasingly used in second language assessment despite relatively limited empirical insights into what they assess. Most research on integrated tasks has primarily focused on the productive skills involved; studies exploring the receptive skills mostly investigated tasks…
Descriptors: Listening Comprehension Tests, Recall (Psychology), Oral Language, Linguistic Input
Cho, Yeonsuk; Blood, Ian A. – Language Testing, 2020
In this study, we examined how much change in "TOEFL® Primary™" listening and reading scores can be expected in relation to the time interval between test administrations. The test records of 5213 young learners of English (aged 8-13 years) in Japan and Turkey who repeated the tests were analyzed to examine test scores as a function of…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Scores
Košak-Babuder, Milena; Kormos, Judit; Ratajczak, Michael; Pižorn, Karmen – Language Testing, 2019
One of the special arrangements in testing contexts is to allow dyslexic students to listen to the text while they read. In our study, we investigated the effect of read-aloud assistance on young English learners' language comprehension scores. We also examined whether students with dyslexia identification benefit from this assistance differently…
Descriptors: Dyslexia, Identification, Scores, English (Second Language)
Khabbazbashi, Nahal – Language Testing, 2017
This study explores the extent to which topic and background knowledge of topic affect spoken performance in a high-stakes speaking test. It is argued that evidence of a substantial influence may introduce construct-irrelevant variance and undermine test fairness. Data were collected from 81 non-native speakers of English who performed on 10…
Descriptors: Speech Tests, High Stakes Tests, English (Second Language), Language Proficiency
Trace, Jonathan; Brown, James Dean; Janssen, Gerriet; Kozhevnikova, Liudmila – Language Testing, 2017
Cloze tests have been the subject of numerous studies regarding their function and use in both first language and second language contexts (e.g., Jonz & Oller, 1994; Watanabe & Koyama, 2008). From a validity standpoint, one area of investigation has been the extent to which cloze tests measure reading ability beyond the sentence level.…
Descriptors: Cloze Procedure, Language Tests, Test Items, Item Analysis
Campfield, Dorota E. – Language Testing, 2017
This paper reports a post-hoc analysis of the influence of lexical difficulty of cue sentences on performance in an elicited imitation (EI) task to assess oral production skills for 645 child L2 English learners in instructional settings. This formed part of a large-scale investigation into effectiveness of foreign language teaching in Polish…
Descriptors: Difficulty Level, Second Language Learning, Second Language Instruction, Elementary School Students
Suzuki, Yuichi – Language Testing, 2015
Self-assessment has been used to assess second language proficiency; however, as sources of measurement errors vary, they may threaten the validity and reliability of the tools. The present paper investigated the role of experiences in using Japanese as a second language in the naturalistic acquisition context on the accuracy of the…
Descriptors: Self Evaluation (Individuals), Error of Measurement, Japanese, Second Language Learning
Kuiken, Folkert; Vedder, Ineke – Language Testing, 2014
This study investigates the relationship in L2 writing between raters' judgments of communicative adequacy and linguistic complexity by means of six-point Likert scales, and general measures of linguistic performance. The participants were 39 learners of Italian and 32 of Dutch, who wrote two short argumentative essays. The same writing tasks…
Descriptors: Writing Evaluation, Second Language Learning, Evaluators, Native Language
Gao, Lingyun; Rogers, W. Todd – Language Testing, 2011
The purpose of this study was to explore whether the results of Tree Based Regression (TBR) analyses, informed by a validated cognitive model, would enhance the interpretation of item difficulties in terms of the cognitive processes involved in answering the reading items included in two forms of the Michigan English Language Assessment Battery…
Descriptors: Test Items, Reading Tests, Item Analysis, Reading Processes
Filipi, Anna – Language Testing, 2012
The Assessment of Language Competence (ALC) certificates is an annual, international testing program developed by the Australian Council for Educational Research to test the listening and reading comprehension skills of lower to middle year levels of secondary school. The tests are developed for three levels in French, German, Italian and…
Descriptors: Listening Comprehension Tests, Item Response Theory, Statistical Analysis, Foreign Countries
David, Gergely – Language Testing, 2007
Some educational contexts almost mandate the application of multiple-choice (MC) testing techniques, even if they are deplored by many practitioners in the field. In such contexts especially, research into how well these types of item perform and how their performance may be characterised is both appropriate and desirable. The focus of this paper…
Descriptors: Student Evaluation, Grammar, Language Tests, Test Items

Perkins, Kyle; And Others – Language Testing, 1995
This article reports the results of using a three-layer back propagation artificial neural network to predict item difficulty in a reading comprehension test. Three classes of variables were examined: text structure, propositional analysis, and cognitive demand. Results demonstrate that the networks can consistently predict item difficulty. (JL)
Descriptors: Artificial Intelligence, Difficulty Level, English (Second Language), Language Tests
Previous Page | Next Page »
Pages: 1 | 2