Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 4 |
Since 2016 (last 10 years) | 7 |
Since 2006 (last 20 years) | 22 |
Descriptor
Source
Language Testing | 32 |
Author
Cohen, Andrew D. | 2 |
Coombe, Christine | 2 |
Liu, Jianda | 2 |
Ai, Haiyang | 1 |
Al-Hamly, Mashael | 1 |
Allan, Alastair I. C. G. | 1 |
Allan, Alistair | 1 |
Baghaei, Purya | 1 |
Batty, Aaron Olaf | 1 |
Bridgeman, Brent | 1 |
Chapelle, Carol | 1 |
More ▼ |
Publication Type
Journal Articles | 32 |
Reports - Research | 25 |
Reports - Evaluative | 7 |
Speeches/Meeting Papers | 1 |
Tests/Questionnaires | 1 |
Education Level
Higher Education | 12 |
Postsecondary Education | 6 |
Secondary Education | 4 |
High Schools | 2 |
Grade 12 | 1 |
Audience
Location
China | 2 |
Japan | 2 |
South Korea | 2 |
Canada | 1 |
Europe | 1 |
Hungary | 1 |
Kuwait | 1 |
Russia | 1 |
Thailand | 1 |
United Arab Emirates | 1 |
United States | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
Test of English as a Foreign… | 4 |
International English… | 1 |
Test of English for… | 1 |
What Works Clearinghouse Rating
Baghaei, Purya; Christensen, Karl Bang – Language Testing, 2023
C-tests are gap-filling tests mainly used as rough and economical measures of second-language proficiency for placement and research purposes. A C-test usually consists of several short independent passages where the second half of every other word is deleted. Owing to their interdependent structure, C-test items violate the local independence…
Descriptors: Item Response Theory, Language Tests, Language Proficiency, Second Language Learning
Liao, Ray J. T. – Language Testing, 2023
Among the variety of selected response formats used in L2 reading assessment, multiple-choice (MC) is the most commonly adopted, primarily due to its efficiency and objectiveness. Given the impact of assessment results on teaching and learning, it is necessary to investigate the degree to which the MC format reliably measures learners' L2 reading…
Descriptors: Reading Tests, Language Tests, Second Language Learning, Second Language Instruction
Van Moere, Alistair; Hanlon, Sean – Language Testing, 2020
In language assessment and in educational measurement more broadly, there is a tendency to interpret scores from single-administration tests as accurate indicators of a latent trait (e.g., reading ability). Even in contexts where learners receive multiple formative assessments throughout the year, estimates of student ability are determined based…
Descriptors: Bayesian Statistics, Measurement, Accuracy, English (Second Language)
Wind, Stefanie A. – Language Testing, 2023
Researchers frequently evaluate rater judgments in performance assessments for evidence of differential rater functioning (DRF), which occurs when rater severity is systematically related to construct-irrelevant student characteristics after controlling for student achievement levels. However, researchers have observed that methods for detecting…
Descriptors: Evaluators, Decision Making, Student Characteristics, Performance Based Assessment
Holzknecht, Franz; McCray, Gareth; Eberharter, Kathrin; Kremmel, Benjamin; Zehentner, Matthias; Spiby, Richard; Dunlea, Jamie – Language Testing, 2021
Studies from various disciplines have reported that spatial location of options in relation to processing order impacts the ultimate choice of the option. A large number of studies have found a primacy effect, that is, the tendency to prefer the first option. In this paper we report on evidence that position of the key in four-option…
Descriptors: Language Tests, Test Items, Multiple Choice Tests, Listening Comprehension Tests
Zhang, Xian; Liu, Jianda; Ai, Haiyang – Language Testing, 2020
The main purpose of this study is to investigate guessing in the Yes/No (YN) format vocabulary test. One-hundred-and-five university students took a YN test, a translation task and a multiple-choice vocabulary size test (MC VST). With matched lexical properties between the real words and the pseudowords, pseudowords could index guessing in the YN…
Descriptors: Vocabulary Development, Language Tests, Test Format, College Students
Löwenadler, John – Language Testing, 2019
This study aims to investigate patterns of variation in the interplay of L2 language ability and general reading comprehension skills in L2 reading, by comparing item-level effects of test-takers' results on L1 and L2 reading comprehension tests. The material comes from more than 500,000 people tested on L1 (Swedish) and L2 (English) in the…
Descriptors: Swedish, English (Second Language), Second Language Learning, Second Language Instruction
Poehner, Matthew E.; Zhang, Jie; Lu, Xiaofei – Language Testing, 2015
Dynamic assessment (DA) derives from the sociocultural theory of mind as elaborated by Russian psychologist L. S. Vygotsky. By offering mediation when individuals experience difficulties and carefully tracing their responsiveness, Vygotsky (1998) proposed that diagnoses may uncover abilities that have fully formed as well as those still in the…
Descriptors: Computer Assisted Testing, Second Language Learning, Reading Tests, Listening Comprehension Tests
Batty, Aaron Olaf – Language Testing, 2015
The rise in the affordability of quality video production equipment has resulted in increased interest in video-mediated tests of foreign language listening comprehension. Although research on such tests has continued fairly steadily since the early 1980s, studies have relied on analyses of raw scores, despite the growing prevalence of item…
Descriptors: Listening Comprehension Tests, Comparative Analysis, Video Technology, Audio Equipment
Coombe, Christine; Davidson, Peter – Language Testing, 2014
The Common Educational Proficiency Assessment (CEPA) is a large-scale, high-stakes, English language proficiency/placement test administered in the United Arab Emirates to Emirati nationals in their final year of secondary education or Grade 12. The purpose of the CEPA is to place students into English classes at the appropriate government…
Descriptors: Language Tests, High Stakes Tests, English (Second Language), Second Language Learning
Elgort, Irina – Language Testing, 2013
This study examines the development and evaluation of a bilingual Vocabulary Size Test (VST, Nation, 2006). A bilingual (English-Russian) test was developed and administered to 121 intermediate proficiency EFL learners (native speakers of Russian), alongside the original monolingual (English-only) version of the test. A comparison of the bilingual…
Descriptors: Test Construction, Vocabulary, Language Tests, English
Stubbe, Raymond – Language Testing, 2012
"Pseudowords", or non-real words, were introduced to the Yes/No (YN) vocabulary test format to provide a means of checking for overestimation of word knowledge by test takers. The purpose of this study is to assess the assumption that more pseudoword checks (false alarms) indicate more instances of overestimation of word knowledge in YN…
Descriptors: Academic Ability, English (Second Language), Multiple Choice Tests, Test Results
Bridgeman, Brent; Powers, Donald; Stone, Elizabeth; Mollaun, Pamela – Language Testing, 2012
Scores assigned by trained raters and by an automated scoring system (SpeechRater[TM]) on the speaking section of the TOEFL iBT[TM] were validated against a communicative competence criterion. Specifically, a sample of 555 undergraduate students listened to speech samples from 184 examinees who took the Test of English as a Foreign Language…
Descriptors: Undergraduate Students, Speech Communication, Rating Scales, Scoring
Currie, Michael; Chiramanee, Thanyapa – Language Testing, 2010
Noting the widespread use of multiple-choice items in tests in English language education in Thailand, this study compared their effect against that of constructed-response items. One hundred and fifty-two university undergraduates took a test of English structure first in constructed-response format, and later in three, stem-equivalent…
Descriptors: Experimental Groups, Multiple Choice Tests, Foreign Countries, Language Tests
In'nami, Yo; Koizumi, Rie – Language Testing, 2009
A meta-analysis was conducted on the effects of multiple-choice and open-ended formats on L1 reading, L2 reading, and L2 listening test performance. Fifty-six data sources located in an extensive search of the literature were the basis for the estimates of the mean effect sizes of test format effects. The results using the mixed effects model of…
Descriptors: Test Format, Listening Comprehension Tests, Multiple Choice Tests, Program Effectiveness