Publication Date
In 2025 | 1 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 6 |
Since 2006 (last 20 years) | 11 |
Descriptor
Source
Language Testing | 11 |
Author
Alvarez, Marta E. | 1 |
Han, Chao | 1 |
Hitoshi Nishizawa | 1 |
John Pill | 1 |
Kang, Okim | 1 |
Kermad, Alyssa | 1 |
Kim, Youn-Hee | 1 |
Li, Hongli | 1 |
Lidster, Ryan | 1 |
Mizumoto, Atsushi | 1 |
Munoz, Ana P. | 1 |
More ▼ |
Publication Type
Journal Articles | 11 |
Reports - Research | 11 |
Education Level
Higher Education | 6 |
Postsecondary Education | 3 |
Secondary Education | 1 |
Audience
Location
Australia | 1 |
Austria | 1 |
Canada (Montreal) | 1 |
China | 1 |
Japan | 1 |
United States | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Test of English as a Foreign… | 2 |
International English… | 1 |
What Works Clearinghouse Rating
Rebecca Sickinger; Tineke Brunfaut; John Pill – Language Testing, 2025
Comparative Judgement (CJ) is an evaluation method, typically conducted online, whereby a rank order is constructed, and scores calculated, from judges' pairwise comparisons of performances. CJ has been researched in various educational contexts, though only rarely in English as a Foreign Language (EFL) writing settings, and is generally agreed to…
Descriptors: Writing Evaluation, English (Second Language), Second Language Learning, Second Language Instruction
Hitoshi Nishizawa – Language Testing, 2024
Corpus-based studies have offered the domain definition inference for test developers. Yet, corpus-based studies on temporal fluency measures (e.g., speech rate) have been limited, especially in the context of academic lecture settings. This made it difficult for test developers to sample representative fluency features to create authentic…
Descriptors: High Stakes Tests, Language Tests, Second Language Learning, Computer Assisted Testing
Han, Chao; Xiao, Xiaoyan – Language Testing, 2022
The quality of sign language interpreting (SLI) is a gripping construct among practitioners, educators and researchers, calling for reliable and valid assessment. There has been a diverse array of methods in the extant literature to measure SLI quality, ranging from traditional error analysis to recent rubric scoring. In this study, we want to…
Descriptors: Comparative Analysis, Sign Language, Deaf Interpreting, Evaluators
Mizumoto, Atsushi; Sasao, Yosuke; Webb, Stuart A. – Language Testing, 2019
The knowledge about affix plays a vital role in the development of word knowledge and vocabulary acquisition. A test for diagnostic information on the level of affix knowledge would be useful in order to inform the test users of what learners have gained or lacked in this integral component of vocabulary knowledge. This paper reports the…
Descriptors: Computer Assisted Testing, Adaptive Testing, College Students, English (Second Language)
Kang, Okim; Rubin, Don; Kermad, Alyssa – Language Testing, 2019
As a result of the fact that judgments of non-native speech are closely tied to social biases, oral proficiency ratings are susceptible to error because of rater background and social attitudes. In the present study we seek first to estimate the variance attributable to rater background and attitudinal variables on novice raters' assessments of L2…
Descriptors: Evaluators, Second Language Learning, Language Tests, English (Second Language)
Shin, Sun-Young; Lidster, Ryan – Language Testing, 2017
In language programs, it is crucial to place incoming students into appropriate levels to ensure that course curriculum and materials are well targeted to their learning needs. Deciding how and where to set cutscores on placement tests is thus of central importance to programs, but previous studies in educational measurement disagree as to which…
Descriptors: Language Tests, English (Second Language), Standard Setting (Scoring), Student Placement
Li, Hongli; Suen, Hoi K. – Language Testing, 2013
Differential skill functioning (DSF) exists when examinees from different groups have different probabilities of successful performance in a certain subskill underlying the measured construct, given that they have the same ability on the overall construct. Using a DSF approach, this study examined the differences between two native language…
Descriptors: Native Language, Differences, Reading Skills, Reading Tests
Kim, Youn-Hee – Language Testing, 2009
This study used a mixed methods research approach to examine how native English-speaking (NS) and non-native English-speaking (NNS) teachers assess students' oral English performance. The evaluation behaviors of two groups of teachers (12 Canadian NS teachers and 12 Korean NNS teachers) were compared with regard to internal consistency, severity,…
Descriptors: Methods Research, Evaluation Criteria, Oral English, English (Second Language)
Wagner, Elvis – Language Testing, 2010
Video is widely used in the teaching of L2 listening, and SLA researchers have argued that the visual components of spoken texts are useful for the listener in comprehending aural information. Yet video texts are rarely used on tests of L2 listening ability, perhaps in part due to the belief that including the visual channel involves assessing…
Descriptors: Experimental Groups, Control Groups, Listening Comprehension, Quasiexperimental Design
Wigglesworth, Gillian; Storch, Neomy – Language Testing, 2009
The assessment of oral language is now quite commonly done in pairs or groups, and there is a growing body of research which investigates the related issues (e.g. May, 2007). Writing generally tends to be thought of as an individual activity, although a small number of studies have documented the advantages of collaboration in writing in the…
Descriptors: Formative Evaluation, Second Language Learning, Oral Language, Collaborative Writing
Munoz, Ana P.; Alvarez, Marta E. – Language Testing, 2010
This article reports the results of a research study to determine the washback effect of an oral assessment system on some areas of the teaching and learning of English as a Foreign Language (EFL). The research combined quantitative and qualitative research methods within a comparative study between an experimental group and a comparison group.…
Descriptors: Experimental Groups, Qualitative Research, Student Surveys, Program Effectiveness