Publication Date
| In 2026 | 0 |
| Since 2025 | 0 |
| Since 2022 (last 5 years) | 0 |
| Since 2017 (last 10 years) | 4 |
| Since 2007 (last 20 years) | 12 |
Descriptor
Source
| Language Testing | 12 |
Author
| Alvarez, Marta E. | 1 |
| Babaii, Esmat | 1 |
| Campfield, Dorota E. | 1 |
| Davis, Larry | 1 |
| In'nami, Yo | 1 |
| Koizumi, Rie | 1 |
| LaFlair, Geoffrey T. | 1 |
| Leaper, David A. | 1 |
| Lee, Shinhye | 1 |
| Munoz, Ana P. | 1 |
| Nakatsuhara, Fumiyo | 1 |
| More ▼ | |
Publication Type
| Journal Articles | 12 |
| Reports - Research | 12 |
| Tests/Questionnaires | 3 |
| Information Analyses | 1 |
Audience
Location
| Japan | 2 |
| Australia | 1 |
| Iran | 1 |
| Netherlands | 1 |
| Poland | 1 |
Laws, Policies, & Programs
Assessments and Surveys
| Test of English as a Foreign… | 2 |
| Michigan Test of English… | 1 |
What Works Clearinghouse Rating
van Batenburg, Eline S. L.; Oostdam, Ron J.; van Gelderen, Amos J. S.; de Jong, Nivja H. – Language Testing, 2018
This article explores ways to assess interactional performance, and reports on the use of a test format that standardizes the interlocutor's linguistic and interactional contributions to the exchange. It describes the construction and administration of six scripted speech tasks (instruction, advice, and sales tasks) with pre-vocational learners (n…
Descriptors: Second Language Learning, Speech Tests, Interaction, Test Reliability
Davis, Larry – Language Testing, 2016
Two factors were investigated that are thought to contribute to consistency in rater scoring judgments: rater training and experience in scoring. Also considered were the relative effects of scoring rubrics and exemplars on rater performance. Experienced teachers of English (N = 20) scored recorded responses from the TOEFL iBT speaking test prior…
Descriptors: Evaluators, Oral Language, Scores, Language Tests
Lee, Shinhye; Winke, Paula – Language Testing, 2018
We investigated how young language learners process their responses on and perceive a computer-mediated, timed speaking test. Twenty 8-, 9-, and 10-year-old non-native English-speaking children (NNSs) and eight same-aged, native English-speaking children (NSs) completed seven computerized sample TOEFL® Primary™ speaking test tasks. We investigated…
Descriptors: Elementary School Students, Second Language Learning, Responses, Computer Assisted Testing
In'nami, Yo; Koizumi, Rie – Language Testing, 2016
We addressed Deville and Chalhoub-Deville's (2006), Schoonen's (2012), and Xi and Mollaun's (2006) call for research into the contextual features that are considered related to person-by-task interactions in the framework of generalizability theory in two ways. First, we quantitatively synthesized the generalizability studies to determine the…
Descriptors: Evaluators, Second Language Learning, Writing Skills, Oral Language
LaFlair, Geoffrey T.; Staples, Shelley – Language Testing, 2017
Investigations of the validity of a number of high-stakes language assessments are conducted using an argument-based approach, which requires evidence for inferences that are critical to score interpretation (Chapelle, Enright, & Jamieson, 2008b; Kane, 2013). The current study investigates the extrapolation inference for a high-stakes test of…
Descriptors: Computational Linguistics, Language Tests, Test Validity, Inferences
Campfield, Dorota E. – Language Testing, 2017
This paper reports a post-hoc analysis of the influence of lexical difficulty of cue sentences on performance in an elicited imitation (EI) task to assess oral production skills for 645 child L2 English learners in instructional settings. This formed part of a large-scale investigation into effectiveness of foreign language teaching in Polish…
Descriptors: Difficulty Level, Second Language Learning, Second Language Instruction, Elementary School Students
O'Hagan, Sally; Pill, John; Zhang, Ying – Language Testing, 2016
Criticism of specific-purpose language (LSP) tests is often directed at their limited ability to represent fully the demands of the target language use situation. Such criticisms extend to the criteria used to assess test performance, which may fail to capture what matters to participants in the domain of interest. This paper reports on the…
Descriptors: Health Personnel, Language Tests, English for Special Purposes, Criticism
Babaii, Esmat; Taghaddomi, Shahin; Pashmforoosh, Roya – Language Testing, 2016
Perceptual (mis)matches between teachers and learners are said to affect learning success or failure. Self-assessment, as a formative assessment tool, may, inter alia, be considered a means to minimize such mismatches. Therefore, the present study investigated the extent to which learners' assessment of their own speaking performance, before and…
Descriptors: Self Evaluation (Individuals), Evaluation Criteria, Oral Language, Second Language Learning
Leaper, David A.; Riazi, Mehdi – Language Testing, 2014
This paper reports an investigation into how the prompt may influence the discourse of group oral tests. The group oral test, in which three or four participants are rated on their ability to discuss a prompt, is a format for assessing the spoken ability of language learners. In this study, 141 Japanese university students were videoed in 41 group…
Descriptors: Oral Language, Language Tests, Second Language Learning, Prompting
Nakatsuhara, Fumiyo – Language Testing, 2011
This study explores the nature of co-constructed interaction in group oral tests by examining whether a test-taker's own and his or her group members' extraversion levels and oral proficiency levels have different influences on conversational styles between two group sizes: groups of three and groups of four. Data were collected from 269 Japanese…
Descriptors: Video Technology, Language Tests, Oral Language, Secondary School Students
Xi, Xiaoming – Language Testing, 2010
Motivated by cognitive theories of graph comprehension, this study systematically manipulated characteristics of a line graph description task in a speaking test in ways to mitigate the influence of graph familiarity, a potential source of construct-irrelevant variance. It extends Xi (2005), which found that the differences in holistic scores on…
Descriptors: Familiarity, Graphs, Scoring, Task Analysis
Munoz, Ana P.; Alvarez, Marta E. – Language Testing, 2010
This article reports the results of a research study to determine the washback effect of an oral assessment system on some areas of the teaching and learning of English as a Foreign Language (EFL). The research combined quantitative and qualitative research methods within a comparative study between an experimental group and a comparison group.…
Descriptors: Experimental Groups, Qualitative Research, Student Surveys, Program Effectiveness

Peer reviewed
Direct link
