Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 7 |
Descriptor
Source
Language Testing | 8 |
Author
Schmitt, Norbert | 2 |
Garras, John | 1 |
Hopp, Holger | 1 |
Jarvis, Scott | 1 |
Kang, Okim | 1 |
Kermad, Alyssa | 1 |
Kim, Youn-Hee | 1 |
Knoch, Ute | 1 |
Lee, Shinhye | 1 |
Ng, Janice Wun Ching | 1 |
Pellicer-Sanchez, Ana | 1 |
More ▼ |
Publication Type
Journal Articles | 8 |
Reports - Research | 6 |
Reports - Evaluative | 2 |
Education Level
Higher Education | 3 |
Elementary Education | 1 |
Audience
Location
United Kingdom | 2 |
Canada (Montreal) | 1 |
China | 1 |
Japan | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Test of English as a Foreign… | 2 |
What Works Clearinghouse Rating
Kang, Okim; Rubin, Don; Kermad, Alyssa – Language Testing, 2019
As a result of the fact that judgments of non-native speech are closely tied to social biases, oral proficiency ratings are susceptible to error because of rater background and social attitudes. In the present study we seek first to estimate the variance attributable to rater background and attitudinal variables on novice raters' assessments of L2…
Descriptors: Evaluators, Second Language Learning, Language Tests, English (Second Language)
Lee, Shinhye; Winke, Paula – Language Testing, 2018
We investigated how young language learners process their responses on and perceive a computer-mediated, timed speaking test. Twenty 8-, 9-, and 10-year-old non-native English-speaking children (NNSs) and eight same-aged, native English-speaking children (NSs) completed seven computerized sample TOEFL® Primary™ speaking test tasks. We investigated…
Descriptors: Elementary School Students, Second Language Learning, Responses, Computer Assisted Testing
Schmid, Monika S.; Hopp, Holger – Language Testing, 2014
This study examines the methodology of global foreign accent ratings in studies on L2 speech production. In three experiments, we test how variation in raters, range within speech samples, as well as instructions and procedures affects ratings of accent in predominantly monolingual speakers of German, non-native speakers of German, as well as…
Descriptors: Comparative Analysis, Second Language Learning, Pronunciation, Native Speakers
Pellicer-Sanchez, Ana; Schmitt, Norbert – Language Testing, 2012
Despite a number of research studies investigating the Yes-No vocabulary test format, one main question remains unanswered: What is the best scoring procedure to adjust for testee overestimation of vocabulary knowledge? Different scoring methodologies have been proposed based on the inclusion and selection of nonwords in the test. However, there…
Descriptors: Language Tests, Scoring, Reaction Time, Vocabulary Development
Schmitt, Norbert; Ng, Janice Wun Ching; Garras, John – Language Testing, 2011
Although the Word Associates Format (WAF) is becoming more frequently used as a depth-of-knowledge measure, relatively little validation has been carried out on it. This report of two validation studies tackles various important WAF issues yet to be satisfactorily resolved. Study 1 conducted introspective interviews regarding students' WAF…
Descriptors: Scoring, Vocabulary Development, Associative Learning, Validity
Kim, Youn-Hee – Language Testing, 2009
This study used a mixed methods research approach to examine how native English-speaking (NS) and non-native English-speaking (NNS) teachers assess students' oral English performance. The evaluation behaviors of two groups of teachers (12 Canadian NS teachers and 12 Korean NNS teachers) were compared with regard to internal consistency, severity,…
Descriptors: Methods Research, Evaluation Criteria, Oral English, English (Second Language)
Knoch, Ute – Language Testing, 2009
Alderson (2005) suggests that diagnostic tests should identify strengths and weaknesses in learners' use of language and focus on specific elements rather than global abilities. However, rating scales used in performance assessment have been repeatedly criticized for being imprecise and therefore often resulting in holistic marking by raters…
Descriptors: Feedback (Response), Language Usage, Performance Based Assessment, Performance Tests

Jarvis, Scott – Language Testing, 2002
Compares accuracy of five formulae in terms of their ability to model the type-token curves of written texts produced by learners and native speakers. The most accurate models are then used to consider unresolved issues of past research on lexical diversity: the relationship between lexical diversity and age, second language instruction (L2), L2…
Descriptors: Age, Comparative Analysis, Language Tests, Native Speakers