Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 4 |
Since 2016 (last 10 years) | 8 |
Since 2006 (last 20 years) | 12 |
Descriptor
Source
Language Testing | 12 |
Author
Manna, Venessa F. | 2 |
McLean, Stuart | 2 |
Powers, Donald E. | 2 |
Stewart, Jeffrey | 2 |
Yoo, Hanwook | 2 |
Batty, Aaron Olaf | 1 |
Cheng, Liying | 1 |
Christopher Nicklin | 1 |
Gyllstad, Henrik | 1 |
Henrik Gyllstad | 1 |
Im, Gwan-Hyeok | 1 |
More ▼ |
Publication Type
Journal Articles | 12 |
Reports - Research | 10 |
Reports - Descriptive | 1 |
Reports - Evaluative | 1 |
Tests/Questionnaires | 1 |
Education Level
Higher Education | 6 |
Postsecondary Education | 5 |
Audience
Location
Japan | 7 |
Europe | 2 |
South Korea | 2 |
United Kingdom | 2 |
Asia | 1 |
Australia | 1 |
Middle East | 1 |
Netherlands | 1 |
South America | 1 |
United Kingdom (England) | 1 |
United States | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
Test of English for… | 12 |
Test of English as a Foreign… | 3 |
International English… | 2 |
What Works Clearinghouse Rating
Miao, Yongzhi – Language Testing, 2023
Scholars have argued for the inclusion of different spoken varieties of English in high-stakes listening tests to better represent the global use of English. However, doing so may introduce additional construct-irrelevant variance due to accent familiarity and the shared first language (L1) advantage, which could threaten test fairness. However,…
Descriptors: Pronunciation, Metalinguistics, Native Language, Intelligibility
Jeffrey Stewart; Henrik Gyllstad; Christopher Nicklin; Stuart McLean – Language Testing, 2024
The purpose of this paper is to (a) establish whether meaning recall and meaning recognition item formats test psychometrically distinct constructs of vocabulary knowledge which measure separate skills, and, if so, (b) determine whether each construct possesses unique properties predictive of L2 reading proficiency. Factor analyses and…
Descriptors: Vocabulary Development, Psychometrics, Language Tests, Recall (Psychology)
Schmidgall, Jonathan; Powers, Donald E. – Language Testing, 2021
In this study we examined the extent to which "TOEIC"® Speaking test scores relate to evaluations by professionals in the international workplace, the target language use domain of TOEIC tests. Linguistic laypersons in 10 countries were invited to participate in an online research survey. The survey incorporated a stratified sample of…
Descriptors: Language Tests, English (Second Language), Second Language Learning, Scores
Gyllstad, Henrik; McLean, Stuart; Stewart, Jeffrey – Language Testing, 2021
The last three decades have seen an increase of tests aimed at measuring an individual's vocabulary level or size. The target words used in these tests are typically sampled from word frequency lists, which are in turn based on language corpora. Conventionally, test developers sample items from frequency bands of 1000 words; different tests employ…
Descriptors: Vocabulary Development, Sample Size, Language Tests, Test Items
McLean, Stuart; Stewart, Jeffrey; Batty, Aaron Olaf – Language Testing, 2020
Vocabulary's relationship to reading proficiency is frequently cited as a justification for the assessment of L2 written receptive vocabulary knowledge. However, to date, there has been relatively little research regarding which modalities of vocabulary knowledge have the strongest correlations to reading proficiency, and observed differences have…
Descriptors: Prediction, Reading Tests, Language Proficiency, Test Items
Yoo, Hanwook; Manna, Venessa F.; Monfils, Lora F.; Oh, Hyeon-Joo – Language Testing, 2019
This study illustrates the use of score equity assessment (SEA) for evaluating the fairness of reported test scores from assessments intended for test takers from diverse cultural, linguistic, and educational backgrounds, using a workplace English proficiency test. Subgroups were defined by test-taker background characteristics that research has…
Descriptors: English (Second Language), Second Language Learning, Culture Fair Tests, Test Validity
Im, Gwan-Hyeok; Cheng, Liying – Language Testing, 2019
The primary purpose of the Test of English for International Communication (TOEIC®) is to measure the everyday English skills of individuals, who speak a first language other than English, working in an international environment (ETS, 2015a, 2016a; Powers & Powers, 2015). The TOEIC also has six secondary purposes: (1) to verify the current…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Language Proficiency
Yoo, Hanwook; Manna, Venessa F. – Language Testing, 2017
This study assessed the factor structure of the Test of English for International Communication (TOEIC®) Listening and Reading test, and its invariance across subgroups of test-takers. The subgroups were defined by (a) gender, (b) age, (c) employment status, (d) time spent studying English, and (e) having lived in a country where English is the…
Descriptors: English (Second Language), Language Proficiency, Language Tests, Second Language Learning
Powers, Donald E.; Powers, Andrew – Language Testing, 2015
Typically, English language proficiency tests yield multiple scores--usually for each of the four traditional language domains. In order to maximize the usefulness of test scores, they may need to be accompanied by information concerning how they complement one another. Using self-assessments by some 2300 TOEIC test takers, this study aimed to…
Descriptors: Language Tests, English (Second Language), Language Proficiency, Prediction
Stubbe, Raymond – Language Testing, 2012
"Pseudowords", or non-real words, were introduced to the Yes/No (YN) vocabulary test format to provide a means of checking for overestimation of word knowledge by test takers. The purpose of this study is to assess the assumption that more pseudoword checks (false alarms) indicate more instances of overestimation of word knowledge in YN…
Descriptors: Academic Ability, English (Second Language), Multiple Choice Tests, Test Results
McNamara, Tim; Knoch, Ute – Language Testing, 2012
This paper examines the uptake of Rasch measurement in language testing through a consideration of research published in language testing research journals in the period 1984 to 2009. Following the publication of the first papers on this topic, exploring the potential of the simple Rasch model for the analysis of dichotomous language test data, a…
Descriptors: Language Tests, Testing, English (Second Language), Item Response Theory
Zhang, Su – Language Testing, 2006
This study applied generalizability theory to investigate the contributions of persons, items, sections, and language backgrounds to the score dependability of the Test of English for International Communication (TOEIC). I replicated and extended Brown's (1999) study of the Test of English as a Foreign Language (TOEFL), using data from two…
Descriptors: Communication (Thought Transfer), Generalizability Theory, English (Second Language), Scores