Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 8 |
Since 2006 (last 20 years) | 11 |
Descriptor
Source
Language Testing | 12 |
Author
Allen, David | 1 |
Bachman, Lyle F. | 1 |
Blood, Ian A. | 1 |
Brunfaut, Tineke | 1 |
Chan, Sathena | 1 |
Cho, Yeonsuk | 1 |
Choi, Ikkyu | 1 |
Choi, Inn-Chull | 1 |
Cohen, Andrew D. | 1 |
Getman, Edward P. | 1 |
Gu, Lin | 1 |
More ▼ |
Publication Type
Journal Articles | 12 |
Reports - Research | 11 |
Reports - Evaluative | 1 |
Tests/Questionnaires | 1 |
Education Level
Elementary Education | 3 |
Secondary Education | 2 |
Adult Education | 1 |
Grade 6 | 1 |
Grade 7 | 1 |
Higher Education | 1 |
Intermediate Grades | 1 |
Junior High Schools | 1 |
Middle Schools | 1 |
Audience
Laws, Policies, & Programs
Assessments and Surveys
Test of English as a Foreign… | 12 |
What Works Clearinghouse Rating
Allen, David; Nakamura, Keita – Language Testing, 2023
Although there is abundant evidence for the use of first-language (L1) knowledge by bilinguals when using a second language (L2), investigation into the impact of L1 knowledge in large-scale L2 language assessments and discussion of how such impact may be controlled has received little attention in the language assessment literature. This study…
Descriptors: Language Tests, Second Language Learning, Contrastive Linguistics, English (Second Language)
Chan, Sathena; May, Lyn – Language Testing, 2023
Despite the increased use of integrated tasks in high-stakes academic writing assessment, research on rating criteria which reflect the unique construct of integrated summary writing skills is comparatively rare. Using a mixed-method approach of expert judgement, text analysis, and statistical analysis, this study examines writing features that…
Descriptors: Scoring, Writing Evaluation, Reading Tests, Listening Skills
Brunfaut, Tineke; Kormos, Judit; Michel, Marije; Ratajczak, Michael – Language Testing, 2021
Extensive research has demonstrated the impact of working memory (WM) on first language (L1) reading comprehension across age groups (Peng et al., 2018), and on foreign language (FL) reading comprehension of adults and older adolescents (Linck et al., 2014). Comparatively little is known about the effect of WM on young FL readers' comprehension,…
Descriptors: Second Language Learning, Second Language Instruction, Reading Comprehension, Accuracy
Schmidgall, Jonathan E.; Getman, Edward P.; Zu, Jiyun – Language Testing, 2018
In this study, we define the term "screener test," elaborate key considerations in test design, and describe how to incorporate the concepts of practicality and argument-based validation to drive an evaluation of screener tests for language assessment. A screener test is defined as a brief assessment designed to identify an examinee as a…
Descriptors: Test Validity, Test Use, Test Construction, Language Tests
Choi, Ikkyu; Papageorgiou, Spiros – Language Testing, 2020
Stakeholders of language tests are often interested in subscores. However, reporting a subscore is not always justified; a subscore should provide reliable and distinct information to be worth reporting. When a subscore is used for decisions across multiple levels (e.g., individual test takers and schools), it needs to be justified for its…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Scores
Cho, Yeonsuk; Blood, Ian A. – Language Testing, 2020
In this study, we examined how much change in "TOEFL® Primary™" listening and reading scores can be expected in relation to the time interval between test administrations. The test records of 5213 young learners of English (aged 8-13 years) in Japan and Turkey who repeated the tests were analyzed to examine test scores as a function of…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Scores
Sawaki, Yasuyo; Sinharay, Sandip – Language Testing, 2018
The present study examined the reliability of the reading, listening, speaking, and writing section scores for the TOEFL iBT® test and their interrelationship in order to collect empirical evidence to support, respectively, the "generalization" inference and the "explanation" inference in the TOEFL iBT validity argument…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Computer Assisted Testing
Yi, Yeon-Sook – Language Testing, 2017
The present study examines the relative importance of attributes within and across items by applying four cognitive diagnostic assessment models. The current study utilizes the function of the models that can indicate inter-attribute relationships that reflect the response behaviors of examinees to analyze scored test-taker responses to four forms…
Descriptors: Second Language Learning, Reading Comprehension, Listening Comprehension, Language Tests
Gu, Lin – Language Testing, 2015
In this study I examined the dimensionality of the latent ability underlying language use that is needed to fulfill the demands young learners face in English-medium instructional environments, where English is used as the means of instruction for teaching subject matters. Previous research on English language use by school-age children provided…
Descriptors: Language Aptitude, Language Proficiency, English (Second Language), English Language Learners
Kim, Ah-Young – Language Testing, 2015
Previous research in cognitive diagnostic assessment (CDA) of L2 reading ability has been frequently conducted using large-scale English proficiency exams (e.g., TOEFL, MELAB). Using CDA, it is possible to analyze individual learners' strengths and weaknesses in multiple attributes (i.e., knowledge, skill, strategy) measured at the item level.…
Descriptors: Language Tests, Diagnostic Tests, Cognitive Measurement, Reading Ability
Cohen, Andrew D.; Upton, Thomas A. – Language Testing, 2007
This study describes the reading and test-taking strategies that test takers used on the "Reading" section of the "LanguEdge Courseware" (2002) materials developed to familiarize prospective respondents with the new TOEFL. The investigation focused on strategies used to respond to more traditional "single selection"…
Descriptors: Courseware, Language Tests, Test Wiseness, Language Teachers

Choi, Inn-Chull; Bachman, Lyle F. – Language Testing, 1992
This study is part of a larger one examining the comparability of the First Certificate in English and the Test of English as a Foreign Language. The general assumption of unidimensionality and goodness-of-fit were tested. Findings raise questions about the consequences of rejecting or retaining misfitting items. (60 references) (LB)
Descriptors: Comparative Analysis, English (Second Language), Goodness of Fit, Item Response Theory