Publication Date
| In 2026 | 0 |
| Since 2025 | 12 |
| Since 2022 (last 5 years) | 83 |
| Since 2017 (last 10 years) | 173 |
| Since 2007 (last 20 years) | 360 |
Descriptor
Source
| Language Testing | 539 |
Author
| Davies, Alan | 8 |
| Bachman, Lyle F. | 7 |
| Elder, Catherine | 7 |
| Cheng, Liying | 6 |
| Xi, Xiaoming | 6 |
| Yan, Xun | 6 |
| Alderson, J. Charles | 5 |
| Aryadoust, Vahid | 5 |
| Cho, Yeonsuk | 5 |
| Ginther, April | 5 |
| Knoch, Ute | 5 |
| More ▼ | |
Publication Type
Education Level
Audience
Location
| Japan | 33 |
| China | 30 |
| Australia | 23 |
| United Kingdom | 15 |
| Canada | 14 |
| South Korea | 13 |
| Europe | 7 |
| Germany | 6 |
| Hong Kong | 6 |
| Netherlands | 6 |
| New Zealand | 5 |
| More ▼ | |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 4 |
| Elementary and Secondary… | 1 |
| Lau v Nichols | 1 |
| Race to the Top | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Watanabe, Yoshinori – Language Testing, 2013
This article describes the National Center Test for University Admissions, a unified national test in Japan, which is taken by 500,000 students every year. It states that implementation of the Center Test began in 1990, with the English component consisting only of the written section until 2005, when the listening section was first implemented…
Descriptors: College Admission, Foreign Countries, College Entrance Examinations, English (Second Language)
Pae, Tae-Il – Language Testing, 2012
This study tracked gender differential item functioning (DIF) on the English subtest of the Korean College Scholastic Aptitude Test (KCSAT) over a nine-year period across three data points, using both the Mantel-Haenszel (MH) and item response theory likelihood ratio (IRT-LR) procedures. Further, the study identified two factors (i.e. reading…
Descriptors: Aptitude Tests, Academic Aptitude, Language Tests, Test Items
Hulstijn, Jan H.; Schoonen, Rob; de Jong, Nivja H.; Steinel, Margarita P.; Florijn, Arjen – Language Testing, 2012
This study examines the associations between the speaking proficiency of 181 adult learners of Dutch as a second language and their linguistic competences. Performance in eight speaking tasks was rated on a scale of communicative adequacy. After extrapolation of these ratings to the Overall Oral Production scale of the Common European Framework of…
Descriptors: Linguistic Competence, Speech Communication, Grammar, Indo European Languages
Barkaoui, Khaled – Language Testing, 2010
This study adopted a multilevel modeling (MLM) approach to examine the contribution of rater and essay factors to variability in ESL essay holistic scores. Previous research aiming to explain variability in essay holistic scores has focused on either rater or essay factors. The few studies that have examined the contribution of more than one…
Descriptors: Performance Based Assessment, English (Second Language), Second Language Learning, Holistic Approach
Weigle, Sara Cushing – Language Testing, 2010
Automated scoring has the potential to dramatically reduce the time and costs associated with the assessment of complex skills such as writing, but its use must be validated against a variety of criteria for it to be accepted by test users and stakeholders. This study approaches validity by comparing human and automated scores on responses to…
Descriptors: Correlation, Validity, Writing Ability, English (Second Language)
Kunnan, Antony John – Language Testing, 2010
This paper presents the author's response to Xiaoming Xi's article titled "How do we go about investigating test fairness?" In this response, the author focuses on test fairness and Toulmin's model of argument structure, Xi's proposal, and the challenges the proposal brings. Xi proposes an approach to investigating test fairness to guide…
Descriptors: Persuasive Discourse, Inferences, Test Bias, Models
Crossley, Scott A.; Salsbury, Tom; McNamara, Danielle S. – Language Testing, 2012
This study explores how second language (L2) texts written by learners at various proficiency levels can be classified using computational indices that characterize lexical competence. For this study, 100 writing samples taken from 100 L2 learners were analyzed using lexical indices reported by the computational tool Coh-Metrix. The L2 writing…
Descriptors: Semantics, Familiarity, Discriminant Analysis, Vocabulary Development
Gan, Zhengdong – Language Testing, 2010
This article examines the interactional work in which two groups of secondary ESL students engaged to achieve and sustain participation in group oral assessment, which is designed to assess a student's interactive communication skills in a school-based assessment context. The in-depth observation of the ways in which participants co-constructed…
Descriptors: Group Discussion, Oral Language, Scoring, Case Studies
Zhang, Bo – Language Testing, 2010
This article investigates how measurement models and statistical procedures can be applied to estimate the accuracy of proficiency classification in language testing. The paper starts with a concise introduction of four measurement models: the classical test theory (CTT) model, the dichotomous item response theory (IRT) model, the testlet response…
Descriptors: Language Tests, Classification, Item Response Theory, Statistical Analysis
Goodwin, Amanda P.; Huggins, A. Corinne; Carlo, Maria; Malabonga, Valerie; Kenyon, Dorry; Louguit, Mohammed; August, Diane – Language Testing, 2012
This study describes the development and validation of the Extract the Base test (ETB), which assesses derivational morphological awareness. Scores on this test were validated for 580 monolingual students and 373 Spanish-speaking English language learners (ELLs) in third through fifth grade. As part of the validation of the internal structure,…
Descriptors: Reading Comprehension, Speech Communication, Second Language Learning, Scoring
Jin, Yan; Fan, Jinsong – Language Testing, 2011
The purpose of the Test for English Majors (TEM) is to measure the English proficiency of Chinese university undergraduates majoring in English Language and Literature and to examine whether these students meet the required levels of English language abilities as specified in the National College English Teaching Syllabus for English Majors…
Descriptors: Majors (Students), Advisory Committees, Individual Testing, Examiners
Zhang, Ying; Elder, Catherine – Language Testing, 2011
This paper reports the findings of an empirical study on ESL/EFL teachers' evaluation and interpretation of oral English proficiency as elicited by the national College English Test-Spoken English Test (CET-SET) of China. Informed by debates on the issue of native speaker (NS) norms which have become the focus of attention in recent years, this…
Descriptors: Language Tests, College English, Foreign Countries, Native Speakers
Fitzpatrick, Tess; Clenton, Jon – Language Testing, 2010
This paper assesses the performance of a vocabulary test designed to measure second language productive vocabulary knowledge.The test, Lex30, uses a word association task to elicit vocabulary, and uses word frequency data to measure the vocabulary produced. Here we report firstly on the reliability of the test as measured by a test-retest study, a…
Descriptors: Language Tests, Construct Validity, Vocabulary Development, Word Frequency
Alderson, J. Charles – Language Testing, 2010
The Lancaster Language Testing Research Group was commissioned in 2006 by the European Organisation for the Safety of Air Navigation (Eurocontrol) to conduct a validation study of the development of a test called ELPAC (English Language Proficiency for Aeronautical Communication), intended to assess the language proficiency of air traffic…
Descriptors: Testing, Language Tests, Language Proficiency, Aviation Education
Wilson, Mark; Moore, Stephen – Language Testing, 2011
This paper provides a summary of a novel and integrated way to think about the item response models (most often used in measurement applications in social science areas such as psychology, education, and especially testing of various kinds) from the viewpoint of the statistical theory of generalized linear and nonlinear mixed models. In addition,…
Descriptors: Reading Comprehension, Testing, Social Sciences, Item Response Theory

Peer reviewed
Direct link
