Publication Date
| In 2026 | 0 |
| Since 2025 | 3 |
| Since 2022 (last 5 years) | 29 |
| Since 2017 (last 10 years) | 61 |
| Since 2007 (last 20 years) | 98 |
Descriptor
Source
| Language Testing | 156 |
Author
| Chapelle, Carol A. | 4 |
| Yan, Xun | 4 |
| Aryadoust, Vahid | 3 |
| Bachman, Lyle F. | 3 |
| Davies, Alan | 3 |
| Fulcher, Glenn | 3 |
| Shohamy, Elana | 3 |
| Alderson, J. Charles | 2 |
| August, Diane | 2 |
| Beglar, David | 2 |
| Brown, James Dean | 2 |
| More ▼ | |
Publication Type
| Journal Articles | 156 |
| Reports - Research | 93 |
| Reports - Evaluative | 31 |
| Opinion Papers | 17 |
| Reports - Descriptive | 14 |
| Information Analyses | 8 |
| Tests/Questionnaires | 5 |
| Speeches/Meeting Papers | 2 |
Education Level
Audience
| Researchers | 1 |
| Teachers | 1 |
Location
| Japan | 8 |
| China | 6 |
| United Kingdom | 5 |
| Australia | 4 |
| Brazil | 3 |
| South Korea | 3 |
| United Kingdom (England) | 3 |
| Canada | 2 |
| Germany | 2 |
| Israel | 2 |
| New Zealand | 2 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Huhta, Ari; Alanen, Riikka; Tarnanen, Mirja; Martin, Maisa; Hirvelä, Tuija – Language Testing, 2014
There is still relatively little research on how well the CEFR and similar holistic scales work when they are used to rate L2 texts. Using both multifaceted Rasch analyses and qualitative data from rater comments and interviews, the ratings obtained by using a CEFR-based writing scale and the Finnish National Core Curriculum scale for L2 writing…
Descriptors: Foreign Countries, Writing Skills, Second Language Learning, Finno Ugric Languages
McNamara, Tim – Language Testing, 2011
The paper by Wilson and Moore (this volume), based on the Messick Lecture delivered in 2006 at the annual Language Testing Research Colloquium in Melbourne, may present a familiar challenge to some language testers: of reading outside one's comfort zone. The distinctive character of language testing lies in its combination of two primary fields of…
Descriptors: Expertise, Applied Linguistics, Testing, Language Tests
Haug, Tobias – Language Testing, 2012
Despite the current need for reliable and valid test instruments in different countries in order to monitor the sign language acquisition of deaf children, very few tests are commercially available that offer strong evidence for their psychometric properties. This mirrors the current state of affairs for many sign languages, where very little…
Descriptors: Evidence, Sign Language, Language Tests, Construct Validity
Wang, Huan; Choi, Ikkyu; Schmidgall, Jonathan; Bachman, Lyle F. – Language Testing, 2012
This review departs from current practice in reviewing tests in that it employs an "argument-based approach" to test validation to guide the review (e.g. Bachman, 2005; Kane, 2006; Mislevy, Steinberg, & Almond, 2002). Specifically, it follows an approach to test development and use that Bachman and Palmer (2010) call the process of "assessment…
Descriptors: Evidence, Stakeholders, Test Construction, Test Use
Davies, Alan – Language Testing, 2010
This article presents the author's response to Xiaoming Xi's paper titled "How do we go about investigating test fairness?" In the paper, Xi offers "a means to fully integrate fairness investigations and practice". Given the current importance accorded to fairness in the language testing community, Xi makes a case for viewing fairness as an aspect…
Descriptors: Investigations, Testing, Language Tests, Validity
Kane, Michael – Language Testing, 2010
This paper presents the author's critique on Xiaoming Xi's article, "How do we go about investigating test fairness?," which lays out a broad framework for studying fairness as comparable validity across groups within the population of interest. Xi proposes to develop a fairness argument that would identify and evaluate potential fairness-based…
Descriptors: Test Bias, Test Validity, Language Tests, Testing
Sasaki, Miyuki – Language Testing, 2012
The Modern Language Aptitude Test (Paper-and-Pencil Version, henceforth, the MLAT) measures "an individual's ability to learn a foreign language." It targets English-speaking adults (over Grade 9) who are literate. The test has only one form, which has not changed since it was first published by the Psychological Corporation in 1959. The test can…
Descriptors: Aptitude Tests, Test Reviews, Rewards, Acoustics
Xi, Xiaoming – Language Testing, 2010
Previous test fairness frameworks have greatly expanded the scope of fairness, but do not provide a means to fully integrate fairness investigations and set priorities. This article proposes an approach to guide practitioners on fairness research and practices. This approach treats fairness as an aspect of validity and conceptualizes it as…
Descriptors: Test Results, Language Tests, Test Validity, English (Second Language)
Watanabe, Yoshinori – Language Testing, 2013
This article describes the National Center Test for University Admissions, a unified national test in Japan, which is taken by 500,000 students every year. It states that implementation of the Center Test began in 1990, with the English component consisting only of the written section until 2005, when the listening section was first implemented…
Descriptors: College Admission, Foreign Countries, College Entrance Examinations, English (Second Language)
Goodwin, Amanda P.; Huggins, A. Corinne; Carlo, Maria; Malabonga, Valerie; Kenyon, Dorry; Louguit, Mohammed; August, Diane – Language Testing, 2012
This study describes the development and validation of the Extract the Base test (ETB), which assesses derivational morphological awareness. Scores on this test were validated for 580 monolingual students and 373 Spanish-speaking English language learners (ELLs) in third through fifth grade. As part of the validation of the internal structure,…
Descriptors: Reading Comprehension, Speech Communication, Second Language Learning, Scoring
Alderson, J. Charles – Language Testing, 2010
The Lancaster Language Testing Research Group was commissioned in 2006 by the European Organisation for the Safety of Air Navigation (Eurocontrol) to conduct a validation study of the development of a test called ELPAC (English Language Proficiency for Aeronautical Communication), intended to assess the language proficiency of air traffic…
Descriptors: Testing, Language Tests, Language Proficiency, Aviation Education
Fulcher, Glenn; Davidson, Fred – Language Testing, 2009
Just like buildings, tests are designed and built for specific purposes, people, and uses. However, both buildings and tests grow and change over time as the needs of their users change. Sometimes, they are also both used for purposes other than those intended in the original designs. This paper explores architecture as a metaphor for language…
Descriptors: Figurative Language, Language Tests, Measurement Techniques, Test Validity
Bae, Jungok; Lee, Yae-Sheik – Language Testing, 2011
Pictures are widely used to elicit expressive language skills, and pictures must be established as parallel before changes in ability can be demonstrated by assessment using pictures prompts. Why parallel prompts are required and what it is necessary to do to ensure that prompts are in fact parallel is not widely known. To date, evidence of…
Descriptors: Second Language Learning, Test Format, Language Tests, Factor Analysis
Bernstein, Jared; Van Moere, Alistair; Cheng, Jian – Language Testing, 2010
This paper presents evidence that supports the valid use of scores from fully automatic tests of spoken language ability to indicate a person's effectiveness in spoken communication. The paper reviews the constructs, scoring, and the concurrent validity evidence of "facility-in-L2" tests, a family of automated spoken language tests in Spanish,…
Descriptors: Speech, Oral Language, Language Tests, Test Validity
Lee-Ellis, Sunyoung – Language Testing, 2009
Despite the importance of having a reliable and valid measure of Second Language (L2) proficiency, L2 researchers of less commonly taught languages rarely have such a tool. Existing proficiency measures (e.g., DLPT, OPI) are often costly, labor-intensive, time-consuming, or unavailable to the public. With the intent to provide a practical and…
Descriptors: Uncommonly Taught Languages, Test Validity, Korean, Test Construction

Peer reviewed
Direct link
