Publication Date
| In 2026 | 0 |
| Since 2025 | 2 |
| Since 2022 (last 5 years) | 30 |
| Since 2017 (last 10 years) | 66 |
| Since 2007 (last 20 years) | 143 |
Descriptor
Source
| Language Testing | 289 |
Author
Publication Type
Education Level
Audience
Location
| Australia | 14 |
| China | 14 |
| Japan | 9 |
| Hong Kong | 5 |
| United Kingdom | 5 |
| Canada | 4 |
| United States | 3 |
| Brazil | 2 |
| France | 2 |
| Germany | 2 |
| Indiana | 2 |
| More ▼ | |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 2 |
| Race to the Top | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Peer reviewedPhakiti, Aek – Language Testing, 2003
Investigates the relationship of test-takers' use of cognitive and metacognitive strategies to English-as-a-Foreign-Language (EFL) reading test performance. Results suggest that use of cognitive and metacognitive strategies had a positive relationship to reading test performance; highly successful test-takers reported significantly higher…
Descriptors: Cognitive Processes, Language Tests, Learning Strategies, Metacognition
The Effect of Rate Variables in the Development of an Occupation-Specific Language Performance Test.
Peer reviewedBrown, Anne – Language Testing, 1995
This article explores the effect of raters' background on assessments made in an occupation-specific oral language test, the Japanese Language Test for Tour Guides. Assessments of 51 test candidates made by 33 assessors were compared in order to determine what effect background has on assessments made on both linguistic and "real-world"…
Descriptors: Comparative Analysis, Evaluators, Japanese, Language Tests
Peer reviewedTakala, Sauli; Kaftandjieva, Felianka – Language Testing, 2000
Analyzes gender-uniform differential item functioning (DIF) in a second language vocabulary test with the tools of item response theory to study potential gender impact on the test performance measured by different item composites. Results show that while there are test items with indications of DIF in favor of either females or males, the test as…
Descriptors: English (Second Language), Foreign Countries, Item Analysis, Language Tests
Peer reviewedEdelenbos, Peter; Vinje, Marja P. – Language Testing, 2000
Reports the outcomes of two English-as-Foreign-Language assessments for Dutch primary school students with regard to listening, reading, and word knowledge. The assessments have provided important insights into the possibilities and limitations of foreign language teaching and learning. (Author/VWL)
Descriptors: Elementary Education, English (Second Language), Foreign Countries, Language Tests
Peer reviewedMcKay, Penny – Language Testing, 2000
Presents principles behind the construction of English-as-Second-Language (ESL) standards for schools, drawing on examples of ESL standards developed in Australia, England, Wales, and the United States. Examines how differences in purposes in these standards--planning, professional understanding, and reporting--influence how ESL standards might…
Descriptors: Academic Standards, Elementary Education, English (Second Language), Foreign Countries
Peer reviewedHuibregtse, Ineke; Admiraal, Wilfried; Meara, Paul – Language Testing, 2002
Discusses how to tackle the problem of determining a meaningful score for yes-no tests used to measure the size of receptive vocabulary. Signal Detection Theory is applied, and a new more accurate index is suggested. (Author/VWL)
Descriptors: English (Second Language), Language Tests, Receptive Language, Scores
Peer reviewedHamp-Lyons, Liz – Language Testing, 1997
Links the theory of washback with the broader concept of impact in educational measurement and to the recent debate on construct validity associated with Messick. Notes that for many years it was asserted that language tests negatively impacted teaching and learning, an impact known as washback. (25 references) (Author/CK)
Descriptors: Ethics, Higher Education, Language Tests, Measurement Techniques
Peer reviewedMessick, Samuel – Language Testing, 1996
Examines the concept of washback as an instance of the consequential aspect of construct validity, linking positive washback to direct assessments and the need to minimize construct underrepresentation and construct-irrelevant difficulty in the test. The article explains washback as referring to the extent to which test use influences language…
Descriptors: Applied Linguistics, Construct Validity, Content Validity, Language Tests
Peer reviewedO'Loughlin, Kieran – Language Testing, 1995
This article examines the effects of test format and task type on candidate output in direct and semidirect versions of the oral interaction subtest of the Australian Assessment of Communicative English Skills. Results are discussed in relation to the degree of interactiveness and other factors that appear to influence lexical density and to the…
Descriptors: English (Second Language), Language Proficiency, Language Tests, Oral Language
Peer reviewedElder, Catherine – Language Testing, 2001
Discusses issues identified by Douglas (2000) as problematic for language for specific purposes testing, making reference to a number of performance-based instruments designed to assess the language proficiency of teachers or intending teachers. Addresses the problems of specificity and authenticity. (Author)
Descriptors: English for Special Purposes, Language Proficiency, Language Teachers, Language Tests
Peer reviewedDouglas, Dan – Language Testing, 2001
Discusses criteria used in assessing language for specific purposes tests. Examines the issue of separability of language and content and reinforces points made by Jacoby and McNamara (1999) that second language assessments based entirely on linguistic criteria may fail to satisfy the purpose of the test user, whereas the use of indigenous…
Descriptors: Evaluation Criteria, Language Tests, Languages for Special Purposes, Native Speakers
Peer reviewedKunnan, Antony John – Language Testing, 1998
Provides an introduction to structural equation modelling (SEM) for language research, including: general objectives of SEM applications relevant to language assessment; methodology and statistical assumptions about data that must be met; commonly-used SEM steps and concepts; application matters, with sample models; and recent critical discussions…
Descriptors: Language Research, Language Tests, Mathematical Formulas, Models
Peer reviewedGuerrero, Michael D. – Language Testing, 2000
Seventeen states in the United States use Spanish-language proficiency tests to ensure that bilingual education teachers are able to deliver academic instruction in Spanish to school-age students. The unified validity of the Four Skills Exam (FSE), used in New Mexico for nearly 18 years, was evaluated using Messick's framework (1989). (Author/VWL)
Descriptors: Bilingual Education, Bilingual Teachers, Elementary Secondary Education, Language Proficiency
Peer reviewedLumley, Tom – Language Testing, 2002
Investigates the process by which raters of texts written by English-as-a-Second-Language learners make their scoring decisions using an analytic rating scale designed for multiple test forms. Demonstrates that the task raters face is to reconcile their impression of the text, the specific features of the text, and the wordings of the rating…
Descriptors: English (Second Language), Evaluation Criteria, Language Tests, Rating Scales
Peer reviewedPatri, Mrudula – Language Testing, 2002
Investigates agreement among teacher-, self-, and peer-assessments of students in the presence of peer feedback. This is done in the context of oral presentation skills of first year undergraduate students of ethnic Chinese background. Findings how that when assessment criteria are firmly set, peer feedback enables students to judge the…
Descriptors: College Students, Higher Education, Language Tests, Oral Language


