NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 13 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Latifi, Syed; Gierl, Mark – Language Testing, 2021
An automated essay scoring (AES) program is a software system that uses techniques from corpus and computational linguistics and machine learning to grade essays. In this study, we aimed to describe and evaluate particular language features of Coh-Metrix for a novel AES program that would score junior and senior high school students' essays from…
Descriptors: Writing Evaluation, Computer Assisted Testing, Scoring, Essays
Peer reviewed Peer reviewed
Direct linkDirect link
Wind, Stefanie A. – Language Testing, 2023
Researchers frequently evaluate rater judgments in performance assessments for evidence of differential rater functioning (DRF), which occurs when rater severity is systematically related to construct-irrelevant student characteristics after controlling for student achievement levels. However, researchers have observed that methods for detecting…
Descriptors: Evaluators, Decision Making, Student Characteristics, Performance Based Assessment
Peer reviewed Peer reviewed
Direct linkDirect link
Ray J. T. Liao; Renka Ohta; Kwangmin Lee – Language Testing, 2024
As integrated writing tasks in large-scale and classroom-based writing assessments have risen in popularity, research studies have increasingly concentrated on providing validity evidence. Given the fact that most of these studies focus on adult second language learners rather than younger ones, this study examined the relationship between written…
Descriptors: Writing (Composition), Writing Evaluation, English Language Learners, Discourse Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Shi, Bibing; Huang, Liyan; Lu, Xiaofei – Language Testing, 2020
The continuation task, a new form of reading-writing integrated task in which test-takers read an incomplete story and then write the continuation and ending of the story, has been increasingly used in writing assessment, especially in China. However, language-test developers' understanding of the effects of important task-related factors on…
Descriptors: Cues, Writing Tests, Writing Evaluation, English (Second Language)
Peer reviewed Peer reviewed
Direct linkDirect link
Koo, Jin; Becker, Betsy Jane; Kim, Young-Suk – Language Testing, 2014
In this study, differential item functioning (DIF) trends were examined for English language learners (ELLs) versus non-ELL students in third and tenth grades on a large-scale reading assessment. To facilitate the analyses, a meta-analytic DIF technique was employed. The results revealed that items requiring knowledge of words and phrases in…
Descriptors: Test Bias, Reading Tests, English Language Learners, Native Speakers
Peer reviewed Peer reviewed
Direct linkDirect link
Elgort, Irina – Language Testing, 2013
This study examines the development and evaluation of a bilingual Vocabulary Size Test (VST, Nation, 2006). A bilingual (English-Russian) test was developed and administered to 121 intermediate proficiency EFL learners (native speakers of Russian), alongside the original monolingual (English-only) version of the test. A comparison of the bilingual…
Descriptors: Test Construction, Vocabulary, Language Tests, English
Peer reviewed Peer reviewed
Direct linkDirect link
Watanabe, Yoshinori – Language Testing, 2013
This article describes the National Center Test for University Admissions, a unified national test in Japan, which is taken by 500,000 students every year. It states that implementation of the Center Test began in 1990, with the English component consisting only of the written section until 2005, when the listening section was first implemented…
Descriptors: College Admission, Foreign Countries, College Entrance Examinations, English (Second Language)
Peer reviewed Peer reviewed
Direct linkDirect link
East, Martin – Language Testing, 2015
Implementing assessment reform can be challenging. Proposed new assessments must be seen by stakeholders to be fit for purpose, and sometimes the perceptions of key stakeholders, such as teachers and students, may differ from the assessment developers. This article considers the recent introduction of a new high-stakes assessment of spoken…
Descriptors: High Stakes Tests, Teacher Attitudes, High School Students, Secondary School Teachers
Peer reviewed Peer reviewed
Direct linkDirect link
Lee, HyeSun; Winke, Paula – Language Testing, 2013
We adapted three practice College Scholastic Ability Tests (CSAT) of English listening, each with five-option items, to create four- and three-option versions by asking 73 Korean speakers or learners of English to eliminate the least plausible options in two rounds. Two hundred and sixty-four Korean high school English-language learners formed…
Descriptors: Academic Ability, Stakeholders, Reliability, Listening Comprehension Tests
Peer reviewed Peer reviewed
Hunt, Alan; Beglar, David – Language Testing, 1999
Analyzed and validated revised versions of the 2000 Word Level and the University Word Level of Nation's Vocabulary Levels Test, which can be administered for course planning and placement in language programs. Japanese high school and university students (n=496) completed four forms of the 2000 Word Test, and 464 completed four forms of the…
Descriptors: College Students, Construct Validity, Content Validity, English (Second Language)
Peer reviewed Peer reviewed
Direct linkDirect link
Saida, Chisato; Hattori, Tamaki – Language Testing, 2008
Despite growing concerns about declining scholastic abilities of Japanese students throughout Japan prior to the implementation of the revised Courses of Study in 2002, little empirical evidence was available at that time to support this perceived decline in academic performance. This research describes post-hoc IRT equating of previously…
Descriptors: Language Tests, Measures (Individuals), Foreign Countries, Item Response Theory
Peer reviewed Peer reviewed
Spolsky, Bernard – Language Testing, 1995
Discusses attempts made in the United States and elsewhere to develop prognostic tests that would justify decisions to exclude unqualified students from high school foreign-language classes. After the Second World War, U.S. government language programs supported research in the assessment of language aptitude to improve selection techniques. (36…
Descriptors: Armed Forces, High School Students, Language Aptitude, Language Research
Peer reviewed Peer reviewed
Direct linkDirect link
Lee, Y-W. – Language Testing, 2004
The purpose of the study reported in this article is to empirically examine passage-related local item dependence (LID) by using an IRT (item response theory) based LID index called Q3 in an EFL reading comprehension test, with a special focus on item types as a potentially competing source of LID with passages. In this article, definitions and…
Descriptors: Psychometrics, Item Response Theory, Content Analysis, Reading Comprehension