NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 6 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
McNamara, Tim; Knoch, Ute – Language Testing, 2012
This paper examines the uptake of Rasch measurement in language testing through a consideration of research published in language testing research journals in the period 1984 to 2009. Following the publication of the first papers on this topic, exploring the potential of the simple Rasch model for the analysis of dichotomous language test data, a…
Descriptors: Language Tests, Testing, English (Second Language), Item Response Theory
Peer reviewed Peer reviewed
Direct linkDirect link
Barkaoui, Khaled – Language Assessment Quarterly, 2013
This article critiques traditional single-level statistical approaches (e.g., multiple regression analysis) to examining relationships between language test scores and variables in the assessment setting. It highlights the conceptual, methodological, and statistical problems associated with these techniques in dealing with multilevel or nested…
Descriptors: Hierarchical Linear Modeling, Statistical Analysis, Multiple Regression Analysis, Generalizability Theory
Peer reviewed Peer reviewed
Direct linkDirect link
Kunnan, Antony John – Language Testing, 2010
This paper presents the author's response to Xiaoming Xi's article titled "How do we go about investigating test fairness?" In this response, the author focuses on test fairness and Toulmin's model of argument structure, Xi's proposal, and the challenges the proposal brings. Xi proposes an approach to investigating test fairness to guide…
Descriptors: Persuasive Discourse, Inferences, Test Bias, Models
Quinlan, Thomas; Higgins, Derrick; Wolff, Susanne – Educational Testing Service, 2009
This report evaluates the construct coverage of the e-rater[R[ scoring engine. The matter of construct coverage depends on whether one defines writing skill, in terms of process or product. Originally, the e-rater engine consisted of a large set of components with a proven ability to predict human holistic scores. By organizing these capabilities…
Descriptors: Guides, Writing Skills, Factor Analysis, Writing Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Wall, Dianne; Horak, Tania – Assessment in Education: Principles, Policy & Practice, 2007
The purpose of this article is to discuss the role of "baseline studies" in investigations of test impact and to illustrate the type of thinking underlying the design and implementation of such studies by reference to a recent study relating to a high-stakes test of English language proficiency. Baseline studies are used to describe an…
Descriptors: Second Language Learning, Language Proficiency, Language Tests, English (Second Language)
Peer reviewed Peer reviewed
Direct linkDirect link
Cumming, Alister; Kantor, Robert; Baba, Kyoko; Erdosy, Usman; Eouanzoui, Keanre; James, Mark – Assessing Writing, 2005
We assessed whether and how the discourse written for prototype integrated tasks (involving writing in response to print or audio source texts) field tested for Next Generation TOEFL[R] differs from the discourse written for independent essays (i.e., the TOEFL Essay[R]). We selected 216 compositions written for six tasks by 36 examinees in a field…
Descriptors: Grammar, Field Tests, English (Second Language), Pragmatics