ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	5

Descriptor

Evaluation Research	6
English (Second Language)	5
Language Tests	5
Essay Tests	3
Discourse Analysis	2
Item Response Theory	2
Second Language Learning	2
Statistical Analysis	2
Writing Tests	2
Automation	1
Change	1
Classification	1
Computer Assisted Testing	1
Construct Validity	1
Differences	1
Educational Assessment	1
Educational Testing	1
Essays	1
Factor Analysis	1
Field Tests	1
Foreign Countries	1
Generalizability Theory	1
Grammar	1
Guides	1
Hierarchical Linear Modeling	1
More ▼

Source

Language Testing	2
Assessing Writing	1
Assessment in Education:…	1
Educational Testing Service	1
Language Assessment Quarterly	1

Author

Baba, Kyoko	1
Barkaoui, Khaled	1
Cumming, Alister	1
Eouanzoui, Keanre	1
Erdosy, Usman	1
Higgins, Derrick	1
Horak, Tania	1
James, Mark	1
Kantor, Robert	1
Knoch, Ute	1
Kunnan, Antony John	1
McNamara, Tim	1
Quinlan, Thomas	1
Wall, Dianne	1
Wolff, Susanne	1
More ▼

Publication Type

Journal Articles	5
Reports - Evaluative	4
Opinion Papers	1
Reports - Research	1

Education Level

Higher Education	2
Postsecondary Education	2
Elementary Secondary Education	1

Audience

Location

Australia	1
Netherlands	1
United Kingdom	1
United States	1

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	6
Graduate Record Examinations	1
International English…	1
Test of English for…	1

What Works Clearinghouse Rating

Showing all 6 results Save | Export

The Rasch Wars: The Emergence of Rasch Measurement in Language Testing

Peer reviewed

Direct link

McNamara, Tim; Knoch, Ute – Language Testing, 2012

This paper examines the uptake of Rasch measurement in language testing through a consideration of research published in language testing research journals in the period 1984 to 2009. Following the publication of the first papers on this topic, exploring the potential of the simple Rasch model for the analysis of dichotomous language test data, a…

Descriptors: Language Tests, Testing, English (Second Language), Item Response Theory

Using Multilevel Modeling in Language Assessment Research: A Conceptual Introduction

Peer reviewed

Direct link

Barkaoui, Khaled – Language Assessment Quarterly, 2013

This article critiques traditional single-level statistical approaches (e.g., multiple regression analysis) to examining relationships between language test scores and variables in the assessment setting. It highlights the conceptual, methodological, and statistical problems associated with these techniques in dealing with multilevel or nested…

Descriptors: Hierarchical Linear Modeling, Statistical Analysis, Multiple Regression Analysis, Generalizability Theory

Test Fairness and Toulmin's Argument Structure

Peer reviewed

Direct link

Kunnan, Antony John – Language Testing, 2010

This paper presents the author's response to Xiaoming Xi's article titled "How do we go about investigating test fairness?" In this response, the author focuses on test fairness and Toulmin's model of argument structure, Xi's proposal, and the challenges the proposal brings. Xi proposes an approach to investigating test fairness to guide…

Descriptors: Persuasive Discourse, Inferences, Test Bias, Models

Evaluating the Construct-Coverage of the e-rater[R] Scoring Engine. Research Report. ETS RR-09-01

Download full text

Quinlan, Thomas; Higgins, Derrick; Wolff, Susanne – Educational Testing Service, 2009

This report evaluates the construct coverage of the e-rater[R[ scoring engine. The matter of construct coverage depends on whether one defines writing skill, in terms of process or product. Originally, the e-rater engine consisted of a large set of components with a proven ability to predict human holistic scores. By organizing these capabilities…

Descriptors: Guides, Writing Skills, Factor Analysis, Writing Tests

Using Baseline Studies in the Investigation of Test Impact

Peer reviewed

Direct link

Wall, Dianne; Horak, Tania – Assessment in Education: Principles, Policy & Practice, 2007

The purpose of this article is to discuss the role of "baseline studies" in investigations of test impact and to illustrate the type of thinking underlying the design and implementation of such studies by reference to a recent study relating to a high-stakes test of English language proficiency. Baseline studies are used to describe an…

Descriptors: Second Language Learning, Language Proficiency, Language Tests, English (Second Language)

Differences in Written Discourse in Independent and Integrated Prototype Tasks for Next Generation TOEFL

Peer reviewed

Direct link

Cumming, Alister; Kantor, Robert; Baba, Kyoko; Erdosy, Usman; Eouanzoui, Keanre; James, Mark – Assessing Writing, 2005

We assessed whether and how the discourse written for prototype integrated tasks (involving writing in response to print or audio source texts) field tested for Next Generation TOEFL[R] differs from the discourse written for independent essays (i.e., the TOEFL Essay[R]). We selected 216 compositions written for six tasks by 36 examinees in a field…

Descriptors: Grammar, Field Tests, English (Second Language), Pragmatics