ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	6
Since 2006 (last 20 years)	12

Source

Language Testing

Publication Type

Journal Articles	13
Reports - Research	11
Reports - Evaluative	2
Tests/Questionnaires	2

Education Level

Higher Education	2
Adult Education	1
Postsecondary Education	1

Audience

Location

Brazil	1
China	1
Japan	1
Kenya	1

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	13
International English…	1

What Works Clearinghouse Rating

Showing all 13 results Save | Export

Towards More Valid Scoring Criteria for Integrated Reading-Writing and Listening-Writing Summary Tasks

Peer reviewed

Direct link

Chan, Sathena; May, Lyn – Language Testing, 2023

Despite the increased use of integrated tasks in high-stakes academic writing assessment, research on rating criteria which reflect the unique construct of integrated summary writing skills is comparatively rare. Using a mixed-method approach of expert judgement, text analysis, and statistical analysis, this study examines writing features that…

Descriptors: Scoring, Writing Evaluation, Reading Tests, Listening Skills

Screener Tests Need Validation Too: Weighing an Argument for Test Use against Practical Concerns

Peer reviewed

Direct link

Schmidgall, Jonathan E.; Getman, Edward P.; Zu, Jiyun – Language Testing, 2018

In this study, we define the term "screener test," elaborate key considerations in test design, and describe how to incorporate the concepts of practicality and argument-based validation to drive an evaluation of screener tests for language assessment. A screener test is defined as a brief assessment designed to identify an examinee as a…

Descriptors: Test Validity, Test Use, Test Construction, Language Tests

Responding to a TOEFL iBT Integrated Speaking Task: Mapping Task Demands and Test Takers' Use of Stimulus Content

Peer reviewed

Direct link

Frost, Kellie; Clothier, Josh; Huisman, Annemiek; Wigglesworth, Gillian – Language Testing, 2020

Integrated speaking tasks requiring test takers to read and/or listen to stimulus texts and to incorporate their content into oral performances are now used in large-scale, high-stakes tests, including the TOEFL iBT. These tasks require test takers to identify, select, and combine relevant source text information to recognize key relationships…

Descriptors: Discourse Analysis, Scoring Rubrics, Speech Communication, English (Second Language)

Comparability of Students' Writing Performance on TOEFL iBT and in Required University Writing Courses

Peer reviewed

Direct link

Llosa, Lorena; Malone, Margaret E. – Language Testing, 2019

Investigating the comparability of students' performance on TOEFL writing tasks and actual academic writing tasks is essential to provide backing for the extrapolation inference in the TOEFL validity argument (Chapelle, Enright, & Jamieson, 2008). This study compared 103 international non-native-English-speaking undergraduate students'…

Descriptors: Computer Assisted Testing, Language Tests, English (Second Language), Second Language Learning

The Influence of Training and Experience on Rater Performance in Scoring Spoken Language

Peer reviewed

Direct link

Davis, Larry – Language Testing, 2016

Two factors were investigated that are thought to contribute to consistency in rater scoring judgments: rater training and experience in scoring. Also considered were the relative effects of scoring rubrics and exemplars on rater performance. Experienced teachers of English (N = 20) scored recorded responses from the TOEFL iBT speaking test prior…

Descriptors: Evaluators, Oral Language, Scores, Language Tests

Language Assessment and the Inseparability of Lexis and Grammar: Focus on the Construct of Speaking

Peer reviewed

Direct link

Römer, Ute – Language Testing, 2017

This paper aims to connect recent corpus research on phraseology with current language testing practice. It discusses how corpora and corpus-analytic techniques can illuminate central aspects of speech and help in conceptualizing the notion of lexicogrammar in second language speaking assessment. The description of speech and some of its core…

Descriptors: Language Tests, Grammar, English (Second Language), Second Language Learning

A Study on the Impact of Fatigue on Human Raters When Scoring Speaking Responses

Peer reviewed

Direct link

Ling, Guangming; Mollaun, Pamela; Xi, Xiaoming – Language Testing, 2014

The scoring of constructed responses may introduce construct-irrelevant factors to a test score and affect its validity and fairness. Fatigue is one of the factors that could negatively affect human performance in general, yet little is known about its effects on a human rater's scoring quality on constructed responses. In this study, we compared…

Descriptors: Evaluators, Fatigue (Biology), Scoring, Performance

Measuring Authorial Voice Strength in L2 Argumentative Writing: The Development and Validation of an Analytic Rubric

Peer reviewed

Direct link

Zhao, Cecilia Guanfang – Language Testing, 2013

Although a key concept in various writing textbooks, learning standards, and writing rubrics, voice remains a construct that is only loosely defined in the literature and impressionistically assessed in practice. Few attempts have been made to formally investigate whether and how the strength of an author's voice in written texts can be reliably…

Descriptors: Factor Analysis, Writing Instruction, Teaching Methods, Protocol Analysis

A Comparison of Two Scoring Methods for an Automated Speech Scoring System

Peer reviewed

Direct link

Xi, Xiaoming; Higgins, Derrick; Zechner, Klaus; Williamson, David – Language Testing, 2012

This paper compares two alternative scoring methods--multiple regression and classification trees--for an automated speech scoring system used in a practice environment. The two methods were evaluated on two criteria: construct representation and empirical performance in predicting human scores. The empirical performance of the two scoring models…

Descriptors: Scoring, Classification, Weighted Scores, Comparative Analysis

TOEFL iBT Speaking Test Scores as Indicators of Oral Communicative Language Proficiency

Peer reviewed

Direct link

Bridgeman, Brent; Powers, Donald; Stone, Elizabeth; Mollaun, Pamela – Language Testing, 2012

Scores assigned by trained raters and by an automated scoring system (SpeechRater[TM]) on the speaking section of the TOEFL iBT[TM] were validated against a communicative competence criterion. Specifically, a sample of 555 undergraduate students listened to speech samples from 184 examinees who took the Test of English as a Foreign Language…

Descriptors: Undergraduate Students, Speech Communication, Rating Scales, Scoring

Predicting Lexical Proficiency in Language Learner Texts Using Computational Indices

Peer reviewed

Direct link

Crossley, Scott A.; Salsbury, Tom; McNamara, Danielle S.; Jarvis, Scott – Language Testing, 2011

The authors present a model of lexical proficiency based on lexical indices related to vocabulary size, depth of lexical knowledge, and accessibility to core lexical items. The lexical indices used in this study come from the computational tool Coh-Metrix and include word length scores, lexical diversity values, word frequency counts, hypernymy…

Descriptors: Semantics, Familiarity, Second Language Learning, Word Frequency

Evaluating Analytic Scoring for the TOEFL[R] Academic Speaking Test (TAST) for Operational Use

Peer reviewed

Direct link

Xi, Xiaoming – Language Testing, 2007

This study explores the utility of analytic scoring for TAST in providing useful and reliable diagnostic information for operational use in three aspects of candidates' performance: delivery, language use and topic development. One hundred and forty examinees' responses to six TAST tasks were scored analytically on these three aspects of speech. G…

Descriptors: Scoring, Profiles, Performance Based Assessment, Academic Discourse

Developing Homogeneous TOEFL Scales by Multidimensional Scaling.

Peer reviewed

Oltman, Phillip K.; Stricker, Lawrence J. – Language Testing, 1990

A recent multidimensional scaling analysis of the Test of English-as-a-Foreign-Language (TOEFL) item response data identified clusters of items in the test sections that, being more homogeneous than their parent sections, might be better for diagnostic use. The analysis was repeated using different scoring techniques. Results diverged only for…

Descriptors: English (Second Language), Item Analysis, Language Tests, Scaling

Language Tests	12
English (Second Language)	11
Second Language Learning	11
Computer Assisted Testing	7
Correlation	7
Oral Language	6
Scoring	6
Scoring Rubrics	6
Evaluators	5
Language Proficiency	4
Scores	4
Grammar	3
Speech Communication	3
Writing Evaluation	3
Academic Discourse	2
Accuracy	2
Classification	2
Comparative Analysis	2
Computational Linguistics	2
English for Academic Purposes	2
High Stakes Tests	2
Language Teachers	2
Prediction	2
Profiles	2
Rating Scales	2
More ▼

Xi, Xiaoming	3
Mollaun, Pamela	2
Bridgeman, Brent	1
Chan, Sathena	1
Clothier, Josh	1
Crossley, Scott A.	1
Davis, Larry	1
Frost, Kellie	1
Getman, Edward P.	1
Higgins, Derrick	1
Huisman, Annemiek	1
Jarvis, Scott	1
Ling, Guangming	1
Llosa, Lorena	1
Malone, Margaret E.	1
May, Lyn	1
McNamara, Danielle S.	1
Oltman, Phillip K.	1
Powers, Donald	1
Römer, Ute	1
Salsbury, Tom	1
Schmidgall, Jonathan E.	1
Stone, Elizabeth	1
Stricker, Lawrence J.	1
Wigglesworth, Gillian	1
More ▼