ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	7
Since 2006 (last 20 years)	22

Descriptor

Correlation	22
Language Tests	20
Second Language Learning	19
Scoring	17
English (Second Language)	13
Language Proficiency	8
Oral Language	8
Computer Assisted Testing	7
Evaluators	7
Scores	7
Scoring Rubrics	7
Grammar	6
Chinese	4
Interrater Reliability	4
Statistical Analysis	4
Writing Evaluation	4
Classification	3
Comparative Analysis	3
Foreign Countries	3
Foreign Students	3
Indo European Languages	3
Language Fluency	3
Language Skills	3
Native Speakers	3
Second Language Instruction	3
More ▼

Source

Language Testing

Publication Type

Journal Articles	22
Reports - Research	17
Reports - Evaluative	5
Information Analyses	1
Tests/Questionnaires	1

Education Level

Higher Education	5
Postsecondary Education	3
Elementary Education	2
Adult Education	1
Early Childhood Education	1
Kindergarten	1
Primary Education	1

Audience

Location

China	2
Hong Kong	1
Japan	1
United Kingdom	1

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	7
Graduate Record Examinations	1
Peabody Picture Vocabulary…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 22 results Save | Export

Comparing Holistic and Analytic Marking Methods in Assessing Speech Act Production in L2 Chinese

Peer reviewed

Direct link

Li, Shuai; Wen, Ting; Li, Xian; Feng, Yali; Lin, Chuan – Language Testing, 2023

This study compared holistic and analytic marking methods for their effects on parameter estimation (of examinees, raters, and items) and rater cognition in assessing speech act production in L2 Chinese. Seventy American learners of Chinese completed an oral Discourse Completion Test assessing requests and refusals. Four first-language (L1)…

Descriptors: Speech Acts, Second Language Learning, Second Language Instruction, Chinese

Responding to a TOEFL iBT Integrated Speaking Task: Mapping Task Demands and Test Takers' Use of Stimulus Content

Peer reviewed

Direct link

Frost, Kellie; Clothier, Josh; Huisman, Annemiek; Wigglesworth, Gillian – Language Testing, 2020

Integrated speaking tasks requiring test takers to read and/or listen to stimulus texts and to incorporate their content into oral performances are now used in large-scale, high-stakes tests, including the TOEFL iBT. These tasks require test takers to identify, select, and combine relevant source text information to recognize key relationships…

Descriptors: Discourse Analysis, Scoring Rubrics, Speech Communication, English (Second Language)

Corpus Linguistics and Language Testing: Navigating Uncharted Waters

Peer reviewed

Direct link

Egbert, Jesse – Language Testing, 2017

The use of corpora and corpus linguistic methods in language testing research is increasing at an accelerated pace. The growing body of language testing research that uses corpus linguistic data is a testament to their utility in test development and validation. Although there are many reasons to be optimistic about the future of using corpus data…

Descriptors: Language Tests, Second Language Learning, Computational Linguistics, Best Practices

Development and Validation of a Chinese Character Acquisition Assessment for Second-Language Kindergarteners

Peer reviewed

Direct link

Chan, Stephanie W. Y.; Cheung, Wai Ming; Huang, Yanli; Lam, Wai-Ip; Lin, Chin-Hsi – Language Testing, 2020

Demand for second-language (L2) Chinese education for kindergarteners has grown rapidly, but little is known about these kindergarteners' L2 skills, with existing studies focusing on school-age populations and alphabetic languages. Accordingly, we developed a six-subtest Chinese character acquisition assessment to measure L2 kindergarteners'…

Descriptors: Chinese, Second Language Learning, Second Language Instruction, Written Language

Scoring with the Computer: Alternative Procedures for Improving the Reliability of Holistic Essay Scoring

Peer reviewed

Direct link

Attali, Yigal; Lewis, Will; Steier, Michael – Language Testing, 2013

Automated essay scoring can produce reliable scores that are highly correlated with human scores, but is limited in its evaluation of content and other higher-order aspects of writing. The increased use of automated essay scoring in high-stakes testing underscores the need for human scoring that is focused on higher-order aspects of writing. This…

Descriptors: Scoring, Essay Tests, Reliability, High Stakes Tests

Comparability of Students' Writing Performance on TOEFL iBT and in Required University Writing Courses

Peer reviewed

Direct link

Llosa, Lorena; Malone, Margaret E. – Language Testing, 2019

Investigating the comparability of students' performance on TOEFL writing tasks and actual academic writing tasks is essential to provide backing for the extrapolation inference in the TOEFL validity argument (Chapelle, Enright, & Jamieson, 2008). This study compared 103 international non-native-English-speaking undergraduate students'…

Descriptors: Computer Assisted Testing, Language Tests, English (Second Language), Second Language Learning

The Influence of Training and Experience on Rater Performance in Scoring Spoken Language

Peer reviewed

Direct link

Davis, Larry – Language Testing, 2016

Two factors were investigated that are thought to contribute to consistency in rater scoring judgments: rater training and experience in scoring. Also considered were the relative effects of scoring rubrics and exemplars on rater performance. Experienced teachers of English (N = 20) scored recorded responses from the TOEFL iBT speaking test prior…

Descriptors: Evaluators, Oral Language, Scores, Language Tests

Task and Rater Effects in L2 Speaking and Writing: A Synthesis of Generalizability Studies

Peer reviewed

Direct link

In'nami, Yo; Koizumi, Rie – Language Testing, 2016

We addressed Deville and Chalhoub-Deville's (2006), Schoonen's (2012), and Xi and Mollaun's (2006) call for research into the contextual features that are considered related to person-by-task interactions in the framework of generalizability theory in two ways. First, we quantitatively synthesized the generalizability studies to determine the…

Descriptors: Evaluators, Second Language Learning, Writing Skills, Oral Language

What Accounts for Integrated Reading-to-Write Task Scores?

Peer reviewed

Direct link

Shin, Sun-Young; Ewert, Doreen – Language Testing, 2015

Reading-to-write (RTW) tasks are becoming increasingly popular and have already been used in several high-stakes English proficiency exams, either replacing or complementing a prompt-based essay test. However, it is still not clear that what accounts for successful or unsuccessful performance on an integrated reading-writing task is owing to the…

Descriptors: English (Second Language), Language Tests, Language Proficiency, Test Items

Confidence Scoring of Speaking Performance: How Does Fuzziness become Exact?

Peer reviewed

Direct link

Jin, Tan; Mak, Barley; Zhou, Pei – Language Testing, 2012

The fuzziness of assessing second language speaking performance raises two difficulties in scoring speaking performance: "indistinction between adjacent levels" and "overlap between scales". To address these two problems, this article proposes a new approach, "confidence scoring", to deal with such fuzziness, leading to "confidence" scores between…

Descriptors: Speech Communication, Scoring, Test Interpretation, Second Language Learning

Rating Written Performance: What Do Raters Do and Why?

Peer reviewed

Direct link

Kuiken, Folkert; Vedder, Ineke – Language Testing, 2014

This study investigates the relationship in L2 writing between raters' judgments of communicative adequacy and linguistic complexity by means of six-point Likert scales, and general measures of linguistic performance. The participants were 39 learners of Italian and 32 of Dutch, who wrote two short argumentative essays. The same writing tasks…

Descriptors: Writing Evaluation, Second Language Learning, Evaluators, Native Language

Conceptual and Empirical Relationships between Temporal Measures of Fluency and Oral English Proficiency with Implications for Automated Scoring

Peer reviewed

Direct link

Ginther, April; Dimova, Slobodanka; Yang, Rui – Language Testing, 2010

Information provided by examination of the skills that underlie holistic scores can be used not only as supporting evidence for the validity of inferences associated with performance tests but also as a way to improve the scoring rubrics, descriptors, and benchmarks associated with scoring scales. As fluency is considered a critical, perhaps…

Descriptors: Performance Tests, Scoring Rubrics, Measures (Individuals), Scoring

Hebrew Language Assessment Measure for Preschool Children: A Comparison between Typically Developing Children and Children with Specific Language Impairment

Peer reviewed

Direct link

Katzenberger, Irit; Meilijson, Sara – Language Testing, 2014

The Katzenberger Hebrew Language Assessment for Preschool Children (henceforth: the KHLA) is the first comprehensive, standardized language assessment tool developed in Hebrew specifically for older preschoolers (4;0-5;11 years). The KHLA is a norm-referenced, Hebrew specific assessment, based on well-established psycholinguistic principles, as…

Descriptors: Semitic Languages, Preschool Children, Language Impairments, Language Tests

A Comparison of Two Scoring Methods for an Automated Speech Scoring System

Peer reviewed

Direct link

Xi, Xiaoming; Higgins, Derrick; Zechner, Klaus; Williamson, David – Language Testing, 2012

This paper compares two alternative scoring methods--multiple regression and classification trees--for an automated speech scoring system used in a practice environment. The two methods were evaluated on two criteria: construct representation and empirical performance in predicting human scores. The empirical performance of the two scoring models…

Descriptors: Scoring, Classification, Weighted Scores, Comparative Analysis

TOEFL iBT Speaking Test Scores as Indicators of Oral Communicative Language Proficiency

Peer reviewed

Direct link

Bridgeman, Brent; Powers, Donald; Stone, Elizabeth; Mollaun, Pamela – Language Testing, 2012

Scores assigned by trained raters and by an automated scoring system (SpeechRater[TM]) on the speaking section of the TOEFL iBT[TM] were validated against a communicative competence criterion. Specifically, a sample of 555 undergraduate students listened to speech samples from 184 examinees who took the Test of English as a Foreign Language…

Descriptors: Undergraduate Students, Speech Communication, Rating Scales, Scoring

Previous Page | Next Page »

Pages: 1 | 2

Xi, Xiaoming	2
Attali, Yigal	1
Bachman, Lyle F.	1
Bae, Jungok	1
Barkaoui, Khaled	1
Bernstein, Jared	1
Bridgeman, Brent	1
Chan, Stephanie W. Y.	1
Chapelle, Carol A.	1
Cheng, Jian	1
Cheung, Wai Ming	1
Chung, Yoo-Ree	1
Clothier, Josh	1
Crossley, Scott A.	1
Davis, Larry	1
Dimova, Slobodanka	1
Egbert, Jesse	1
Ewert, Doreen	1
Feng, Yali	1
Frost, Kellie	1
Garras, John	1
Ginther, April	1
Hegelheimer, Volker	1
Higgins, Derrick	1
Huang, Yanli	1
More ▼