ERIC - Search Results

Publication Date

In 2025	2
Since 2024	2
Since 2021 (last 5 years)	7
Since 2016 (last 10 years)	18
Since 2006 (last 20 years)	25

Descriptor

Foreign Countries	26
Scoring	23
Language Tests	19
Second Language Learning	19
English (Second Language)	15
Comparative Analysis	11
Scores	10
Second Language Instruction	10
Item Response Theory	9
Oral Language	6
Chinese	5
Grammar	5
Secondary School Students	5
Statistical Analysis	5
Evaluators	4
Interrater Reliability	4
Item Analysis	4
Language Fluency	4
Language Proficiency	4
Language Teachers	4
Rating Scales	4
Reading Comprehension	4
Test Items	4
Testing	4
Validity	4
More ▼

Source

Language Testing

Publication Type

Journal Articles	26
Reports - Research	21
Reports - Evaluative	3
Reports - Descriptive	2
Tests/Questionnaires	2

Education Level

Higher Education	9
Postsecondary Education	6
Secondary Education	5
Elementary Education	4
Early Childhood Education	1
Grade 6	1
Intermediate Grades	1
Kindergarten	1
Primary Education	1

Audience

Location

China	5
Japan	5
Iran	3
Netherlands	3
United Kingdom	3
Brazil	2
Colombia	2
France	2
Germany	2
Poland	2
United States	2
Austria	1
Burma	1
Cambodia	1
Cameroon	1
Canada	1
China (Shanghai)	1
Hong Kong	1
Indonesia	1
Kenya	1
New Zealand	1
Singapore	1
South Korea	1
Sweden	1
Switzerland	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Peabody Picture Vocabulary…	1
Test of English as a Foreign…	1
Test of English for…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 26 results Save | Export

Comparative Judgement for Evaluating Young Learners' EFL Writing Performances: Reliability and Teacher Perceptions of Holistic and Dimension-Based Judgements

Peer reviewed

Direct link

Rebecca Sickinger; Tineke Brunfaut; John Pill – Language Testing, 2025

Comparative Judgement (CJ) is an evaluation method, typically conducted online, whereby a rank order is constructed, and scores calculated, from judges' pairwise comparisons of performances. CJ has been researched in various educational contexts, though only rarely in English as a Foreign Language (EFL) writing settings, and is generally agreed to…

Descriptors: Writing Evaluation, English (Second Language), Second Language Learning, Second Language Instruction

A New Scoring Method for Item Response Theory Analysis of C-Tests

Peer reviewed

Direct link

Farshad Effatpanah; Purya Baghaei; Mona Tabatabaee-Yazdi; Esmat Babaii – Language Testing, 2025

This study aimed to propose a new method for scoring C-Tests as measures of general language proficiency. In this approach, the unit of analysis is sentences rather than gaps or passages. That is, the gaps correctly reformulated in each sentence were aggregated as sentence score, and then each sentence was entered into the analysis as a polytomous…

Descriptors: Item Response Theory, Language Tests, Test Items, Test Construction

Assessing the Speaking Proficiency of L2 Chinese Learners: Review of the Hanyu Shuiping Kouyu Kaoshi

Peer reviewed

Direct link

Li, Albert W. – Language Testing, 2023

The Hanyu Shuiping Kaoshi (HSK) is a multi-level, multi-purpose Chinese proficiency test developed by the Center for Language Education and Cooperation (previously the Office of Chinese Language Council International and, henceforth, referred to by its colloquial name "Hanban"). It assesses reading, writing, and listening skills of…

Descriptors: Language Tests, Language Proficiency, Chinese, Second Language Learning

Linking Scores from Two Written Receptive English Academic Vocabulary Tests--The VLT-Ac and the AVT

Peer reviewed

Direct link

Warnby, Marcus; Malmström, Hans; Hansen, Kajsa Yang – Language Testing, 2023

The academic section of the Vocabulary Levels Test (VLT-Ac) and the Academic Vocabulary Test (AVT) both assess meaning-recognition knowledge of written receptive academic vocabulary, deemed central for engagement in academic activities. Depending on the purpose and context of the testing, either of the tests can be appropriate, but for research…

Descriptors: Foreign Countries, Scores, Written Language, Receptive Language

Do Experience and Text Quality Matter for Raters' Decision-Making Behaviors?

Peer reviewed

Direct link

Sahan, Özgür; Razi, Salim – Language Testing, 2020

This study examines the decision-making behaviors of raters with varying levels of experience while assessing EFL essays of distinct qualities. The data were collected from 28 raters with varying levels of rating experience and working at the English language departments of different universities in Turkey. Using a 10-point analytic rubric, each…

Descriptors: Decision Making, Essays, Writing Evaluation, Evaluators

Comparing Holistic and Analytic Marking Methods in Assessing Speech Act Production in L2 Chinese

Peer reviewed

Direct link

Li, Shuai; Wen, Ting; Li, Xian; Feng, Yali; Lin, Chuan – Language Testing, 2023

This study compared holistic and analytic marking methods for their effects on parameter estimation (of examinees, raters, and items) and rater cognition in assessing speech act production in L2 Chinese. Seventy American learners of Chinese completed an oral Discourse Completion Test assessing requests and refusals. Four first-language (L1)…

Descriptors: Speech Acts, Second Language Learning, Second Language Instruction, Chinese

Validity Evidence for a Sentence Repetition Test of Swiss German Sign Language

Peer reviewed

Direct link

Haug, Tobias; Batty, Aaron Olaf; Venetz, Martin; Notter, Christa; Girard-Groeber, Simone; Knoch, Ute; Audeoud, Mireille – Language Testing, 2020

In this study we seek evidence of validity according to the socio-cognitive framework (Weir, 2005) for a new sentence repetition test (SRT) for young Deaf L1 Swiss German Sign Language (DSGS) users. SRTs have been developed for various purposes for both spoken and sign languages to assess language development in children. In order to address the…

Descriptors: Foreign Countries, Language Tests, Sentences, Repetition

Cloze Testing for Comprehension Assessment: The HyTeC-cloze

Peer reviewed

Direct link

Kleijn, Suzanne; Pander Maat, Henk; Sanders, Ted – Language Testing, 2019

Although there are many methods available for assessing text comprehension, the cloze test is not widely acknowledged as one of them. Critiques on cloze testing center on its supposedly limited ability to measure comprehension beyond the sentence. However, these critiques do not hold for all types of cloze tests; the particular configuration of a…

Descriptors: Cloze Procedure, Language Tests, Semantics, Scoring

Proficiency at the Lexis-Grammar Interface: Comparing Oral versus Written French Exam Tasks

Peer reviewed

Direct link

Vandeweerd, Nathan; Housen, Alex; Paquot, Magali – Language Testing, 2023

This study investigates whether re-thinking the separation of lexis and grammar in language testing could lead to more valid inferences about proficiency across modes. As argued by Römer, typical scoring rubrics ignore important information about proficiency encoded at the lexis-grammar interface, in particular how the co-selection of lexical and…

Descriptors: French, Language Tests, Grammar, Second Language Learning

A Comparative Judgment Approach to Assessing Chinese Sign Language Interpreting

Peer reviewed

Direct link

Han, Chao; Xiao, Xiaoyan – Language Testing, 2022

The quality of sign language interpreting (SLI) is a gripping construct among practitioners, educators and researchers, calling for reliable and valid assessment. There has been a diverse array of methods in the extant literature to measure SLI quality, ranging from traditional error analysis to recent rubric scoring. In this study, we want to…

Descriptors: Comparative Analysis, Sign Language, Deaf Interpreting, Evaluators

Screener Tests Need Validation Too: Weighing an Argument for Test Use against Practical Concerns

Peer reviewed

Direct link

Schmidgall, Jonathan E.; Getman, Edward P.; Zu, Jiyun – Language Testing, 2018

In this study, we define the term "screener test," elaborate key considerations in test design, and describe how to incorporate the concepts of practicality and argument-based validation to drive an evaluation of screener tests for language assessment. A screener test is defined as a brief assessment designed to identify an examinee as a…

Descriptors: Test Validity, Test Use, Test Construction, Language Tests

The Test of English for International Communication (TOEIC®)

Peer reviewed

Direct link

Im, Gwan-Hyeok; Cheng, Liying – Language Testing, 2019

The primary purpose of the Test of English for International Communication (TOEIC®) is to measure the everyday English skills of individuals, who speak a first language other than English, working in an international environment (ETS, 2015a, 2016a; Powers & Powers, 2015). The TOEIC also has six secondary purposes: (1) to verify the current…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Language Proficiency

Development and Validation of a Chinese Character Acquisition Assessment for Second-Language Kindergarteners

Peer reviewed

Direct link

Chan, Stephanie W. Y.; Cheung, Wai Ming; Huang, Yanli; Lam, Wai-Ip; Lin, Chin-Hsi – Language Testing, 2020

Demand for second-language (L2) Chinese education for kindergarteners has grown rapidly, but little is known about these kindergarteners' L2 skills, with existing studies focusing on school-age populations and alphabetic languages. Accordingly, we developed a six-subtest Chinese character acquisition assessment to measure L2 kindergarteners'…

Descriptors: Chinese, Second Language Learning, Second Language Instruction, Written Language

Setting Cut Scores on an EFL Placement Test Using the Prototype Group Method: A Receiver Operating Characteristic (ROC) Analysis

Peer reviewed

Direct link

Eckes, Thomas – Language Testing, 2017

This paper presents an approach to standard setting that combines the prototype group method (PGM; Eckes, 2012) with a receiver operating characteristic (ROC) analysis. The combined PGM-ROC approach is applied to setting cut scores on a placement test of English as a foreign language (EFL). To implement the PGM, experts first named learners whom…

Descriptors: English (Second Language), Language Tests, Cutting Scores, Standard Setting (Scoring)

Measuring the Impact of Rater Negotiation in Writing Performance Assessment

Peer reviewed

Direct link

Trace, Jonathan; Janssen, Gerriet; Meier, Valerie – Language Testing, 2017

Previous research in second language writing has shown that when scoring performance assessments even trained raters can exhibit significant differences in severity. When raters disagree, using discussion to try to reach a consensus is one popular form of score resolution, particularly in contexts with limited resources, as it does not require…

Descriptors: Performance Based Assessment, Second Language Learning, Scoring, Evaluators

Previous Page | Next Page »

Pages: 1 | 2

Schmitt, Norbert	2
Audeoud, Mireille	1
Babaii, Esmat	1
Batty, Aaron Olaf	1
Campfield, Dorota E.	1
Chan, Stephanie W. Y.	1
Cheng, Liying	1
Cheung, Wai Ming	1
Deygers, Bart	1
Eckes, Thomas	1
Esmat Babaii	1
Farshad Effatpanah	1
Feng, Yali	1
Garras, John	1
Getman, Edward P.	1
Girard-Groeber, Simone	1
Han, Chao	1
Hansen, Kajsa Yang	1
Harsch, Claudia	1
Hartig, Johannes	1
Haug, Tobias	1
Housen, Alex	1
Hsieh, Mingchuan	1
Huang, Yanli	1
More ▼