ERIC - Search Results

Publication Date

In 2026	0
Since 2025	2
Since 2022 (last 5 years)	4
Since 2017 (last 10 years)	6
Since 2007 (last 20 years)	10

Source

Language Testing

Publication Type

Journal Articles	11
Reports - Research	6
Reports - Evaluative	4
Reports - Descriptive	1

Education Level

Higher Education	6
Postsecondary Education	3
Secondary Education	3
Grade 12	1
High Schools	1
Junior High Schools	1
Middle Schools	1

Audience

Location

Canada	1
China	1
Europe	1
Finland	1
Iran	1
Japan	1
South Korea	1
United Arab Emirates	1

Laws, Policies, & Programs

Assessments and Surveys

Graduate Record Examinations	1
International English…	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing all 11 results Save | Export

Communal Factors in Rater Severity and Consistency over Time in High-Stakes Oral Assessment

Peer reviewed

Direct link

Reeta Neittaanmäki; Iasonas Lamprianou – Language Testing, 2024

This article focuses on rater severity and consistency and their relation to major changes in the rating system in a high-stakes testing context. The study is based on longitudinal data collected from 2009 to 2019 from the second language (L2) Finnish speaking subtest in the National Certificates of Language Proficiency in Finland. We investigated…

Descriptors: Foreign Countries, Interrater Reliability, Evaluators, Item Response Theory

Test Review: Computer-Based English Listening and Speaking Test (CELST) of National Matriculation English Test (NMET) Guangdong Version in China

Peer reviewed

Direct link

Ying Xu; Xiaodong Li; Jin Chen – Language Testing, 2025

This article provides a detailed review of the Computer-based English Listening Speaking Test (CELST) used in Guangdong, China, as part of the National Matriculation English Test (NMET) to assess students' English proficiency. The CELST measures listening and speaking skills as outlined in the "English Curriculum for Senior Middle…

Descriptors: Computer Assisted Testing, English (Second Language), Language Tests, Listening Comprehension Tests

A New Scoring Method for Item Response Theory Analysis of C-Tests

Peer reviewed

Direct link

Farshad Effatpanah; Purya Baghaei; Mona Tabatabaee-Yazdi; Esmat Babaii – Language Testing, 2025

This study aimed to propose a new method for scoring C-Tests as measures of general language proficiency. In this approach, the unit of analysis is sentences rather than gaps or passages. That is, the gaps correctly reformulated in each sentence were aggregated as sentence score, and then each sentence was entered into the analysis as a polytomous…

Descriptors: Item Response Theory, Language Tests, Test Items, Test Construction

More Efficient Processes for Creating Automated Essay Scoring Frameworks: A Demonstration of Two Algorithms

Peer reviewed

Direct link

Shin, Jinnie; Gierl, Mark J. – Language Testing, 2021

Automated essay scoring (AES) has emerged as a secondary or as a sole marker for many high-stakes educational assessments, in native and non-native testing, owing to remarkable advances in feature engineering using natural language processing, machine learning, and deep-neural algorithms. The purpose of this study is to compare the effectiveness…

Descriptors: Scoring, Essays, Writing Evaluation, Computer Software

Setting Standards for a Diagnostic Test of Aviation English for Student Pilots

Peer reviewed

Direct link

Maria Treadaway; John Read – Language Testing, 2024

Standard-setting is an essential component of test development, supporting the meaningfulness and appropriate interpretation of test scores. However, in the high-stakes testing environment of aviation, standard-setting studies are underexplored. To address this gap, we document two stages in the standard-setting procedures for the Overseas Flight…

Descriptors: Standard Setting, Diagnostic Tests, High Stakes Tests, English for Special Purposes

The Longitudinal Stability of Rating Characteristics in an EFL Examination: Methodological and Substantive Considerations

Peer reviewed

Direct link

Lamprianou, Iasonas; Tsagari, Dina; Kyriakou, Nansia – Language Testing, 2021

This longitudinal study (2002-2014) investigates the stability of rating characteristics of a large group of raters over time in the context of the writing paper of a national high-stakes examination. The study uses one measure of rater severity and two measures of rater consistency. The results suggest that the rating characteristics of…

Descriptors: Longitudinal Studies, Evaluators, High Stakes Tests, Writing Evaluation

Scoring with the Computer: Alternative Procedures for Improving the Reliability of Holistic Essay Scoring

Peer reviewed

Direct link

Attali, Yigal; Lewis, Will; Steier, Michael – Language Testing, 2013

Automated essay scoring can produce reliable scores that are highly correlated with human scores, but is limited in its evaluation of content and other higher-order aspects of writing. The increased use of automated essay scoring in high-stakes testing underscores the need for human scoring that is focused on higher-order aspects of writing. This…

Descriptors: Scoring, Essay Tests, Reliability, High Stakes Tests

Common Educational Proficiency Assessment (CEPA) in English

Peer reviewed

Direct link

Coombe, Christine; Davidson, Peter – Language Testing, 2014

The Common Educational Proficiency Assessment (CEPA) is a large-scale, high-stakes, English language proficiency/placement test administered in the United Arab Emirates to Emirati nationals in their final year of secondary education or Grade 12. The purpose of the CEPA is to place students into English classes at the appropriate government…

Descriptors: Language Tests, High Stakes Tests, English (Second Language), Second Language Learning

The Differences among Three-, Four-, and Five-Option-Item Formats in the Context of a High-Stakes English-Language Listening Test

Peer reviewed

Direct link

Lee, HyeSun; Winke, Paula – Language Testing, 2013

We adapted three practice College Scholastic Ability Tests (CSAT) of English listening, each with five-option items, to create four- and three-option versions by asking 73 Korean speakers or learners of English to eliminate the least plausible options in two rounds. Two hundred and sixty-four Korean high school English-language learners formed…

Descriptors: Academic Ability, Stakeholders, Reliability, Listening Comprehension Tests

Test Review: Canadian Academic English Language (CAEL) Assessment

Peer reviewed

Direct link

Malone, Margaret E. – Language Testing, 2010

This article presents a review of the Canadian Academic English Language (CAEL) Assessment, a high stakes standardized test of the English language. It is a topic-based test that integrates listening, reading, writing and speaking. The test is designed to describe the level of English language proficiency of test takers planning to study at…

Descriptors: Test Reliability, Language Tests, Standardized Tests, Test Validity

Validity Evidence in a University Group Oral Test

Peer reviewed

Direct link

Van Moere, Alistair – Language Testing, 2006

This article investigates a group oral test as administered at a university in Japan to find if it is appropriate to use scores for higher stakes decision making. It is one component of an in-house English proficiency test used for placing students, evaluating their progress, and making informed decisions for the development of the English…

Descriptors: Foreign Countries, Generalizability Theory, Achievement Tests, English (Second Language)

High Stakes Tests	11
English (Second Language)	7
Foreign Countries	7
Language Tests	7
Language Proficiency	5
Test Reliability	5
Test Validity	5
College Entrance Examinations	4
Interrater Reliability	4
Scores	4
Test Construction	4
Evaluators	3
Listening Comprehension Tests	3
Reliability	3
Scoring	3
Second Language Learning	3
Comparative Analysis	2
Correlation	2
Cutting Scores	2
Item Response Theory	2
Native Speakers	2
Rating Scales	2
Secondary School Students	2
Writing Evaluation	2
Academic Ability	1
More ▼

Attali, Yigal	1
Coombe, Christine	1
Davidson, Peter	1
Esmat Babaii	1
Farshad Effatpanah	1
Gierl, Mark J.	1
Iasonas Lamprianou	1
Jin Chen	1
John Read	1
Kyriakou, Nansia	1
Lamprianou, Iasonas	1
Lee, HyeSun	1
Lewis, Will	1
Malone, Margaret E.	1
Maria Treadaway	1
Mona Tabatabaee-Yazdi	1
Purya Baghaei	1
Reeta Neittaanmäki	1
Shin, Jinnie	1
Steier, Michael	1
Tsagari, Dina	1
Van Moere, Alistair	1
Winke, Paula	1
Xiaodong Li	1
Ying Xu	1
More ▼