ERIC - Search Results

Publication Date

In 2025	2
Since 2024	2
Since 2021 (last 5 years)	9
Since 2016 (last 10 years)	23
Since 2006 (last 20 years)	30

Source

Language Testing

Publication Type

Journal Articles	32
Reports - Research	29
Reports - Evaluative	3
Tests/Questionnaires	1

Education Level

Higher Education	11
Postsecondary Education	7
Secondary Education	7
Elementary Education	3
Adult Education	1
High Schools	1

Audience

Location

Japan	3
China	2
Germany	2
Turkey	2
United Kingdom	2
Austria	1
Belgium	1
Canada	1
Europe	1
France	1
Iran	1
Israel	1
Netherlands	1
Russia	1
Slovenia	1
Sweden	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	6
International English…	2
ACTFL Oral Proficiency…	1
Edinburgh Handedness Inventory	1
Michigan Test of English…	1
Peabody Picture Vocabulary…	1
Test of English for…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 32 results Save | Export

Comparative Judgement for Evaluating Young Learners' EFL Writing Performances: Reliability and Teacher Perceptions of Holistic and Dimension-Based Judgements

Peer reviewed

Direct link

Rebecca Sickinger; Tineke Brunfaut; John Pill – Language Testing, 2025

Comparative Judgement (CJ) is an evaluation method, typically conducted online, whereby a rank order is constructed, and scores calculated, from judges' pairwise comparisons of performances. CJ has been researched in various educational contexts, though only rarely in English as a Foreign Language (EFL) writing settings, and is generally agreed to…

Descriptors: Writing Evaluation, English (Second Language), Second Language Learning, Second Language Instruction

Test Review: High-Stakes English Language Proficiency Tests--Enquiry, Resit, and Retake Policies

Peer reviewed

Direct link

Pearson, William S. – Language Testing, 2023

Many candidates undertaking high-stakes English language proficiency tests for academic enrolment do not achieve the results they need for reasons including linguistic unreadiness, test unpreparedness, illness, an unfavourable configuration of tasks, or administrative and marking errors. Owing to the importance of meeting goals or out of a belief…

Descriptors: High Stakes Tests, English (Second Language), Language Proficiency, Language Tests

A New Scoring Method for Item Response Theory Analysis of C-Tests

Peer reviewed

Direct link

Farshad Effatpanah; Purya Baghaei; Mona Tabatabaee-Yazdi; Esmat Babaii – Language Testing, 2025

This study aimed to propose a new method for scoring C-Tests as measures of general language proficiency. In this approach, the unit of analysis is sentences rather than gaps or passages. That is, the gaps correctly reformulated in each sentence were aggregated as sentence score, and then each sentence was entered into the analysis as a polytomous…

Descriptors: Item Response Theory, Language Tests, Test Items, Test Construction

More Efficient Processes for Creating Automated Essay Scoring Frameworks: A Demonstration of Two Algorithms

Peer reviewed

Direct link

Shin, Jinnie; Gierl, Mark J. – Language Testing, 2021

Automated essay scoring (AES) has emerged as a secondary or as a sole marker for many high-stakes educational assessments, in native and non-native testing, owing to remarkable advances in feature engineering using natural language processing, machine learning, and deep-neural algorithms. The purpose of this study is to compare the effectiveness…

Descriptors: Scoring, Essays, Writing Evaluation, Computer Software

Linking Scores from Two Written Receptive English Academic Vocabulary Tests--The VLT-Ac and the AVT

Peer reviewed

Direct link

Warnby, Marcus; Malmström, Hans; Hansen, Kajsa Yang – Language Testing, 2023

The academic section of the Vocabulary Levels Test (VLT-Ac) and the Academic Vocabulary Test (AVT) both assess meaning-recognition knowledge of written receptive academic vocabulary, deemed central for engagement in academic activities. Depending on the purpose and context of the testing, either of the tests can be appropriate, but for research…

Descriptors: Foreign Countries, Scores, Written Language, Receptive Language

Monitoring the Performance of Human and Automated Scores for Spoken Responses

Peer reviewed

Direct link

Wang, Zhen; Zechner, Klaus; Sun, Yu – Language Testing, 2018

As automated scoring systems for spoken responses are increasingly used in language assessments, testing organizations need to analyze their performance, as compared to human raters, across several dimensions, for example, on individual items or based on subgroups of test takers. In addition, there is a need in testing organizations to establish…

Descriptors: Automation, Scoring, Speech Tests, Language Tests

Gauging the Impact of Literacy and Educational Background on Receptive Vocabulary Test Scores

Peer reviewed

Direct link

Deygers, Bart; Vanbuel, Marieke – Language Testing, 2022

The Peabody Picture Vocabulary Test (PPVT) is a widely used test of receptive vocabulary, but no researchers to date have examined the performance of low-educated, low-literate L2 adults, or compared these individuals' performances to their more highly educated peers. In this study, we used many-facet Rasch analysis and mixed-effects linear…

Descriptors: Literacy, Educational Background, Verbal Ability, Intelligence Tests

A Comparison of Holistic, Analytic, and Part Marking Models in Speaking Assessment

Peer reviewed

Direct link

Khabbazbashi, Nahal; Galaczi, Evelina D. – Language Testing, 2020

This mixed methods study examined holistic, analytic, and part marking models (MMs) in terms of their measurement properties and impact on candidate CEFR classifications in a semi-direct online speaking test. Speaking performances of 240 candidates were first marked holistically and by part (phase 1). On the basis of phase 1 findings--which…

Descriptors: Holistic Approach, Classification, Grading, Language Tests

Proficiency at the Lexis-Grammar Interface: Comparing Oral versus Written French Exam Tasks

Peer reviewed

Direct link

Vandeweerd, Nathan; Housen, Alex; Paquot, Magali – Language Testing, 2023

This study investigates whether re-thinking the separation of lexis and grammar in language testing could lead to more valid inferences about proficiency across modes. As argued by Römer, typical scoring rubrics ignore important information about proficiency encoded at the lexis-grammar interface, in particular how the co-selection of lexical and…

Descriptors: French, Language Tests, Grammar, Second Language Learning

Measuring the Development of General Language Skills in English as a Foreign Language--Longitudinal Invariance of the C-Test

Peer reviewed

Direct link

Schnoor, Birger; Hartig, Johannes; Klinger, Thorsten; Naumann, Alexander; Usanova, Irina – Language Testing, 2023

Research on assessing English as a foreign language (EFL) development has been growing recently. However, empirical evidence from longitudinal analyses based on substantial samples is still needed. In such settings, tests for measuring language development must meet high standards of test quality such as validity, reliability, and objectivity, as…

Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Longitudinal Studies

What Can Gaze Behaviors, Neuroimaging Data, and Test Scores Tell Us about Test Method Effects and Cognitive Load in Listening Assessments?

Peer reviewed

Direct link

Aryadoust, Vahid; Foo, Stacy; Ng, Li Ying – Language Testing, 2022

The aim of this study was to investigate how test methods affect listening test takers' performance and cognitive load. Test methods were defined and operationalized as while-listening performance (WLP) and post-listening performance (PLP) formats. To achieve the goal of the study, we examined test takers' (N = 80) brain activity patterns…

Descriptors: Listening Comprehension Tests, Language Tests, Eye Movements, Brain Hemisphere Functions

ACTFL Oral Proficiency Interview -- Computer (OPIc)

Peer reviewed

Direct link

Isbell, Dan; Winke, Paula – Language Testing, 2019

The American Council on the Teaching of Foreign Languages (ACTFL) oral proficiency interview -- computer (OPIc) testing system represents an ambitious effort in language assessment: Assessing oral proficiency in over a dozen languages, on the same scale, from virtually anywhere at any time. Especially for users in contexts where multiple foreign…

Descriptors: Oral Language, Language Tests, Language Proficiency, Second Language Learning

Mapping the Fluctuating Effect of Strategy Use Ability on English Reading Performance for Nursing Students: A Multi-Layered Moderation Analysis Approach

Peer reviewed

Direct link

Cai, Yuyang; Kunnan, Antony John – Language Testing, 2020

An essential hypothesis of modern language assessment theory pertains to the interaction between strategy use ability (strategic competence) and second language knowledge. However, how they interact with each other is rarely explored. Drawing on relevant research in the literature, in this paper we proposed three interaction patterns (i.e.,…

Descriptors: English (Second Language), Second Language Learning, Nursing Education, Reading Tests

An Analysis of "TOEFL® Primary™" Repeaters: How Much Score Change Occurs?

Peer reviewed

Direct link

Cho, Yeonsuk; Blood, Ian A. – Language Testing, 2020

In this study, we examined how much change in "TOEFL® Primary™" listening and reading scores can be expected in relation to the time interval between test administrations. The test records of 5213 young learners of English (aged 8-13 years) in Japan and Turkey who repeated the tests were analyzed to examine test scores as a function of…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Scores

Do Experience and Text Quality Matter for Raters' Decision-Making Behaviors?

Peer reviewed

Direct link

Sahan, Özgür; Razi, Salim – Language Testing, 2020

This study examines the decision-making behaviors of raters with varying levels of experience while assessing EFL essays of distinct qualities. The data were collected from 28 raters with varying levels of rating experience and working at the English language departments of different universities in Turkey. Using a 10-point analytic rubric, each…

Descriptors: Decision Making, Essays, Writing Evaluation, Evaluators

Previous Page | Next Page »

Pages: 1 | 2 | 3

Comparative Analysis	32
Scores	32
Language Tests	27
English (Second Language)	26
Second Language Learning	26
Foreign Countries	18
Language Proficiency	9
Second Language Instruction	9
Scoring	8
Computer Assisted Testing	6
Correlation	6
Evaluators	6
Oral Language	6
Secondary School Students	6
Statistical Analysis	6
Test Validity	6
Testing	6
College Students	5
Essays	5
High Stakes Tests	5
Native Language	5
Writing Evaluation	5
Bilingualism	4
Foreign Students	4
Rating Scales	4
More ▼

Hartig, Johannes	2
Kunnan, Antony John	2
Winke, Paula	2
Alvarez, Marta E.	1
Aryadoust, Vahid	1
Attali, Yigal	1
Blood, Ian A.	1
Brooks, Lindsay	1
Cai, Yuyang	1
Cho, Yeonsuk	1
Crossley, Scott	1
Deygers, Bart	1
Elgort, Irina	1
Esmat Babaii	1
Farshad Effatpanah	1
Foo, Stacy	1
Galaczi, Evelina D.	1
Garras, John	1
Gierl, Mark J.	1
Hansen, Kajsa Yang	1
Harsch, Claudia	1
Hopp, Holger	1
Housen, Alex	1
Isbell, Dan	1
John Pill	1
More ▼