ERIC - Search Results

Publication Date

In 2026	0
Since 2025	1
Since 2022 (last 5 years)	4
Since 2017 (last 10 years)	6
Since 2007 (last 20 years)	12

Source

Language Testing

Publication Type

Journal Articles	12
Reports - Research	8
Reports - Descriptive	2
Reports - Evaluative	2
Tests/Questionnaires	1

Education Level

Higher Education	4
Postsecondary Education	2
Secondary Education	2
Elementary Education	1
Grade 5	1
Intermediate Grades	1

Audience

Location

Iran	1
Japan	1
Saudi Arabia	1
Sweden	1

Laws, Policies, & Programs

Assessments and Surveys

Early Childhood Longitudinal…	1
International English…	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing all 12 results Save | Export

More Efficient Processes for Creating Automated Essay Scoring Frameworks: A Demonstration of Two Algorithms

Peer reviewed

Direct link

Shin, Jinnie; Gierl, Mark J. – Language Testing, 2021

Automated essay scoring (AES) has emerged as a secondary or as a sole marker for many high-stakes educational assessments, in native and non-native testing, owing to remarkable advances in feature engineering using natural language processing, machine learning, and deep-neural algorithms. The purpose of this study is to compare the effectiveness…

Descriptors: Scoring, Essays, Writing Evaluation, Computer Software

Exploring Potential Biases in GPT-4o's Ratings of English Language Learners' Essays

Peer reviewed

Direct link

Taichi Yamashita – Language Testing, 2025

With the rapid development of generative artificial intelligence (AI) frameworks (e.g., the generative pre-trained transformer [GPT]), a growing number of researchers have started to explore its potential as an automated essay scoring (AES) system. While previous studies have investigated the alignment between human ratings and GPT ratings, few…

Descriptors: Artificial Intelligence, English (Second Language), Second Language Learning, Second Language Instruction

A Comprehensive Review of Rasch Measurement in Language Assessment: Recommendations and Guidelines for Research

Peer reviewed

Direct link

Aryadoust, Vahid; Ng, Li Ying; Sayama, Hiroki – Language Testing, 2021

Over the past decades, the application of Rasch measurement in language assessment has gradually increased. In the present study, we coded 215 papers using Rasch measurement published in 21 applied linguistics journals for multiple features. We found that seven Rasch models and 23 software packages were adopted in these papers, with many-facet…

Descriptors: Language Tests, Testing, Test Items, Network Analysis

Evaluating the Impact of Nonverbal Behavior on Language Ability Ratings

Peer reviewed

Direct link

J. Dylan Burton – Language Testing, 2024

Nonverbal behavior can impact language proficiency scores in speaking tests, but there is little empirical information of the size or consistency of its effects or whether language proficiency may be a moderating variable. In this study, 100 novice raters watched and scored 30 recordings of test takers taking an international, high stakes…

Descriptors: Nonverbal Ability, Language Fluency, Second Language Learning, Language Proficiency

Critical Language Assessment Literacy of EFL Teachers: Scale Construction and Validation

Peer reviewed

Direct link

Tajeddin, Zia; Khatib, Mohammad; Mahdavi, Mohsen – Language Testing, 2022

Critical language assessment (CLA) has been addressed in numerous studies. However, the majority of the studies have overlooked the need for a practical framework to measure the CLA dimension of teachers' language assessment literacy (LAL). This gap prompted us to develop and validate a critical language assessment literacy (CLAL) scale to further…

Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Language Tests

Assessing Rasch Measurement Estimation Methods across R Packages with Yes/No Vocabulary Test Data

Peer reviewed

Direct link

Nicklin, Christopher; Vitta, Joseph P. – Language Testing, 2022

Instrument measurement conducted with Rasch analysis is a common process in language assessment research. A recent systematic review of 215 studies involving Rasch analysis in language testing and applied linguistics research reported that 23 different software packages had been utilized. However, none of the analyses were conducted with one of…

Descriptors: Programming Languages, Vocabulary Development, Language Tests, Computer Software

SLA Developmental Stages and Teachers' Assessment of Written French: Exploring Direkt Profil as a Diagnostic Assessment Tool

Peer reviewed

Direct link

Granfeldt, Jonas; Ågren, Malin – Language Testing, 2014

One core area of research in Second Language Acquisition is the identification and definition of developmental stages in different L2s. For L2 French, Bartning and Schlyter (2004) presented a model of six morphosyntactic stages of development in the shape of grammatical profiles. The model formed the basis for the computer program Direkt Profil…

Descriptors: Second Language Learning, Language Tests, French, Language Teachers

Principles of Quantile Regression and an Application

Peer reviewed

Direct link

Chen, Fang; Chalhoub-Deville, Micheline – Language Testing, 2014

Newer statistical procedures are typically introduced to help address the limitations of those already in practice or to deal with emerging research needs. Quantile regression (QR) is introduced in this paper as a relatively new methodology, which is intended to overcome some of the limitations of least squares mean regression (LMR). QR is more…

Descriptors: Regression (Statistics), Language Tests, Language Proficiency, Mathematics Achievement

TOEFL iBT Speaking Test Scores as Indicators of Oral Communicative Language Proficiency

Peer reviewed

Direct link

Bridgeman, Brent; Powers, Donald; Stone, Elizabeth; Mollaun, Pamela – Language Testing, 2012

Scores assigned by trained raters and by an automated scoring system (SpeechRater[TM]) on the speaking section of the TOEFL iBT[TM] were validated against a communicative competence criterion. Specifically, a sample of 555 undergraduate students listened to speech samples from 184 examinees who took the Test of English as a Foreign Language…

Descriptors: Undergraduate Students, Speech Communication, Rating Scales, Scoring

Complementing Human Judgment of Essays Written by English Language Learners with E-Rater[R] Scoring

Peer reviewed

Direct link

Enright, Mary K.; Quinlan, Thomas – Language Testing, 2010

E-rater[R] is an automated essay scoring system that uses natural language processing techniques to extract features from essays and to model statistically human holistic ratings. Educational Testing Service has investigated the use of e-rater, in conjunction with human ratings, to score one of the two writing tasks on the TOEFL-iBT[R] writing…

Descriptors: Second Language Learning, Scoring, Essays, Language Processing

EduSpeak[R]: A Speech Recognition and Pronunciation Scoring Toolkit for Computer-Aided Language Learning Applications

Peer reviewed

Direct link

Franco, Horacio; Bratt, Harry; Rossier, Romain; Rao Gadde, Venkata; Shriberg, Elizabeth; Abrash, Victor; Precoda, Kristin – Language Testing, 2010

SRI International's EduSpeak[R] system is a software development toolkit that enables developers of interactive language education software to use state-of-the-art speech recognition and pronunciation scoring technology. Automatic pronunciation scoring allows the computer to provide feedback on the overall quality of pronunciation and to point to…

Descriptors: Feedback (Response), Sentences, Oral Language, Predictor Variables

The Utility of Article and Preposition Error Correction Systems for English Language Learners: Feedback and Assessment

Peer reviewed

Direct link

Chodorow, Martin; Gamon, Michael; Tetreault, Joel – Language Testing, 2010

In this paper, we describe and evaluate two state-of-the-art systems for identifying and correcting writing errors involving English articles and prepositions. Criterion[superscript SM], developed by Educational Testing Service, and "ESL Assistant", developed by Microsoft Research, both use machine learning techniques to build models of article…

Descriptors: Grammar, Feedback (Response), Form Classes (Languages), Second Language Learning

Computer Software	12
Second Language Learning	9
Language Tests	8
English (Second Language)	7
Second Language Instruction	6
Evaluators	5
Scoring	5
Essays	4
Grammar	4
Language Proficiency	4
Comparative Analysis	3
Foreign Countries	3
Scores	3
Writing Evaluation	3
Artificial Intelligence	2
Computer Assisted Instruction	2
Computer Assisted Testing	2
Correlation	2
Culture Fair Tests	2
Error Correction	2
Feedback (Response)	2
Goodness of Fit	2
High Stakes Tests	2
Item Analysis	2
Language Teachers	2
More ▼

Abrash, Victor	1
Aryadoust, Vahid	1
Bratt, Harry	1
Bridgeman, Brent	1
Chalhoub-Deville, Micheline	1
Chen, Fang	1
Chodorow, Martin	1
Enright, Mary K.	1
Franco, Horacio	1
Gamon, Michael	1
Gierl, Mark J.	1
Granfeldt, Jonas	1
J. Dylan Burton	1
Khatib, Mohammad	1
Mahdavi, Mohsen	1
Mollaun, Pamela	1
Ng, Li Ying	1
Nicklin, Christopher	1
Powers, Donald	1
Precoda, Kristin	1
Quinlan, Thomas	1
Rao Gadde, Venkata	1
Rossier, Romain	1
Sayama, Hiroki	1
Shin, Jinnie	1
More ▼