NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing all 7 results Save | Export
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Osama Koraishi – Language Teaching Research Quarterly, 2024
This study conducts a comprehensive quantitative evaluation of OpenAI's language model, ChatGPT 4, for grading Task 2 writing of the IELTS exam. The objective is to assess the alignment between ChatGPT's grading and that of official human raters. The analysis encompassed a multifaceted approach, including a comparison of means and reliability…
Descriptors: Second Language Learning, English (Second Language), Language Tests, Artificial Intelligence
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Qiao Wang; Ralph L. Rose; Ayaka Sugawara; Naho Orita – Vocabulary Learning and Instruction, 2025
VocQGen is an automated tool designed to generate multiple-choice cloze (MCC) questions for vocabulary assessment in second language learning contexts. It leverages several natural language processing (NLP) tools and OpenAI's GPT-4 model to produce MCC items quickly from user-specified word lists. To evaluate its effectiveness, we used the first…
Descriptors: Vocabulary Skills, Artificial Intelligence, Computer Software, Multiple Choice Tests
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Kim, Byungsoo; Yu, Hangyeol; Shin, Dongmin; Choi, Youngduck – International Educational Data Mining Society, 2021
The needs for precisely estimating a student's academic performance have been emphasized with an increasing amount of attention paid to Intelligent Tutoring System (ITS). However, since labels for academic performance, such as test scores, are collected from outside of ITS, obtaining the labels is costly, leading to label-scarcity problem which…
Descriptors: Academic Achievement, Intelligent Tutoring Systems, Prediction, Scores
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Chukharev-Hudilainen, Evgeny; Ockey, Gary J. – ETS Research Report Series, 2021
This paper describes the development and evaluation of Interaction Competence Elicitor (ICE), a spoken dialog system (SDS) for the delivery of a paired oral discussion task in the context of language assessment. The purpose of ICE is to sustain a topic-specific conversation with a test taker in order to elicit discourse that can be later judged to…
Descriptors: Intercultural Communication, Oral Language, Communicative Competence (Languages), Error Analysis (Language)
Peer reviewed Peer reviewed
Direct linkDirect link
Sinclair, Jeanne; Jang, Eunice Eunhee; Rudzicz, Frank – Journal of Educational Psychology, 2021
Advances in machine learning (ML) are poised to contribute to our understanding of the linguistic processes associated with successful reading comprehension, which is a critical aspect of children's educational success. We used ML techniques to investigate and compare associations between children's reading comprehension and 260 linguistic…
Descriptors: Prediction, Reading Comprehension, Natural Language Processing, Speech Communication
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Zesch, Torsten; Horbach, Andrea; Melanie Goggin, Melanie; Wrede-Jackes, Jennifer – Research-publishing.net, 2018
We present a tool for the creation and curation of C-tests. C-tests are an established tool in language proficiency testing and language learning. They require examinees to complete a text in which the second half of every second word is replaced by a gap. We support teachers and test designers in creating such tests through a web-based system…
Descriptors: Language Tests, Language Proficiency, Second Language Learning, Second Language Instruction
Peer reviewed Peer reviewed
Direct linkDirect link
Enright, Mary K.; Quinlan, Thomas – Language Testing, 2010
E-rater[R] is an automated essay scoring system that uses natural language processing techniques to extract features from essays and to model statistically human holistic ratings. Educational Testing Service has investigated the use of e-rater, in conjunction with human ratings, to score one of the two writing tasks on the TOEFL-iBT[R] writing…
Descriptors: Second Language Learning, Scoring, Essays, Language Processing