ERIC Number: EJ1465742
Record Type: Journal
Publication Date: 2025-Apr
Pages: 27
Abstractor: As Provided
ISBN: N/A
ISSN: ISSN-1360-2357
EISSN: EISSN-1573-7608
Available Date: 2024-09-17
Efficient Topic Identification for Urgent MOOC Forum Posts Using BERTopic and Traditional Topic Modeling Techniques
Nabila Khodeir1; Fatma Elghannam1
Education and Information Technologies, v30 n5 p5501-5527 2025
MOOC platforms provide a means of communication through forums, allowing learners to express their difficulties and challenges while studying various courses. Within these forums, some posts require urgent attention from instructors. Failing to respond promptly to these posts can contribute to higher dropout rates and lower course completion rates. While existing research primarily focuses on identifying urgent posts through various classification techniques, it has not adequately addressed the underlying reasons behind them. This research aims to delve into these reasons and assess the extent to which they vary. By understanding the root causes of urgency, instructors can effectively address these issues and provide appropriate support and solutions. BERTopic utilizes the advanced language capabilities of transformer models and represents an advanced approach in topic modeling. In this study, a comparison was conducted to evaluate the performance of BERTopic in topic modeling on MOOCs discussion forums, alongside traditional topic models such as LDA, LSI, and NMF. The experimental results revealed that the NMF and BERTopic models outperformed the other models. Specifically, the NMF model demonstrated superior performance when a lower number of topics was required, whereas the BERTopic model excelled in generating topics with higher coherence when a larger number of topics was needed. The results considering all urgent posts from the dataset were as follows: Optimal number of topics is 6 for NMF and 50 for BERTopic; coherence scores is 0.66 for NMF and 0.616 for BERTopic; and IRBO scores is 1 for both models. This highlights the BERTopic model capability to distinguish and extract diverse topics comprehensively and coherently, aiding in the identification of various reasons behind MOOC Forum posts.
Descriptors: MOOCs, Computer Mediated Communication, Conferences (Gatherings), Models, Educational Technology, Electronic Learning, Artificial Intelligence
Springer. Available from: Springer Nature. One New York Plaza, Suite 4600, New York, NY 10004. Tel: 800-777-4643; Tel: 212-460-1500; Fax: 212-460-1700; e-mail: customerservice@springernature.com; Web site: https://link.springer.com/
Publication Type: Journal Articles; Reports - Research
Education Level: N/A
Audience: N/A
Language: English
Sponsor: N/A
Authoring Institution: N/A
Grant or Contract Numbers: N/A
Author Affiliations: 1Electronics Research Institute, Elbahth Elemy, Informatics Research Department, Cairo, Egypt