ERIC Number: ED624059
Record Type: Non-Journal
Publication Date: 2022
Pages: 13
Abstractor: As Provided
ISBN: N/A
ISSN: N/A
EISSN: N/A
Available Date: N/A
Building a Reinforcement Learning Environment from Limited Data to Optimize Teachable Robot Interventions
Maidment, Tristan; Yu, Mingzhi; Lobczowski, Nikki; Kovashka, Adriana; Walker, Erin; Litman, Diane; Nokes-Malach, Timothy
International Educational Data Mining Society, Paper presented at the International Conference on Educational Data Mining (EDM) (15th, Durham, United Kingdom, Jul 24-27, 2022)
Working collaboratively in groups can positively impact performance and student engagement. Intelligent social agents can provide a source of personalized support for students, and their benefits likely extend to collaborative settings, but it is difficult to determine how these agents should interact with students. Reinforcement learning (RL) offers an opportunity for adapting the interactions between the social agent and the students to better support collaboration and learning. However, using RL in education with social agents typically involves training using real students. In this work, we train an RL agent in a high-quality simulated environment to learn how to improve students' collaboration. Data was collected during a pilot study with dyads of students who worked together to tutor an intelligent teachable robot. We explore the process of building an environment from the data, training a policy, and the impact of the policy on different students, compared to various baselines. [For the full proceedings, see ED623995.]
Descriptors: Robotics, Cooperative Learning, Artificial Intelligence, Training, Reinforcement, Undergraduate Students, Student Attitudes, Simulation
International Educational Data Mining Society. e-mail: admin@educationaldatamining.org; Web site: https://educationaldatamining.org/conferences/
Publication Type: Speeches/Meeting Papers; Reports - Research
Education Level: Higher Education; Postsecondary Education
Audience: N/A
Language: English
Sponsor: National Science Foundation (NSF)
Authoring Institution: N/A
Grant or Contract Numbers: 2024645
Author Affiliations: N/A