Benchmarking Offline Reinforcement Learning for Emotion-Adaptive Social Robotics
By: Soon Jynn Chu, Raju Gottumukkala, Alan Barhorst
Potential Business Impact:
Teaches robots to understand feelings from old data.
The ability of social robots to respond to human emotions is crucial for building trust and acceptance in human-robot collaborative environments. However, developing such capabilities through online reinforcement learning is sometimes impractical due to the prohibitive cost of data collection and the risk of generating unsafe behaviors. In this paper, we study the use of offline reinforcement learning as a practical and efficient alternative. This technique uses pre-collected data to enable emotion-adaptive social robots. We present a system architecture that integrates multimodal sensing and recognition, decision-making, and adaptive responses. Using a limited dataset from a human-robot game-playing scenario, we establish a benchmark for comparing offline reinforcement learning algorithms that do not require an online environment. Our results show that BCQ and CQL are more robust to data sparsity, achieving higher state-action values compared to NFQ, DQN, and DDQN. This work establishes a foundation for benchmarking offline RL in emotion-adaptive robotics and informs future deployment in real-world HRI. Our findings provide empirical insight into the performance of offline reinforcement learning algorithms in data-constrained HRI. This work establishes a foundation for benchmarking offline RL in emotion-adaptive robotics and informs its future deployment in real-world HRI, such as in conversational agents, educational partners, and personal assistants, require reliable emotional responsiveness.
Similar Papers
Behavior-Adaptive Q-Learning: A Unifying Framework for Offline-to-Online RL
Machine Learning (CS)
Helps robots learn safely from past mistakes.
From Imitation to Optimization: A Comparative Study of Offline Learning for Autonomous Driving
Machine Learning (CS)
Teaches self-driving cars to avoid crashes.
Investigating Adaptive Tuning of Assistive Exoskeletons Using Offline Reinforcement Learning: Challenges and Insights
Robotics
Helps robot arms move better with less setup.