Nightmare Dreamer: Dreaming About Unsafe States And Planning Ahead
By: Oluwatosin Oseni , Shengjie Wang , Jun Zhu and more
Potential Business Impact:
Robots learn to do tasks safely and fast.
Reinforcement Learning (RL) has shown remarkable success in real-world applications, particularly in robotics control. However, RL adoption remains limited due to insufficient safety guarantees. We introduce Nightmare Dreamer, a model-based Safe RL algorithm that addresses safety concerns by leveraging a learned world model to predict potential safety violations and plan actions accordingly. Nightmare Dreamer achieves nearly zero safety violations while maximizing rewards. Nightmare Dreamer outperforms model-free baselines on Safety Gymnasium tasks using only image observations, achieving nearly a 20x improvement in efficiency.
Similar Papers
DREAM: Disentangling Risks to Enhance Safety Alignment in Multimodal Large Language Models
Computation and Language
Makes AI safer by understanding pictures and words.
medDreamer: Model-Based Reinforcement Learning with Latent Imagination on Complex EHRs for Clinical Decision Support
Machine Learning (CS)
Helps doctors pick best treatments for sick people.
Dreaming Falcon: Physics-Informed Model-Based Reinforcement Learning for Quadcopters
Robotics
Teaches drones to fly safely in wind.