Barrier Function Overrides For Non-Convex Fixed Wing Flight Control and Self-Driving Cars
By: Eric Squires, Phillip Odom, Zsolt Kira
Potential Business Impact:
Keeps robots safe while learning new tasks.
Reinforcement Learning (RL) has enabled vast performance improvements for robotics systems. To achieve these results though, the agent often must randomly explore the environment, which for safety critical systems presents a significant challenge. Barrier functions can solve this challenge by enabling an override that approximates the RL control input as closely as possible without violating a safety constraint. Unfortunately, this override can be computationally intractable in cases where the dynamics are not convex in the control input or when time is discrete, as is often the case when training RL systems. We therefore consider these cases, developing novel barrier functions for two non-convex systems (fixed wing aircraft and self-driving cars performing lane merging with adaptive cruise control) in discrete time. Although solving for an online and optimal override is in general intractable when the dynamics are nonconvex in the control input, we investigate approximate solutions, finding that these approximations enable performance commensurate with baseline RL methods with zero safety violations. In particular, even without attempting to solve for the optimal override at all, performance is still competitive with baseline RL performance. We discuss the tradeoffs of the approximate override solutions including performance and computational tractability.
Similar Papers
A Review On Safe Reinforcement Learning Using Lyapunov and Barrier Functions
Systems and Control
Keeps smart robots from making dangerous mistakes.
A Review On Safe Reinforcement Learning Using Lyapunov and Barrier Functions
Systems and Control
Keeps smart machines from making dangerous mistakes.
Control Barrier Function for Unknown Systems: An Approximation-free Approach
Systems and Control
Keeps robots safe from moving obstacles.