V-OCBF: Learning Safety Filters from Offline Data via Value-Guided Offline Control Barrier Functions
By: Mumuksh Tayal , Manan Tayal , Aditya Singh and more
Potential Business Impact:
Teaches robots to be safe without watching them.
Ensuring safety in autonomous systems requires controllers that satisfy hard, state-wise constraints without relying on online interaction. While existing Safe Offline RL methods typically enforce soft expected-cost constraints, they do not guarantee forward invariance. Conversely, Control Barrier Functions (CBFs) provide rigorous safety guarantees but usually depend on expert-designed barrier functions or full knowledge of the system dynamics. We introduce Value-Guided Offline Control Barrier Functions (V-OCBF), a framework that learns a neural CBF entirely from offline demonstrations. Unlike prior approaches, V-OCBF does not assume access to the dynamics model; instead, it derives a recursive finite-difference barrier update, enabling model-free learning of a barrier that propagates safety information over time. Moreover, V-OCBF incorporates an expectile-based objective that avoids querying the barrier on out-of-distribution actions and restricts updates to the dataset-supported action set. The learned barrier is then used with a Quadratic Program (QP) formulation to synthesize real-time safe control. Across multiple case studies, V-OCBF yields substantially fewer safety violations than baseline methods while maintaining strong task performance, highlighting its scalability for offline synthesis of safety-critical controllers without online interaction or hand-engineered barriers.
Similar Papers
Online Learning-Enhanced High Order Adaptive Safety Control
Robotics
Keeps drones safe from crashes, even in wind.
Learning Conservative Neural Control Barrier Functions from Offline Data
Machine Learning (CS)
Keeps robots safe by learning from past mistakes.
CBF-RL: Safety Filtering Reinforcement Learning in Training with Control Barrier Functions
Robotics
Teaches robots to be safe while learning.