Score: 0

V-OCBF: Learning Safety Filters from Offline Data via Value-Guided Offline Control Barrier Functions

Published: December 11, 2025 | arXiv ID: 2512.10822v1

By: Mumuksh Tayal , Manan Tayal , Aditya Singh and more

Potential Business Impact:

Teaches robots to be safe without watching them.

Business Areas:
Autonomous Vehicles Transportation

Ensuring safety in autonomous systems requires controllers that satisfy hard, state-wise constraints without relying on online interaction. While existing Safe Offline RL methods typically enforce soft expected-cost constraints, they do not guarantee forward invariance. Conversely, Control Barrier Functions (CBFs) provide rigorous safety guarantees but usually depend on expert-designed barrier functions or full knowledge of the system dynamics. We introduce Value-Guided Offline Control Barrier Functions (V-OCBF), a framework that learns a neural CBF entirely from offline demonstrations. Unlike prior approaches, V-OCBF does not assume access to the dynamics model; instead, it derives a recursive finite-difference barrier update, enabling model-free learning of a barrier that propagates safety information over time. Moreover, V-OCBF incorporates an expectile-based objective that avoids querying the barrier on out-of-distribution actions and restricts updates to the dataset-supported action set. The learned barrier is then used with a Quadratic Program (QP) formulation to synthesize real-time safe control. Across multiple case studies, V-OCBF yields substantially fewer safety violations than baseline methods while maintaining strong task performance, highlighting its scalability for offline synthesis of safety-critical controllers without online interaction or hand-engineered barriers.

Country of Origin
🇮🇳 India

Page Count
23 pages

Category
Computer Science:
Artificial Intelligence