DiffAD: A Unified Diffusion Modeling Approach for Autonomous Driving
By: Tao Wang , Cong Zhang , Xingguang Qu and more
Potential Business Impact:
Cars drive themselves by creating driving pictures.
End-to-end autonomous driving (E2E-AD) has rapidly emerged as a promising approach toward achieving full autonomy. However, existing E2E-AD systems typically adopt a traditional multi-task framework, addressing perception, prediction, and planning tasks through separate task-specific heads. Despite being trained in a fully differentiable manner, they still encounter issues with task coordination, and the system complexity remains high. In this work, we introduce DiffAD, a novel diffusion probabilistic model that redefines autonomous driving as a conditional image generation task. By rasterizing heterogeneous targets onto a unified bird's-eye view (BEV) and modeling their latent distribution, DiffAD unifies various driving objectives and jointly optimizes all driving tasks in a single framework, significantly reducing system complexity and harmonizing task coordination. The reverse process iteratively refines the generated BEV image, resulting in more robust and realistic driving behaviors. Closed-loop evaluations in Carla demonstrate the superiority of the proposed method, achieving a new state-of-the-art Success Rate and Driving Score.
Similar Papers
HiP-AD: Hierarchical and Multi-Granularity Planning with Deformable Attention for Autonomous Driving in a Single Decoder
Robotics
Makes self-driving cars better at driving themselves.
Diffusion Models for Safety Validation of Autonomous Driving Systems
Robotics
Creates realistic car mistakes for testing.
SynAD: Enhancing Real-World End-to-End Autonomous Driving Models through Synthetic Data Integration
Robotics
Teaches self-driving cars to drive safer.