Score: 1

Sample from What You See: Visuomotor Policy Learning via Diffusion Bridge with Observation-Embedded Stochastic Differential Equation

Published: December 8, 2025 | arXiv ID: 2512.07212v1

By: Zhaoyang Liu , Mokai Pan , Zhongyi Wang and more

Potential Business Impact:

Robots learn to move better by watching and copying.

Business Areas:

Autonomous Vehicles Transportation

Imitation learning with diffusion models has advanced robotic control by capturing multi-modal action distributions. However, existing approaches typically treat observations as high-level conditioning inputs to the denoising network, rather than integrating them into the stochastic dynamics of the diffusion process itself. As a result, sampling must begin from random Gaussian noise, weakening the coupling between perception and control and often yielding suboptimal performance. We introduce BridgePolicy, a generative visuomotor policy that explicitly embeds observations within the stochastic differential equation via a diffusion-bridge formulation. By constructing an observation-informed trajectory, BridgePolicy enables sampling to start from a rich, informative prior rather than random noise, substantially improving precision and reliability in control. A key challenge is that classical diffusion bridges connect distributions with matched dimensionality, whereas robotic observations are heterogeneous and multi-modal and do not naturally align with the action space. To address this, we design a multi-modal fusion module and a semantic aligner that unify visual and state inputs and align observation and action representations, making the bridge applicable to heterogeneous robot data. Extensive experiments across 52 simulation tasks on three benchmarks and five real-world tasks demonstrate that BridgePolicy consistently outperforms state-of-the-art generative policies.

Delay-Aware Diffusion Policy: Bridging the Observation-Execution Gap in Dynamic Tasks

Robotics

Makes robots react faster to changing worlds.

8 Dec 2025 0

89%

Hybrid-Diffusion Models: Combining Open-loop Routines with Visuomotor Diffusion Policies

Robotics

Robots learn to do tricky jobs faster and better.

4 Dec 2025 0

89%

Prior Does Matter: Visual Navigation via Denoising Diffusion Bridge Models

Robotics

Robots learn to navigate better and faster.

14 Apr 2025 1

View PDF Login to Bookmark

Country of Origin

🇨🇳 China

Page Count

18 pages

Sample from What You See: Visuomotor Policy Learning via Diffusion Bridge with Observation-Embedded Stochastic Differential Equation

Robots learn to move better by watching and copying.

Technical Abstract

Delay-Aware Diffusion Policy: Bridging the Observation-Execution Gap in Dynamic Tasks

Hybrid-Diffusion Models: Combining Open-loop Routines with Visuomotor Diffusion Policies

Prior Does Matter: Visual Navigation via Denoising Diffusion Bridge Models