Score: 1

Spatial Policy: Guiding Visuomotor Robotic Manipulation with Spatial-Aware Modeling and Reasoning

Published: August 21, 2025 | arXiv ID: 2508.15874v1

By: Yijun Liu , Yuwei Liu , Yuan Meng and more

Potential Business Impact:

Robots learn to move objects by seeing.

Business Areas:

Robotics Hardware, Science and Engineering, Software

Vision-centric hierarchical embodied models have demonstrated strong potential for long-horizon robotic control. However, existing methods lack spatial awareness capabilities, limiting their effectiveness in bridging visual plans to actionable control in complex environments. To address this problem, we propose Spatial Policy (SP), a unified spatial-aware visuomotor robotic manipulation framework via explicit spatial modeling and reasoning. Specifically, we first design a spatial-conditioned embodied video generation module to model spatially guided predictions through a spatial plan table. Then, we propose a spatial-based action prediction module to infer executable actions with coordination. Finally, we propose a spatial reasoning feedback policy to refine the spatial plan table via dual-stage replanning. Extensive experiments show that SP significantly outperforms state-of-the-art baselines, achieving a 33.0% average improvement over the best baseline. With an 86.7% average success rate across 11 diverse tasks, SP substantially enhances the practicality of embodied models for robotic control applications. Code and checkpoints are maintained at https://plantpotatoonmoon.github.io/SpatialPolicy/.

AutoSpatial: Visual-Language Reasoning for Social Robot Navigation through Efficient Spatial Reasoning Learning

Robotics

Helps robots understand where things are and move.

10 Mar 2025 2

89%

mindmap: Spatial Memory in Deep Feature Maps for 3D Action Policies

Robotics

Robot remembers where things are to do jobs.

24 Sep 2025 3

89%

Embodied Spatial Intelligence: from Implicit Scene Modeling to Spatial Reasoning

Robotics

Robots understand and follow spoken directions.

30 Aug 2025 0

View PDF Login to Bookmark

Page Count

12 pages

Spatial Policy: Guiding Visuomotor Robotic Manipulation with Spatial-Aware Modeling and Reasoning

Robots learn to move objects by seeing.

Technical Abstract

AutoSpatial: Visual-Language Reasoning for Social Robot Navigation through Efficient Spatial Reasoning Learning

mindmap: Spatial Memory in Deep Feature Maps for 3D Action Policies

Embodied Spatial Intelligence: from Implicit Scene Modeling to Spatial Reasoning