Score: 0

Eq.Bot: Enhance Robotic Manipulation Learning via Group Equivariant Canonicalization

Published: November 19, 2025 | arXiv ID: 2511.15194v1

By: Jian Deng , Yuandong Wang , Yangfu Zhu and more

Potential Business Impact:

Robots learn to move objects more accurately.

Business Areas:
Robotics Hardware, Science and Engineering, Software

Robotic manipulation systems are increasingly deployed across diverse domains. Yet existing multi-modal learning frameworks lack inherent guarantees of geometric consistency, struggling to handle spatial transformations such as rotations and translations. While recent works attempt to introduce equivariance through bespoke architectural modifications, these methods suffer from high implementation complexity, computational cost, and poor portability. Inspired by human cognitive processes in spatial reasoning, we propose Eq.Bot, a universal canonicalization framework grounded in SE(2) group equivariant theory for robotic manipulation learning. Our framework transforms observations into a canonical space, applies an existing policy, and maps the resulting actions back to the original space. As a model-agnostic solution, Eq.Bot aims to endow models with spatial equivariance without requiring architectural modifications. Extensive experiments demonstrate the superiority of Eq.Bot under both CNN-based (e.g., CLIPort) and Transformer-based (e.g., OpenVLA-OFT) architectures over existing methods on various robotic manipulation tasks, where the most significant improvement can reach 50.0%.

Country of Origin
🇨🇳 China

Page Count
12 pages

Category
Computer Science:
Robotics