Eq.Bot: Enhance Robotic Manipulation Learning via Group Equivariant Canonicalization
By: Jian Deng , Yuandong Wang , Yangfu Zhu and more
Potential Business Impact:
Robots learn to move objects more accurately.
Robotic manipulation systems are increasingly deployed across diverse domains. Yet existing multi-modal learning frameworks lack inherent guarantees of geometric consistency, struggling to handle spatial transformations such as rotations and translations. While recent works attempt to introduce equivariance through bespoke architectural modifications, these methods suffer from high implementation complexity, computational cost, and poor portability. Inspired by human cognitive processes in spatial reasoning, we propose Eq.Bot, a universal canonicalization framework grounded in SE(2) group equivariant theory for robotic manipulation learning. Our framework transforms observations into a canonical space, applies an existing policy, and maps the resulting actions back to the original space. As a model-agnostic solution, Eq.Bot aims to endow models with spatial equivariance without requiring architectural modifications. Extensive experiments demonstrate the superiority of Eq.Bot under both CNN-based (e.g., CLIPort) and Transformer-based (e.g., OpenVLA-OFT) architectures over existing methods on various robotic manipulation tasks, where the most significant improvement can reach 50.0%.
Similar Papers
Multi-Group Equivariant Augmentation for Reinforcement Learning in Robot Manipulation
Robotics
Teaches robots to learn tasks faster.
SE(3)-Equivariant Robot Learning and Control: A Tutorial Survey
Robotics
Robots learn faster by understanding shapes.
Local-Canonicalization Equivariant Graph Neural Networks for Sample-Efficient and Generalizable Swarm Robot Control
Robotics
Helps robot teams work together better, even when things change.