Stable Offline Hand-Eye Calibration for any Robot with Just One Mark
By: Sicheng Xie , Lingchen Meng , Zhiying Du and more
Potential Business Impact:
Helps robots learn to do tasks better.
Imitation learning has achieved remarkable success in a variety of robotic tasks by learning a mapping function from camera-space observations to robot-space actions. Recent work indicates that the use of robot-to-camera transformation information ({\ie}, camera extrinsics) benefits the learning process and produces better results. However, camera extrinsics are oftentimes unavailable and estimation methods usually suffer from local minima and poor generalizations. In this paper, we present CalibAll, a simple yet effective method that \textbf{requires only a single mark} and performs training-free, stable, and accurate camera extrinsic estimation across diverse robots and datasets through a coarse-to-fine calibration pipeline. In particular, we annotate a single mark on an end-effector (EEF), and leverage the correspondence ability emerged from vision foundation models (VFM) to automatically localize the corresponding mark across robots in diverse datasets. Using this mark, together with point tracking and the 3D EEF trajectory, we obtain a coarse camera extrinsic via temporal Perspective-n-Point (PnP). This estimate is further refined through a rendering-based optimization that aligns rendered and ground-true masks, yielding accurate and stable camera extrinsic. Experimental results demonstrate that our method outperforms state-of-the-art approaches, showing strong robustness and general effectiveness across three robot platforms. It also produces useful auxiliary annotations such as depth maps, link-wise masks, and end-effector 2D trajectories, which can further support downstream tasks.
Similar Papers
Marker-Based Extrinsic Calibration Method for Accurate Multi-Camera 3D Reconstruction
CV and Pattern Recognition
Aligns 3D camera pictures perfectly for clear models.
Ego-Exo 3D Hand Tracking in the Wild with a Mobile Multi-Camera Rig
CV and Pattern Recognition
Tracks hands in 3D, even when moving freely.
3D Hand-Eye Calibration for Collaborative Robot Arm: Look at Robot Base Once
Robotics
Robots learn to see and grab things faster.