PoseStreamer: A Multi-modal Framework for 6DoF Pose Estimation of Unseen Moving Objects
By: Huiming Yang , Linglin Liao , Fei Ding and more
Potential Business Impact:
Helps robots see fast-moving things in the dark.
Six degree of freedom (6DoF) pose estimation for novel objects is a critical task in computer vision, yet it faces significant challenges in high-speed and low-light scenarios where standard RGB cameras suffer from motion blur. While event cameras offer a promising solution due to their high temporal resolution, current 6DoF pose estimation methods typically yield suboptimal performance in high-speed object moving scenarios. To address this gap, we propose PoseStreamer, a robust multi-modal 6DoF pose estimation framework designed specifically on high-speed moving scenarios. Our approach integrates three core components: an Adaptive Pose Memory Queue that utilizes historical orientation cues for temporal consistency, an Object-centric 2D Tracker that provides strong 2D priors to boost 3D center recall, and a Ray Pose Filter for geometric refinement along camera rays. Furthermore, we introduce MoCapCube6D, a novel multi-modal dataset constructed to benchmark performance under rapid motion. Extensive experiments demonstrate that PoseStreamer not only achieves superior accuracy in high-speed moving scenarios, but also exhibits strong generalizability as a template-free framework for unseen moving objects.
Similar Papers
DynamicPose: Real-time and Robust 6D Object Pose Tracking for Fast-Moving Cameras and Objects
CV and Pattern Recognition
Tracks moving things even when camera and object zoom.
Optical Flow-Guided 6DoF Object Pose Tracking with an Event Camera
CV and Pattern Recognition
Tracks objects better in tricky situations.
6-DoF Object Tracking with Event-based Optical Flow and Frames
CV and Pattern Recognition
Tracks fast-moving objects with special cameras.