Attention-guided reference point shifting for Gaussian-mixture-based partial point set registration
By: Mizuki Kikkawa , Tatsuya Yatagawa , Yutaka Ohtake and more
Potential Business Impact:
Helps computers match 3D shapes even when they're tilted.
This study investigates the impact of the invariance of feature vectors for partial-to-partial point set registration under translation and rotation of input point sets, particularly in the realm of techniques based on deep learning and Gaussian mixture models (GMMs). We reveal both theoretical and practical problems associated with such deep-learning-based registration methods using GMMs, with a particular focus on the limitations of DeepGMR, a pioneering study in this line, to the partial-to-partial point set registration. Our primary goal is to uncover the causes behind such methods and propose a comprehensible solution for that. To address this, we introduce an attention-based reference point shifting (ARPS) layer, which robustly identifies a common reference point of two partial point sets, thereby acquiring transformation-invariant features. The ARPS layer employs a well-studied attention module to find a common reference point rather than the overlap region. Owing to this, it significantly enhances the performance of DeepGMR and its recent variant, UGMMReg. Furthermore, these extension models outperform even prior deep learning methods using attention blocks and Transformer to extract the overlap region or common reference points. We believe these findings provide deeper insights into registration methods using deep learning and GMMs.
Similar Papers
Gaussian Alignment for Relative Camera Pose Estimation via Single-View Reconstruction
CV and Pattern Recognition
Helps computers understand 3D space from pictures.
Gaussian Primitive Optimized Deformable Retinal Image Registration
CV and Pattern Recognition
Makes eye scans match perfectly for better health.
Enhancing Rotation-Invariant 3D Learning with Global Pose Awareness and Attention Mechanisms
CV and Pattern Recognition
Helps computers tell apart similar 3D shapes.