LLHA-Net: A Hierarchical Attention Network for Two-View Correspondence Learning
By: Shuyuan Lin , Yu Guo , Xiao Chen and more
Potential Business Impact:
Finds matching points in pictures better.
Establishing the correct correspondence of feature points is a fundamental task in computer vision. However, the presence of numerous outliers among the feature points can significantly affect the matching results, reducing the accuracy and robustness of the process. Furthermore, a challenge arises when dealing with a large proportion of outliers: how to ensure the extraction of high-quality information while reducing errors caused by negative samples. To address these issues, in this paper, we propose a novel method called Layer-by-Layer Hierarchical Attention Network, which enhances the precision of feature point matching in computer vision by addressing the issue of outliers. Our method incorporates stage fusion, hierarchical extraction, and an attention mechanism to improve the network's representation capability by emphasizing the rich semantic information of feature points. Specifically, we introduce a layer-by-layer channel fusion module, which preserves the feature semantic information from each stage and achieves overall fusion, thereby enhancing the representation capability of the feature points. Additionally, we design a hierarchical attention module that adaptively captures and fuses global perception and structural semantic information using an attention mechanism. Finally, we propose two architectures to extract and integrate features, thereby improving the adaptability of our network. We conduct experiments on two public datasets, namely YFCC100M and SUN3D, and the results demonstrate that our proposed method outperforms several state-of-the-art techniques in both outlier removal and camera pose estimation. Source code is available at http://www.linshuyuan.com.
Similar Papers
LAHNet: Local Attentive Hashing Network for Point Cloud Registration
CV and Pattern Recognition
Helps 3D scans match up better.
Hierarchical MLANet: Multi-level Attention for 3D Face Reconstruction From Single Images
CV and Pattern Recognition
Makes 3D faces from regular photos.
Hierarchical MLANet: Multi-level Attention for 3D Face Reconstruction From Single Images
CV and Pattern Recognition
Makes 3D faces from regular photos.