Score: 2

MGCA-Net: Multi-Graph Contextual Attention Network for Two-View Correspondence Learning

Published: December 29, 2025 | arXiv ID: 2512.23369v1

By: Shuyuan Lin , Mengtin Lo , Haosheng Chen and more

Potential Business Impact:

Helps computers see matching objects in different pictures.

Business Areas:

Image Recognition Data and Analytics, Software

Two-view correspondence learning is a key task in computer vision, which aims to establish reliable matching relationships for applications such as camera pose estimation and 3D reconstruction. However, existing methods have limitations in local geometric modeling and cross-stage information optimization, which make it difficult to accurately capture the geometric constraints of matched pairs and thus reduce the robustness of the model. To address these challenges, we propose a Multi-Graph Contextual Attention Network (MGCA-Net), which consists of a Contextual Geometric Attention (CGA) module and a Cross-Stage Multi-Graph Consensus (CSMGC) module. Specifically, CGA dynamically integrates spatial position and feature information via an adaptive attention mechanism and enhances the capability to capture both local and global geometric relationships. Meanwhile, CSMGC establishes geometric consensus via a cross-stage sparse graph network, ensuring the consistency of geometric information across different stages. Experimental results on two representative YFCC100M and SUN3D datasets show that MGCA-Net significantly outperforms existing SOTA methods in the outlier rejection and camera pose estimation tasks. Source code is available at http://www.linshuyuan.com.

Context-Aware Network Based on Multi-scale Spatio-temporal Attention for Action Recognition in Videos

CV and Pattern Recognition

Helps computers understand what's happening in videos.

21 Dec 2025 0

89%

SC-Net: Robust Correspondence Learning via Spatial and Cross-Channel Context

CV and Pattern Recognition

Finds matching points in pictures better.

29 Dec 2025 1

89%

Improving Cross-view Object Geo-localization: A Dual Attention Approach with Cross-view Interaction and Multi-Scale Spatial Features

CV and Pattern Recognition

Helps find objects using pictures from different angles.

31 Oct 2025 1

View PDF Login to Bookmark

Country of Origin

🇨🇳 🇭🇰 Hong Kong, China

Page Count

9 pages

MGCA-Net: Multi-Graph Contextual Attention Network for Two-View Correspondence Learning

Helps computers see matching objects in different pictures.

Technical Abstract

Context-Aware Network Based on Multi-scale Spatio-temporal Attention for Action Recognition in Videos

SC-Net: Robust Correspondence Learning via Spatial and Cross-Channel Context

Improving Cross-view Object Geo-localization: A Dual Attention Approach with Cross-view Interaction and Multi-Scale Spatial Features