Multi-Scale Correlation-Aware Transformer for Maritime Vessel Re-Identification
By: Yunhe Liu
Potential Business Impact:
Helps computers tell ships apart, even if parts are missing.
Maritime vessel re-identification (Re-ID) plays a crucial role in advancing maritime monitoring and intelligent situational awareness systems. However, some existing vessel Re-ID methods are directly adapted from pedestrian-focused algorithms, making them ill-suited for mitigating the unique problems present in vessel images, particularly the greater intra-identity variations and more severe missing of local parts, which lead to the emergence of outlier samples within the same identity. To address these challenges, we propose the Multi-scale Correlation-aware Transformer Network (MCFormer), which explicitly models multi-scale correlations across the entire input set to suppress the adverse effects of outlier samples with intra-identity variations or local missing, incorporating two novel modules, the Global Correlation Module (GCM), and the Local Correlation Module (LCM). Specifically, GCM constructs a global similarity affinity matrix across all input images to model global correlations through feature aggregation based on inter-image consistency, rather than solely learning features from individual images as in most existing approaches. Simultaneously, LCM mines and aligns local features of positive samples with contextual similarity to extract local correlations by maintaining a dynamic memory bank, effectively compensating for missing or occluded regions in individual images. To further enhance feature robustness, MCFormer integrates global and local features that have been respectively correlated across multiple scales, effectively capturing latent relationships among image features. Experiments on three benchmarks demonstrate that MCFormer achieves state-of-the-art performance.
Similar Papers
MOS: Mitigating Optical-SAR Modality Gap for Cross-Modal Ship Re-Identification
CV and Pattern Recognition
Helps cameras and radar find the same ship.
Identity Clue Refinement and Enhancement for Visible-Infrared Person Re-Identification
CV and Pattern Recognition
Helps cameras find people in different light.
3D-Aware Multi-Task Learning with Cross-View Correlations for Dense Scene Understanding
CV and Pattern Recognition
Helps computers understand 3D scenes from many pictures.