SegMASt3R: Geometry Grounded Segment Matching
By: Rohit Jayanti , Swayam Agrawal , Vansh Garg and more
Potential Business Impact:
Matches parts of pictures from far away.
Segment matching is an important intermediate task in computer vision that establishes correspondences between semantically or geometrically coherent regions across images. Unlike keypoint matching, which focuses on localized features, segment matching captures structured regions, offering greater robustness to occlusions, lighting variations, and viewpoint changes. In this paper, we leverage the spatial understanding of 3D foundation models to tackle wide-baseline segment matching, a challenging setting involving extreme viewpoint shifts. We propose an architecture that uses the inductive bias of these 3D foundation models to match segments across image pairs with up to 180 degree view-point change. Extensive experiments show that our approach outperforms state-of-the-art methods, including the SAM2 video propagator and local feature matching methods, by upto 30% on the AUPRC metric, on ScanNet++ and Replica datasets. We further demonstrate benefits of the proposed model on relevant downstream tasks, including 3D instance segmentation and image-goal navigation. Project Page: https://segmast3r.github.io/
Similar Papers
Dense Semantic Matching with VGGT Prior
CV and Pattern Recognition
Matches objects in pictures, even if mirrored.
Split Matching for Inductive Zero-shot Semantic Segmentation
CV and Pattern Recognition
Teaches computers to identify new things without training.
CoMatcher: Multi-View Collaborative Feature Matching
CV and Pattern Recognition
Helps computers see objects from many angles.