Robust Scene Coordinate Regression via Geometrically-Consistent Global Descriptors
By: Son Tung Nguyen , Tobias Fischer , Alejandro Fontan and more
Recent learning-based visual localization methods use global descriptors to disambiguate visually similar places, but existing approaches often derive these descriptors from geometric cues alone (e.g., covisibility graphs), limiting their discriminative power and reducing robustness in the presence of noisy geometric constraints. We propose an aggregator module that learns global descriptors consistent with both geometrical structure and visual similarity, ensuring that images are close in descriptor space only when they are visually similar and spatially connected. This corrects erroneous associations caused by unreliable overlap scores. Using a batch-mining strategy based solely on the overlap scores and a modified contrastive loss, our method trains without manual place labels and generalizes across diverse environments. Experiments on challenging benchmarks show substantial localization gains in large-scale environments while preserving computational and memory efficiency. Code is available at \href{https://github.com/sontung/robust\_scr}{github.com/sontung/robust\_scr}.
Similar Papers
$A^2$GC: $A$symmetric $A$ggregation with Geometric Constraints for Locally Aggregated Descriptors
CV and Pattern Recognition
Helps robots find places using pictures.
R-SCoRe: Revisiting Scene Coordinate Regression for Robust Large-Scale Visual Localization
CV and Pattern Recognition
Helps robots find their way better in new places.
Geometry-Aware Scene-Consistent Image Generation
CV and Pattern Recognition
Adds objects to pictures while keeping scene real.