GECO: Geometrically Consistent Embedding with Lightspeed Inference
By: Regine Hartwig , Dominik Muhle , Riccardo Marin and more
Potential Business Impact:
Teaches computers to see 3D shapes like humans.
Recent advances in feature learning have shown that self-supervised vision foundation models can capture semantic correspondences but often lack awareness of underlying 3D geometry. GECO addresses this gap by producing geometrically coherent features that semantically distinguish parts based on geometry (e.g., left/right eyes, front/back legs). We propose a training framework based on optimal transport, enabling supervision beyond keypoints, even under occlusions and disocclusions. With a lightweight architecture, GECO runs at 30 fps, 98.2% faster than prior methods, while achieving state-of-the-art performance on PFPascal, APK, and CUB, improving PCK by 6.0%, 6.2%, and 4.1%, respectively. Finally, we show that PCK alone is insufficient to capture geometric quality and introduce new metrics and insights for more geometry-aware feature learning. Link to project page: https://reginehartwig.github.io/publications/geco/
Similar Papers
LiteGE: Lightweight Geodesic Embedding for Efficient Geodesics Computation and Non-Isometric Shape Correspondence
CV and Pattern Recognition
Makes 3D shape matching super fast and easy.
GACO-CAD: Geometry-Augmented and Conciseness-Optimized CAD Model Generation from Single Image
CV and Pattern Recognition
Turns drawings into 3D models for building.
Robust Scene Coordinate Regression via Geometrically-Consistent Global Descriptors
CV and Pattern Recognition
Helps robots find their way better in new places.