Geodesic Prototype Matching via Diffusion Maps for Interpretable Fine-Grained Recognition
By: Junhao Jia , Yunyou Liu , Yifei Sun and more
Potential Business Impact:
Teaches computers to see tiny differences in pictures.
Nonlinear manifolds are widespread in deep visual features, where Euclidean distances often fail to capture true similarity. This limitation becomes particularly severe in prototype-based interpretable fine-grained recognition, where subtle semantic distinctions are essential. To address this challenge, we propose a novel paradigm for prototype-based recognition that anchors similarity within the intrinsic geometry of deep features. Specifically, we distill the latent manifold structure of each class into a diffusion space and introduce a differentiable Nystr\"om interpolation, making the geometry accessible to both unseen samples and learnable prototypes. To ensure efficiency, we employ compact per-class landmark sets with periodic updates. This design keeps the embedding aligned with the evolving backbone, enabling fast and scalable inference. Extensive experiments on the CUB-200-2011 and Stanford Cars datasets show that our GeoProto framework produces prototypes focusing on semantically aligned parts, significantly outperforming Euclidean prototype networks.
Similar Papers
GeoDM: Geometry-aware Distribution Matching for Dataset Distillation
CV and Pattern Recognition
Makes small data sets work like big ones.
Learning functions through Diffusion Maps
Machine Learning (CS)
Makes computers learn from scattered data points.
Enabling Probabilistic Learning on Manifolds through Double Diffusion Maps
Machine Learning (Stat)
Creates realistic data from few examples.