Score: 0

Geodesic Prototype Matching via Diffusion Maps for Interpretable Fine-Grained Recognition

Published: September 21, 2025 | arXiv ID: 2509.17050v1

By: Junhao Jia , Yunyou Liu , Yifei Sun and more

Potential Business Impact:

Teaches computers to see tiny differences in pictures.

Business Areas:

Indoor Positioning Navigation and Mapping

Nonlinear manifolds are widespread in deep visual features, where Euclidean distances often fail to capture true similarity. This limitation becomes particularly severe in prototype-based interpretable fine-grained recognition, where subtle semantic distinctions are essential. To address this challenge, we propose a novel paradigm for prototype-based recognition that anchors similarity within the intrinsic geometry of deep features. Specifically, we distill the latent manifold structure of each class into a diffusion space and introduce a differentiable Nystr\"om interpolation, making the geometry accessible to both unseen samples and learnable prototypes. To ensure efficiency, we employ compact per-class landmark sets with periodic updates. This design keeps the embedding aligned with the evolving backbone, enabling fast and scalable inference. Extensive experiments on the CUB-200-2011 and Stanford Cars datasets show that our GeoProto framework produces prototypes focusing on semantically aligned parts, significantly outperforming Euclidean prototype networks.