Topological Metric for Unsupervised Embedding Quality Evaluation
By: Aleksei Shestov , Anton Klenitskiy , Daria Denisova and more
Modern representation learning increasingly relies on unsupervised and self-supervised methods trained on large-scale unlabeled data. While these approaches achieve impressive generalization across tasks and domains, evaluating embedding quality without labels remains an open challenge. In this work, we propose Persistence, a topology-aware metric based on persistent homology that quantifies the geometric structure and topological richness of embedding spaces in a fully unsupervised manner. Unlike metrics that assume linear separability or rely on covariance structure, Persistence captures global and multi-scale organization. Empirical results across diverse domains show that Persistence consistently achieves top-tier correlations with downstream performance, outperforming existing unsupervised metrics and enabling reliable model and hyperparameter selection.
Similar Papers
From Topology to Retrieval: Decoding Embedding Spaces with Unified Signatures
Machine Learning (CS)
Maps text meaning to help computers find information.
From Topology to Retrieval: Decoding Embedding Spaces with Unified Signatures
Machine Learning (CS)
Maps text meaning to help computers find information.
Stability of 0-dimensional persistent homology in enriched and sparsified point clouds
Algebraic Topology
Helps understand animal homes with math.