Score: 0

Neighbor Embeddings Using Unbalanced Optimal Transport Metrics

Published: September 23, 2025 | arXiv ID: 2509.19226v1

By: Muhammad Rana, Keaton Hamm

Potential Business Impact:

Makes computers learn better from messy data.

Business Areas:
Last Mile Transportation Transportation

This paper proposes the use of the Hellinger--Kantorovich metric from unbalanced optimal transport (UOT) in a dimensionality reduction and learning (supervised and unsupervised) pipeline. The performance of UOT is compared to that of regular OT and Euclidean-based dimensionality reduction methods on several benchmark datasets including MedMNIST. The experimental results demonstrate that, on average, UOT shows improvement over both Euclidean and OT-based methods as verified by statistical hypothesis tests. In particular, on the MedMNIST datasets, UOT outperforms OT in classification 81\% of the time. For clustering MedMNIST, UOT outperforms OT 83\% of the time and outperforms both other metrics 58\% of the time.

Page Count
9 pages

Category
Statistics:
Machine Learning (Stat)