Score: 0

On the cosine similarity and orthogonality between persistence diagrams

Published: April 6, 2025 | arXiv ID: 2504.04361v1

By: Azmeer Nordin , Mohd Salmi Md Noorani , Nurulkamal Masseran and more

Potential Business Impact:

Compares data shapes better to find differences.

Business Areas:
Data Visualization Data and Analytics, Design, Information Technology, Software

Topological data analysis is an approach to study shape of a data set by means of topology. Its main object of study is the persistence diagram, which represents the topological features of the data set at different spatial resolutions. Multiple data sets can be compared by the similarity of their diagrams to understand their behaviors in relative to each other. The bottleneck and Wasserstein distances are often used as a tool to indicate the similarity. In this paper, we introduce cosine similarity as a new indicator for the similarity between persistence diagrams and investigate its properties. Furthermore, it leads to the new notion of orthogonality between persistence diagrams. It turns out that the orthogonality refers to perfect dissimilarity between persistence diagrams under the cosine similarity. Through data demonstration, the cosine similarity is shown to be more accurate than the standard distances to measure the similarity between persistence diagrams.

Page Count
25 pages

Category
Mathematics:
Algebraic Topology