On the cosine similarity and orthogonality between persistence diagrams
By: Azmeer Nordin , Mohd Salmi Md Noorani , Nurulkamal Masseran and more
Potential Business Impact:
Compares data shapes better to find differences.
Topological data analysis is an approach to study shape of a data set by means of topology. Its main object of study is the persistence diagram, which represents the topological features of the data set at different spatial resolutions. Multiple data sets can be compared by the similarity of their diagrams to understand their behaviors in relative to each other. The bottleneck and Wasserstein distances are often used as a tool to indicate the similarity. In this paper, we introduce cosine similarity as a new indicator for the similarity between persistence diagrams and investigate its properties. Furthermore, it leads to the new notion of orthogonality between persistence diagrams. It turns out that the orthogonality refers to perfect dissimilarity between persistence diagrams under the cosine similarity. Through data demonstration, the cosine similarity is shown to be more accurate than the standard distances to measure the similarity between persistence diagrams.
Similar Papers
Towards Scalable Topological Regularizers
Machine Learning (CS)
Makes computers understand shapes and pictures better.
Topological Correlation
Algebraic Topology
Finds patterns by comparing data in new ways.
Topological Metric for Unsupervised Embedding Quality Evaluation
Machine Learning (CS)
Measures how well computer "brains" learn without teachers.