Higher-Order Network Structure Inference: A Topological Approach to Network Selection
By: Adam Schroeder , Russell Funk , Jingyi Guan and more
Potential Business Impact:
Finds the best way to connect ideas in data.
Thresholding--the pruning of nodes or edges based on their properties or weights--is an essential preprocessing tool for extracting interpretable structure from complex network data, yet existing methods face several key limitations. Threshold selection often relies on heuristic methods or trial and error due to large parameter spaces and unclear optimization criteria, leading to sensitivity where small parameter variations produce significant changes in network structure. Moreover, most approaches focus on pairwise relationships between nodes, overlooking critical higher-order interactions involving three or more nodes. We introduce a systematic thresholding algorithm that leverages topological data analysis to identify optimal network parameters by accounting for higher-order structural relationships. Our method uses persistent homology to compute the stability of homological features across the parameter space, identifying parameter choices that are robust to small variations while preserving meaningful topological structure. Hyperparameters allow users to specify minimum requirements for topological features, effectively constraining the parameter search to avoid spurious solutions. We demonstrate the approach with an application in the Science of Science, where networks of scientific concepts are extracted from research paper abstracts, and concepts are connected when they co-appear in the same abstract. The flexibility of our approach allows researchers to incorporate domain-specific constraints and extends beyond network thresholding to general parameterization problems in data analysis.
Similar Papers
Enhancing Graph Representation Learning with Localized Topological Features
Machine Learning (CS)
Helps computers understand complex connections better.
Hierarchical biomarker thresholding: a model-agnostic framework for stability
Methodology
Fixes how computers judge health tests.
Dynamical System Parameter Path Optimization using Persistent Homology
Dynamical Systems
Finds best settings for complex machines.