Asymmetric graph alignment and the phase transition for asymmetric tree correlation testing
By: Jakob Maier, Laurent Massoulié
Potential Business Impact:
Finds matching parts in different computer networks.
Graph alignment - identifying node correspondences between two graphs - is a fundamental problem with applications in network analysis, biology, and privacy research. While substantial progress has been made in aligning correlated Erd\H{o}s-R\'enyi graphs under symmetric settings, real-world networks often exhibit asymmetry in both node numbers and edge densities. In this work, we introduce a novel framework for asymmetric correlated Erd\H{o}s-R\'enyi graphs, generalizing existing models to account for these asymmetries. We conduct a rigorous theoretical analysis of graph alignment in the sparse regime, where local neighborhoods exhibit tree-like structures. Our approach leverages tree correlation testing as the central tool in our polynomial-time algorithm, MPAlign, which achieves one-sided partial alignment under certain conditions. A key contribution of our work is characterizing these conditions under which asymmetric tree correlation testing is feasible: If two correlated graphs $G$ and $G'$ have average degrees $\lambda s$ and $\lambda s'$ respectively, where $\lambda$ is their common density and $s,s'$ are marginal correlation parameters, their tree neighborhoods can be aligned if $ss' > \alpha$, where $\alpha$ denotes Otter's constant and $\lambda$ is supposed large enough. The feasibility of this tree comparison problem undergoes a sharp phase transition since $ss' \leq \alpha$ implies its impossibility. These new results on tree correlation testing allow us to solve a class of random subgraph isomorphism problems, resolving an open problem in the field.
Similar Papers
Statistical-computational gap in multiple Gaussian graph alignment
Machine Learning (Stat)
Makes computers solve hard graph puzzles faster.
Testing Correlation in Graphs by Counting Bounded Degree Motifs
Social and Information Networks
Finds hidden connections between two sets of data.
Sparse Graph Reconstruction and Seriation for Large-Scale Image Stacks
Data Structures and Algorithms
Sorts messy data with fewer questions.