Cluster expansion of the log-likelihood ratio: Optimal detection of planted matchings
By: Timothy L. H. Wee, Cheng Mao
To understand how hidden information can be extracted from statistical networks, planted models in random graphs have been the focus of intensive study in recent years. In this work, we consider the detection of a planted matching, i.e., an independent edge set, hidden in an Erdős-Rényi random graph, which is formulated as a hypothesis testing problem. We identify the critical regime for this testing problem and prove that the log-likelihood ratio is asymptotically normal. Via analyses of computationally efficient edge or wedge count test statistics that attain the optimal limits of detection, our results also reveal the absence of a statistical-to-computational gap. Our main technical tool is the cluster expansion from statistical physics, which allows us to prove a precise, non-asymptotic characterization of the log-likelihood ratio. Our analyses rely on a careful reorganization and cancellation of terms that occur in the difference between monomer-dimer log partition functions on the complete and Erdős-Rényi graphs. This combinatorial and statistical physics approach represents a significant departure from the more established methods such as orthogonal decompositions, and positions the cluster expansion as a viable technique in the study of log-likelihood ratios for planted models in general.
Similar Papers
Robust Detection of Planted Subgraphs in Semi-Random Models
Information Theory
Find hidden groups even when some connections are hidden.
Likelihood Ratio Tests by Kernel Gaussian Embedding
Machine Learning (Stat)
Finds if two groups of data are truly different.
Likelihood Ratio Tests by Kernel Gaussian Embedding
Machine Learning (Stat)
Finds differences between two groups of data.