A Unified Framework for Community Detection and Model Selection in Blockmodels
By: Subhankar Bhadra, Minh Tang, Srijan Sengupta
Potential Business Impact:
Finds best way to group people in networks.
Blockmodels are a foundational tool for modeling community structure in networks, with the stochastic blockmodel (SBM), degree-corrected blockmodel (DCBM), and popularity-adjusted blockmodel (PABM) forming a natural hierarchy of increasing generality. While community detection under these models has been extensively studied, much less attention has been paid to the model selection problem, i.e., determining which model best fits a given network. Building on recent theoretical insights about the spectral geometry of these models, we propose a unified framework for simultaneous community detection and model selection across the full blockmodel hierarchy. A key innovation is the use of loss functions that serve a dual role: they act as objective functions for community detection and as test statistics for hypothesis testing. We develop a greedy algorithm to minimize these loss functions and establish theoretical guarantees for exact label recovery and model selection consistency under each model. Extensive simulation studies demonstrate that our method achieves high accuracy in both tasks, outperforming or matching state-of-the-art alternatives. Applications to five real-world networks further illustrate the interpretability and practical utility of our approach.
Similar Papers
Collapsed Structured Block Models for Community Detection in Complex Networks
Statistics Theory
Finds hidden groups in connected data.
SubSearch: Robust Estimation and Outlier Detection for Stochastic Block Models via Subgraph Search
Machine Learning (Stat)
Finds hidden groups in messy data networks.
Improved Community Detection using Stochastic Block Models
Social and Information Networks
Finds groups of friends that actually talk to each other.