BASIC: Bipartite Assisted Spectral-clustering for Identifying Communities in Large-scale Networks
By: Tianchen Gao , Jingyuan Liu , Rui Pan and more
Potential Business Impact:
Finds hidden groups in connected data.
Community detection, which focuses on recovering the group structure within networks, is a crucial and fundamental task in network analysis. However, the detection process can be quite challenging and unstable when community signals are weak. Motivated by a newly collected large-scale academic network dataset from the Web of Science, which includes multi-layer network information, we propose a Bipartite Assisted Spectral-clustering approach for Identifying Communities (BASIC), which incorporates the bipartite network information into the community structure learning of the primary network. The accuracy and stability enhancement of BASIC is validated theoretically on the basis of the degree-corrected stochastic block model framework, as well as numerically through extensive simulation studies. We rigorously study the convergence rate of BASIC even under weak signal scenarios and prove that BASIC yields a tighter upper error bound than that based on the primary network information alone. We utilize the proposed BASIC method to analyze the newly collected large-scale academic network dataset from statistical papers. During the author collaboration network structure learning, we incorporate the bipartite network information from author-paper, author-institution, and author-region relationships. From both statistical and interpretative perspectives, these bipartite networks greatly aid in identifying communities within the primary collaboration network.
Similar Papers
Bi-SCORE for Weighted Bipartite Networks with Application in Knowledge Source Discovery
Methodology
Finds main ideas in science papers.
Spectral Clustering on Multilayer Networks with Covariates
Methodology
Finds hidden groups in connected information.
Multi-Community Spectral Clustering for Geometric Graphs
Social and Information Networks
Finds hidden groups in online networks.