Efficient Maintenance of Leiden Communities in Large Dynamic Graphs
By: Chunxu Lin , Yumao Xie , Yixiang Fang and more
As a well-known community detection algorithm, Leiden has been widely used in various scenarios such as large language model generation (e.g., Graph-RAG), anomaly detection, and biological analysis. In these scenarios, the graphs are often large and dynamic, where vertices and edges are inserted and deleted frequently, so it is costly to obtain the updated communities by Leiden from scratch when the graph has changed. Recently, one work has attempted to study how to maintain Leiden communities in the dynamic graph, but it lacks a detailed theoretical analysis, and its algorithms are inefficient for large graphs. To address these issues, in this paper, we first theoretically show that the existing algorithms are relatively unbounded via the boundedness analysis (a powerful tool for analyzing incremental algorithms on dynamic graphs), and also analyze the memberships of vertices in communities when the graph changes. Based on theoretical analysis, we develop a novel efficient maintenance algorithm, called Hierarchical Incremental Tree Leiden (HIT-Leiden), which effectively reduces the range of affected vertices by maintaining the connected components and hierarchical community structures. Comprehensive experiments in various datasets demonstrate the superior performance of HIT-Leiden. In particular, it achieves speedups of up to five orders of magnitude over existing methods.
Similar Papers
A Parallel Hierarchical Approach for Community Detection on Large-scale Dynamic Networks
Social and Information Networks
Finds groups in changing online networks faster.
Discovering Communities in Continuous-Time Temporal Networks by Optimizing L-Modularity
Social and Information Networks
Finds changing groups in networks over time.
LLMs Between the Nodes: Community Discovery Beyond Vectors
Social and Information Networks
Finds friend groups in online networks better.