Score: 0

Scalable Varied-Density Clustering via Graph Propagation

Published: August 5, 2025 | arXiv ID: 2508.02989v1

By: Ninh Pham, Yingtao Zheng, Hugo Phibbs

Potential Business Impact:

Finds hidden groups in huge, messy data fast.

We propose a novel perspective on varied-density clustering for high-dimensional data by framing it as a label propagation process in neighborhood graphs that adapt to local density variations. Our method formally connects density-based clustering with graph connectivity, enabling the use of efficient graph propagation techniques developed in network science. To ensure scalability, we introduce a density-aware neighborhood propagation algorithm and leverage advanced random projection methods to construct approximate neighborhood graphs. Our approach significantly reduces computational cost while preserving clustering quality. Empirically, it scales to datasets with millions of points in minutes and achieves competitive accuracy compared to existing baselines.

Country of Origin
🇳🇿 New Zealand

Page Count
12 pages

Category
Computer Science:
Machine Learning (CS)