Score: 1

Neural Normalized Cut: A Differential and Generalizable Approach for Spectral Clustering

Published: March 12, 2025 | arXiv ID: 2503.09260v1

By: Wei He , Shangzhi Zhang , Chun-Guang Li and more

Potential Business Impact:

Finds groups in huge amounts of data faster.

Business Areas:
Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Spectral clustering, as a popular tool for data clustering, requires an eigen-decomposition step on a given affinity to obtain the spectral embedding. Nevertheless, such a step suffers from the lack of generalizability and scalability. Moreover, the obtained spectral embeddings can hardly provide a good approximation to the ground-truth partition and thus a k-means step is adopted to quantize the embedding. In this paper, we propose a simple yet effective scalable and generalizable approach, called Neural Normalized Cut (NeuNcut), to learn the clustering membership for spectral clustering directly. In NeuNcut, we properly reparameterize the unknown cluster membership via a neural network, and train the neural network via stochastic gradient descent with a properly relaxed normalized cut loss. As a result, our NeuNcut enjoys a desired generalization ability to directly infer clustering membership for out-of-sample unseen data and hence brings us an efficient way to handle clustering task with ultra large-scale data. We conduct extensive experiments on both synthetic data and benchmark datasets and experimental results validate the effectiveness and the superiority of our approach. Our code is available at: https://github.com/hewei98/NeuNcut.

Repos / Data Links

Page Count
27 pages

Category
Computer Science:
Machine Learning (CS)