DiEC: Diffusion Embedded Clustering
By: Haidong Hu
Deep clustering critically depends on representations that expose clear cluster structure, yet most prior methods learn a single embedding with an autoencoder or a self-supervised encoder and treat it as the primary representation for clustering. In contrast, a pretrained diffusion model induces a rich representation trajectory over network layers and noise timesteps, along which clusterability varies substantially. We propose Diffusion Embedded Clustering (DiEC), an unsupervised clustering framework that exploits this trajectory by directly leveraging intermediate activations of a pretrained diffusion U-Net. DiEC formulates representation selection over layer * timestep and adopts a practical two-stage procedure: it uses the U-Net bottleneck as the Clustering Middle Layer (CML, l*) and identifies the Clustering-Optimal Timestep (COT, t*) via an efficient subset-based, noise-averaged search. Conditioning on (l*, t*), DiEC learns clustering embeddings through a lightweight residual mapping, optimized with a DEC-style KL self-training objective and structural regularization, while a parallel random-timestep denoising-consistency loss stabilizes training and preserves diffusion behavior. Experiments on standard benchmarks demonstrate that DiEC achieves strong clustering performance and reveal the importance of selecting diffusion representations for clustering.
Similar Papers
DiEC: Diffusion Embedded Clustering
Machine Learning (CS)
Finds hidden groups in data using AI's image-making steps.
Automated Learning of Semantic Embedding Representations for Diffusion Models
Machine Learning (CS)
Makes computers understand pictures better for learning.
Encoder-Decoder Diffusion Language Models for Efficient Training and Inference
Machine Learning (CS)
Makes AI write and understand faster.