DiEC: Diffusion Embedded Clustering
Haidong Hu, Xiaoyu Zheng, Jin Zhou, Yingxu Wang, Rui Wang, Pei Dong, Shiyuan Han, Lin Wang, C. L. Philip Chen, Tong Zhang, Yuehui Chen

TL;DR
DiEC introduces an unsupervised clustering framework that leverages optimal intermediate representations from pretrained diffusion models, systematically identifying the most clustering-friendly layer and timestep to enhance performance without additional data augmentation.
Contribution
The paper proposes a novel method to select optimal diffusion model representations for clustering, improving efficiency and effectiveness over traditional single-representation approaches.
Findings
Achieves state-of-the-art clustering results on multiple benchmarks.
Effectively identifies clustering-optimal layer and timestep in diffusion models.
Reduces computational overhead compared to existing methods.
Abstract
Deep clustering methods typically rely on a single, well-defined representation for clustering. In contrast, pretrained diffusion models provide abundant and diverse multi-scale representations across network layers and noise timesteps. However, a key challenge is how to efficiently identify the most clustering-friendly representation in the layer*timestep space. To address this issue, we propose Diffusion Embedded Clustering (DiEC), an unsupervised framework that performs clustering by leveraging optimal intermediate representations from pretrained diffusion models. DiEC systematically evaluates the clusterability of representations along the trajectory of network depth and noise timesteps. Meanwhile, an unsupervised search strategy is designed for recognizing the Clustering-optimal Layer (COL) and Clustering-optimal Timestep (COT) in the layer*timestep space of pretrained diffusion…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Stochastic Gradient Optimization Techniques · Generative Adversarial Networks and Image Synthesis
