Diffusion Model as Representation Learner

Xingyi Yang; Xinchao Wang

arXiv:2308.10916·cs.CV·August 23, 2023

Diffusion Model as Representation Learner

Xingyi Yang, Xinchao Wang

PDF

Open Access 1 Repo

TL;DR

This paper explores the representation capabilities of Diffusion Probabilistic Models (DPMs) and introduces RepFusion, a novel knowledge transfer method that leverages DPMs for recognition tasks, outperforming existing methods.

Contribution

The paper provides an in-depth analysis of DPMs as autoencoders and proposes RepFusion, a new paradigm for transferring knowledge from DPMs to recognition models using reinforcement learning.

Findings

01

DPMs inherently function as denoising autoencoders.

02

RepFusion improves recognition performance across multiple benchmarks.

03

Knowledge transfer from DPMs enhances recognition tasks.

Abstract

Diffusion Probabilistic Models (DPMs) have recently demonstrated impressive results on various generative tasks.Despite its promises, the learned representations of pre-trained DPMs, however, have not been fully understood. In this paper, we conduct an in-depth investigation of the representation power of DPMs, and propose a novel knowledge transfer method that leverages the knowledge acquired by generative DPMs for recognition tasks. Our study begins by examining the feature space of DPMs, revealing that DPMs are inherently denoising autoencoders that balance the representation learning with regularizing model capacity. To this end, we introduce a novel knowledge transfer paradigm named RepFusion. Our paradigm extracts representations at different time steps from off-the-shelf DPMs and dynamically employs them as supervision for student networks, in which the optimal time is determined…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

adamdad/repfusion
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Topic Modeling · Computational and Text Analysis Methods