Variational Distillation of Diffusion Policies into Mixture of Experts
Hongyi Zhou, Denis Blessing, Ge Li, Onur Celik, Xiaogang Jia, Gerhard, Neumann, Rudolf Lioutikov

TL;DR
This paper presents Variational Diffusion Distillation (VDD), a novel approach that efficiently distills diffusion policies into Mixture of Experts models, combining the strengths of both to improve behavior learning and inference speed.
Contribution
VDD is the first method to distill pre-trained diffusion models into MoE models, enabling efficient training and deployment while maintaining complex distribution representation.
Findings
VDD accurately distills complex diffusion-based distributions.
VDD outperforms existing distillation methods.
VDD surpasses traditional MoE training approaches.
Abstract
This work introduces Variational Diffusion Distillation (VDD), a novel method that distills denoising diffusion policies into Mixtures of Experts (MoE) through variational inference. Diffusion Models are the current state-of-the-art in generative modeling due to their exceptional ability to accurately learn and represent complex, multi-modal distributions. This ability allows Diffusion Models to replicate the inherent diversity in human behavior, making them the preferred models in behavior learning such as Learning from Human Demonstrations (LfD). However, diffusion models come with some drawbacks, including the intractability of likelihoods and long inference times due to their iterative sampling process. The inference times, in particular, pose a significant challenge to real-time applications such as robot control. In contrast, MoEs effectively address the aforementioned issues…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsOpinion Dynamics and Social Influence · Expert finding and Q&A systems · Multi-Criteria Decision Making
MethodsMixture of Experts · Diffusion
