Meta-Learning via Classifier(-free) Diffusion Guidance
Elvis Nava, Seijin Kobayashi, Yifei Yin, Robert K. Katzschmann,, Benjamin F. Grewe

TL;DR
This paper presents novel meta-learning algorithms that leverage diffusion models and natural language guidance to perform zero-shot neural network weight adaptation for unseen tasks, outperforming existing methods.
Contribution
It introduces a new approach combining hypernetworks and diffusion models with natural language guidance for zero-shot task adaptation.
Findings
Outperforms existing multi-task and meta-learning methods in zero-shot experiments
Utilizes classifier-free guidance to improve task-specific weight generation
Demonstrates effectiveness on the Meta-VQA dataset
Abstract
We introduce meta-learning algorithms that perform zero-shot weight-space adaptation of neural network models to unseen tasks. Our methods repurpose the popular generative image synthesis techniques of natural language guidance and diffusion models to generate neural network weights adapted for tasks. We first train an unconditional generative hypernetwork model to produce neural network weights; then we train a second "guidance" model that, given a natural language task description, traverses the hypernetwork latent space to find high-performance task-adapted weights in a zero-shot manner. We explore two alternative approaches for latent space guidance: "HyperCLIP"-based classifier guidance and a conditional Hypernetwork Latent Diffusion Model ("HyperLDM"), which we show to benefit from the classifier-free guidance technique common in image generation. Finally, we demonstrate that our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Generative Adversarial Networks and Image Synthesis · Multimodal Machine Learning Applications
MethodsLatent Diffusion Model · Diffusion · HyperNetwork
