Continual Learning of Generative Models with Limited Data: From Wasserstein-1 Barycenter to Adaptive Coalescence
Mehmet Dedeoglu, Sen Lin, Zhaofeng Zhang, Junshan Zhang

TL;DR
This paper proposes a novel framework for continual learning of generative models at edge nodes with limited data, leveraging Wasserstein-1 barycenters of pre-trained models for efficient adaptation and model compression.
Contribution
It introduces a systematic Wasserstein-1 barycenter approach for adaptive coalescence of pre-trained models in continual learning at the edge.
Findings
Effective Wasserstein-1 barycenter computation for model aggregation
Fast adaptation of meta-models to local data at edge nodes
Model compression via weight ternarization improves efficiency
Abstract
Learning generative models is challenging for a network edge node with limited data and computing power. Since tasks in similar environments share model similarity, it is plausible to leverage pre-trained generative models from the cloud or other edge nodes. Appealing to optimal transport theory tailored towards Wasserstein-1 generative adversarial networks (WGAN), this study aims to develop a framework which systematically optimizes continual learning of generative models using local data at the edge node while exploiting adaptive coalescence of pre-trained generative models. Specifically, by treating the knowledge transfer from other nodes as Wasserstein balls centered around their pre-trained models, continual learning of generative models is cast as a constrained optimization problem, which is further reduced to a Wasserstein-1 barycenter problem. A two-stage approach is devised…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Model Reduction and Neural Networks · Domain Adaptation and Few-Shot Learning
MethodsConvolution · Wasserstein GAN
