Continual Learning of Generative Models with Limited Data: From   Wasserstein-1 Barycenter to Adaptive Coalescence

Mehmet Dedeoglu; Sen Lin; Zhaofeng Zhang; Junshan Zhang

arXiv:2101.09225·cs.LG·January 25, 2021

Continual Learning of Generative Models with Limited Data: From Wasserstein-1 Barycenter to Adaptive Coalescence

Mehmet Dedeoglu, Sen Lin, Zhaofeng Zhang, Junshan Zhang

PDF

Open Access

TL;DR

This paper proposes a novel framework for continual learning of generative models at edge nodes with limited data, leveraging Wasserstein-1 barycenters of pre-trained models for efficient adaptation and model compression.

Contribution

It introduces a systematic Wasserstein-1 barycenter approach for adaptive coalescence of pre-trained models in continual learning at the edge.

Findings

01

Effective Wasserstein-1 barycenter computation for model aggregation

02

Fast adaptation of meta-models to local data at edge nodes

03

Model compression via weight ternarization improves efficiency

Abstract

Learning generative models is challenging for a network edge node with limited data and computing power. Since tasks in similar environments share model similarity, it is plausible to leverage pre-trained generative models from the cloud or other edge nodes. Appealing to optimal transport theory tailored towards Wasserstein-1 generative adversarial networks (WGAN), this study aims to develop a framework which systematically optimizes continual learning of generative models using local data at the edge node while exploiting adaptive coalescence of pre-trained generative models. Specifically, by treating the knowledge transfer from other nodes as Wasserstein balls centered around their pre-trained models, continual learning of generative models is cast as a constrained optimization problem, which is further reduced to a Wasserstein-1 barycenter problem. A two-stage approach is devised…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Model Reduction and Neural Networks · Domain Adaptation and Few-Shot Learning

MethodsConvolution · Wasserstein GAN