Towards Better Data Augmentation using Wasserstein Distance in   Variational Auto-encoder

Zichuan Chen; Peng Liu

arXiv:2109.14795·cs.LG·August 17, 2022·1 cites

Towards Better Data Augmentation using Wasserstein Distance in Variational Auto-encoder

Zichuan Chen, Peng Liu

PDF

Open Access

TL;DR

This paper introduces Wasserstein distance into variational auto-encoders to improve data augmentation, resulting in better convergence and more effective synthetic data for image classification.

Contribution

The paper proposes replacing KL divergence with Wasserstein distance in VAE, showing improved theoretical bounds and practical performance for data augmentation.

Findings

01

Wasserstein-based VAE has a superior ELBO lower bound compared to KL-based VAE.

02

The new loss function improves convergence properties.

03

Generated images better support image classification tasks.

Abstract

VAE, or variational auto-encoder, compresses data into latent attributes, and generates new data of different varieties. VAE based on KL divergence has been considered as an effective technique for data augmentation. In this paper, we propose the use of Wasserstein distance as a measure of distributional similarity for the latent attributes, and show its superior theoretical lower bound (ELBO) compared with that of KL divergence under mild conditions. Using multiple experiments, we demonstrate that the new loss function exhibits better convergence property and generates artificial images that could better aid the image classification tasks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Image Processing Techniques · Image and Signal Denoising Methods