DynamicVAE: Decoupling Reconstruction Error and Disentangled Representation Learning
Huajie Shao, Haohong Lin, Qinmin Yang, Shuochao Yao, Han Zhao, Tarek, Abdelzaher

TL;DR
DynamicVAE introduces a novel approach that dynamically adjusts the KL-divergence weight during training to decouple disentanglement from reconstruction accuracy, leading to improved performance in both aspects.
Contribution
The paper proposes DynamicVAE, which uses a stage-wise, adaptive control of the KL-divergence weight to separate disentanglement from reconstruction, overcoming limitations of existing methods.
Findings
Significantly improves reconstruction accuracy
Achieves comparable disentanglement to state-of-the-art methods
Successfully decouples disentanglement from reconstruction trade-off
Abstract
This paper challenges the common assumption that the weight , in -VAE, should be larger than in order to effectively disentangle latent factors. We demonstrate that -VAE, with , can not only attain good disentanglement but also significantly improve reconstruction accuracy via dynamic control. The paper removes the inherent trade-off between reconstruction accuracy and disentanglement for -VAE. Existing methods, such as -VAE and FactorVAE, assign a large weight to the KL-divergence term in the objective function, leading to high reconstruction errors for the sake of better disentanglement. To mitigate this problem, a ControlVAE has recently been developed that dynamically tunes the KL-divergence weight in an attempt to control the trade-off to more a favorable point. However, ControlVAE fails to eliminate the conflict between the need…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Image and Signal Denoising Methods · Advanced Image Processing Techniques
