Tighter Variational Bounds are Not Necessarily Better. A Research Report on Implementation, Ablation Study, and Extensions
Amine M'Charrak, V\'it R\r{u}\v{z}i\v{c}ka, Sangyun Shin, Madhu, Vankadari

TL;DR
This paper investigates the effects of increasing importance samples in IWAE, revealing that more samples can degrade gradient signal-to-noise ratio and proposing alternative methods to improve gradient estimates and posterior approximation.
Contribution
It introduces three novel importance weighted autoencoder variants that improve gradient estimation and posterior approximation, extending the original IWAE framework.
Findings
Increasing K can reduce gradient signal-to-noise ratio.
Tighter bounds benefit the generative network, looser bounds benefit inference.
Proposed methods produce closer posterior approximations than IWAE.
Abstract
This report explains, implements and extends the works presented in "Tighter Variational Bounds are Not Necessarily Better" (T Rainforth et al., 2018). We provide theoretical and empirical evidence that increasing the number of importance samples in the importance weighted autoencoder (IWAE) (Burda et al., 2016) degrades the signal-to-noise ratio (SNR) of the gradient estimator in the inference network and thereby affecting the full learning process. In other words, even though increasing decreases the standard deviation of the gradients, it also reduces the magnitude of the true gradient faster, thereby increasing the relative variance of the gradient updates. Extensive experiments are performed to understand the importance of . These experiments suggest that tighter variational bounds are beneficial for the generative network, whereas looser bounds are preferable for the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis
