Tighter Variational Bounds are Not Necessarily Better. A Research Report   on Implementation, Ablation Study, and Extensions

Amine M'Charrak; V\'it R\r{u}\v{z}i\v{c}ka; Sangyun Shin; Madhu; Vankadari

arXiv:2209.11875·stat.ML·September 27, 2022

Tighter Variational Bounds are Not Necessarily Better. A Research Report on Implementation, Ablation Study, and Extensions

Amine M'Charrak, V\'it R\r{u}\v{z}i\v{c}ka, Sangyun Shin, Madhu, Vankadari

PDF

Open Access 1 Repo

TL;DR

This paper investigates the effects of increasing importance samples in IWAE, revealing that more samples can degrade gradient signal-to-noise ratio and proposing alternative methods to improve gradient estimates and posterior approximation.

Contribution

It introduces three novel importance weighted autoencoder variants that improve gradient estimation and posterior approximation, extending the original IWAE framework.

Findings

01

Increasing K can reduce gradient signal-to-noise ratio.

02

Tighter bounds benefit the generative network, looser bounds benefit inference.

03

Proposed methods produce closer posterior approximations than IWAE.

Abstract

This report explains, implements and extends the works presented in "Tighter Variational Bounds are Not Necessarily Better" (T Rainforth et al., 2018). We provide theoretical and empirical evidence that increasing the number of importance samples $K$ in the importance weighted autoencoder (IWAE) (Burda et al., 2016) degrades the signal-to-noise ratio (SNR) of the gradient estimator in the inference network and thereby affecting the full learning process. In other words, even though increasing $K$ decreases the standard deviation of the gradients, it also reduces the magnitude of the true gradient faster, thereby increasing the relative variance of the gradient updates. Extensive experiments are performed to understand the importance of $K$ . These experiments suggest that tighter variational bounds are beneficial for the generative network, whereas looser bounds are preferable for the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

madhubabuv/TightIWAE
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis