Hierarchical Importance Weighted Autoencoders
Chin-Wei Huang, Kris Sankaran, Eeshan Dhekane, Alexandre Lacoste,, Aaron Courville

TL;DR
This paper introduces a hierarchical importance weighted autoencoder that uses correlated proposals to reduce variance and improve inference quality in importance sampling, supported by theoretical analysis and empirical validation.
Contribution
It proposes a hierarchical structure for importance sampling proposals to induce correlation, reducing variance and enhancing inference efficiency.
Findings
Hierarchical proposals induce negative correlation, reducing estimator variance.
Maximizing the lower bound implicitly minimizes variance.
Inference performance improves with more samples.
Abstract
Importance weighted variational inference (Burda et al., 2015) uses multiple i.i.d. samples to have a tighter variational lower bound. We believe a joint proposal has the potential of reducing the number of redundant samples, and introduce a hierarchical structure to induce correlation. The hope is that the proposals would coordinate to make up for the error made by one another to reduce the variance of the importance estimator. Theoretically, we analyze the condition under which convergence of the estimator variance can be connected to convergence of the lower bound. Empirically, we confirm that maximization of the lower bound does implicitly minimize variance. Further analysis shows that this is a result of negative correlation induced by the proposed hierarchical meta sampling scheme, and performance of inference also improves when the number of samples increases.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Domain Adaptation and Few-Shot Learning · Gaussian Processes and Bayesian Inference
