Variational Inference with Tail-adaptive f-Divergence
Dilin Wang, Hao Liu, Qiang Liu

TL;DR
This paper introduces tail-adaptive f-divergences for variational inference, improving stability and mass-covering ability by adaptively managing importance weight tails, with applications in Bayesian neural networks and reinforcement learning.
Contribution
The paper proposes a novel class of tail-adaptive f-divergences that ensure finite moments and enhance mass-covering properties in variational inference.
Findings
Significant improvements over classical KL and α-divergence methods.
Effective in Bayesian neural networks and deep reinforcement learning.
Theoretically guarantees finite moments of importance weights.
Abstract
Variational inference with {\alpha}-divergences has been widely used in modern probabilistic machine learning. Compared to Kullback-Leibler (KL) divergence, a major advantage of using {\alpha}-divergences (with positive {\alpha} values) is their mass-covering property. However, estimating and optimizing {\alpha}-divergences require to use importance sampling, which could have extremely large or infinite variances due to heavy tails of importance weights. In this paper, we propose a new class of tail-adaptive f-divergences that adaptively change the convex function f with the tail of the importance weights, in a way that theoretically guarantees finite moments, while simultaneously achieving mass-covering properties. We test our methods on Bayesian neural networks, as well as deep reinforcement learning in which our method is applied to improve a recent soft actor-critic (SAC) algorithm.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Fault Detection and Control Systems
