A Stein Identity for q-Gaussians with Bounded Support
Sophia Sklaviadis, Thomas Moellenhoff, Andre F. T. Martins, Mario A. T. Figueiredo, Mohammad Emtiyaz Khan

TL;DR
This paper introduces a new Stein identity for bounded-support q-Gaussians, enabling easier gradient estimation with reduced variance, which benefits machine learning tasks involving non-Gaussian distributions.
Contribution
The authors extend Stein's identity to bounded-support q-Gaussians, providing simplified gradient estimators and variance reduction techniques for non-Gaussian distributions.
Findings
Gradient estimators for q-Gaussians have low variance.
Bounded-support distributions improve gradient estimation in practice.
The method is applicable to Bayesian deep learning and sharpness-aware minimization.
Abstract
Stein's identity is a fundamental tool in machine learning with applications in generative models, stochastic optimization, and other problems involving gradients of expectations under Gaussian distributions. Less attention has been paid to problems with non-Gaussian expectations. Here, we consider the class of bounded-support -Gaussians and derive a new Stein identity leading to gradient estimators which have nearly identical forms to the Gaussian ones, and which are similarly easy to implement. We do this by extending the previous results of Landsman, Vanduffel, and Yao (2013) to prove new Bonnet- and Price-type theorems for q-Gaussians. We also simplify their forms by using escort distributions. Our experiments show that bounded-support distributions can reduce the variance of gradient estimators, which can potentially be useful for Bayesian deep learning and sharpness-aware…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Markov Chains and Monte Carlo Methods · Gaussian Processes and Bayesian Inference
