Clipped Stochastic Methods for Variational Inequalities with Heavy-Tailed Noise
Eduard Gorbunov, Marina Danilova, David Dobre, Pavel Dvurechensky,, Alexander Gasnikov, Gauthier Gidel

TL;DR
This paper establishes the first high-probability convergence bounds for stochastic methods solving variational inequalities with heavy-tailed noise, demonstrating their effectiveness in unbounded domains and structured non-monotone problems.
Contribution
It provides novel high-probability complexity results for stochastic VIP methods under heavy-tailed noise, extending beyond previous sub-Gaussian assumptions.
Findings
High-probability bounds with logarithmic confidence dependence are proven.
Clipping improves the performance of SEG and SGDA in heavy-tailed noise scenarios.
Numerical experiments show practical GAN gradient noise is heavy-tailed.
Abstract
Stochastic first-order methods such as Stochastic Extragradient (SEG) or Stochastic Gradient Descent-Ascent (SGDA) for solving smooth minimax problems and, more generally, variational inequality problems (VIP) have been gaining a lot of attention in recent years due to the growing popularity of adversarial formulations in machine learning. However, while high-probability convergence bounds are known to reflect the actual behavior of stochastic methods more accurately, most convergence results are provided in expectation. Moreover, the only known high-probability complexity results have been derived under restrictive sub-Gaussian (light-tailed) noise and bounded domain assumption [Juditsky et al., 2011]. In this work, we prove the first high-probability complexity results with logarithmic dependence on the confidence level for stochastic methods for solving monotone and structured…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsStatistical Methods and Inference · Stochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques
