On Relativistic $f$-Divergences
Alexia Jolicoeur-Martineau

TL;DR
This paper rigorously analyzes relativistic f-divergences in GANs, proving their properties, comparing them to Wasserstein distances, and examining estimators, revealing insights into their effectiveness and limitations.
Contribution
It introduces a formal framework for relativistic f-divergences, compares their strength to Wasserstein distances, and evaluates estimators, clarifying their roles in GAN performance.
Findings
Relativistic f-divergences are valid statistical divergences.
Wasserstein distance is weaker than relativistic f-divergences.
Removing bias in estimators does not improve GAN performance.
Abstract
This paper provides a more rigorous look at Relativistic Generative Adversarial Networks (RGANs). We prove that the objective function of the discriminator is a statistical divergence for any concave function with minimal properties (, , ). We also devise a few variants of relativistic -divergences. Wasserstein GAN was originally justified by the idea that the Wasserstein distance (WD) is most sensible because it is weak (i.e., it induces a weak topology). We show that the WD is weaker than -divergences which are weaker than relativistic -divergences. Given the good performance of RGANs, this suggests that WGAN does not performs well primarily because of the weak metric, but rather because of regularization and the use of a relativistic discriminator. We also take a closer look at estimators of relativistic -divergences. We introduce…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Statistical Mechanics and Entropy · Adversarial Robustness in Machine Learning
MethodsConvolution · Wasserstein GAN
