Why Spectral Normalization Stabilizes GANs: Analysis and Improvements
Zinan Lin, Vyas Sekar, Giulia Fanti

TL;DR
This paper explains how spectral normalization stabilizes GAN training by controlling gradient issues, connects it to initialization methods, and proposes an improved normalization technique that enhances stability and sample quality.
Contribution
The paper provides a theoretical analysis of spectral normalization's effectiveness, links it to initialization strategies, and introduces Bidirectional Scaled Spectral Normalization (BSSN) for improved performance.
Findings
SN controls exploding and vanishing gradients in GANs
BSSN outperforms SN in stability and sample quality
Theoretical analysis links SN to LeCun initialization
Abstract
Spectral normalization (SN) is a widely-used technique for improving the stability and sample quality of Generative Adversarial Networks (GANs). However, there is currently limited understanding of why SN is effective. In this work, we show that SN controls two important failure modes of GAN training: exploding and vanishing gradients. Our proofs illustrate a (perhaps unintentional) connection with the successful LeCun initialization. This connection helps to explain why the most popular implementation of SN for GANs requires no hyper-parameter tuning, whereas stricter implementations of SN have poor empirical performance out-of-the-box. Unlike LeCun initialization which only controls gradient vanishing at the beginning of training, SN preserves this property throughout training. Building on this theoretical understanding, we propose a new spectral normalization technique: Bidirectional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Generative Adversarial Networks and Image Synthesis · Model Reduction and Neural Networks
MethodsXavier Initialization · Spectral Normalization
