Improved Techniques for Training Score-Based Generative Models
Yang Song, Stefano Ermon

TL;DR
This paper introduces improved training techniques for score-based generative models, enabling high-resolution image synthesis with enhanced stability and quality comparable to state-of-the-art GANs.
Contribution
The authors provide a new theoretical analysis, propose a weight averaging method, and demonstrate scalable high-resolution image generation with score-based models.
Findings
Successfully trained models on images up to 256x256 resolution
Generated high-fidelity images comparable to GANs on multiple datasets
Enhanced training stability across diverse datasets
Abstract
Score-based generative models can produce high quality image samples comparable to GANs, without requiring adversarial optimization. However, existing training procedures are limited to images of low resolution (typically below 32x32), and can be unstable under some settings. We provide a new theoretical analysis of learning and sampling from score models in high dimensional spaces, explaining existing failure modes and motivating new solutions that generalize across datasets. To enhance stability, we also propose to maintain an exponential moving average of model weights. With these improvements, we can effortlessly scale score-based generative models to images with unprecedented resolutions ranging from 64x64 to 256x256. Our score-based models can generate high-fidelity samples that rival best-in-class GANs on various image datasets, including CelebA, FFHQ, and multiple LSUN…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Image Processing Techniques · Cell Image Analysis Techniques
