Supersampling Stable Diffusion and Beyond: A Seamless, Training-Free Approach for Scaling Neural Networks Using Common Interpolation Methods
Md Abu Obaida Zishan, Jannatun Noor, Annajiat Alim Rasel

TL;DR
This paper introduces a training-free kernel interpolation method for scaling Stable Diffusion models to generate higher-resolution images without finetuning, addressing object duplication artifacts.
Contribution
The authors propose a mathematically justified kernel interpolation technique that enables high-resolution image generation and neural network adaptation without additional training.
Findings
Kernel interpolation correctly scales convolution kernels with a constant coefficient.
Empirical results show competitive high-resolution image generation using the method.
The approach reduces memory footprint of neural networks by up to 4 times.
Abstract
Stable Diffusion (SD) has evolved DDPM (Denoising Diffusion Probabilistic Model) based image generation significantly by denoising in latent space instead of feature space. This popularized DDPM-based image generation as the cost and compute barrier was significantly lowered. However, these models could only generate fixed-resolution images according to their training configuration. When we attempt to generate higher resolutions, the resulting images show object duplication artifacts consistently. To solve this problem without finetuning SD models, recent works have tried dilating the convolution kernels of the models and have achieved a great level of success. But dilated kernels are harder to fine-tune due to being zero-gapped. Apart from this, other methods, such as patched diffusion, could not solve the object-duplication problem efficiently. Hence, to overcome the limitations of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
