RISSOLE: Parameter-efficient Diffusion Models via Block-wise Generation and Retrieval-Guidance
Avideep Mukherjee, Soumya Banerjee, Piyush Rai, Vinay P. Namboodiri

TL;DR
This paper introduces RISSOLE, a parameter-efficient diffusion model that uses block-wise generation and retrieval-guidance to produce coherent images with fewer parameters, suitable for resource-limited devices.
Contribution
The paper proposes a retrieval-augmented block-wise diffusion approach that ensures coherence across image blocks, enabling compact yet high-quality image generation.
Findings
Achieves high-quality image generation with fewer parameters.
Ensures coherence across blocks through retrieval-guided conditioning.
Demonstrates effectiveness on latent diffusion models.
Abstract
Diffusion-based models demonstrate impressive generation capabilities. However, they also have a massive number of parameters, resulting in enormous model sizes, thus making them unsuitable for deployment on resource-constraint devices. Block-wise generation can be a promising alternative for designing compact-sized (parameter-efficient) deep generative models since the model can generate one block at a time instead of generating the whole image at once. However, block-wise generation is also considerably challenging because ensuring coherence across generated blocks can be non-trivial. To this end, we design a retrieval-augmented generation (RAG) approach and leverage the corresponding blocks of the images retrieved by the RAG module to condition the training and generation stages of a block-wise denoising diffusion model. Our conditioning schemes ensure coherence across the different…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Time Series Analysis and Forecasting · Anomaly Detection Techniques and Applications
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Byte Pair Encoding · Softmax · Linear Layer · Adam · Weight Decay · Dropout · Layer Normalization · Dense Connections
