Loading paper
Swinv2-Imagen: Hierarchical Vision Transformer Diffusion Models for Text-to-Image Generation | Tomesphere