RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths
Zeyue Xue, Guanglu Song, Qiushan Guo, Boxiao Liu, Zhuofan Zong, Yu, Liu, Ping Luo

TL;DR
RAPHAEL is a large diffusion model that generates highly artistic and accurate images from text prompts by leveraging a mixture of diffusion paths, outperforming existing models in quality and style diversity.
Contribution
Introduces RAPHAEL, a novel text-conditional diffusion model with mixture-of-experts layers enabling billions of diffusion paths for improved image generation.
Findings
Outperforms recent models like DALL-E 2 and Stable Diffusion in quality and style diversity.
Achieves a state-of-the-art zero-shot FID score of 6.61 on COCO.
Surpasses counterparts in human evaluations on ViLG-300.
Abstract
Text-to-image generation has recently witnessed remarkable achievements. We introduce a text-conditional image diffusion model, termed RAPHAEL, to generate highly artistic images, which accurately portray the text prompts, encompassing multiple nouns, adjectives, and verbs. This is achieved by stacking tens of mixture-of-experts (MoEs) layers, i.e., space-MoE and time-MoE layers, enabling billions of diffusion paths (routes) from the network input to the output. Each path intuitively functions as a "painter" for depicting a particular textual concept onto a specified image region at a diffusion timestep. Comprehensive experiments reveal that RAPHAEL outperforms recent cutting-edge models, such as Stable Diffusion, ERNIE-ViLG 2.0, DeepFloyd, and DALL-E 2, in terms of both image quality and aesthetic appeal. Firstly, RAPHAEL exhibits superior performance in switching images across diverse…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Multimodal Machine Learning Applications · Computational and Text Analysis Methods
MethodsDiffusion
