Create Your World: Lifelong Text-to-Image Diffusion
Gan Sun, Wenqi Liang, Jiahua Dong, Jun Li, Zhengming Ding, Yang Cong

TL;DR
This paper introduces L2DM, a lifelong text-to-image diffusion model that learns new concepts quickly while retaining previous knowledge, addressing catastrophic forgetting and semantic neglect in image synthesis.
Contribution
The paper proposes a novel lifelong diffusion framework with memory enhancement, concept distillation, and attention modules to improve continual concept learning in text-to-image generation.
Findings
L2DM outperforms state-of-the-art models in qualitative image fidelity.
The model effectively mitigates catastrophic forgetting of previous concepts.
It maintains semantic relevance across diverse continual prompts.
Abstract
Text-to-image generative models can produce diverse high-quality images of concepts with a text prompt, which have demonstrated excellent ability in image generation, image translation, etc. We in this work study the problem of synthesizing instantiations of a use's own concepts in a never-ending manner, i.e., create your world, where the new concepts from user are quickly learned with a few examples. To achieve this goal, we propose a Lifelong text-to-image Diffusion Model (L2DM), which intends to overcome knowledge "catastrophic forgetting" for the past encountered concepts, and semantic "catastrophic neglecting" for one or more concepts in the text prompt. In respect of knowledge "catastrophic forgetting", our L2DM framework devises a task-aware memory enhancement module and a elastic-concept distillation module, which could respectively safeguard the knowledge of both prior concepts…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Generative Adversarial Networks and Image Synthesis · Advanced Image and Video Retrieval Techniques
MethodsDiffusion
