Reward Incremental Learning in Text-to-Image Generation
Maorong Wang, Jiafeng Mao, Xueting Wang, Toshihiko Yamasaki

TL;DR
This paper introduces Reward Incremental Learning (RIL) for text-to-image diffusion models, addressing the challenge of adapting to multiple objectives over time while mitigating catastrophic forgetting through a novel distillation method.
Contribution
It defines the RIL problem, identifies catastrophic forgetting in diffusion models during incremental learning, and proposes Reward Incremental Distillation (RID) to effectively address these issues.
Findings
RID maintains high-quality image generation across multiple reward tasks
RID significantly reduces catastrophic forgetting in diffusion models
Experimental results validate RID's effectiveness in RIL scenarios
Abstract
The recent success of denoising diffusion models has significantly advanced text-to-image generation. While these large-scale pretrained models show excellent performance in general image synthesis, downstream objectives often require fine-tuning to meet specific criteria such as aesthetics or human preference. Reward gradient-based strategies are promising in this context, yet existing methods are limited to single-reward tasks, restricting their applicability in real-world scenarios that demand adapting to multiple objectives introduced incrementally over time. In this paper, we first define this more realistic and unexplored problem, termed Reward Incremental Learning (RIL), where models are desired to adapt to multiple downstream objectives incrementally. Additionally, while the models adapt to the ever-emerging new objectives, we observe a unique form of catastrophic forgetting in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Video Analysis and Summarization · Topic Modeling
MethodsDiffusion
