Reward Incremental Learning in Text-to-Image Generation

Maorong Wang; Jiafeng Mao; Xueting Wang; Toshihiko Yamasaki

arXiv:2411.17310·cs.CV·November 27, 2024

Reward Incremental Learning in Text-to-Image Generation

Maorong Wang, Jiafeng Mao, Xueting Wang, Toshihiko Yamasaki

PDF

Open Access

TL;DR

This paper introduces Reward Incremental Learning (RIL) for text-to-image diffusion models, addressing the challenge of adapting to multiple objectives over time while mitigating catastrophic forgetting through a novel distillation method.

Contribution

It defines the RIL problem, identifies catastrophic forgetting in diffusion models during incremental learning, and proposes Reward Incremental Distillation (RID) to effectively address these issues.

Findings

01

RID maintains high-quality image generation across multiple reward tasks

02

RID significantly reduces catastrophic forgetting in diffusion models

03

Experimental results validate RID's effectiveness in RIL scenarios

Abstract

The recent success of denoising diffusion models has significantly advanced text-to-image generation. While these large-scale pretrained models show excellent performance in general image synthesis, downstream objectives often require fine-tuning to meet specific criteria such as aesthetics or human preference. Reward gradient-based strategies are promising in this context, yet existing methods are limited to single-reward tasks, restricting their applicability in real-world scenarios that demand adapting to multiple objectives introduced incrementally over time. In this paper, we first define this more realistic and unexplored problem, termed Reward Incremental Learning (RIL), where models are desired to adapt to multiple downstream objectives incrementally. Additionally, while the models adapt to the ever-emerging new objectives, we observe a unique form of catastrophic forgetting in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Video Analysis and Summarization · Topic Modeling

MethodsDiffusion