CAT: Contrastive Adapter Training for Personalized Image Generation
Jae Wan Park, Sang Hyun Park, Jun Young Koh, Junha Lee, Min Song

TL;DR
This paper introduces Contrastive Adapter Training (CAT), a novel method to improve personalized image generation by preserving the base model's knowledge and diversity, addressing issues of adapter training limitations.
Contribution
The paper proposes CAT loss and KPS metric to enhance adapter training, maintaining diversity and prior knowledge in personalized diffusion models.
Findings
CAT improves diversity in object generation
KPS effectively measures knowledge preservation
Experimental results show enhanced image quality
Abstract
The emergence of various adapters, including Low-Rank Adaptation (LoRA) applied from the field of natural language processing, has allowed diffusion models to personalize image generation at a low cost. However, due to the various challenges including limited datasets and shortage of regularization and computation resources, adapter training often results in unsatisfactory outcomes, leading to the corruption of the backbone model's prior knowledge. One of the well known phenomena is the loss of diversity in object generation, especially within the same class which leads to generating almost identical objects with minor variations. This poses challenges in generation capabilities. To solve this issue, we present Contrastive Adapter Training (CAT), a simple yet effective strategy to enhance adapter training through the application of CAT loss. Our approach facilitates the preservation of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Topic Modeling
MethodsBalanced Selection · Adapter · Diffusion
