CtrLoRA: An Extensible and Efficient Framework for Controllable Image Generation
Yifeng Xu, Zhenliang He, Shiguang Shan, Xilin Chen

TL;DR
CtrLoRA introduces a flexible, efficient framework for controllable image generation that reduces training costs and complexity by learning shared knowledge and condition-specific details with minimal data and training time.
Contribution
The paper proposes CtrLoRA, a novel framework that enables quick adaptation to new conditions with fewer data and training time, while significantly reducing model complexity compared to existing methods.
Findings
Requires only 1,000 data pairs for new conditions
Reduces learnable parameters by 90% compared to ControlNet
Achieves satisfactory results with less than one hour of training
Abstract
Recently, large-scale diffusion models have made impressive progress in text-to-image (T2I) generation. To further equip these T2I models with fine-grained spatial control, approaches like ControlNet introduce an extra network that learns to follow a condition image. However, for every single condition type, ControlNet requires independent training on millions of data pairs with hundreds of GPU hours, which is quite expensive and makes it challenging for ordinary users to explore and develop new types of conditions. To address this problem, we propose the CtrLoRA framework, which trains a Base ControlNet to learn the common knowledge of image-to-image generation from multiple base conditions, along with condition-specific LoRAs to capture distinct characteristics of each condition. Utilizing our pretrained Base ControlNet, users can easily adapt it to new conditions, requiring as few as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Retrieval and Classification Techniques · Medical Image Segmentation Techniques
MethodsDiffusion · Balanced Selection
