SeqLoRA: Bilevel Orthogonal Adaptation for Continual Multi-Concept Generation
Javad Parsa, Enis Simsar, Amir Joudaki, Thomas Hofmann, Andr\'e M. H. Teixeira

TL;DR
SeqLoRA introduces a bilevel optimization framework for continual multi-concept generation in text-to-image models, improving fidelity and scalability while minimizing interference and catastrophic forgetting.
Contribution
It proposes a novel bilevel optimization method for LoRA adaptation, with theoretical guarantees and superior multi-concept generation performance.
Findings
Enhances identity preservation across 101 concepts.
Reduces attribute interference in generated images.
Provides convergence guarantees and models residual activations.
Abstract
Parameter-efficient fine-tuning enables fast personalization of text-to-image diffusion models, but composing multiple custom concepts remains challenging due to representation interference. Existing modular methods either rely on expensive post-hoc fusion or freeze adaptation subspaces, which limit expressiveness and concept fidelity. To address this trade-off, we propose Sequential regularized LoRA (SeqLoRA), a constrained continual learning framework that jointly optimizes both LoRA factors via bilevel optimization. Theoretically, we establish strong convergence guarantees for our algorithm and model the residual layer activations as a matrix sub-Gaussian process to derive high-probability bounds on catastrophic forgetting. We further prove that learning the LoRA basis from data minimizes residual interference energy more effectively than frozen-basis methods. Experiments on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
