CTRLorALTer: Conditional LoRAdapter for Efficient 0-Shot Control &   Altering of T2I Models

Nick Stracke; Stefan Andreas Baumann; Joshua M. Susskind; Miguel Angel; Bautista; Bj\"orn Ommer

arXiv:2405.07913·cs.CV·October 10, 2024·2 cites

CTRLorALTer: Conditional LoRAdapter for Efficient 0-Shot Control & Altering of T2I Models

Nick Stracke, Stefan Andreas Baumann, Joshua M. Susskind, Miguel Angel, Bautista, Bj\"orn Ommer

PDF

Open Access 1 Models

TL;DR

This paper introduces LoRAdapter, a novel, efficient method for zero-shot control of text-to-image diffusion models, enabling detailed style and structure conditioning during image generation.

Contribution

LoRAdapter unifies style and structure conditioning with a new conditional LoRA block, improving zero-shot control in text-to-image models.

Findings

01

Outperforms recent state-of-the-art methods

02

Enables fine-grained control during generation

03

Architecture-agnostic and efficient approach

Abstract

Text-to-image generative models have become a prominent and powerful tool that excels at generating high-resolution realistic images. However, guiding the generative process of these models to consider detailed forms of conditioning reflecting style and/or structure information remains an open problem. In this paper, we present LoRAdapter, an approach that unifies both style and structure conditioning under the same formulation using a novel conditional LoRA block that enables zero-shot control. LoRAdapter is an efficient, powerful, and architecture-agnostic approach to condition text-to-image diffusion models, which enables fine-grained control conditioning during generation and outperforms recent state-of-the-art approaches.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
kliyer/LoRAdapter
model· ♡ 2
♡ 2

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMedical Imaging Techniques and Applications · Advanced Optical Sensing Technologies

MethodsDiffusion