TokenVerse: Versatile Multi-concept Personalization in Token Modulation Space
Daniel Garibi, Shahar Yadin, Roni Paiss, Omer Tov, Shiran Zada, Ariel, Ephrat, Tomer Michaeli, Inbar Mosseri, Tali Dekel

TL;DR
TokenVerse introduces a versatile method for multi-concept personalization in text-to-image diffusion models, enabling disentangled, localized control over complex visual attributes from minimal input images.
Contribution
It offers a novel optimization framework that finds semantic directions in modulation space for multi-concept personalization, handling multiple images and diverse concepts.
Findings
Effective multi-concept disentanglement from few images
Supports complex concept combinations in generated images
Outperforms existing personalization methods
Abstract
We present TokenVerse -- a method for multi-concept personalization, leveraging a pre-trained text-to-image diffusion model. Our framework can disentangle complex visual elements and attributes from as little as a single image, while enabling seamless plug-and-play generation of combinations of concepts extracted from multiple images. As opposed to existing works, TokenVerse can handle multiple images with multiple concepts each, and supports a wide-range of concepts, including objects, accessories, materials, pose, and lighting. Our work exploits a DiT-based text-to-image model, in which the input text affects the generation through both attention and modulation (shift and scale). We observe that the modulation space is semantic and enables localized control over complex concepts. Building on this insight, we devise an optimization-based framework that takes as input an image and a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Malware Detection Techniques · Video Analysis and Summarization · Embedded Systems Design Techniques
MethodsSoftmax · Attention Is All You Need · Diffusion
