DisEnvisioner: Disentangled and Enriched Visual Prompt for Customized Image Generation
Jing He, Haodong Li, Yongzhe Hu, Guibao Shen, Yingjie Cai, Weichao, Qiu, Ying-Cong Chen

TL;DR
DisEnvisioner introduces a tuning-free method that extracts and enriches subject-essential features from a single image, significantly improving personalized image generation by filtering irrelevant information and enhancing ID preservation.
Contribution
It presents a novel disentangled visual prompt approach that effectively separates and enriches subject features for improved customization without tuning.
Findings
Outperforms existing methods in instruction response and ID consistency.
Achieves faster inference and higher image quality.
Demonstrates superior customization performance in experiments.
Abstract
In the realm of image generation, creating customized images from visual prompt with additional textual instruction emerges as a promising endeavor. However, existing methods, both tuning-based and tuning-free, struggle with interpreting the subject-essential attributes from the visual prompt. This leads to subject-irrelevant attributes infiltrating the generation process, ultimately compromising the personalization quality in both editability and ID preservation. In this paper, we present DisEnvisioner, a novel approach for effectively extracting and enriching the subject-essential features while filtering out -irrelevant information, enabling exceptional customization performance, in a tuning-free manner and using only a single image. Specifically, the feature of the subject and other irrelevant components are effectively separated into distinctive visual tokens, enabling a much more…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAesthetic Perception and Analysis · Data Visualization and Analytics
