InstantSwap: Fast Customized Concept Swapping across Sharp Shape Differences
Chenyang Zhu, Kai Li, Yue Ma, Longxiang Tang, Chengyu Fang, Chubin, Chen, Qifeng Chen, Xiu Li

TL;DR
InstantSwap introduces a fast, efficient method for customized concept swapping in images that maintains consistency despite sharp shape differences, using attention-based object extraction and semantic-enhanced representations.
Contribution
The paper presents InstantSwap, a novel CCS approach that handles large shape disparities efficiently by combining attention-based object extraction, background preservation, and periodic gradient computation.
Findings
Outperforms existing methods in speed and consistency.
Effectively maintains foreground and background integrity during swapping.
Demonstrates versatility across diverse image scenarios.
Abstract
Recent advances in Customized Concept Swapping (CCS) enable a text-to-image model to swap a concept in the source image with a customized target concept. However, the existing methods still face the challenges of inconsistency and inefficiency. They struggle to maintain consistency in both the foreground and background during concept swapping, especially when the shape difference is large between objects. Additionally, they either require time-consuming training processes or involve redundant calculations during inference. To tackle these issues, we introduce InstantSwap, a new CCS method that aims to handle sharp shape disparity at speed. Specifically, we first extract the bbox of the object in the source image automatically based on attention map analysis and leverage the bbox to achieve both foreground and background consistency. For background consistency, we remove the gradient…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText and Document Classification Technologies · Image Retrieval and Classification Techniques · Advanced Image and Video Retrieval Techniques
MethodsSoftmax · Attention Is All You Need · Focus
