SmartBrush: Text and Shape Guided Object Inpainting with Diffusion Model
Shaoan Xie, Zhifei Zhang, Zhe Lin, Tobias Hinz, Kun Zhang

TL;DR
SmartBrush is a diffusion-based model that enables precise object inpainting guided by both text descriptions and shape masks, improving control and background preservation over previous methods.
Contribution
It introduces a novel multi-modal inpainting approach combining text and shape guidance with a new training and sampling strategy for better control and background retention.
Findings
Outperforms baselines in visual quality
Achieves better mask controllability
Preserves background more effectively
Abstract
Generic image inpainting aims to complete a corrupted image by borrowing surrounding information, which barely generates novel content. By contrast, multi-modal inpainting provides more flexible and useful controls on the inpainted content, \eg, a text prompt can be used to describe an object with richer attributes, and a mask can be used to constrain the shape of the inpainted object rather than being only considered as a missing area. We propose a new diffusion-based model named SmartBrush for completing a missing region with an object using both text and shape-guidance. While previous work such as DALLE-2 and Stable Diffusion can do text-guided inapinting they do not support shape guidance and tend to modify background texture surrounding the generated object. Our model incorporates both text and shape guidance with precision control. To preserve the background better, we propose a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Computer Graphics and Visualization Techniques · Image Retrieval and Classification Techniques
MethodsConvolution · Max Pooling · *Communicated@Fast*How Do I Communicate to Expedia? · Concatenated Skip Connection · Inpainting · U-Net · Diffusion
