Training-and-Prompt-Free General Painterly Harmonization via Zero-Shot Disentenglement on Style and Content References
Teng-Fang Hsiao, Bo-Kai Ruan, Hong-Han Shuai

TL;DR
The paper introduces TF-GPH, a training-and-prompt-free method for painterly image harmonization that uses novel disentanglement and reweighting mechanisms to improve style-content blending without additional training or prompts.
Contribution
It presents a new zero-shot harmonization approach with innovative disentanglement and reweighting techniques, along with a new benchmark and evaluation metrics.
Findings
Effective in harmonizing images without training or prompts
Outperforms existing methods on multiple benchmarks
Proposed metrics better reflect real-world performance
Abstract
Painterly image harmonization aims at seamlessly blending disparate visual elements within a single image. However, previous approaches often struggle due to limitations in training data or reliance on additional prompts, leading to inharmonious and content-disrupted output. To surmount these hurdles, we design a Training-and-prompt-Free General Painterly Harmonization method (TF-GPH). TF-GPH incorporates a novel ``Similarity Disentangle Mask'', which disentangles the foreground content and background image by redirecting their attention to corresponding reference images, enhancing the attention mechanism for multi-image inputs. Additionally, we propose a ``Similarity Reweighting'' mechanism to balance harmonization between stylization and content preservation. This mechanism minimizes content disruption by prioritizing the content-similar features within the given background style…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Industrial Vision Systems and Defect Detection · Image and Video Quality Assessment
MethodsLatent Diffusion Model · Diffusion
