HarmonPaint: Harmonized Training-Free Diffusion Inpainting

Ying Li; Xinzhe Li; Yong Du; Yangyang Xu; Junyu Dong; Shengfeng He

arXiv:2507.16732·cs.CV·July 23, 2025

HarmonPaint: Harmonized Training-Free Diffusion Inpainting

Ying Li, Xinzhe Li, Yong Du, Yangyang Xu, Junyu Dong, Shengfeng He

PDF

Open Access

TL;DR

HarmonPaint is a training-free diffusion inpainting method that uses attention mechanisms and masking strategies to produce coherent, style-harmonized inpainted images without retraining or fine-tuning.

Contribution

It introduces a novel training-free inpainting framework that leverages diffusion models' attention mechanisms for structural and style coherence.

Findings

01

Effective across diverse scenes and styles

02

Achieves high-quality, harmonized inpainting without training

03

Maintains structural fidelity and style transfer

Abstract

Existing inpainting methods often require extensive retraining or fine-tuning to integrate new content seamlessly, yet they struggle to maintain coherence in both structure and style between inpainted regions and the surrounding background. Motivated by these limitations, we introduce HarmonPaint, a training-free inpainting framework that seamlessly integrates with the attention mechanisms of diffusion models to achieve high-quality, harmonized image inpainting without any form of training. By leveraging masking strategies within self-attention, HarmonPaint ensures structural fidelity without model retraining or fine-tuning. Additionally, we exploit intrinsic diffusion model properties to transfer style information from unmasked to masked regions, achieving a harmonious integration of styles. Extensive experiments demonstrate the effectiveness of HarmonPaint across diverse scenes and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · 3D Shape Modeling and Analysis · Computer Graphics and Visualization Techniques