HAM: A Training-Free Style Transfer Approach via Heterogeneous Attention Modulation for Diffusion Models
Yeqi He, Liang Li, Zhiwen Yang, Xichun Sheng, Zhidong Zhao, Chenggang Yan

TL;DR
This paper introduces HAM, a training-free style transfer method for diffusion models that uses heterogeneous attention modulation to better preserve content identity while transferring complex styles.
Contribution
It proposes a novel training-free approach employing heterogeneous attention modulation, including GAR and LAT, to improve style transfer quality and content preservation in diffusion models.
Findings
Achieves state-of-the-art performance on multiple metrics.
Effectively preserves content identity during style transfer.
Handles complex style references better than existing methods.
Abstract
Diffusion models have demonstrated remarkable performance in image generation, particularly within the domain of style transfer. Prevailing style transfer approaches typically leverage pre-trained diffusion models' robust feature extraction capabilities alongside external modular control pathways to explicitly impose style guidance signals. However, these methods often fail to capture complex style reference or retain the identity of user-provided content images, thus falling into the trap of style-content balance. Thus, we propose a training-free style transfer approach via eterogeneous ttention odulation () to protect identity information during image/text-guided style reference transfer, thereby addressing the style-content trade-off challenge. Specifically, we first introduces style noise initialization to initialize latent noise for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning · Image Enhancement Techniques
