Region-to-Region: Enhancing Generative Image Harmonization with Adaptive Regional Injection

Zhiqiu Zhang; Dongqi Fan; Mingjie Wang; Qiang Tang; Jian Yang; Zili Yi

arXiv:2508.09746·cs.CV·August 14, 2025

Region-to-Region: Enhancing Generative Image Harmonization with Adaptive Regional Injection

Zhiqiu Zhang, Dongqi Fan, Mingjie Wang, Qiang Tang, Jian Yang, Zili Yi

PDF

TL;DR

This paper introduces R2R, a novel image harmonization model that uses adaptive regional injection and a new synthetic dataset to improve detail preservation and realism in composite images.

Contribution

The paper proposes the R2R model with Clear-VAE and MACA for enhanced harmonization, and introduces the RPHarmony dataset generated via Random Poisson Blending.

Findings

01

R2R outperforms existing methods in quantitative metrics.

02

The RPHarmony dataset improves model generalization to real images.

03

The approach effectively preserves details and enhances visual harmony.

Abstract

The goal of image harmonization is to adjust the foreground in a composite image to achieve visual consistency with the background. Recently, latent diffusion model (LDM) are applied for harmonization, achieving remarkable results. However, LDM-based harmonization faces challenges in detail preservation and limited harmonization ability. Additionally, current synthetic datasets rely on color transfer, which lacks local variations and fails to capture complex real-world lighting conditions. To enhance harmonization capabilities, we propose the Region-to-Region transformation. By injecting information from appropriate regions into the foreground, this approach preserves original details while achieving image harmonization or, conversely, generating new composite data. From this perspective, We propose a novel model R2R. Specifically, we design Clear-VAE to preserve high-frequency details…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.