Semantic-guided Multi-Mask Image Harmonization
Xuqian Ren, Yifan Liu

TL;DR
This paper introduces a new semantic-guided multi-mask image harmonization task that handles multiple perturbations without input masks, proposing a flexible operator mask-based approach and benchmarks to improve and evaluate image editing in complex scenes.
Contribution
It proposes a novel multi-mask harmonization task and a flexible operator mask-based method, along with new benchmarks for complex scene image harmonization.
Findings
Operator masks improve editing flexibility.
The method outperforms state-of-the-art RGB regression models.
Benchmarks demonstrate effectiveness in complex scenes.
Abstract
Previous harmonization methods focus on adjusting one inharmonious region in an image based on an input mask. They may face problems when dealing with different perturbations on different semantic regions without available input masks. To deal with the problem that one image has been pasted with several foregrounds coming from different images and needs to harmonize them towards different domain directions without any mask as input, we propose a new semantic-guided multi-mask image harmonization task. Different from the previous single-mask image harmonization task, each inharmonious image is perturbed with different methods according to the semantic segmentation masks. Two challenging benchmarks, HScene and HLIP, are constructed based on and semantic classes, respectively. Furthermore, previous baselines focus on regressing the exact value for each pixel of the harmonized…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Enhancement Techniques · Image and Signal Denoising Methods · Advanced Vision and Imaging
