Inharmonious Region Localization via Recurrent Self-Reasoning

Penghao Wu; Li Niu; Jing Liang; Liqing Zhang

arXiv:2210.02036·cs.CV·October 6, 2022

Inharmonious Region Localization via Recurrent Self-Reasoning

Penghao Wu, Li Niu, Jing Liang, Liqing Zhang

PDF

Open Access

TL;DR

This paper introduces a Recurrent Self-Reasoning (RSR) module integrated into a UNet architecture to effectively localize inharmonious regions in synthetic images, enhancing image quality by identifying inconsistent regions.

Contribution

The novel RSR module enables pixel clustering into inharmonious and background regions within a UNet framework, improving localization accuracy in synthetic image analysis.

Findings

01

Achieves competitive performance on image harmonization datasets.

02

Effectively localizes inharmonious regions with improved accuracy.

03

Demonstrates both quantitative and qualitative improvements.

Abstract

Synthetic images created by image editing operations are prevalent, but the color or illumination inconsistency between the manipulated region and background may make it unrealistic. Thus, it is important yet challenging to localize the inharmonious region to improve the quality of synthetic image. Inspired by the classic clustering algorithm, we aim to group pixels into two clusters: inharmonious cluster and background cluster by inserting a novel Recurrent Self-Reasoning (RSR) module into the bottleneck of UNet structure. The mask output from RSR module is provided for the decoder as attention guidance. Finally, we adaptively combine the masks from RSR and the decoder to form our final mask. Experimental results on the image harmonization dataset demonstrate that our method achieves competitive performance both quantitatively and qualitatively.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Image Processing Techniques and Applications · Image Enhancement Techniques