Attention-GAN for Object Transfiguration in Wild Images
Xinyuan Chen, Chang Xu, Xiaokang Yang, Dacheng Tao

TL;DR
This paper introduces a novel Attention-GAN framework that decomposes the object transfiguration process into dedicated attention and transformation networks, improving focus on objects and enhancing image quality in wild images.
Contribution
The paper proposes separating attention prediction from object translation in GANs, utilizing sparse attention maps and segmentation annotations to improve object transfiguration.
Findings
Attention maps remain consistent before and after transfiguration.
The method achieves more accurate attention and higher quality generated images.
Segmentation annotations enhance attention learning.
Abstract
This paper studies the object transfiguration problem in wild images. The generative network in classical GANs for object transfiguration often undertakes a dual responsibility: to detect the objects of interests and to convert the object from source domain to target domain. In contrast, we decompose the generative network into two separat networks, each of which is only dedicated to one particular sub-task. The attention network predicts spatial attention maps of images, and the transformation network focuses on translating objects. Attention maps produced by attention network are encouraged to be sparse, so that major attention can be paid to objects of interests. No matter before or after object transfiguration, attention maps should remain constant. In addition, learning attention network can receive more instructions, given the available segmentation annotations of images.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Digital Media Forensic Detection · Advanced Image and Video Retrieval Techniques
