Generative Semantic Manipulation with Contrasting GAN
Xiaodan Liang, Hao Zhang, Eric P. Xing

TL;DR
This paper introduces a contrastive GAN framework for high-level semantic image manipulation, enabling realistic modifications of object categories while preserving original image characteristics.
Contribution
The paper proposes a novel contrastive adversarial objective and a mask-conditional architecture for semantic manipulation, improving over existing GANs in realism and semantic accuracy.
Findings
Outperforms existing conditional GANs on semantic manipulation tasks
Generates high-fidelity images with accurate semantic changes
Demonstrates effectiveness on ImageNet and MSCOCO datasets
Abstract
Generative Adversarial Networks (GANs) have recently achieved significant improvement on paired/unpaired image-to-image translation, such as photo sketch and artist painting style transfer. However, existing models can only be capable of transferring the low-level information (e.g. color or texture changes), but fail to edit high-level semantic meanings (e.g., geometric structure or content) of objects. On the other hand, while some researches can synthesize compelling real-world images given a class label or caption, they cannot condition on arbitrary shapes or structures, which largely limits their application scenarios and interpretive capability of model results. In this work, we focus on a more challenging semantic manipulation task, which aims to modify the semantic meaning of an object while preserving its own characteristics (e.g. viewpoints and shapes), such as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Digital Media Forensic Detection · Multimodal Machine Learning Applications
