Generative Semantic Manipulation with Contrasting GAN

Xiaodan Liang; Hao Zhang; Eric P. Xing

arXiv:1708.00315·cs.CV·August 2, 2017·27 cites

Generative Semantic Manipulation with Contrasting GAN

Xiaodan Liang, Hao Zhang, Eric P. Xing

PDF

Open Access

TL;DR

This paper introduces a contrastive GAN framework for high-level semantic image manipulation, enabling realistic modifications of object categories while preserving original image characteristics.

Contribution

The paper proposes a novel contrastive adversarial objective and a mask-conditional architecture for semantic manipulation, improving over existing GANs in realism and semantic accuracy.

Findings

01

Outperforms existing conditional GANs on semantic manipulation tasks

02

Generates high-fidelity images with accurate semantic changes

03

Demonstrates effectiveness on ImageNet and MSCOCO datasets

Abstract

Generative Adversarial Networks (GANs) have recently achieved significant improvement on paired/unpaired image-to-image translation, such as photo $\to$ sketch and artist painting style transfer. However, existing models can only be capable of transferring the low-level information (e.g. color or texture changes), but fail to edit high-level semantic meanings (e.g., geometric structure or content) of objects. On the other hand, while some researches can synthesize compelling real-world images given a class label or caption, they cannot condition on arbitrary shapes or structures, which largely limits their application scenarios and interpretive capability of model results. In this work, we focus on a more challenging semantic manipulation task, which aims to modify the semantic meaning of an object while preserving its own characteristics (e.g. viewpoints and shapes), such as…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Digital Media Forensic Detection · Multimodal Machine Learning Applications