Edge Guided GANs with Multi-Scale Contrastive Learning for Semantic Image Synthesis
Hao Tang, Guolei Sun, Nicu Sebe, Luc Van Gool

TL;DR
This paper introduces ECGAN, a novel semantic image synthesis framework that leverages edge guidance and multi-scale contrastive learning to improve local detail preservation, semantic consistency, and global semantic relations across multiple inputs.
Contribution
The paper proposes an innovative edge-guided attention module and a multi-scale contrastive learning approach to address key challenges in semantic image synthesis.
Findings
Enhanced preservation of local details and structures.
Improved semantic consistency across generated images.
Effective modeling of global semantic relations from multiple inputs.
Abstract
We propose a novel ECGAN for the challenging semantic image synthesis task. Although considerable improvements have been achieved by the community in the recent period, the quality of synthesized images is far from satisfactory due to three largely unresolved challenges. 1) The semantic labels do not provide detailed structural information, making it challenging to synthesize local details and structures; 2) The widely adopted CNN operations such as convolution, down-sampling, and normalization usually cause spatial resolution loss and thus cannot fully preserve the original semantic information, leading to semantically inconsistent results (e.g., missing small objects); 3) Existing semantic image synthesis methods focus on modeling 'local' semantic information from a single input semantic layout. However, they ignore 'global' semantic information of multiple input semantic layouts,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Advanced Neural Network Applications · Visual Attention and Saliency Detection
MethodsContrastive Learning · Focus
