Semantic Example Guided Image-to-Image Translation
Jialu Huang, Jing Liao, Tak Wu Sam Kwong

TL;DR
This paper introduces a semantically guided image-to-image translation method that uses reference images for controlled, diverse outputs, employing a self-supervised framework and non-local blocks to enhance quality.
Contribution
It proposes a novel semantic matching approach for reference-guided I2I translation, improving output control and diversity without paired data.
Findings
Enhanced output diversity and quality demonstrated.
Semantic matching effectively preserves reference semantics.
Outperforms several state-of-the-art models.
Abstract
Many image-to-image (I2I) translation problems are in nature of high diversity that a single input may have various counterparts. Prior works proposed the multi-modal network that can build a many-to-many mapping between two visual domains. However, most of them are guided by sampled noises. Some others encode the reference images into a latent vector, by which the semantic information of the reference image will be washed away. In this work, we aim to provide a solution to control the output based on references semantically. Given a reference image and an input in another domain, a semantic matching is first performed between the two visual contents and generates the auxiliary image, which is explicitly encouraged to preserve semantic characteristics of the reference. A deep network then is used for I2I translation and the final outputs are expected to be semantically similar to both…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Generative Adversarial Networks and Image Synthesis · Video Analysis and Summarization
