Image-to-image translation for cross-domain disentanglement
Abel Gonzalez-Garcia, Joost van de Weijer, Yoshua Bengio

TL;DR
This paper introduces a novel cross-domain disentanglement approach that separates shared and exclusive features in image translation, enabling diverse, controllable, and label-free image transfer and retrieval across domains.
Contribution
The paper proposes a new model combining bidirectional translation and cross-domain autoencoders to achieve disentanglement and improve multi-modal image translation and retrieval.
Findings
Outperforms state-of-the-art in multi-modal translation
Enables diverse, controllable image transfer without labels
Achieves better results on challenging datasets
Abstract
Deep image translation methods have recently shown excellent results, outputting high-quality images covering multiple modes of the data distribution. There has also been increased interest in disentangling the internal representations learned by deep methods to further improve their performance and achieve a finer control. In this paper, we bridge these two objectives and introduce the concept of cross-domain disentanglement. We aim to separate the internal representation into three parts. The shared part contains information for both domains. The exclusive parts, on the other hand, contain only factors of variation that are particular to each domain. We achieve this through bidirectional image translation based on Generative Adversarial Networks and cross-domain autoencoders, a novel network component. Our model offers multiple advantages. We can output diverse samples covering…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Image Processing Techniques · Digital Media Forensic Detection
