Maximum Spatial Perturbation Consistency for Unpaired Image-to-Image Translation
Yanwu Xu, Shaoan Xie, Wenhao Wu, Kun Zhang, Mingming Gong, Kayhan, Batmanghelich

TL;DR
This paper introduces a universal regularization called maximum spatial perturbation consistency (MSPC) for unpaired image-to-image translation, addressing limitations of existing constraints by better handling spatial variations and reducing content distortion.
Contribution
The paper proposes MSPC, a novel regularization enforcing commutativity between spatial perturbation and translation, with adversarial training components, improving I2I translation quality and robustness.
Findings
Outperforms state-of-the-art methods on most benchmarks.
Effective in handling spatial variations like object size and distortion.
Validated on a new front face to profile face dataset.
Abstract
Unpaired image-to-image translation (I2I) is an ill-posed problem, as an infinite number of translation functions can map the source domain distribution to the target distribution. Therefore, much effort has been put into designing suitable constraints, e.g., cycle consistency (CycleGAN), geometry consistency (GCGAN), and contrastive learning-based constraints (CUTGAN), that help better pose the problem. However, these well-known constraints have limitations: (1) they are either too restrictive or too weak for specific I2I tasks; (2) these methods result in content distortion when there is a significant spatial variation between the source and target domains. This paper proposes a universal regularization technique called maximum spatial perturbation consistency (MSPC), which enforces a spatial perturbation function (T ) and the translation operator (G) to be commutative (i.e., TG = GT…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · AI in cancer detection · Multimodal Machine Learning Applications
MethodsALIGN
