Multi-cropping Contrastive Learning and Domain Consistency for Unsupervised Image-to-Image Translation
Chen Zhao, Wei-Ling Cai, Zheng Yuan, Cheng-Wei Hu

TL;DR
This paper introduces MCDUT, a novel unsupervised image-to-image translation framework that enhances contrastive learning with multi-cropping and domain consistency, achieving state-of-the-art results.
Contribution
The paper proposes a new framework combining multi-cropping contrastive learning and domain consistency loss, along with a dual coordinate attention network for improved image translation.
Findings
Achieves state-of-the-art results on multiple image translation tasks.
Demonstrates the effectiveness of multi-cropping and domain consistency in contrastive learning.
Validates the proposed methods through extensive experiments and ablation studies.
Abstract
Recently, unsupervised image-to-image translation methods based on contrastive learning have achieved state-of-the-art results in many tasks. However, in the previous works, the negatives are sampled from the input image itself, which inspires us to design a data augmentation method to improve the quality of the selected negatives. Moreover, the previous methods only preserve the content consistency via patch-wise contrastive learning in the embedding space, which ignores the domain consistency between the generated images and the real images of the target domain. In this paper, we propose a novel unsupervised image-to-image translation framework based on multi-cropping contrastive learning and domain consistency, called MCDUT. Specifically, we obtain the multi-cropping views via the center-cropping and the random-cropping with the aim of further generating the high-quality negative…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Cancer-related molecular mechanisms research · Generative Adversarial Networks and Image Synthesis
MethodsCoordinate attention · *Communicated@Fast*How Do I Communicate to Expedia? · Sigmoid Activation · Average Pooling · Dense Connections · Global Average Pooling · Kaiming Initialization · Max Pooling · Squeeze-and-Excitation Block · Convolution
