Multi-cropping Contrastive Learning and Domain Consistency for   Unsupervised Image-to-Image Translation

Chen Zhao; Wei-Ling Cai; Zheng Yuan; Cheng-Wei Hu

arXiv:2304.12235·cs.CV·July 6, 2023·1 cites

Multi-cropping Contrastive Learning and Domain Consistency for Unsupervised Image-to-Image Translation

Chen Zhao, Wei-Ling Cai, Zheng Yuan, Cheng-Wei Hu

PDF

Open Access

TL;DR

This paper introduces MCDUT, a novel unsupervised image-to-image translation framework that enhances contrastive learning with multi-cropping and domain consistency, achieving state-of-the-art results.

Contribution

The paper proposes a new framework combining multi-cropping contrastive learning and domain consistency loss, along with a dual coordinate attention network for improved image translation.

Findings

01

Achieves state-of-the-art results on multiple image translation tasks.

02

Demonstrates the effectiveness of multi-cropping and domain consistency in contrastive learning.

03

Validates the proposed methods through extensive experiments and ablation studies.

Abstract

Recently, unsupervised image-to-image translation methods based on contrastive learning have achieved state-of-the-art results in many tasks. However, in the previous works, the negatives are sampled from the input image itself, which inspires us to design a data augmentation method to improve the quality of the selected negatives. Moreover, the previous methods only preserve the content consistency via patch-wise contrastive learning in the embedding space, which ignores the domain consistency between the generated images and the real images of the target domain. In this paper, we propose a novel unsupervised image-to-image translation framework based on multi-cropping contrastive learning and domain consistency, called MCDUT. Specifically, we obtain the multi-cropping views via the center-cropping and the random-cropping with the aim of further generating the high-quality negative…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Cancer-related molecular mechanisms research · Generative Adversarial Networks and Image Synthesis

MethodsCoordinate attention · *Communicated@Fast*How Do I Communicate to Expedia? · Sigmoid Activation · Average Pooling · Dense Connections · Global Average Pooling · Kaiming Initialization · Max Pooling · Squeeze-and-Excitation Block · Convolution