CSHNet: A Novel Information Asymmetric Image Translation Method

Xi Yang; Haoyuan Shi; Zihan Wang; Nannan Wang; Xinbo Gao

arXiv:2501.10197·cs.CV·January 20, 2025

CSHNet: A Novel Information Asymmetric Image Translation Method

Xi Yang, Haoyuan Shi, Zihan Wang, Nannan Wang, Xinbo Gao

PDF

Open Access 1 Repo

TL;DR

CSHNet introduces a hybrid CNN-Swin transformer architecture with interactive and loss-based modules to improve asymmetric cross-domain image translation, especially in structure preservation and detail enhancement.

Contribution

The paper proposes a novel CNN-Swin hybrid network with interactive and edge-preserving components for improved asymmetric image translation.

Findings

01

Outperforms existing methods in visual quality and metrics

02

Effective in both scene-level and instance-level datasets

03

Enhances structural integrity during translation

Abstract

Despite advancements in cross-domain image translation, challenges persist in asymmetric tasks such as SAR-to-Optical and Sketch-to-Instance conversions, which involve transforming data from a less detailed domain into one with richer content. Traditional CNN-based methods are effective at capturing fine details but struggle with global structure, leading to unwanted merging of image regions. To address this, we propose the CNN-Swin Hybrid Network (CSHNet), which combines two key modules: Swin Embedded CNN (SEC) and CNN Embedded Swin (CES), forming the SEC-CES-Bottleneck (SCB). SEC leverages CNN's detailed feature extraction while integrating the Swin Transformer's structural bias. CES, in turn, preserves the Swin Transformer's global integrity, compensating for CNN's lack of focus on structure. Additionally, CSHNet includes two components designed to enhance cross-domain information…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

xdushi/cshnet
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Retrieval and Classification Techniques

MethodsFocus