SPatchGAN: A Statistical Feature Based Discriminator for Unsupervised   Image-to-Image Translation

Xuning Shao; Weidong Zhang

arXiv:2103.16219·cs.CV·August 30, 2021

SPatchGAN: A Statistical Feature Based Discriminator for Unsupervised Image-to-Image Translation

Xuning Shao, Weidong Zhang

PDF

Open Access 1 Repo 1 Models

TL;DR

SPatchGAN introduces a statistical feature-based discriminator for unsupervised image-to-image translation, improving stability and detail preservation without complex constraints, outperforming state-of-the-art models across diverse tasks.

Contribution

The paper presents a novel discriminator architecture that emphasizes statistical features over patches, simplifying the framework and enhancing translation quality.

Findings

01

Outperforms existing models in selfie-to-anime translation

02

Effective in male-to-female and glasses removal tasks

03

Stabilizes training through multi-scale distribution matching

Abstract

For unsupervised image-to-image translation, we propose a discriminator architecture which focuses on the statistical features instead of individual patches. The network is stabilized by distribution matching of key statistical features at multiple scales. Unlike the existing methods which impose more and more constraints on the generator, our method facilitates the shape deformation and enhances the fine details with a greatly simplified framework. We show that the proposed method outperforms the existing state-of-the-art models in various challenging applications including selfie-to-anime, male-to-female and glasses removal.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

NetEase-GameAI/SPatchGAN
tfOfficial

Models

🤗
necoarc/spatchgan-model
model· ♡ 1
♡ 1

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Video Analysis and Summarization · Handwritten Text Recognition Techniques