CoordGAN: Self-Supervised Dense Correspondences Emerge from GANs

Jiteng Mu; Shalini De Mello; Zhiding Yu; Nuno Vasconcelos; Xiaolong; Wang; Jan Kautz; Sifei Liu

arXiv:2203.16521·cs.CV·March 31, 2022

CoordGAN: Self-Supervised Dense Correspondences Emerge from GANs

Jiteng Mu, Shalini De Mello, Zhiding Yu, Nuno Vasconcelos, Xiaolong, Wang, Jan Kautz, Sifei Liu

PDF

Open Access 1 Repo

TL;DR

CoordGAN introduces a novel GAN architecture that explicitly learns dense pixel-level correspondences across images by disentangling structure and texture, enabling applications like segmentation transfer and improved interpretability.

Contribution

This work presents CoordGAN, a structure-texture disentangled GAN that explicitly learns dense correspondences and improves structure and texture disentanglement over prior methods.

Findings

01

Successfully extracts dense correspondence maps for generated and real images.

02

Achieves better structure and texture disentanglement compared to existing approaches.

03

Demonstrates effective segmentation mask transfer across datasets.

Abstract

Recent advances show that Generative Adversarial Networks (GANs) can synthesize images with smooth variations along semantically meaningful latent directions, such as pose, expression, layout, etc. While this indicates that GANs implicitly learn pixel-level correspondences across images, few studies explored how to extract them explicitly. In this work, we introduce Coordinate GAN (CoordGAN), a structure-texture disentangled GAN that learns a dense correspondence map for each generated image. We represent the correspondence maps of different images as warped coordinate frames transformed from a canonical coordinate frame, i.e., the correspondence map, which describes the structure (e.g., the shape of a face), is controlled via a transformation. Hence, finding correspondences boils down to locating the same coordinate in different correspondence maps. In CoordGAN, we sample a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

NVlabs/CoordGAN
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Image Processing and 3D Reconstruction · Face recognition and analysis