CoordGAN: Self-Supervised Dense Correspondences Emerge from GANs
Jiteng Mu, Shalini De Mello, Zhiding Yu, Nuno Vasconcelos, Xiaolong, Wang, Jan Kautz, Sifei Liu

TL;DR
CoordGAN introduces a novel GAN architecture that explicitly learns dense pixel-level correspondences across images by disentangling structure and texture, enabling applications like segmentation transfer and improved interpretability.
Contribution
This work presents CoordGAN, a structure-texture disentangled GAN that explicitly learns dense correspondences and improves structure and texture disentanglement over prior methods.
Findings
Successfully extracts dense correspondence maps for generated and real images.
Achieves better structure and texture disentanglement compared to existing approaches.
Demonstrates effective segmentation mask transfer across datasets.
Abstract
Recent advances show that Generative Adversarial Networks (GANs) can synthesize images with smooth variations along semantically meaningful latent directions, such as pose, expression, layout, etc. While this indicates that GANs implicitly learn pixel-level correspondences across images, few studies explored how to extract them explicitly. In this work, we introduce Coordinate GAN (CoordGAN), a structure-texture disentangled GAN that learns a dense correspondence map for each generated image. We represent the correspondence maps of different images as warped coordinate frames transformed from a canonical coordinate frame, i.e., the correspondence map, which describes the structure (e.g., the shape of a face), is controlled via a transformation. Hence, finding correspondences boils down to locating the same coordinate in different correspondence maps. In CoordGAN, we sample a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Image Processing and 3D Reconstruction · Face recognition and analysis
