TL;DR
This paper proposes a geometry-guided conditional GAN approach for synthesizing images across drastically different views, such as ground and aerial perspectives, by leveraging homography to preserve details and improve realism.
Contribution
It introduces a novel method combining homography-based geometric constraints with GANs for cross-view image synthesis, enhancing detail preservation and realism.
Findings
Geometry constraints improve image detail quality
The method outperforms purely pixel-based synthesis approaches
Inpainting with GANs adds realism to transformed images
Abstract
We address the problem of generating images across two drastically different views, namely ground (street) and aerial (overhead) views. Image synthesis by itself is a very challenging computer vision task and is even more so when generation is conditioned on an image in another view. Due the difference in viewpoints, there is small overlapping field of view and little common content between these two views. Here, we try to preserve the pixel information between the views so that the generated image is a realistic representation of cross view input image. For this, we propose to use homography as a guide to map the images between the views based on the common field of view to preserve the details in the input image. We then use generative adversarial networks to inpaint the missing regions in the transformed image and add realism to it. Our exhaustive evaluation and model comparison…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
