Unsupervised Novel View Synthesis from a Single Image
Pierluigi Zama Ramirez, Diego Martin Arroyo, Alessio Tonioni, Federico, Tombari

TL;DR
This paper introduces an unsupervised approach for novel view synthesis from a single image, eliminating the need for 3D or multi-view supervision during training, and demonstrates its effectiveness on synthetic and natural datasets.
Contribution
It presents a novel unsupervised training framework that combines generative modeling and autoencoding for view synthesis without supervision.
Findings
Effective on ShapeNet and natural image datasets
No competing methods trained on unconstrained natural images
Generates consistent novel views from a single image
Abstract
Novel view synthesis from a single image has recently achieved remarkable results, although the requirement of some form of 3D, pose, or multi-view supervision at training time limits the deployment in real scenarios. This work aims at relaxing these assumptions enabling training of conditional generative models for novel view synthesis in a completely unsupervised manner. We first pre-train a purely generative decoder model using a 3D-aware GAN formulation while at the same time train an encoder network to invert the mapping from latent space to images. Then, we swap encoder and decoder and train the network as a conditioned GAN with a mixture of an autoencoder-like objective and self-distillation. At test time, given a view of an object, our model first embeds the image content in a latent code and regresses its pose, then generates novel views of it by keeping the code fixed and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
