Novel View Synthesis from Single Images via Point Cloud Transformation
Hoang-An Le, Thomas Mensink, Partha Das, Theo Gevers

TL;DR
This paper introduces a method for novel view synthesis from a single image by estimating a point cloud to represent object geometry, then using image completion to generate dense views, validated on the ShapeNet benchmark.
Contribution
The approach combines point cloud estimation from a single image with end-to-end training using warping techniques, enabling explicit 3D shape use for view synthesis without depth supervision.
Findings
Effective on ShapeNet benchmark
Produces dense novel views from single images
End-to-end training without depth supervision
Abstract
In this paper the argument is made that for true novel view synthesis of objects, where the object can be synthesized from any viewpoint, an explicit 3D shape representation isdesired. Our method estimates point clouds to capture the geometry of the object, which can be freely rotated into the desired view and then projected into a new image. This image, however, is sparse by nature and hence this coarse view is used as the input of an image completion network to obtain the dense target view. The point cloud is obtained using the predicted pixel-wise depth map, estimated from a single RGB input image,combined with the camera intrinsics. By using forward warping and backward warpingbetween the input view and the target view, the network can be trained end-to-end without supervision on depth. The benefit of using point clouds as an explicit 3D shape for novel view synthesis is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHermeneutics and Narrative Identity · Aging, Elder Care, and Social Issues · Health, Medicine and Society
