ViewNeRF: Unsupervised Viewpoint Estimation Using Category-Level Neural   Radiance Fields

Octave Mariotti; Oisin Mac Aodha; Hakan Bilen

arXiv:2212.00436·cs.CV·December 2, 2022·1 cites

ViewNeRF: Unsupervised Viewpoint Estimation Using Category-Level Neural Radiance Fields

Octave Mariotti, Oisin Mac Aodha, Hakan Bilen

PDF

Open Access

TL;DR

ViewNeRF is a novel unsupervised method that leverages category-level neural radiance fields to accurately estimate viewpoints from images, even in complex multi-scene scenarios, without requiring ground-truth camera poses.

Contribution

It introduces a self-supervised approach combining conditional NeRF with a viewpoint predictor and scene encoder for category-level viewpoint estimation.

Findings

01

Achieves accurate viewpoint prediction in complex scenarios

02

Performs well on synthetic and real datasets

03

Works on both single scenes and multi-instance collections

Abstract

We introduce ViewNeRF, a Neural Radiance Field-based viewpoint estimation method that learns to predict category-level viewpoints directly from images during training. While NeRF is usually trained with ground-truth camera poses, multiple extensions have been proposed to reduce the need for this expensive supervision. Nonetheless, most of these methods still struggle in complex settings with large camera movements, and are restricted to single scenes, i.e. they cannot be trained on a collection of scenes depicting the same object category. To address these issues, our method uses an analysis by synthesis approach, combining a conditional NeRF with a viewpoint predictor and a scene encoder in order to produce self-supervised reconstructions for whole object categories. Rather than focusing on high fidelity reconstruction, we target efficient and accurate viewpoint prediction in complex…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Robotics and Sensor-Based Localization · 3D Shape Modeling and Analysis