Structure-Aware NeRF without Posed Camera via Epipolar Constraint

Shu Chen; Yang Zhang; Yaxin Xu; and Beiji Zou

arXiv:2210.00183·cs.CV·October 4, 2022·5 cites

Structure-Aware NeRF without Posed Camera via Epipolar Constraint

Shu Chen, Yang Zhang, Yaxin Xu, and Beiji Zou

PDF

Open Access 1 Repo

TL;DR

This paper presents a novel end-to-end method for NeRF that jointly optimizes camera poses and view synthesis using only RGB images, leveraging epipolar constraints and a CNN-based pose network.

Contribution

It introduces a unified framework that eliminates the need for pre-acquired camera poses, improving NeRF's robustness and scene understanding by integrating pose estimation into the training process.

Findings

01

Joint optimization improves view synthesis quality.

02

Method achieves better generalization across scenes.

03

Eliminates dependency on external pose estimation tools.

Abstract

The neural radiance field (NeRF) for realistic novel view synthesis requires camera poses to be pre-acquired by a structure-from-motion (SfM) approach. This two-stage strategy is not convenient to use and degrades the performance because the error in the pose extraction can propagate to the view synthesis. We integrate the pose extraction and view synthesis into a single end-to-end procedure so they can benefit from each other. For training NeRF models, only RGB images are given, without pre-known camera poses. The camera poses are obtained by the epipolar constraint in which the identical feature in different views has the same world coordinates transformed from the local camera coordinates according to the extracted poses. The epipolar constraint is jointly optimized with pixel color constraint. The poses are represented by a CNN-based deep network, whose input is the related frames.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

xtu-pr-lab/sanerf
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Robotics and Sensor-Based Localization · 3D Shape Modeling and Analysis

MethodsAttentive Walk-Aggregating Graph Neural Network