Structure-Aware NeRF without Posed Camera via Epipolar Constraint
Shu Chen, Yang Zhang, Yaxin Xu, and Beiji Zou

TL;DR
This paper presents a novel end-to-end method for NeRF that jointly optimizes camera poses and view synthesis using only RGB images, leveraging epipolar constraints and a CNN-based pose network.
Contribution
It introduces a unified framework that eliminates the need for pre-acquired camera poses, improving NeRF's robustness and scene understanding by integrating pose estimation into the training process.
Findings
Joint optimization improves view synthesis quality.
Method achieves better generalization across scenes.
Eliminates dependency on external pose estimation tools.
Abstract
The neural radiance field (NeRF) for realistic novel view synthesis requires camera poses to be pre-acquired by a structure-from-motion (SfM) approach. This two-stage strategy is not convenient to use and degrades the performance because the error in the pose extraction can propagate to the view synthesis. We integrate the pose extraction and view synthesis into a single end-to-end procedure so they can benefit from each other. For training NeRF models, only RGB images are given, without pre-known camera poses. The camera poses are obtained by the epipolar constraint in which the identical feature in different views has the same world coordinates transformed from the local camera coordinates according to the extracted poses. The epipolar constraint is jointly optimized with pixel color constraint. The poses are represented by a CNN-based deep network, whose input is the related frames.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Robotics and Sensor-Based Localization · 3D Shape Modeling and Analysis
MethodsAttentive Walk-Aggregating Graph Neural Network
