Focused Specific Objects NeRF
Yuesong Li, Feng Pan, Helong Yan, Xiuli Xin, Xiaoxue Feng

TL;DR
This paper introduces a scene-specific NeRF approach utilizing semantic priors for faster training and improved rendering, along with weak supervision and novel scene editing techniques, applicable to all NeRF models.
Contribution
It proposes a semantic prior-based focus mechanism for NeRF, significantly accelerating training and enhancing rendering quality, with extensions to scene editing and self-supervised correction.
Findings
Training speed increased by 7.78 times
Applicable to all NeRF-based models
Enhanced scene editing with semantic masking
Abstract
Most NeRF-based models are designed for learning the entire scene, and complex scenes can lead to longer learning times and poorer rendering effects. This paper utilizes scene semantic priors to make improvements in fast training, allowing the network to focus on the specific targets and not be affected by complex backgrounds. The training speed can be increased by 7.78 times with better rendering effect, and small to medium sized targets can be rendered faster. In addition, this improvement applies to all NeRF-based models. Considering the inherent multi-view consistency and smoothness of NeRF, this paper also studies weak supervision by sparsely sampling negative ray samples. With this method, training can be further accelerated and rendering quality can be maintained. Finally, this paper extends pixel semantic and color rendering formulas and proposes a new scene editing technique…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Computer Graphics and Visualization Techniques · Generative Adversarial Networks and Image Synthesis
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Focus
