TL;DR
This paper introduces NeRP3D, a point-based 3D detector that leverages NeRF principles to learn continuous 3D representations, avoiding prior conflicts and improving scene understanding in autonomous driving.
Contribution
It proposes a novel NeRF-Resembled Point-based 3D detector that maintains the NeRF network for better 3D scene reconstruction and detection.
Findings
Outperforms previous state-of-the-art methods on nuScenes dataset.
Significantly improves downstream detection tasks.
Enhances scene reconstruction accuracy.
Abstract
Neural radiance fields (NeRFs) have emerged as a prominent pre-training paradigm for vision-centric autonomous driving, which enhances 3D geometry and appearance understanding in a fully self-supervised manner. To apply NeRF-based pretraining to 3D perception models, recent approaches have simply applied NeRFs to volumetric features obtained from view transformation. However, coupling NeRFs with view transformation inherits conflicting priors; view transformation imposes discrete and rigid representations, whereas radiance fields assume continuous and adaptive functions. When these opposing assumptions are forced into a single pipeline, the misalignment surfaces as blurry and ambiguous 3D representations that ultimately limit 3D scene understanding. Moreover, the NeRF network for pre-training is discarded during downstream tasks, resulting in inefficient utilization of enhanced 3D…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
