To View Transform or Not to View Transform: NeRF-based Pre-training Perspective

Hyeonjun Jeong; Juyeb Shin; Dongsuk Kum

arXiv:2603.28090·cs.CV·March 31, 2026

To View Transform or Not to View Transform: NeRF-based Pre-training Perspective

Hyeonjun Jeong, Juyeb Shin, Dongsuk Kum

PDF

1 Video

TL;DR

This paper introduces NeRP3D, a point-based 3D detector that leverages NeRF principles to learn continuous 3D representations, avoiding prior conflicts and improving scene understanding in autonomous driving.

Contribution

It proposes a novel NeRF-Resembled Point-based 3D detector that maintains the NeRF network for better 3D scene reconstruction and detection.

Findings

01

Outperforms previous state-of-the-art methods on nuScenes dataset.

02

Significantly improves downstream detection tasks.

03

Enhances scene reconstruction accuracy.

Abstract

Neural radiance fields (NeRFs) have emerged as a prominent pre-training paradigm for vision-centric autonomous driving, which enhances 3D geometry and appearance understanding in a fully self-supervised manner. To apply NeRF-based pretraining to 3D perception models, recent approaches have simply applied NeRFs to volumetric features obtained from view transformation. However, coupling NeRFs with view transformation inherits conflicting priors; view transformation imposes discrete and rigid representations, whereas radiance fields assume continuous and adaptive functions. When these opposing assumptions are forced into a single pipeline, the misalignment surfaces as blurry and ambiguous 3D representations that ultimately limit 3D scene understanding. Moreover, the NeRF network for pre-training is discarded during downstream tasks, resulting in inefficient utilization of enhanced 3D…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

To View Transform or Not to View Transform: NeRF-based Pre-training Perspective· slideslive