LeC$^2$O-NeRF: Learning Continuous and Compact Large-Scale Occupancy for   Urban Scenes

Zhenxing Mi; Dan Xu

arXiv:2411.11374·cs.CV·November 19, 2024

LeC$^2$O-NeRF: Learning Continuous and Compact Large-Scale Occupancy for Urban Scenes

Zhenxing Mi, Dan Xu

PDF

Open Access

TL;DR

This paper introduces LeC$^2$O-NeRF, a novel continuous occupancy network for large-scale urban scenes that improves training efficiency and accuracy by effectively modeling occupied and unoccupied regions without relying on traditional grid-based methods.

Contribution

The paper presents the first continuous and compact occupancy learning method for large-scale NeRF, incorporating an imbalanced loss, a dual-architecture design, and an explicit density loss.

Findings

01

Achieves more accurate and smooth occupancy modeling than grid-based methods.

02

Speeds up NeRF training on large-scale scenes without losing accuracy.

03

Provides higher accuracy in large-scale urban scene reconstruction.

Abstract

In NeRF, a critical problem is to effectively estimate the occupancy to guide empty-space skipping and point sampling. Grid-based methods work well for small-scale scenes. However, on large-scale scenes, they are limited by predefined bounding boxes, grid resolutions, and high memory usage for grid updates, and thus struggle to speed up training for large-scale, irregularly bounded and complex urban scenes without sacrificing accuracy. In this paper, we propose to learn a continuous and compact large-scale occupancy network, which can classify 3D points as occupied or unoccupied points. We train this occupancy network end-to-end together with the radiance field in a self-supervised manner by three designs. First, we propose a novel imbalanced occupancy loss to regularize the occupancy network. It makes the occupancy network effectively control the ratio of unoccupied and occupied…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Surveillance and Tracking Methods · Automated Road and Building Extraction · Remote Sensing and LiDAR Applications

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings