BirdNeRF: Fast Neural Reconstruction of Large-Scale Scenes From Aerial Imagery
Huiqing Zhang, Yifei Xue, Ming Liao, Yizhen Lao

TL;DR
BirdNeRF introduces a scalable aerial scene reconstruction method that decomposes large datasets into manageable sub-scenes, enabling faster training and rendering with high visual fidelity on a single GPU.
Contribution
We propose a novel spatial decomposition and view re-rendering strategy for large-scale aerial scene reconstruction using NeRF, significantly improving speed and quality.
Findings
Reconstruction speed improved by 10x over classical photogrammetry.
Reconstruction speed improved by 50x over existing large-scale NeRF methods.
Achieved high-quality rendering on a single GPU.
Abstract
In this study, we introduce BirdNeRF, an adaptation of Neural Radiance Fields (NeRF) designed specifically for reconstructing large-scale scenes using aerial imagery. Unlike previous research focused on small-scale and object-centric NeRF reconstruction, our approach addresses multiple challenges, including (1) Addressing the issue of slow training and rendering associated with large models. (2) Meeting the computational demands necessitated by modeling a substantial number of images, requiring extensive resources such as high-performance GPUs. (3) Overcoming significant artifacts and low visual fidelity commonly observed in large-scale reconstruction tasks due to limited model capacity. Specifically, we present a novel bird-view pose-based spatial decomposition algorithm that decomposes a large aerial image set into multiple small sets with appropriately sized overlaps, allowing us to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRemote Sensing and LiDAR Applications · Remote Sensing in Agriculture · Species Distribution and Climate Change
MethodsSparse Evolutionary Training · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
