TL;DR
OccuFly introduces a novel camera-based aerial semantic scene completion benchmark, addressing the lack of LiDAR-free datasets and revealing current vision models' limitations in aerial 3D perception.
Contribution
The paper presents the first real-world, camera-based aerial SSC benchmark, OccuFly, with extensive data and benchmarks, and proposes a LiDAR-free data generation framework for aerial environments.
Findings
Current vision models show limitations in aerial SSC tasks.
OccuFly provides over 20,000 samples across diverse environments.
Benchmarking reveals fundamental challenges in aerial 3D scene understanding.
Abstract
Semantic Scene Completion (SSC) is essential for 3D perception in mobile robotics, as it enables holistic scene understanding by jointly estimating dense volumetric occupancy and per-voxel semantics. Although SSC has been widely studied in terrestrial domains such as autonomous driving, aerial settings like autonomous flying remain largely unexplored, thereby limiting progress on downstream applications. Furthermore, LiDAR sensors are the primary modality for SSC data generation, which poses challenges for most uncrewed aerial vehicles (UAVs) due to flight regulations, mass and energy constraints, and the sparsity of LiDAR point clouds from elevated viewpoints. To address these limitations, we propose a LiDAR-free, camera-based data generation framework. By leveraging classical 3D reconstruction, our framework automates semantic label transfer by lifting <10% of annotated images into…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
