$M^2$-Occ: Resilient 3D Semantic Occupancy Prediction for Autonomous Driving with Incomplete Camera Inputs
Kaixin Lin, Kunyu Peng, Di Wen, Yufan Chen, Ruiping Liu, Kailun Yang

TL;DR
This paper introduces $M^2$-Occ, a novel framework for resilient 3D semantic occupancy prediction in autonomous driving that maintains performance despite incomplete multi-camera inputs by leveraging multi-view reconstruction and semantic memory modules.
Contribution
The paper proposes $M^2$-Occ, a new method combining multi-view masked reconstruction and semantic memory modules to improve robustness of 3D semantic occupancy prediction with missing camera views.
Findings
$M^2$-Occ improves IoU by 4.93% under missing back-view conditions.
The method boosts IoU by 5.01% with five missing views.
Robustness gap widens as the number of missing cameras increases.
Abstract
Semantic occupancy prediction enables dense 3D geometric and semantic understanding for autonomous driving. However, existing camera-based approaches implicitly assume complete surround-view observations, an assumption that rarely holds in real-world deployment due to occlusion, hardware malfunction, or communication failures. We study semantic occupancy prediction under incomplete multi-camera inputs and introduce -Occ, a framework designed to preserve geometric structure and semantic coherence when views are missing. -Occ addresses two complementary challenges. First, a Multi-view Masked Reconstruction (MMR) module leverages the spatial overlap among neighboring cameras to recover missing-view representations directly in the feature space. Second, a Feature Memory Module (FMM) introduces a learnable memory bank that stores class-level semantic prototypes. By retrieving and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Vision and Imaging · Autonomous Vehicle Technology and Safety
