$M^2$-Occ: Resilient 3D Semantic Occupancy Prediction for Autonomous Driving with Incomplete Camera Inputs

Kaixin Lin; Kunyu Peng; Di Wen; Yufan Chen; Ruiping Liu; Kailun Yang

arXiv:2603.09737·cs.CV·March 11, 2026

$M^2$-Occ: Resilient 3D Semantic Occupancy Prediction for Autonomous Driving with Incomplete Camera Inputs

Kaixin Lin, Kunyu Peng, Di Wen, Yufan Chen, Ruiping Liu, Kailun Yang

PDF

Open Access

TL;DR

This paper introduces $M^2$-Occ, a novel framework for resilient 3D semantic occupancy prediction in autonomous driving that maintains performance despite incomplete multi-camera inputs by leveraging multi-view reconstruction and semantic memory modules.

Contribution

The paper proposes $M^2$-Occ, a new method combining multi-view masked reconstruction and semantic memory modules to improve robustness of 3D semantic occupancy prediction with missing camera views.

Findings

01

$M^2$-Occ improves IoU by 4.93% under missing back-view conditions.

02

The method boosts IoU by 5.01% with five missing views.

03

Robustness gap widens as the number of missing cameras increases.

Abstract

Semantic occupancy prediction enables dense 3D geometric and semantic understanding for autonomous driving. However, existing camera-based approaches implicitly assume complete surround-view observations, an assumption that rarely holds in real-world deployment due to occlusion, hardware malfunction, or communication failures. We study semantic occupancy prediction under incomplete multi-camera inputs and introduce $M^{2}$ -Occ, a framework designed to preserve geometric structure and semantic coherence when views are missing. $M^{2}$ -Occ addresses two complementary challenges. First, a Multi-view Masked Reconstruction (MMR) module leverages the spatial overlap among neighboring cameras to recover missing-view representations directly in the feature space. Second, a Feature Memory Module (FMM) introduces a learnable memory bank that stores class-level semantic prototypes. By retrieving and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Advanced Vision and Imaging · Autonomous Vehicle Technology and Safety