UP-Fuse: Uncertainty-guided LiDAR-Camera Fusion for 3D Panoptic Segmentation
Rohit Mohan, Florian Drews, Yakov Miron, Daniele Cattaneo, Abhinav Valada

TL;DR
UP-Fuse is a robust uncertainty-aware LiDAR-camera fusion framework for 3D panoptic segmentation that maintains high performance under sensor degradation and failures by dynamically weighting visual cues based on learned uncertainty maps.
Contribution
It introduces a novel uncertainty-guided fusion module and a hybrid 2D-3D transformer, enhancing robustness and accuracy in adverse conditions for 3D perception.
Findings
Outperforms existing methods on Panoptic nuScenes, SemanticKITTI, and Panoptic Waymo benchmarks.
Maintains high segmentation accuracy under severe visual degradation.
Demonstrates robustness to sensor failure and calibration drift.
Abstract
LiDAR-camera fusion enhances 3D panoptic segmentation by leveraging camera images to complement sparse LiDAR scans, but it also introduces a critical failure mode. Under adverse conditions, degradation or failure of the camera sensor can significantly compromise the reliability of the perception system. To address this problem, we introduce UP-Fuse, a novel uncertainty-aware fusion framework in the 2D range-view that remains robust under camera sensor degradation, calibration drift, and sensor failure. Raw LiDAR data is first projected into the range-view and encoded by a LiDAR encoder, while camera features are simultaneously extracted and projected into the same shared space. At its core, UP-Fuse employs an uncertainty-guided fusion module that dynamically modulates cross-modal interaction using predicted uncertainty maps. These maps are learned by quantifying representational…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Advanced Vision and Imaging · Advanced Optical Sensing Technologies
