EPMF: Efficient Perception-aware Multi-sensor Fusion for 3D Semantic Segmentation
Mingkui Tan, Zhuangwei Zhuang, Sitao Chen, Rong Li, Kui Jia, Qicheng, Wang, Yuanqing Li

TL;DR
This paper introduces EPMF, an efficient perception-aware multi-sensor fusion method for 3D semantic segmentation that effectively combines RGB images and point clouds, improving accuracy and computational efficiency.
Contribution
The paper proposes an optimized multi-sensor fusion framework, EPMF, with novel perspective projection, cross-modal alignment, and fusion modules for enhanced 3D semantic segmentation.
Findings
EPMF outperforms state-of-the-art methods on nuScenes dataset.
The proposed approach improves mIoU by 0.9% over RangeFormer.
Efficient architecture reduces computational costs while maintaining high accuracy.
Abstract
We study multi-sensor fusion for 3D semantic segmentation that is important to scene understanding for many applications, such as autonomous driving and robotics. Existing fusion-based methods, however, may not achieve promising performance due to the vast difference between the two modalities. In this work, we investigate a collaborative fusion scheme called perception-aware multi-sensor fusion (PMF) to effectively exploit perceptual information from two modalities, namely, appearance information from RGB images and spatio-depth information from point clouds. To this end, we project point clouds to the camera coordinate using perspective projection, and process both inputs from LiDAR and cameras in 2D space while preventing the information loss of RGB images. Then, we propose a two-stream network to extract features from the two modalities, separately. The extracted features are fused…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Optical Sensing Technologies · Advanced Neural Network Applications · Industrial Vision Systems and Defect Detection
