CMDFusion: Bidirectional Fusion Network with Cross-modality Knowledge Distillation for LIDAR Semantic Segmentation
Jun Cen, Shiwei Zhang, Yixuan Pei, Kun Li, Hang Zheng, Maochun Luo,, Yingya Zhang, Qifeng Chen

TL;DR
CMDFusion introduces a bidirectional fusion network with cross-modality knowledge distillation that enhances 3D LIDAR semantic segmentation by leveraging 2D image knowledge without requiring RGB images during inference.
Contribution
The paper proposes a novel bidirectional fusion scheme combined with cross-modality knowledge distillation, enabling improved 3D segmentation without needing RGB images at inference.
Findings
Achieves state-of-the-art performance on SemanticKITTI and nuScenes datasets.
Outperforms existing fusion-based methods in LIDAR semantic segmentation.
Effectively distills 2D knowledge to enhance 3D feature representation.
Abstract
2D RGB images and 3D LIDAR point clouds provide complementary knowledge for the perception system of autonomous vehicles. Several 2D and 3D fusion methods have been explored for the LIDAR semantic segmentation task, but they suffer from different problems. 2D-to-3D fusion methods require strictly paired data during inference, which may not be available in real-world scenarios, while 3D-to-2D fusion methods cannot explicitly make full use of the 2D information. Therefore, we propose a Bidirectional Fusion Network with Cross-Modality Knowledge Distillation (CMDFusion) in this work. Our method has two contributions. First, our bidirectional fusion scheme explicitly and implicitly enhances the 3D feature via 2D-to-3D fusion and 3D-to-2D fusion, respectively, which surpasses either one of the single fusion schemes. Second, we distillate the 2D knowledge from a 2D network (Camera branch) to a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Robotics and Sensor-Based Localization · Remote Sensing and LiDAR Applications
MethodsKnowledge Distillation
