MapKD: Unlocking Prior Knowledge with Cross-Modal Distillation for Efficient Online HD Map Construction
Ziyang Yan, Ruikai Li, Zhiyong Cui, Bohan Li, Han Jiang, Yilong Ren, Aoyong Li, Zhenning Li, Sijia Wen, Haiyang Yu

TL;DR
MapKD introduces a multi-level cross-modal knowledge distillation framework that leverages prior map knowledge to train efficient, vision-centric models for real-time HD map construction in autonomous driving, reducing computational costs.
Contribution
The paper proposes a novel Teacher-Coach-Student distillation paradigm with two specific strategies, TGPD and MSRD, to effectively transfer knowledge from multimodal models to lightweight vision-based models.
Findings
Improves student model performance by +6.68 mIoU and +10.94 mAP.
Accelerates inference speed of the student model.
Demonstrates effectiveness on the nuScenes dataset.
Abstract
Online HD map construction is a fundamental task in autonomous driving systems, aiming to acquire semantic information of map elements around the ego vehicle based on real-time sensor inputs. Recently, several approaches have achieved promising results by incorporating offline priors such as SD maps and HD maps or by fusing multi-modal data. However, these methods depend on stale offline maps and multi-modal sensor suites, resulting in avoidable computational overhead at inference. To address these limitations, we employ a knowledge distillation strategy to transfer knowledge from multimodal models with prior knowledge to an efficient, low-cost, and vision-centric student model. Specifically, we propose MapKD, a novel multi-level cross-modal knowledge distillation framework with an innovative Teacher-Coach-Student (TCS) paradigm. This framework consists of: (1) a camera-LiDAR fusion…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
