MapKD: Unlocking Prior Knowledge with Cross-Modal Distillation for Efficient Online HD Map Construction

Ziyang Yan; Ruikai Li; Zhiyong Cui; Bohan Li; Han Jiang; Yilong Ren; Aoyong Li; Zhenning Li; Sijia Wen; Haiyang Yu

arXiv:2508.15653·cs.CV·August 25, 2025

MapKD: Unlocking Prior Knowledge with Cross-Modal Distillation for Efficient Online HD Map Construction

Ziyang Yan, Ruikai Li, Zhiyong Cui, Bohan Li, Han Jiang, Yilong Ren, Aoyong Li, Zhenning Li, Sijia Wen, Haiyang Yu

PDF

TL;DR

MapKD introduces a multi-level cross-modal knowledge distillation framework that leverages prior map knowledge to train efficient, vision-centric models for real-time HD map construction in autonomous driving, reducing computational costs.

Contribution

The paper proposes a novel Teacher-Coach-Student distillation paradigm with two specific strategies, TGPD and MSRD, to effectively transfer knowledge from multimodal models to lightweight vision-based models.

Findings

01

Improves student model performance by +6.68 mIoU and +10.94 mAP.

02

Accelerates inference speed of the student model.

03

Demonstrates effectiveness on the nuScenes dataset.

Abstract

Online HD map construction is a fundamental task in autonomous driving systems, aiming to acquire semantic information of map elements around the ego vehicle based on real-time sensor inputs. Recently, several approaches have achieved promising results by incorporating offline priors such as SD maps and HD maps or by fusing multi-modal data. However, these methods depend on stale offline maps and multi-modal sensor suites, resulting in avoidable computational overhead at inference. To address these limitations, we employ a knowledge distillation strategy to transfer knowledge from multimodal models with prior knowledge to an efficient, low-cost, and vision-centric student model. Specifically, we propose MapKD, a novel multi-level cross-modal knowledge distillation framework with an innovative Teacher-Coach-Student (TCS) paradigm. This framework consists of: (1) a camera-LiDAR fusion…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.