MapTR: Structured Modeling and Learning for Online Vectorized HD Map Construction
Bencheng Liao, Shaoyu Chen, Xinggang Wang, Tianheng Cheng, Qian Zhang,, Wenyu Liu, Chang Huang

TL;DR
MapTR introduces a structured Transformer model for real-time, camera-only, vectorized HD map construction in autonomous driving, outperforming existing methods in accuracy and speed.
Contribution
The paper proposes a novel permutation-equivalent modeling approach and hierarchical query scheme for efficient, stable, and accurate online HD map construction using only camera input.
Findings
MapTR-nano runs at 25.1 FPS, 8x faster than previous methods.
MapTR achieves 5.0 higher mAP than state-of-the-art camera-based approaches.
MapTR models outperform multi-modality methods in accuracy and speed.
Abstract
High-definition (HD) map provides abundant and precise environmental information of the driving scene, serving as a fundamental and indispensable component for planning in autonomous driving system. We present MapTR, a structured end-to-end Transformer for efficient online vectorized HD map construction. We propose a unified permutation-equivalent modeling approach, i.e., modeling map element as a point set with a group of equivalent permutations, which accurately describes the shape of map element and stabilizes the learning process. We design a hierarchical query embedding scheme to flexibly encode structured map information and perform hierarchical bipartite matching for map element learning. MapTR achieves the best performance and efficiency with only camera input among existing vectorized map construction approaches on nuScenes dataset. In particular, MapTR-nano runs at real-time…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Robotics and Sensor-Based Localization · Data Management and Algorithms
MethodsAttention Is All You Need · Linear Layer · Softmax · Absolute Position Encodings · Byte Pair Encoding · Adam · Layer Normalization · Label Smoothing · Multi-Head Attention · Dense Connections
