Improving Hierarchical Representations of Vectorized HD Maps with Perspective Clues
Chi Zhang, Qi Song, Feifei Li, Jie Li, and Rui Huang

TL;DR
PerCMap enhances vectorized HD map construction from surround-view images by leveraging perspective clues through cross-view instance activation and dual-view point embedding, significantly improving accuracy and consistency in autonomous driving applications.
Contribution
The paper introduces PerCMap, a novel method that explicitly exploits perspective-view clues at both instance and point levels to improve map vector estimation.
Findings
Achieves 67.1 mAP on nuScenes
Achieves 70.5 mAP on Argoverse 2
Demonstrates strong, consistent performance across benchmarks
Abstract
The construction of vectorized High-Definition (HD) maps from onboard surround-view cameras has become a significant focus in autonomous driving. However, current map vector estimation pipelines face two key limitations: input-agnostic queries struggle to capture complex map structures, and the view transformation leads to information loss. These issues often result in inaccurate shape restoration or missing instances in map predictions. To address this concern, we propose a novel approach, namely \textbf{PerCMap}, which explicitly exploits clues from perspective-view features at both instance and point level. Specifically, at instance level, we propose Cross-view Instance Activation (CIA) to activate instance queries across surround-view images, thereby helping the model recover the instance attributes of map vectors. At point level, we design Dual-view Point Embedding (DPE), which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Modeling in Geospatial Applications · Multimedia Communication and Technology
