Improving Hierarchical Representations of Vectorized HD Maps with Perspective Clues

Chi Zhang; Qi Song; Feifei Li; Jie Li; and Rui Huang

arXiv:2404.11155·cs.CV·October 14, 2025·1 cites

Improving Hierarchical Representations of Vectorized HD Maps with Perspective Clues

Chi Zhang, Qi Song, Feifei Li, Jie Li, and Rui Huang

PDF

Open Access

TL;DR

PerCMap enhances vectorized HD map construction from surround-view images by leveraging perspective clues through cross-view instance activation and dual-view point embedding, significantly improving accuracy and consistency in autonomous driving applications.

Contribution

The paper introduces PerCMap, a novel method that explicitly exploits perspective-view clues at both instance and point levels to improve map vector estimation.

Findings

01

Achieves 67.1 mAP on nuScenes

02

Achieves 70.5 mAP on Argoverse 2

03

Demonstrates strong, consistent performance across benchmarks

Abstract

The construction of vectorized High-Definition (HD) maps from onboard surround-view cameras has become a significant focus in autonomous driving. However, current map vector estimation pipelines face two key limitations: input-agnostic queries struggle to capture complex map structures, and the view transformation leads to information loss. These issues often result in inaccurate shape restoration or missing instances in map predictions. To address this concern, we propose a novel approach, namely \textbf{PerCMap}, which explicitly exploits clues from perspective-view features at both instance and point level. Specifically, at instance level, we propose Cross-view Instance Activation (CIA) to activate instance queries across surround-view images, thereby helping the model recover the instance attributes of map vectors. At point level, we design Dual-view Point Embedding (DPE), which…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Modeling in Geospatial Applications · Multimedia Communication and Technology