SATMapTR: Satellite Image Enhanced Online HD Map Construction
Bingyuan Huang, Guanyi Zhao, Qian Xu, Yang Lou, Yung-Hui Li, Jianping Wang

TL;DR
SATMapTR introduces a novel online HD map construction method that effectively fuses satellite imagery with onboard sensor data, significantly improving accuracy and robustness in autonomous driving scenarios.
Contribution
The paper presents SATMapTR, a new model with gated feature refinement and geometry-aware fusion modules for better satellite and sensor data integration.
Findings
Achieves 73.8 mAP on nuScenes, outperforming previous models by 14.2 mAP.
Shows lower performance degradation under adverse weather and sensor failures.
Nearly triples mAP at extended perception ranges.
Abstract
High-definition (HD) maps are evolving from pre-annotated to real-time construction to better support autonomous driving in diverse scenarios. However, this process is hindered by low-quality input data caused by onboard sensors limited capability and frequent occlusions, leading to incomplete, noisy, or missing data, and thus reduced mapping accuracy and robustness. Recent efforts have introduced satellite images as auxiliary input, offering a stable, wide-area view to complement the limited ego perspective. However, satellite images in Bird's Eye View are often degraded by shadows and occlusions from vegetation and buildings. Prior methods using basic feature extraction and fusion remain ineffective. To address these challenges, we propose SATMapTR, a novel online map construction model that effectively fuses satellite image through two key components: (1) a gated feature refinement…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutomated Road and Building Extraction · Remote-Sensing Image Classification · Advanced Image and Video Retrieval Techniques
