Localization-Guided Foreground Augmentation in Autonomous Driving
Jiawei Yong, Deyuan Qu, Qi Chen, Kentaro Oguchi, Shintaro Fukushima

TL;DR
LG-FA is a lightweight module that enhances foreground perception in autonomous driving by online geometric context enrichment, improving BEV stability, localization, and topology reconstruction under adverse conditions.
Contribution
It introduces a novel, plug-and-play inference module that constructs a global geometric context online, aiding perception without requiring HD maps or backbone modifications.
Findings
Improves geometric completeness and temporal stability of BEV representations.
Reduces localization error and enhances lane and topology reconstruction.
Seamlessly integrates into existing perception systems without backbone changes.
Abstract
Autonomous driving systems often degrade under adverse visibility conditions-such as rain, nighttime, or snow-where online scene geometry (e.g., lane dividers, road boundaries, and pedestrian crossings) becomes sparse or fragmented. While high-definition (HD) maps can provide missing structural context, they are costly to construct and maintain at scale. We propose Localization-Guided Foreground Augmentation (LG-FA), a lightweight and plug-and-play inference module that enhances foreground perception by enriching geometric context online. LG-FA: (i) incrementally constructs a sparse global vector layer from per-frame Bird's-Eye View (BEV) predictions; (ii) estimates ego pose via class-constrained geometric alignment, jointly improving localization and completing missing local topology; and (iii) reprojects the augmented foreground into a unified global frame to improve per-frame…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
