Localization-Guided Foreground Augmentation in Autonomous Driving

Jiawei Yong; Deyuan Qu; Qi Chen; Kentaro Oguchi; Shintaro Fukushima

arXiv:2604.18940·cs.CV·April 22, 2026

Localization-Guided Foreground Augmentation in Autonomous Driving

Jiawei Yong, Deyuan Qu, Qi Chen, Kentaro Oguchi, Shintaro Fukushima

PDF

TL;DR

LG-FA is a lightweight module that enhances foreground perception in autonomous driving by online geometric context enrichment, improving BEV stability, localization, and topology reconstruction under adverse conditions.

Contribution

It introduces a novel, plug-and-play inference module that constructs a global geometric context online, aiding perception without requiring HD maps or backbone modifications.

Findings

01

Improves geometric completeness and temporal stability of BEV representations.

02

Reduces localization error and enhances lane and topology reconstruction.

03

Seamlessly integrates into existing perception systems without backbone changes.

Abstract

Autonomous driving systems often degrade under adverse visibility conditions-such as rain, nighttime, or snow-where online scene geometry (e.g., lane dividers, road boundaries, and pedestrian crossings) becomes sparse or fragmented. While high-definition (HD) maps can provide missing structural context, they are costly to construct and maintain at scale. We propose Localization-Guided Foreground Augmentation (LG-FA), a lightweight and plug-and-play inference module that enhances foreground perception by enriching geometric context online. LG-FA: (i) incrementally constructs a sparse global vector layer from per-frame Bird's-Eye View (BEV) predictions; (ii) estimates ego pose via class-constrained geometric alignment, jointly improving localization and completing missing local topology; and (iii) reprojects the augmented foreground into a unified global frame to improve per-frame…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.