RESBev: Making BEV Perception More Robust
Lifeng Zhuo, Kefan Jin, Zhe Liu, Hesheng Wang

TL;DR
RESBev is a robust, plug-and-play method that enhances bird's-eye-view perception in autonomous driving by predicting clean features from corrupted observations, improving safety under disturbances and attacks.
Contribution
It introduces a latent semantic prediction framework that boosts BEV perception robustness without altering existing models, applicable to natural and adversarial disturbances.
Findings
Significantly improves robustness against disturbances and attacks
Operates at the semantic feature level of BEV perception pipeline
Requires only few-shot fine-tuning for effective performance
Abstract
Bird's-eye-view (BEV) perception has emerged as a cornerstone of autonomous driving systems, providing a structured, ego-centric representation critical for downstream planning and control. However, real-world deployment faces challenges from sensor degradation and adversarial attacks, which can cause severe perceptual anomalies and ultimately compromise the safety of autonomous driving systems. To address this, we propose a resilient and plug-and-play BEV perception method, RESBev, which can be easily applied to existing BEV perception methods to enhance their robustness to diverse disturbances. Specifically, we reframe perception robustness as a latent semantic prediction problem. A latent world model is constructed to extract spatiotemporal correlations across sequential BEV observations, thereby learning the underlying BEV state transitions to predict clean BEV features for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Autonomous Vehicle Technology and Safety · Advanced Neural Network Applications
