NRSeg: Noise-Resilient Learning for BEV Semantic Segmentation via Driving World Models
Siyu Li, Fei Teng, Yihong Cao, Kailun Yang, Zhiyong Li, Yaonan Wang

TL;DR
This paper introduces NRSeg, a noise-resilient learning framework for BEV semantic segmentation in autonomous driving, leveraging synthetic data and novel metrics to improve robustness and accuracy in semi-supervised and unsupervised settings.
Contribution
NRSeg proposes a new framework with a perspective-geometry consistency metric, a dual-distribution prediction approach, and a hierarchical local semantic exclusion module for robust BEV segmentation.
Findings
Achieves state-of-the-art mIoU improvements of 13.8% and 11.4% in unsupervised and semi-supervised tasks.
Effectively utilizes synthetic data despite generation noise.
Demonstrates robustness and superior performance over existing methods.
Abstract
Birds' Eye View (BEV) semantic segmentation is an indispensable perception task in end-to-end autonomous driving systems. Unsupervised and semi-supervised learning for BEV tasks, as pivotal for real-world applications, underperform due to the homogeneous distribution of the labeled data. In this work, we explore the potential of synthetic data from driving world models to enhance the diversity of labeled data for robustifying BEV segmentation. Yet, our preliminary findings reveal that generation noise in synthetic data compromises efficient BEV model learning. To fully harness the potential of synthetic data from world models, this paper proposes NRSeg, a noise-resilient learning framework for BEV semantic segmentation. Specifically, a Perspective-Geometry Consistency Metric (PGCM) is proposed to quantitatively evaluate the guidance capability of generated data for model learning. This…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Natural Language Processing Techniques · Topic Modeling
