OE-BevSeg: An Object Informed and Environment Aware Multimodal Framework for Bird's-eye-view Vehicle Semantic Segmentation
Jian Sun, Yuqi Dai, Chi-Man Vong, Qing Xu, Shengbo Eben Li, Jianqiang, Wang, Lei He, Keqiang Li

TL;DR
OE-BevSeg is a novel multimodal framework that significantly improves bird's-eye-view vehicle segmentation by enhancing environment understanding and target object recognition using global modeling, object supervision, and multimodal fusion.
Contribution
The paper introduces OE-BevSeg, a comprehensive framework that combines environment-aware BEV compression, center-informed object enhancement, and multimodal fusion for superior segmentation performance.
Findings
Achieves state-of-the-art results on nuScenes dataset.
Improves long-distance environment perception and object detail recognition.
Enhances segmentation accuracy in both camera-only and multimodal setups.
Abstract
Bird's-eye-view (BEV) semantic segmentation is becoming crucial in autonomous driving systems. It realizes ego-vehicle surrounding environment perception by projecting 2D multi-view images into 3D world space. Recently, BEV segmentation has made notable progress, attributed to better view transformation modules, larger image encoders, or more temporal information. However, there are still two issues: 1) a lack of effective understanding and enhancement of BEV space features, particularly in accurately capturing long-distance environmental features and 2) recognizing fine details of target objects. To address these issues, we propose OE-BevSeg, an end-to-end multimodal framework that enhances BEV segmentation performance through global environment-aware perception and local target object enhancement. OE-BevSeg employs an environment-aware BEV compressor. Based on prior knowledge about…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutonomous Vehicle Technology and Safety · Advanced Neural Network Applications
