Fast Road Segmentation via Uncertainty-aware Symmetric Network
Yicong Chang, Feng Xue, Fei Sheng, Wenteng Liang, Anlong Ming

TL;DR
This paper introduces USNet, a real-time RGB-D road segmentation network that effectively fuses data using an uncertainty-aware approach, achieving high accuracy and speed suitable for autonomous driving.
Contribution
USNet employs separate lightweight subnetworks for RGB and depth, with a multiscale evidence collection and uncertainty-guided fusion, advancing real-time, high-accuracy road segmentation.
Findings
Achieves state-of-the-art accuracy in RGB-D road segmentation.
Runs at over 43 FPS in real-time.
Effectively utilizes uncertainty to improve fusion reliability.
Abstract
The high performance of RGB-D based road segmentation methods contrasts with their rare application in commercial autonomous driving, which is owing to two reasons: 1) the prior methods cannot achieve high inference speed and high accuracy in both ways; 2) the different properties of RGB and depth data are not well-exploited, limiting the reliability of predicted road. In this paper, based on the evidence theory, an uncertainty-aware symmetric network (USNet) is proposed to achieve a trade-off between speed and accuracy by fully fusing RGB and depth data. Firstly, cross-modal feature fusion operations, which are indispensable in the prior RGB-D based methods, are abandoned. We instead separately adopt two light-weight subnetworks to learn road representations from RGB and depth inputs. The light-weight structure guarantees the real-time inference of our method. Moreover, a multiscale…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInfrastructure Maintenance and Monitoring · Automated Road and Building Extraction · Autonomous Vehicle Technology and Safety
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
