TL;DR
This paper introduces a cost-effective multi-modal road segmentation approach using raw sensor data and multi-task learning, achieving high accuracy and real-time performance without relying on extensive pre-processing.
Contribution
It presents a novel fusion architecture integrating RGB, LiDAR, and IMU/GNSS data for efficient, multi-modal road segmentation with minimal pre-processing and high accuracy.
Findings
High accuracy on KITTI dataset
Competitive results on Cityscapes
Real-time inference performance
Abstract
Multi-modal systems have the capacity of producing more reliable results than systems with a single modality in road detection due to perceiving different aspects of the scene. We focus on using raw sensor inputs instead of, as it is typically done in many SOTA works, leveraging architectures that require high pre-processing costs such as surface normals or dense depth predictions. By using raw sensor inputs, we aim to utilize a low-cost model thatminimizes both the pre-processing andmodel computation costs. This study presents a cost-effective and highly accurate solution for road segmentation by integrating data from multiple sensorswithin a multi-task learning architecture.Afusion architecture is proposed in which RGB and LiDAR depth images constitute the inputs of the network. Another contribution of this study is to use IMU/GNSS (inertial measurement unit/global navigation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsFocus
