Multi-Modal Multi-Task (3MT) Road Segmentation

Erkan Milli; \"Ozg\"ur Erkent; As{\i}m Egemen Y{\i}lmaz

arXiv:2308.11983·cs.RO·August 24, 2023

Multi-Modal Multi-Task (3MT) Road Segmentation

Erkan Milli, \"Ozg\"ur Erkent, As{\i}m Egemen Y{\i}lmaz

PDF

1 Repo

TL;DR

This paper introduces a cost-effective multi-modal road segmentation approach using raw sensor data and multi-task learning, achieving high accuracy and real-time performance without relying on extensive pre-processing.

Contribution

It presents a novel fusion architecture integrating RGB, LiDAR, and IMU/GNSS data for efficient, multi-modal road segmentation with minimal pre-processing and high accuracy.

Findings

01

High accuracy on KITTI dataset

02

Competitive results on Cityscapes

03

Real-time inference performance

Abstract

Multi-modal systems have the capacity of producing more reliable results than systems with a single modality in road detection due to perceiving different aspects of the scene. We focus on using raw sensor inputs instead of, as it is typically done in many SOTA works, leveraging architectures that require high pre-processing costs such as surface normals or dense depth predictions. By using raw sensor inputs, we aim to utilize a low-cost model thatminimizes both the pre-processing andmodel computation costs. This study presents a cost-effective and highly accurate solution for road segmentation by integrating data from multiple sensorswithin a multi-task learning architecture.Afusion architecture is proposed in which RGB and LiDAR depth images constitute the inputs of the network. Another contribution of this study is to use IMU/GNSS (inertial measurement unit/global navigation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

erkanmilli/3mt-roadseg
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsFocus