InfraParis: A multi-modal and multi-task autonomous driving dataset

Gianni Franchi; Marwane Hariat; Xuanlong Yu; Nacim Belkhir; Antoine; Manzanera; David Filliat

arXiv:2309.15751·cs.CV·November 7, 2023

InfraParis: A multi-modal and multi-task autonomous driving dataset

Gianni Franchi, Marwane Hariat, Xuanlong Yu, Nacim Belkhir, Antoine, Manzanera, David Filliat

PDF

1 Repo 1 Video

TL;DR

InfraParis is a comprehensive multi-modal dataset for autonomous driving, supporting RGB, depth, and infrared data across multiple tasks, aiming to improve model robustness in diverse and challenging scenarios.

Contribution

We introduce InfraParis, a new multi-modal, multi-task dataset for autonomous driving, enabling better training and evaluation of models in varied conditions.

Findings

01

Baseline models show varying performance across modalities.

02

InfraParis facilitates research on multi-modal sensor fusion.

03

The dataset covers diverse scenarios including nighttime and noise conditions.

Abstract

Current deep neural networks (DNNs) for autonomous driving computer vision are typically trained on specific datasets that only involve a single type of data and urban scenes. Consequently, these models struggle to handle new objects, noise, nighttime conditions, and diverse scenarios, which is essential for safety-critical applications. Despite ongoing efforts to enhance the resilience of computer vision DNNs, progress has been sluggish, partly due to the absence of benchmarks featuring multiple modalities. We introduce a novel and versatile dataset named InfraParis that supports multiple tasks across three modalities: RGB, depth, and infrared. We assess various state-of-the-art baseline techniques, encompassing models for the tasks of semantic segmentation, object detection, and depth estimation. More visualizations and the download link for InfraParis are available at…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ENSTA-U2IS-AI/Multimodal_Deep_segmentation
pytorchOfficial

Videos

InfraParis: A Multi-Modal and Multi-Task Autonomous Driving Dataset· youtube