Multi-modal Experts Network for Autonomous Driving
Shihong Fang, Anna Choromanska

TL;DR
This paper introduces a multi-modal experts network architecture with a gating mechanism for autonomous driving, addressing computational complexity and overfitting issues in multi-sensor systems, demonstrated on a scaled truck platform.
Contribution
A novel multi-modal experts network with a gating mechanism and multi-stage training for efficient, robust sensor data integration in autonomous driving.
Findings
Effective sensor selection at inference time.
Reduced overfitting to dominant sensors.
Validated on a scaled autonomous vehicle platform.
Abstract
End-to-end learning from sensory data has shown promising results in autonomous driving. While employing many sensors enhances world perception and should lead to more robust and reliable behavior of autonomous vehicles, it is challenging to train and deploy such network and at least two problems are encountered in the considered setting. The first one is the increase of computational complexity with the number of sensing devices. The other is the phenomena of network overfitting to the simplest and most informative input. We address both challenges with a novel, carefully tailored multi-modal experts network architecture and propose a multi-stage training procedure. The network contains a gating mechanism, which selects the most relevant input at each inference time step using a mixed discrete-continuous policy. We demonstrate the plausibility of the proposed approach on our 1/6 scale…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
