Vision-Guided Forecasting -- Visual Context for Multi-Horizon Time Series Forecasting
Eitan Kosman, Dotan Di Castro

TL;DR
This paper introduces a multi-modal forecasting approach for vehicle states in autonomous driving, combining visual and sensor data to improve prediction accuracy across multiple horizons, outperforming existing methods.
Contribution
It proposes three end-to-end architectures that fuse visual and sensor data for multi-horizon vehicle state forecasting, demonstrating significant performance improvements.
Findings
Vision features reduce forecasting error by over 50%.
Models with visual data outperform those without by large margins.
The approach achieves state-of-the-art results on public datasets.
Abstract
Autonomous driving gained huge traction in recent years, due to its potential to change the way we commute. Much effort has been put into trying to estimate the state of a vehicle. Meanwhile, learning to forecast the state of a vehicle ahead introduces new capabilities, such as predicting dangerous situations. Moreover, forecasting brings new supervision opportunities by learning to predict richer a context, expressed by multiple horizons. Intuitively, a video stream originated from a front-facing camera is necessary because it encodes information about the upcoming road. Besides, historical traces of the vehicle's states give more context. In this paper, we tackle multi-horizon forecasting of vehicle states by fusing the two modalities. We design and experiment with 3 end-to-end architectures that exploit 3D convolutions for visual features extraction and 1D convolutions for features…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutonomous Vehicle Technology and Safety · Advanced Neural Network Applications · Anomaly Detection Techniques and Applications
