TL;DR
This paper proposes a multi-stream RNN model for predicting future vehicle locations and scales from egocentric views, incorporating optical flow and ego-vehicle motion modeling, evaluated on a new challenging dataset.
Contribution
It introduces a novel multi-stream RNN approach that jointly predicts vehicle position and scale in first-person views, enhanced by optical flow and ego-vehicle motion modeling.
Findings
Optical flow significantly improves prediction accuracy.
Modeling ego-vehicle motion enhances localization performance.
The approach performs well on a new diverse dataset.
Abstract
Predicting the future location of vehicles is essential for safety-critical applications such as advanced driver assistance systems (ADAS) and autonomous driving. This paper introduces a novel approach to simultaneously predict both the location and scale of target vehicles in the first-person (egocentric) view of an ego-vehicle. We present a multi-stream recurrent neural network (RNN) encoder-decoder model that separately captures both object location and scale and pixel-level observations for future vehicle localization. We show that incorporating dense optical flow improves prediction results significantly since it captures information about motion as well as appearance change. We also find that explicitly modeling future motion of the ego-vehicle improves the prediction accuracy, which could be especially beneficial in intelligent and automated vehicles that have motion planning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
