# INFER: INtermediate representations for FuturE pRediction

**Authors:** Shashank Srikanth, Junaid Ahmed Ansari, Karnik Ram R, Sarthak, Sharma, Krishna Murthy J., Madhava Krishna K

arXiv: 1903.10641 · 2019-03-27

## TL;DR

This paper introduces semantic-based intermediate representations for vehicle trajectory prediction in urban driving, significantly improving generalization across diverse datasets and driving conditions.

## Contribution

It proposes a novel semantic representation approach and autoregressive model that outperform pixel-based methods in trajectory prediction and generalize across datasets and countries.

## Key findings

- Semantic representations outperform raw pixel methods.
- Models generalize across datasets from different cities and countries.
- Application demonstrated in multi-object tracking.

## Abstract

In urban driving scenarios, forecasting future trajectories of surrounding vehicles is of paramount importance. While several approaches for the problem have been proposed, the best-performing ones tend to require extremely detailed input representations (eg. image sequences). But, such methods do not generalize to datasets they have not been trained on. We propose intermediate representations that are particularly well-suited for future prediction. As opposed to using texture (color) information, we rely on semantics and train an autoregressive model to accurately predict future trajectories of traffic participants (vehicles) (see fig. above). We demonstrate that using semantics provides a significant boost over techniques that operate over raw pixel intensities/disparities. Uncharacteristic of state-of-the-art approaches, our representations and models generalize to completely different datasets, collected across several cities, and also across countries where people drive on opposite sides of the road (left-handed vs right-handed driving). Additionally, we demonstrate an application of our approach in multi-object tracking (data association). To foster further research in transferrable representations and ensure reproducibility, we release all our code and data.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1903.10641/full.md

## Figures

15 figures with captions in the complete paper: https://tomesphere.com/paper/1903.10641/full.md

## References

30 references — full list in the complete paper: https://tomesphere.com/paper/1903.10641/full.md

---
Source: https://tomesphere.com/paper/1903.10641