Scene Compliant Trajectory Forecast with Agent-Centric Spatio-Temporal   Grids

Daniela Ridel; Nachiket Deo; Denis Wolf; Mohan Trivedi

arXiv:1909.07507·cs.CV·September 18, 2019

Scene Compliant Trajectory Forecast with Agent-Centric Spatio-Temporal Grids

Daniela Ridel, Nachiket Deo, Denis Wolf, Mohan Trivedi

PDF

TL;DR

This paper introduces a novel scene-compliant trajectory forecasting model that uses agent-centric spatio-temporal grids to effectively integrate scene context and past motion, outperforming previous methods on the Stanford Drone Dataset.

Contribution

The paper proposes a grid-based representation and a ConvLSTM decoder for joint modeling of scene and trajectory, improving long-term human motion prediction accuracy.

Findings

01

Outperforms prior approaches on Stanford Drone Dataset

02

Produces realistic, scene-compliant future trajectories

03

Effectively encodes scene and motion using convolutional architectures

Abstract

Forecasting long-term human motion is a challenging task due to the non-linearity, multi-modality and inherent uncertainty in future trajectories. The underlying scene and past motion of agents can provide useful cues to predict their future motion. However, the heterogeneity of the two inputs poses a challenge for learning a joint representation of the scene and past trajectories. To address this challenge, we propose a model based on grid representations to forecast agent trajectories. We represent the past trajectories of agents using binary 2-D grids, and the underlying scene as a RGB birds-eye view (BEV) image, with an agent-centric frame of reference. We encode the scene and past trajectories using convolutional layers and generate trajectory forecasts using a Convolutional LSTM (ConvLSTM) decoder. Results on the publicly available Stanford Drone Dataset (SDD) show that our model…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory