Goal-driven Self-Attentive Recurrent Networks for Trajectory Prediction
Luigi Filippo Chiara, Pasquale Coscia, Sourav Das, Simone Calderara,, Rita Cucchiara, Lamberto Ballan

TL;DR
This paper introduces a lightweight, attention-based recurrent model for human trajectory prediction that, when combined with a scene-aware goal estimation module, achieves competitive accuracy with reduced complexity.
Contribution
The paper presents a novel, simple recurrent architecture enhanced with a scene-aware goal module, improving trajectory prediction accuracy while maintaining low model complexity.
Findings
Performs on par with state-of-the-art methods
Reduces model complexity significantly
Effective scene-aware goal estimation improves accuracy
Abstract
Human trajectory forecasting is a key component of autonomous vehicles, social-aware robots and advanced video-surveillance applications. This challenging task typically requires knowledge about past motion, the environment and likely destination areas. In this context, multi-modality is a fundamental aspect and its effective modeling can be beneficial to any architecture. Inferring accurate trajectories is nevertheless challenging, due to the inherently uncertain nature of the future. To overcome these difficulties, recent models use different inputs and propose to model human intentions using complex fusion mechanisms. In this respect, we propose a lightweight attention-based recurrent backbone that acts solely on past observed positions. Although this backbone already provides promising results, we demonstrate that its prediction accuracy can be improved considerably when combined…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Autonomous Vehicle Technology and Safety · Human Pose and Action Recognition
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Max Pooling · Concatenated Skip Connection · Convolution · U-Net
