StarNet: Joint Action-Space Prediction with Star Graphs and Implicit Global Frame Self-Attention
Faris Janjo\v{s}, Maxim Dolgov, and J. Marius Z\"ollner

TL;DR
StarNet introduces a novel graph-based architecture for multi-agent trajectory prediction, effectively modeling map topology and agent interactions, and extends to joint scene prediction with improved accuracy on real-world datasets.
Contribution
It presents a new star graph-based map modeling approach and extends to joint multi-agent prediction using masked self-attention, enhancing scene understanding.
Findings
Achieves state-of-the-art results on inD and INTERACTION datasets.
Effectively models map topology with star graphs for better prediction.
Joint-StarNet improves multi-agent trajectory forecasts.
Abstract
In this work, we present a novel multi-modal multi-agent trajectory prediction architecture, focusing on map and interaction modeling using graph representation. For the purposes of map modeling, we capture rich topological structure into vector-based star graphs, which enable an agent to directly attend to relevant regions along polylines that are used to represent the map. We denote this architecture StarNet, and integrate it in a single-agent prediction setting. As the main result, we extend this architecture to joint scene-level prediction, which produces multiple agents' predictions simultaneously. The key idea in joint-StarNet is integrating the awareness of one agent in its own reference frame with how it is perceived from the points of view of other agents. We achieve this via masked self-attention. Both proposed architectures are built on top of the action-space prediction…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
