Hyper-STTN: Hypergraph Augmented Spatial-Temporal Transformer Network for Trajectory Prediction
Weizheng Wang, Baijian Yang, Sungeun Hong, Wenhai Sun, and Byung-Cheol Min

TL;DR
Hyper-STTN introduces a hypergraph-augmented spatial-temporal transformer that effectively models complex group and pairwise interactions for accurate crowd trajectory prediction, outperforming existing methods.
Contribution
The paper presents a novel hypergraph-based spatial-temporal transformer network that captures multiscale groupwise and pairwise interactions for improved trajectory prediction.
Findings
Hyper-STTN outperforms state-of-the-art baselines on public datasets.
Multiscale hypergraph modeling enhances group interaction representation.
Fusion of heterogeneous features improves prediction accuracy.
Abstract
Predicting crowd intentions and trajectories is critical for a range of real-world applications, involving social robotics and autonomous driving. Accurately modeling such behavior remains challenging due to the complexity of pairwise spatial-temporal interactions and the heterogeneous influence of groupwise dynamics. To address these challenges, we propose Hyper-STTN, a Hypergraph-based Spatial-Temporal Transformer Network for crowd trajectory prediction. Hyper-STTN constructs multiscale hypergraphs of varying group sizes to model groupwise correlations, captured through spectral hypergraph convolution based on random-walk probabilities. In parallel, a spatial-temporal transformer is employed to learn pedestrians' pairwise latent interactions across spatial and temporal dimensions. These heterogeneous groupwise and pairwise features are subsequently fused and aligned via a multimodal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Autonomous Vehicle Technology and Safety · Traffic and Road Safety
Methodstravel james · Sparse Evolutionary Training · Multi-Head Attention · Attention Is All You Need · Label Smoothing · Absolute Position Encodings · Layer Normalization · Softmax · Residual Connection · Linear Layer
