SocialFormer: Social Interaction Modeling with Edge-enhanced Heterogeneous Graph Transformers for Trajectory Prediction
Zixu Wang, Zhigang Sun, Juergen Luettin, Lavdim Halilaj

TL;DR
SocialFormer is a novel trajectory prediction model that uses edge-enhanced heterogeneous graph transformers and temporal encoding to better capture complex traffic interactions and improve autonomous driving safety.
Contribution
It introduces an edge-enhanced heterogeneous graph transformer and a comprehensive information fusion framework for improved trajectory prediction.
Findings
Achieves state-of-the-art performance on nuScenes benchmark.
Effectively models complex interactions between traffic participants.
Integrates semantic, spatial, and temporal information for accurate predictions.
Abstract
Accurate trajectory prediction is crucial for ensuring safe and efficient autonomous driving. However, most existing methods overlook complex interactions between traffic participants that often govern their future trajectories. In this paper, we propose SocialFormer, an agent interaction-aware trajectory prediction method that leverages the semantic relationship between the target vehicle and surrounding vehicles by making use of the road topology. We also introduce an edge-enhanced heterogeneous graph transformer (EHGT) as the aggregator in a graph neural network (GNN) to encode the semantic and spatial agent interaction information. Additionally, we introduce a temporal encoder based on gated recurrent units (GRU) to model the temporal social behavior of agent movements. Finally, we present an information fusion framework that integrates agent encoding, lane encoding, and agent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraffic Prediction and Management Techniques · Human Mobility and Location-Based Analysis · Traffic and Road Safety
MethodsAttention Is All You Need · Linear Layer · Laplacian EigenMap · Multi-Head Attention · Position-Wise Feed-Forward Layer · Dropout · Label Smoothing · Residual Connection · Softmax · Absolute Position Encodings
