MacFormer: Map-Agent Coupled Transformer for Real-time and Robust Trajectory Prediction
Chen Feng, Hangning Zhou, Huadong Lin, Zhigang Zhang, Ziyao Xu, Chi, Zhang, Boyu Zhou, Shaojie Shen

TL;DR
MacFormer is a novel transformer-based framework that integrates map constraints and efficient modules to achieve real-time, robust trajectory prediction for autonomous vehicles, outperforming existing methods on multiple benchmarks.
Contribution
The paper introduces MacFormer, a transformer model with explicit map integration, multi-task optimization, and lightweight design, advancing real-time and robust trajectory prediction.
Findings
Achieved state-of-the-art performance on Argoverse 1, Argoverse 2, and nuScenes datasets.
Demonstrated lowest inference latency and smallest model size among competitors.
Proved robustness to imperfect tracklet inputs and versatility across classical models.
Abstract
Predicting the future behavior of agents is a fundamental task in autonomous vehicle domains. Accurate prediction relies on comprehending the surrounding map, which significantly regularizes agent behaviors. However, existing methods have limitations in exploiting the map and exhibit a strong dependence on historical trajectories, which yield unsatisfactory prediction performance and robustness. Additionally, their heavy network architectures impede real-time applications. To tackle these problems, we propose Map-Agent Coupled Transformer (MacFormer) for real-time and robust trajectory prediction. Our framework explicitly incorporates map constraints into the network via two carefully designed modules named coupled map and reference extractor. A novel multi-task optimization strategy (MTOS) is presented to enhance learning of topology and rule constraints. We also devise bilateral query…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Autonomous Vehicle Technology and Safety · Traffic Prediction and Management Techniques
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Position-Wise Feed-Forward Layer · Byte Pair Encoding · Adam · Label Smoothing · Layer Normalization · Softmax · Dense Connections
