ADFormer: Aggregation Differential Transformer for Passenger Demand Forecasting
Haichen Wang, Liu Yang, Xinyuan Zhang, Haomin Yu, Ming Li, Jilin Hu

TL;DR
ADFormer introduces a novel transformer model that effectively captures both original and high-level spatio-temporal correlations for improved passenger demand forecasting, demonstrating superior performance on real-world datasets.
Contribution
The paper proposes Differential Attention and aggregation strategies to unify original and high-level correlations, enhancing spatio-temporal modeling in demand forecasting.
Findings
Outperforms existing methods on taxi and bike datasets
Demonstrates improved accuracy and efficiency
Validates the importance of high-level correlation integration
Abstract
Passenger demand forecasting helps optimize vehicle scheduling, thereby improving urban efficiency. Recently, attention-based methods have been used to adequately capture the dynamic nature of spatio-temporal data. However, existing methods that rely on heuristic masking strategies cannot fully adapt to the complex spatio-temporal correlations, hindering the model from focusing on the right context. These works also overlook the high-level correlations that exist in the real world. Effectively integrating these high-level correlations with the original correlations is crucial. To fill this gap, we propose the Aggregation Differential Transformer (ADFormer), which offers new insights to demand forecasting promotion. Specifically, we utilize Differential Attention to capture the original spatial correlations and achieve attention denoising. Meanwhile, we design distinct aggregation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraffic Prediction and Management Techniques · Human Mobility and Location-Based Analysis · Transportation and Mobility Innovations
MethodsAbsolute Position Encodings · Layer Normalization · Byte Pair Encoding · Label Smoothing · Softmax · Dropout · Dense Connections · Transformer · Attention Is All You Need
