xMTrans: Temporal Attentive Cross-Modality Fusion Transformer for Long-Term Traffic Prediction
Huy Quang Ung, Hao Niu, Minh-Son Dao, Shinya Wada, Atsunori Minamikawa

TL;DR
xMTrans is a novel transformer-based model that leverages multi-modal data with temporal attention for improved long-term traffic prediction accuracy.
Contribution
The paper introduces xMTrans, a new cross-modality transformer model that effectively captures temporal correlations between different data modalities for traffic prediction.
Findings
xMTrans outperforms state-of-the-art methods on real-world datasets.
The model effectively captures temporal correlations between modalities.
Ablation studies confirm the importance of each module.
Abstract
Traffic predictions play a crucial role in intelligent transportation systems. The rapid development of IoT devices allows us to collect different kinds of data with high correlations to traffic predictions, fostering the development of efficient multi-modal traffic prediction models. Until now, there are few studies focusing on utilizing advantages of multi-modal data for traffic predictions. In this paper, we introduce a novel temporal attentive cross-modality transformer model for long-term traffic predictions, namely xMTrans, with capability of exploring the temporal correlations between the data of two modalities: one target modality (for prediction, e.g., traffic congestion) and one support modality (e.g., people flow). We conducted extensive experiments to evaluate our proposed model on traffic congestion and taxi demand predictions using real-world datasets. The results showed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraffic Prediction and Management Techniques · Neural Networks and Applications
