XicorAttention: Time Series Transformer Using Attention with Nonlinear Correlation
Daichi Kimura, Tomonori Izumitani, Hisashi Kashima

TL;DR
XicorAttention introduces a nonlinear correlation-based attention mechanism for time series forecasting, enhancing accuracy by capturing complex dependencies that traditional methods may miss.
Contribution
The paper proposes a novel attention mechanism using Chatterjee's rank correlation, integrated with SoftSort and SoftRank, to better model nonlinear dependencies in time series data.
Findings
Improved forecasting accuracy by up to 9.1% over existing models.
Effective integration of nonlinear correlation into Transformer architectures.
Demonstrated benefits on real-world datasets.
Abstract
Various Transformer-based models have been proposed for time series forecasting. These models leverage the self-attention mechanism to capture long-term temporal or variate dependencies in sequences. Existing methods can be divided into two approaches: (1) reducing computational cost of attention by making the calculations sparse, and (2) reshaping the input data to aggregate temporal features. However, existing attention mechanisms may not adequately capture inherent nonlinear dependencies present in time series data, leaving room for improvement. In this study, we propose a novel attention mechanism based on Chatterjee's rank correlation coefficient, which measures nonlinear dependencies between variables. Specifically, we replace the matrix multiplication in standard attention mechanisms with this rank coefficient to measure the query-key relationship. Since computing Chatterjee's…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
