Efficient Linear Attention for Multivariate Time Series Modeling via Entropy Equality
Mingtao Zhang, Guoli Yang, Zhanxing Zhu, Mengzhu Wang, Xiaoying Bai

TL;DR
This paper introduces a linear attention mechanism for multivariate time series that reduces computational complexity by leveraging entropy equality, enabling scalable and efficient modeling with competitive forecasting results.
Contribution
The work presents a novel entropy-based linear attention mechanism that significantly improves scalability for long sequences in time series modeling.
Findings
Achieves linear complexity in attention computation.
Demonstrates superior or comparable forecasting accuracy.
Reduces memory usage and computational time substantially.
Abstract
Attention mechanisms have been extensively employed in various applications, including time series modeling, owing to their capacity to capture intricate dependencies; however, their utility is often constrained by quadratic computational complexity, which impedes scalability for long sequences. In this work, we propose a novel linear attention mechanism designed to overcome these limitations. Our approach is grounded in a theoretical demonstration that entropy, as a strictly concave function on the probability simplex, implies that distributions with aligned probability rankings and similar entropy values exhibit structural resemblance. Building on this insight, we develop an efficient approximation algorithm that computes the entropy of dot-product-derived distributions with only linear complexity, enabling the implementation of a linear attention mechanism based on entropy equality.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTime Series Analysis and Forecasting · Machine Learning in Healthcare · Forecasting Techniques and Applications
