A Comparison of LSTMs and Attention Mechanisms for Forecasting Financial Time Series
Thomas Hollis, Antoine Viscardi, Seung Eun Yi

TL;DR
This paper compares LSTM models with and without attention mechanisms for financial time series forecasting, showing that attention can improve performance but also introduces new challenges.
Contribution
The paper introduces an LSTM with attention mechanism and compares its performance against standard LSTMs on stock data.
Findings
LSTM with attention outperforms standalone LSTM in some cases
Both models achieve up to 60% accuracy on stock forecasting
Attention mechanisms can help mitigate long-term dependency issues
Abstract
While LSTMs show increasingly promising results for forecasting Financial Time Series (FTS), this paper seeks to assess if attention mechanisms can further improve performance. The hypothesis is that attention can help prevent long-term dependencies experienced by LSTM models. To test this hypothesis, the main contribution of this paper is the implementation of an LSTM with attention. Both the benchmark LSTM and the LSTM with attention were compared and both achieved reasonable performances of up to 60% on five stocks from Kaggle's Two Sigma dataset. This comparative analysis demonstrates that an LSTM with attention can indeed outperform standalone LSTMs but further investigation is required as issues do arise with such model architectures.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStock Market Forecasting Methods · Energy Load and Power Forecasting · Forecasting Techniques and Applications
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory
