Making Good on LSTMs' Unfulfilled Promise
Daniel Philps, Artur d'Avila Garcez, Tillman Weyde

TL;DR
This paper demonstrates that memory-augmented continual learning approaches, especially those based on feed-forward neural networks, outperform traditional LSTMs in real-world financial time-series tasks, offering better explainability and robustness.
Contribution
It introduces Continual Learning Augmentation (CLA) as a flexible framework, compares different learners and similarity measures, and highlights the superiority of FFNN-based CL over LSTM-based CL in financial applications.
Findings
FFNN-based CL outperforms LSTM-based CL and simple sliding window models.
Warp-AE similarity approach yields the best performance in memory recall.
Time-series noise impacts similarity measures, with warp-AE being most robust.
Abstract
LSTMs promise much to financial time-series analysis, temporal and cross-sectional inference, but we find that they do not deliver in a real-world financial management task. We examine an alternative called Continual Learning (CL), a memory-augmented approach, which can provide transparent explanations, i.e. which memory did what and when. This work has implications for many financial applications including credit, time-varying fairness in decision making and more. We make three important new observations. Firstly, as well as being more explainable, time-series CL approaches outperform LSTMs as well as a simple sliding window learner using feed-forward neural networks (FFNN). Secondly, we show that CL based on a sliding window learner (FFNN) is more effective than CL based on a sequential learner (LSTM). Thirdly, we examine how real-world, time-series noise impacts several similarity…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTime Series Analysis and Forecasting · Data Stream Mining Techniques · Stock Market Forecasting Methods
MethodsTest · Dynamic Time Warping · Autoencoders · Sigmoid Activation · Tanh Activation · Long Short-Term Memory
