Risk forecasting using Long Short-Term Memory Mixture Density Networks
Nico Herrig

TL;DR
This paper explores the use of LSTM mixture density networks for Value-at-Risk forecasting, comparing their effectiveness with traditional models across different market conditions, highlighting their potential and current limitations.
Contribution
Introduces and evaluates LSTM-MDN architectures for VaR forecasting, including a novel 3-component model, and assesses their performance against established methods in real market data.
Findings
LSTM-MDNs outperform benchmarks during turbulent periods
Neural networks capture volatility clustering effectively
Limited success during calm market periods
Abstract
This work aims to implement Long Short-Term Memory mixture density networks (LSTM-MDNs) for Value-at-Risk forecasting and compare their performance with established models (historical simulation, CMM, and GARCH) using a defined backtesting procedure. The focus was on the neural network's ability to capture volatility clustering and its real-world applicability. Three architectures were tested: a 2-component mixture density network, a regularized 2-component model (Arimond et al., 2020), and a 3-component mixture model, the latter being tested for the first time in Value-at-Risk forecasting. Backtesting was performed on three stock indices (FTSE 100, S&P 500, EURO STOXX 50) over two distinct two-year periods (2017-2018 as a calm period, 2021-2022 as turbulent). Model performance was assessed through unconditional coverage and independence assumption tests. The neural network's ability…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Clustering Algorithms Research · Neural Networks and Applications · Anomaly Detection Techniques and Applications
MethodsTanh Activation · Sigmoid Activation · Long Short-Term Memory · Focus
