Comparing Deep Learning Models for the Task of Volatility Prediction Using Multivariate Data
Wenbo Ge, Pooia Lalbakhsh, Leigh Isai, Artem Lensky, Hanna Suominen

TL;DR
This paper compares various deep learning models for volatility prediction across multiple assets, demonstrating that advanced architectures like the Temporal Fusion Transformer generally outperform classical models and shallow networks.
Contribution
It provides a comprehensive comparison of deep learning architectures for volatility forecasting, highlighting the superior performance of the Temporal Fusion Transformer over traditional methods.
Findings
Temporal Fusion Transformer outperforms classical GARCH models
Deep architectures outperform shallow networks in volatility prediction
Results are statistically significant across multiple assets
Abstract
This study aims to compare multiple deep learning-based forecasters for the task of predicting volatility using multivariate data. The paper evaluates a range of models, starting from simpler and shallower ones and progressing to deeper and more complex architectures. Additionally, the performance of these models is compared against naive predictions and variations of classical GARCH models. The prediction of volatility for five assets, namely S&P500, NASDAQ100, gold, silver, and oil, is specifically addressed using GARCH models, Multi-Layer Perceptrons, Recurrent Neural Networks, Temporal Convolutional Networks, and the Temporal Fusion Transformer. In the majority of cases, the Temporal Fusion Transformer, followed by variants of the Temporal Convolutional Network, outperformed classical approaches and shallow networks. These experiments were repeated, and the differences observed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMarket Dynamics and Volatility · Stock Market Forecasting Methods · Currency Recognition and Detection
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Position-Wise Feed-Forward Layer · Absolute Position Encodings · Layer Normalization · Label Smoothing · Adam · Byte Pair Encoding · Residual Connection
