Is attention truly all we need? An empirical study of asset pricing in pretrained RNN sparse and global attention models
Shanyan Lai

TL;DR
This paper empirically evaluates pretrained RNN attention models with various attention mechanisms for asset pricing on large-cap US stocks, demonstrating their ability to better capture temporal dependencies and market dynamics, especially during extreme conditions.
Contribution
First large-scale application of state-of-the-art attention mechanisms in asset pricing, addressing limitations of traditional models and providing insights for future economic research.
Findings
Self-att and Sparse-att models excel in absolute returns and downside risk hedging.
Sparse-att model shows more stable performance across different stock sizes.
Models perform well during COVID-19 market extremes, with high Sortino ratios.
Abstract
This study investigates the pretrained RNN attention models with the mainstream attention mechanisms such as additive attention, Luong's three attentions, global self-attention (Self-att) and sliding window sparse attention (Sparse-att) for the empirical asset pricing research on top 420 large-cap US stocks. This is the first paper on the large-scale state-of-the-art (SOTA) attention mechanisms applied in the asset pricing context. They overcome the limitations of the traditional machine learning (ML) based asset pricing, such as mis-capturing the temporal dependency and short memory. Moreover, the enforced causal masks in the attention mechanisms address the future data leaking issue ignored by the more advanced attention-based models, such as the classic Transformer. The proposed attention models also consider the temporal sparsity characteristic of asset pricing data and mitigate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
