A Deep Learning Approach for Trading Factor Residuals
Wo Long, Victor Xiao

TL;DR
This paper replicates a deep learning-based arbitrage methodology on recent data, achieving high out-of-sample Sharpe ratios but raising questions about overfitting and real-world applicability.
Contribution
It extends the original DLSA methodology to a new time period with strict PIT principles, providing insights into model performance and potential overfitting issues.
Findings
Out-of-sample Sharpe ratios sometimes exceed 10
Strong performance metrics observed in certain tests
Potential overfitting or market-specific effects identified
Abstract
The residuals in factor models prevalent in asset pricing presents opportunities to exploit the mis-pricing from unexplained cross-sectional variation for arbitrage. We performed a replication of the methodology of Guijarro-Ordonez et al. (2019) (G-P-Z) on Deep Learning Statistical Arbitrage (DLSA), originally applied to U.S. equity data from 1998 to 2016, using a more recent out-of-sample period from 2016 to 2024. Adhering strictly to point-in-time (PIT) principles and ensuring no information leakage, we follow the same data pre-processing, factor modeling, and deep learning architectures (CNNs and Transformers) as outlined by G-P-Z. Our replication yields unusually strong performance metrics in certain tests, with out-of-sample Sharpe ratios occasionally exceeding 10. While such results are intriguing, they may indicate model overfitting, highly specific market conditions, or…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStock Market Forecasting Methods
MethodsALIGN
