Statistical Arbitrage Risk Premium by Machine Learning
Raymond C. W. Leung, Yu-Man Tam

TL;DR
This paper introduces a machine learning approach to estimate the statistical arbitrage risk premium (SARP) by constructing peer portfolios without knowing specific factors, revealing that unique stocks have higher SARP and excess returns.
Contribution
It develops a novel method using elastic-net to estimate SARP, linking peer-based residual risk to expected returns, and demonstrates its empirical significance.
Findings
High SAR stocks have 1.101% higher monthly SARP than low SAR stocks.
High SAR stocks have 0.710% higher monthly excess returns.
Average SAR is countercyclical across the market.
Abstract
How to hedge factor risks without knowing the identities of the factors? We first prove a general theoretical result: even if the exact set of factors cannot be identified, any risky asset can use some portfolio of similar peer assets to hedge against its own factor exposures. A long position of a risky asset and a short position of a "replicate portfolio" of its peers represent that asset's factor residual risk. We coin the expected return of an asset's factor residual risk as its Statistical Arbitrage Risk Premium (SARP). The challenge in empirically estimating SARP is finding the peers for each asset and constructing the replicate portfolios. We use the elastic-net, a machine learning method, to project each stock's past returns onto that of every other stock. The resulting high-dimensional but sparse projection vector serves as investment weights in constructing the stocks'…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFinancial Markets and Investment Strategies · Financial Risk and Volatility Modeling · Complex Systems and Time Series Analysis
