Deep Reinforcement Learning and Convex Mean-Variance Optimisation for Portfolio Management
Ruan Pretorius, Terence van Zyl

TL;DR
This paper compares reinforcement learning and traditional convex mean-variance optimisation for portfolio management across different markets, showing RL's potential advantages under certain conditions.
Contribution
It introduces RL models incorporating investor preferences and realistic transaction costs, providing a comprehensive comparison with traditional methods in diverse market scenarios.
Findings
RL outperforms traditional methods in upward trending markets.
RL models match traditional models in sideways markets for most risk levels.
Market conditions influence the relative performance of RL and traditional optimisation.
Abstract
Traditional portfolio management methods can incorporate specific investor preferences but rely on accurate forecasts of asset returns and covariances. Reinforcement learning (RL) methods do not rely on these explicit forecasts and are better suited for multi-stage decision processes. To address limitations of the evaluated research, experiments were conducted on three markets in different economies with different overall trends. By incorporating specific investor preferences into our RL models' reward functions, a more comprehensive comparison could be made to traditional methods in risk-return space. Transaction costs were also modelled more realistically by including nonlinear changes introduced by market volatility and trading volume. The results of this study suggest that there can be an advantage to using RL methods compared to traditional convex mean-variance optimisation methods…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFinancial Markets and Investment Strategies · Advanced Bandit Algorithms Research · Energy Load and Power Forecasting
