Deep Reinforcement Learning and Convex Mean-Variance Optimisation for   Portfolio Management

Ruan Pretorius; Terence van Zyl

arXiv:2203.11318·q-fin.PM·March 23, 2022

Deep Reinforcement Learning and Convex Mean-Variance Optimisation for Portfolio Management

Ruan Pretorius, Terence van Zyl

PDF

Open Access

TL;DR

This paper compares reinforcement learning and traditional convex mean-variance optimisation for portfolio management across different markets, showing RL's potential advantages under certain conditions.

Contribution

It introduces RL models incorporating investor preferences and realistic transaction costs, providing a comprehensive comparison with traditional methods in diverse market scenarios.

Findings

01

RL outperforms traditional methods in upward trending markets.

02

RL models match traditional models in sideways markets for most risk levels.

03

Market conditions influence the relative performance of RL and traditional optimisation.

Abstract

Traditional portfolio management methods can incorporate specific investor preferences but rely on accurate forecasts of asset returns and covariances. Reinforcement learning (RL) methods do not rely on these explicit forecasts and are better suited for multi-stage decision processes. To address limitations of the evaluated research, experiments were conducted on three markets in different economies with different overall trends. By incorporating specific investor preferences into our RL models' reward functions, a more comprehensive comparison could be made to traditional methods in risk-return space. Transaction costs were also modelled more realistically by including nonlinear changes introduced by market volatility and trading volume. The results of this study suggest that there can be an advantage to using RL methods compared to traditional convex mean-variance optimisation methods…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFinancial Markets and Investment Strategies · Advanced Bandit Algorithms Research · Energy Load and Power Forecasting