Bridging the gap between Markowitz planning and deep reinforcement learning
Eric Benhamou, David Saltiel, Sandrine Ungari, Abhishek Mukhopadhyay

TL;DR
This paper explores how deep reinforcement learning can be applied to portfolio allocation, offering a flexible, adaptive approach that overcomes traditional financial assumptions and incorporates complex data inputs.
Contribution
It demonstrates the potential of DRL for portfolio management by framing it as a continuous control problem, bridging finance and machine learning methodologies.
Findings
DRL can adapt to changing market conditions.
DRL does not rely on traditional risk assumptions.
Preliminary experiments with convolutional networks show promise.
Abstract
While researchers in the asset management industry have mostly focused on techniques based on financial and risk planning techniques like Markowitz efficient frontier, minimum variance, maximum diversification or equal risk parity, in parallel, another community in machine learning has started working on reinforcement learning and more particularly deep reinforcement learning to solve other decision making problems for challenging task like autonomous driving, robot learning, and on a more conceptual side games solving like Go. This paper aims to bridge the gap between these two approaches by showing Deep Reinforcement Learning (DRL) techniques can shed new lights on portfolio allocation thanks to a more general optimization setting that casts portfolio allocation as an optimal control problem that is not just a one-step optimization, but rather a continuous control optimization with a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsConvolution
