A novel multi-agent dynamic portfolio optimization learning system based on hierarchical deep reinforcement learning
Ruoyu Sun, Yue Xi, Angelos Stefanidis, Zhengyong Jiang, Jionglong Su

TL;DR
This paper introduces a hierarchical multi-agent deep reinforcement learning framework for portfolio optimization, addressing challenges like reward sparsity and high dimensionality to improve risk-adjusted returns.
Contribution
It proposes a novel multi-agent HDRL system with an auxiliary agent to enhance policy exploration and training efficiency in sparse reward environments.
Findings
Improved risk-adjusted profitability in portfolio management.
Enhanced training efficiency through hierarchical multi-agent cooperation.
Overcame curse of dimensionality in DRL-based portfolio optimization.
Abstract
Deep Reinforcement Learning (DRL) has been extensively used to address portfolio optimization problems. The DRL agents acquire knowledge and make decisions through unsupervised interactions with their environment without requiring explicit knowledge of the joint dynamics of portfolio assets. Among these DRL algorithms, the combination of actor-critic algorithms and deep function approximators is the most widely used DRL algorithm. Here, we find that training the DRL agent using the actor-critic algorithm and deep function approximators may lead to scenarios where the improvement in the DRL agent's risk-adjusted profitability is not significant. We propose that such situations primarily arise from the following two problems: sparsity in positive reward and the curse of dimensionality. These limitations prevent DRL agents from comprehensively learning asset price change patterns in the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsFocus
