MSPM: A Modularized and Scalable Multi-Agent Reinforcement Learning-based System for Financial Portfolio Management
Zhenhan Huang, Fumihide Tanaka

TL;DR
This paper introduces MSPM, a modular multi-agent reinforcement learning system for financial portfolio management that enhances scalability and reusability, outperforming traditional methods on long-term stock data.
Contribution
The paper proposes a novel modular multi-agent RL system with reusable asset-specific agents, improving scalability and adaptability in dynamic financial markets.
Findings
MSPM outperforms five baselines in accumulated return.
EAM modules significantly boost portfolio performance.
System demonstrates high reusability and scalability in experiments.
Abstract
Financial portfolio management (PM) is one of the most applicable problems in reinforcement learning (RL) owing to its sequential decision-making nature. However, existing RL-based approaches rarely focus on scalability or reusability to adapt to the ever-changing markets. These approaches are rigid and unscalable to accommodate the varying number of assets of portfolios and increasing need for heterogeneous data. Also, RL agents in the existing systems are ad-hoc trained and hardly reusable for different portfolios. To confront the above problems, a modular design is desired for the systems to be compatible with reusable asset-dedicated agents. In this paper, we propose a multi-agent RL-based system for PM (MSPM). MSPM involves two types of asynchronously-updated modules: Evolving Agent Module (EAM) and Strategic Agent Module (SAM). An EAM is an information-generating module with a DQN…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
