MSPM: A Modularized and Scalable Multi-Agent Reinforcement   Learning-based System for Financial Portfolio Management

Zhenhan Huang; Fumihide Tanaka

arXiv:2102.03502·q-fin.PM·February 22, 2022

MSPM: A Modularized and Scalable Multi-Agent Reinforcement Learning-based System for Financial Portfolio Management

Zhenhan Huang, Fumihide Tanaka

PDF

TL;DR

This paper introduces MSPM, a modular multi-agent reinforcement learning system for financial portfolio management that enhances scalability and reusability, outperforming traditional methods on long-term stock data.

Contribution

The paper proposes a novel modular multi-agent RL system with reusable asset-specific agents, improving scalability and adaptability in dynamic financial markets.

Findings

01

MSPM outperforms five baselines in accumulated return.

02

EAM modules significantly boost portfolio performance.

03

System demonstrates high reusability and scalability in experiments.

Abstract

Financial portfolio management (PM) is one of the most applicable problems in reinforcement learning (RL) owing to its sequential decision-making nature. However, existing RL-based approaches rarely focus on scalability or reusability to adapt to the ever-changing markets. These approaches are rigid and unscalable to accommodate the varying number of assets of portfolios and increasing need for heterogeneous data. Also, RL agents in the existing systems are ad-hoc trained and hardly reusable for different portfolios. To confront the above problems, a modular design is desired for the systems to be compatible with reusable asset-dedicated agents. In this paper, we propose a multi-agent RL-based system for PM (MSPM). MSPM involves two types of asynchronously-updated modules: Evolving Agent Module (EAM) and Strategic Agent Module (SAM). An EAM is an information-generating module with a DQN…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.