CAD: Clustering And Deep Reinforcement Learning Based Multi-Period Portfolio Management Strategy
Zhengyong Jiang, Jeyan Thiayagalingam, Jionglong Su, Jinjun Liang

TL;DR
This paper introduces a novel multi-period portfolio management strategy combining clustering and deep reinforcement learning, demonstrating superior performance over traditional methods through extensive back-testing on real stock data.
Contribution
It is the first to integrate clustering with reinforcement learning for multi-period portfolio management, enhancing decision-making and returns.
Findings
Achieved an average return of 151% over 360 periods.
Outperformed traditional strategies like Robust Median Reversion.
Validated effectiveness across multiple datasets and metrics.
Abstract
In this paper, we present a novel trading strategy that integrates reinforcement learning methods with clustering techniques for portfolio management in multi-period trading. Specifically, we leverage the clustering method to categorize stocks into various clusters based on their financial indices. Subsequently, we utilize the algorithm Asynchronous Advantage Actor-Critic to determine the trading actions for stocks within each cluster. Finally, we employ the algorithm DDPG to generate the portfolio weight vector, which decides the amount of stocks to buy, sell, or hold according to the trading actions of different clusters. To the best of our knowledge, our approach is the first to combine clustering methods and reinforcement learning methods for portfolio management in the context of multi-period trading. Our proposed strategy is evaluated using a series of back-tests on four…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFinancial Markets and Investment Strategies · Stock Market Forecasting Methods · Advanced Bandit Algorithms Research
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Adam · Weight Decay · Experience Replay · Batch Normalization · Convolution · Dense Connections · Deep Deterministic Policy Gradient
