Deep Reinforcement Learning for Automated Stock Trading: An Ensemble Strategy
Hongyang Yang, Xiao-Yang Liu, Shan Zhong, Anwar Walid

TL;DR
This paper introduces an ensemble deep reinforcement learning approach for automated stock trading that combines multiple algorithms to adaptively maximize returns and outperform traditional strategies on Dow Jones stocks.
Contribution
It proposes a novel ensemble strategy integrating PPO, A2C, and DDPG algorithms for robust stock trading in dynamic markets.
Findings
Ensemble strategy outperforms individual algorithms in risk-adjusted returns.
The approach effectively adapts to different market conditions.
Open-sourced implementation available on GitHub.
Abstract
Stock trading strategies play a critical role in investment. However, it is challenging to design a profitable strategy in a complex and dynamic stock market. In this paper, we propose an ensemble strategy that employs deep reinforcement schemes to learn a stock trading strategy by maximizing investment return. We train a deep reinforcement learning agent and obtain an ensemble trading strategy using three actor-critic based algorithms: Proximal Policy Optimization (PPO), Advantage Actor Critic (A2C), and Deep Deterministic Policy Gradient (DDPG). The ensemble strategy inherits and integrates the best features of the three algorithms, thereby robustly adjusting to different market situations. In order to avoid the large memory consumption in training networks with continuous action space, we employ a load-on-demand technique for processing very large data. We test our algorithms on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStock Market Forecasting Methods · Financial Markets and Investment Strategies · Advanced Bandit Algorithms Research
