Ensemble RL through Classifier Models: Enhancing Risk-Return Trade-offs in Trading Strategies
Zheli Xiong

TL;DR
This paper explores ensemble reinforcement learning models combined with classifiers to improve financial trading strategies, demonstrating enhanced risk-adjusted returns and stability over individual models.
Contribution
It introduces a novel ensemble approach integrating RL algorithms with classifiers, highlighting the importance of dynamic variance threshold adjustment for optimal performance.
Findings
Ensemble methods outperform individual RL models in risk-adjusted returns.
Dynamic adjustment of variance threshold { au} improves ensemble performance.
Ensemble models better manage drawdowns and stability in trading strategies.
Abstract
This paper presents a comprehensive study on the use of ensemble Reinforcement Learning (RL) models in financial trading strategies, leveraging classifier models to enhance performance. By combining RL algorithms such as A2C, PPO, and SAC with traditional classifiers like Support Vector Machines (SVM), Decision Trees, and Logistic Regression, we investigate how different classifier groups can be integrated to improve risk-return trade-offs. The study evaluates the effectiveness of various ensemble methods, comparing them with individual RL models across key financial metrics, including Cumulative Returns, Sharpe Ratios (SR), Calmar Ratios, and Maximum Drawdown (MDD). Our results demonstrate that ensemble methods consistently outperform base models in terms of risk-adjusted returns, providing better management of drawdowns and overall stability. However, we identify the sensitivity of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStock Market Forecasting Methods
Methods1x1 Convolution · Average Pooling · Convolution · Logistic Regression · Dilated Convolution · Global Average Pooling · Switchable Atrous Convolution · Entropy Regularization · A2C · Proximal Policy Optimization
