Ensemble RL through Classifier Models: Enhancing Risk-Return Trade-offs in Trading Strategies

Zheli Xiong

arXiv:2502.17518·cs.LG·May 21, 2026

Ensemble RL through Classifier Models: Enhancing Risk-Return Trade-offs in Trading Strategies

Zheli Xiong

PDF

TL;DR

This paper explores ensemble reinforcement learning models combined with classifiers to improve financial trading strategies, demonstrating enhanced risk-adjusted returns and stability over individual models.

Contribution

It introduces a novel ensemble approach integrating RL algorithms with classifiers, highlighting the importance of dynamic variance threshold adjustment for optimal performance.

Findings

01

Ensemble methods outperform individual RL models in risk-adjusted returns.

02

Dynamic adjustment of variance threshold { au} improves ensemble performance.

03

Ensemble models better manage drawdowns and stability in trading strategies.

Abstract

This paper presents a comprehensive study on the use of ensemble Reinforcement Learning (RL) models in financial trading strategies, leveraging classifier models to enhance performance. By combining RL algorithms such as A2C, PPO, and SAC with traditional classifiers like Support Vector Machines (SVM), Decision Trees, and Logistic Regression, we investigate how different classifier groups can be integrated to improve risk-return trade-offs. The study evaluates the effectiveness of various ensemble methods, comparing them with individual RL models across key financial metrics, including Cumulative Returns, Sharpe Ratios (SR), Calmar Ratios, and Maximum Drawdown (MDD). Our results demonstrate that ensemble methods consistently outperform base models in terms of risk-adjusted returns, providing better management of drawdowns and overall stability. However, we identify the sensitivity of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStock Market Forecasting Methods

Methods1x1 Convolution · Average Pooling · Convolution · Logistic Regression · Dilated Convolution · Global Average Pooling · Switchable Atrous Convolution · Entropy Regularization · A2C · Proximal Policy Optimization