Improving Portfolio Optimization Results with Bandit Networks
Gustavo de Freitas Fonseca, Lucas Coelho e Silva, and Paulo Andr\'e, Lima de Castro

TL;DR
This paper introduces novel bandit algorithms and architectures, including Bandit Networks, to improve portfolio optimization in non-stationary environments, demonstrating superior performance over classical methods in financial data experiments.
Contribution
The paper develops Adaptive Discounted Thompson Sampling and its extension for portfolio optimization, along with Bandit Networks that integrate these algorithms for better dynamic decision-making.
Findings
Bandit Networks outperform classical portfolio methods in financial data.
Proposed algorithms adapt effectively to non-stationary reward environments.
Best network achieves 20% higher out-of-sample Sharpe Ratio.
Abstract
In Reinforcement Learning (RL), multi-armed Bandit (MAB) problems have found applications across diverse domains such as recommender systems, healthcare, and finance. Traditional MAB algorithms typically assume stationary reward distributions, which limits their effectiveness in real-world scenarios characterized by non-stationary dynamics. This paper addresses this limitation by introducing and evaluating novel Bandit algorithms designed for non-stationary environments. First, we present the Adaptive Discounted Thompson Sampling (ADTS) algorithm, which enhances adaptability through relaxed discounting and sliding window mechanisms to better respond to changes in reward distributions. We then extend this approach to the Portfolio Optimization problem by introducing the Combinatorial Adaptive Discounted Thompson Sampling (CADTS) algorithm, which addresses computational challenges within…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Reservoir Engineering and Simulation Methods · Stock Market Forecasting Methods
