Improving Portfolio Optimization Results with Bandit Networks

Gustavo de Freitas Fonseca; Lucas Coelho e Silva; and Paulo Andr\'e; Lima de Castro

arXiv:2410.04217·cs.AI·October 10, 2024

Improving Portfolio Optimization Results with Bandit Networks

Gustavo de Freitas Fonseca, Lucas Coelho e Silva, and Paulo Andr\'e, Lima de Castro

PDF

Open Access 1 Repo

TL;DR

This paper introduces novel bandit algorithms and architectures, including Bandit Networks, to improve portfolio optimization in non-stationary environments, demonstrating superior performance over classical methods in financial data experiments.

Contribution

The paper develops Adaptive Discounted Thompson Sampling and its extension for portfolio optimization, along with Bandit Networks that integrate these algorithms for better dynamic decision-making.

Findings

01

Bandit Networks outperform classical portfolio methods in financial data.

02

Proposed algorithms adapt effectively to non-stationary reward environments.

03

Best network achieves 20% higher out-of-sample Sharpe Ratio.

Abstract

In Reinforcement Learning (RL), multi-armed Bandit (MAB) problems have found applications across diverse domains such as recommender systems, healthcare, and finance. Traditional MAB algorithms typically assume stationary reward distributions, which limits their effectiveness in real-world scenarios characterized by non-stationary dynamics. This paper addresses this limitation by introducing and evaluating novel Bandit algorithms designed for non-stationary environments. First, we present the Adaptive Discounted Thompson Sampling (ADTS) algorithm, which enhances adaptability through relaxed discounting and sliding window mechanisms to better respond to changes in reward distributions. We then extend this approach to the Portfolio Optimization problem by introducing the Combinatorial Adaptive Discounted Thompson Sampling (CADTS) algorithm, which addresses computational challenges within…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

gfonseca92/bandits-lib
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Reservoir Engineering and Simulation Methods · Stock Market Forecasting Methods