Risk-Aware Multi-Armed Bandit Problem with Application to Portfolio   Selection

Xiaoguang Huo; Feng Fu

arXiv:1709.04415·q-fin.PM·September 14, 2017·2 cites

Risk-Aware Multi-Armed Bandit Problem with Application to Portfolio Selection

Xiaoguang Huo, Feng Fu

PDF

Open Access

TL;DR

This paper introduces a risk-aware multi-armed bandit algorithm for portfolio selection, balancing risk and return by integrating market structure filtering and coherent risk measures within a reinforcement learning framework.

Contribution

It extends the classic multi-armed bandit model by incorporating risk-awareness and market topology, providing a novel approach to sequential portfolio optimization.

Findings

01

Effective risk-return balance achieved

02

Incorporates market topology into portfolio selection

03

Demonstrates improved decision-making under uncertainty

Abstract

Sequential portfolio selection has attracted increasing interests in the machine learning and quantitative finance communities in recent years. As a mathematical framework for reinforcement learning policies, the stochastic multi-armed bandit problem addresses the primary difficulty in sequential decision making under uncertainty, namely the exploration versus exploitation dilemma, and therefore provides a natural connection to portfolio selection. In this paper, we incorporate risk-awareness into the classic multi-armed bandit setting and introduce an algorithm to construct portfolio. Through filtering assets based on the topological structure of financial market and combining the optimal multi-armed bandit policy with the minimization of a coherent risk measure, we achieve a balance between risk and return.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Reinforcement Learning in Robotics · Smart Grid Energy Management