Off-Policy Optimization of Portfolio Allocation Policies under   Constraints

Nymisha Bandi; Theja Tulabandhula

arXiv:2012.11715·cs.AI·December 23, 2020

Off-Policy Optimization of Portfolio Allocation Policies under Constraints

Nymisha Bandi, Theja Tulabandhula

PDF

Open Access 1 Repo

TL;DR

This paper develops a framework for off-policy optimization of portfolio allocation policies that satisfy constraints, using a minimax approach with off-policy estimators and online learning, validated on historical equities data.

Contribution

It introduces a novel minimax framework for constraint-aware portfolio policy optimization using off-policy data and online learning strategies.

Findings

01

Effective constraint satisfaction in portfolio policies.

02

Robust performance across different market regimes.

03

Promising back-test results on equities data.

Abstract

The dynamic portfolio optimization problem in finance frequently requires learning policies that adhere to various constraints, driven by investor preferences and risk. We motivate this problem of finding an allocation policy within a sequential decision making framework and study the effects of: (a) using data collected under previously employed policies, which may be sub-optimal and constraint-violating, and (b) imposing desired constraints while computing near-optimal policies with this data. Our framework relies on solving a minimax objective, where one player evaluates policies via off-policy estimators, and the opponent uses an online learning strategy to control constraint violations. We extensively investigate various choices for off-policy estimation and their corresponding optimization sub-routines, and quantify their impact on computing constraint-aware allocation policies.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

NymishaBandi/constrained-batch-policy-learning
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Reservoir Engineering and Simulation Methods · Risk and Portfolio Optimization