A Block Coordinate Ascent Algorithm for Mean-Variance Optimization

Bo Liu; Tengyang Xie; Yangyang Xu; Mohammad Ghavamzadeh; Yinlam Chow,; Daoming Lyu; Daesub Yoon

arXiv:1809.02292·cs.LG·November 5, 2018·25 cites

A Block Coordinate Ascent Algorithm for Mean-Variance Optimization

Bo Liu, Tengyang Xie, Yangyang Xu, Mohammad Ghavamzadeh, Yinlam Chow,, Daoming Lyu, Daesub Yoon

PDF

Open Access

TL;DR

This paper introduces a model-free block coordinate ascent algorithm for mean-variance optimization, providing finite-sample guarantees and addressing tuning challenges of existing stochastic approximation methods.

Contribution

It develops a novel stochastic block coordinate ascent policy search method with convergence guarantees and finite-sample error bounds for mean-variance optimization.

Findings

01

Convergence guarantees for the last iteration and randomly selected solutions.

02

Finite-sample error bounds for local optima.

03

Validated on several benchmark domains.

Abstract

Risk management in dynamic decision problems is a primary concern in many fields, including financial investment, autonomous driving, and healthcare. The mean-variance function is one of the most widely used objective functions in risk management due to its simplicity and interpretability. Existing algorithms for mean-variance optimization are based on multi-time-scale stochastic approximation, whose learning rate schedules are often hard to tune, and have only asymptotic convergence proof. In this paper, we develop a model-free policy search framework for mean-variance optimization with finite-sample error bound analysis (to local optima). Our starting point is a reformulation of the original mean-variance function with its Fenchel dual, from which we propose a stochastic block coordinate ascent policy search algorithm. Both the asymptotic convergence guarantee of the last iteration's…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Risk and Portfolio Optimization · Advanced Bandit Algorithms Research