First-Order Methods for Wasserstein Distributionally Robust MDP

Julien Grand-Cl\'ement; Christian Kroer

arXiv:2009.06790·math.OC·May 5, 2021·5 cites

First-Order Methods for Wasserstein Distributionally Robust MDP

Julien Grand-Cl\'ement, Christian Kroer

PDF

Open Access 1 Video

TL;DR

This paper introduces a first-order method framework for solving Wasserstein distributionally robust MDPs, achieving faster convergence and better scalability than existing approaches, with strong empirical performance.

Contribution

The paper develops a novel first-order optimization framework for Wasserstein distributionally robust MDPs, improving convergence rates and computational efficiency over prior methods.

Findings

01

Proposed algorithms achieve a convergence rate of $O(NA^{2.5}S^{3.5} ext{log}(S) ext{log}(rac{1}{ extepsilon}) extepsilon^{-1.5}$.

02

Dependence on $N$, $A$, and $S$ is significantly better than existing methods.

03

Numerical experiments demonstrate superior scalability and performance across multiple domains.

Abstract

Markov decision processes (MDPs) are known to be sensitive to parameter specification. Distributionally robust MDPs alleviate this issue by allowing for \emph{ambiguity sets} which give a set of possible distributions over parameter sets. The goal is to find an optimal policy with respect to the worst-case parameter distribution. We propose a framework for solving Distributionally robust MDPs via first-order methods, and instantiate it for several types of Wasserstein ambiguity sets. By developing efficient proximal updates, our algorithms achieve a convergence rate of $O (N A^{2.5} S^{3.5} lo g (S) lo g (ϵ^{- 1}) ϵ^{- 1.5})$ for the number of kernels $N$ in the support of the nominal distribution, states $S$ , and actions $A$ ; this rate varies slightly based on the Wasserstein setup. Our dependence on $N, A$ and $S$ is significantly better than existing methods, which…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

First-Order Methods for Wasserstein Distributionally Robust MDP· slideslive

Taxonomy

TopicsRisk and Portfolio Optimization · Adversarial Robustness in Machine Learning · Advanced Neural Network Applications