Gradient-Bounded Dynamic Programming with Submodular and Concave   Extensible Value Functions

Denis Lebedev; Paul Goulart; Kostas Margellos

arXiv:2005.11213·math.OC·May 25, 2020

Gradient-Bounded Dynamic Programming with Submodular and Concave Extensible Value Functions

Denis Lebedev, Paul Goulart, Kostas Margellos

PDF

TL;DR

This paper introduces a novel algorithm for high-dimensional dynamic programming problems with submodular and concave extensible value functions, providing bounds and demonstrating effectiveness in delivery slot pricing.

Contribution

The paper presents a new algorithm that computes bounds for complex dynamic programs with specific value function properties, ensuring finite termination.

Findings

01

Algorithm computes deterministic upper bounds and stochastic lower bounds.

02

Proven finite termination of the proposed algorithm.

03

Effective application demonstrated in high-dimensional delivery pricing example.

Abstract

We consider dynamic programming problems with finite, discrete-time horizons and prohibitively high-dimensional, discrete state-spaces for direct computation of the value function from the Bellman equation. For the case that the value function of the dynamic program is concave extensible and submodular in its state-space, we present a new algorithm that computes deterministic upper and stochastic lower bounds of the value function similar to dual dynamic programming. We then show that the proposed algorithm terminates after a finite number of iterations. Finally, we demonstrate the efficacy of our approach on a high-dimensional numerical example from delivery slot pricing in attended home delivery.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.