An Approximate Dynamic Programming Algorithm for Monotone Value Functions
Daniel R. Jiang, Warren B. Powell

TL;DR
This paper introduces Monotone-ADP, a convergent approximate dynamic programming algorithm that leverages the monotonicity of value functions to efficiently solve large-scale Markov Decision Processes, demonstrated through three application domains.
Contribution
The paper presents a new ADP algorithm that exploits value function monotonicity, with proven convergence and significant computational efficiency improvements over traditional methods.
Findings
Achieves high-quality solutions with fewer iterations.
Uses up to 100 times less computation than exact methods.
Effective across multiple application domains.
Abstract
Many sequential decision problems can be formulated as Markov Decision Processes (MDPs) where the optimal value function (or cost-to-go function) can be shown to satisfy a monotone structure in some or all of its dimensions. When the state space becomes large, traditional techniques, such as the backward dynamic programming algorithm (i.e., backward induction or value iteration), may no longer be effective in finding a solution within a reasonable time frame, and thus we are forced to consider other approaches, such as approximate dynamic programming (ADP). We propose a provably convergent ADP algorithm called Monotone-ADP that exploits the monotonicity of the value functions in order to increase the rate of convergence. In this paper, we describe a general finite-horizon problem setting where the optimal value function is monotone, present a convergence proof for Monotone-ADP under…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSmart Grid Energy Management · Cardiovascular Function and Risk Factors · Supply Chain and Inventory Management
