On Bellman's principle with inequality constraints

Edwin K. P. Chong; Scott A. Miller; Jason Adaska

arXiv:1111.3271·math.OC·November 15, 2011

On Bellman's principle with inequality constraints

Edwin K. P. Chong, Scott A. Miller, Jason Adaska

PDF

TL;DR

This paper addresses a paradox in constrained Markov decision processes where Bellman's principle appears violated, and proposes a modified version that accounts for changing constraints at reachable states.

Contribution

It introduces a revised Bellman's principle that accommodates constraint changes in constrained MDPs, resolving previous inconsistencies.

Findings

01

Modified Bellman's principle preserves optimality with changing constraints.

02

Demonstrates resolution of Haviv's example of Bellman's principle violation.

03

Provides a framework for constrained decision processes with dynamic constraints.

Abstract

We consider an example by Haviv (1996) of a constrained Markov decision process that, in some sense, violates Bellman's principle. We resolve this issue by showing how to preserve a form of Bellman's principle that accounts for a change of constraint at states that are reachable from the initial state.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.