Integer programming for weakly coupled stochastic dynamic programs with partial information
Victor Cohen, Axel Parmentier

TL;DR
This paper develops algorithms combining integer programming and POMDP techniques to efficiently solve weakly coupled stochastic dynamic programs with partial information, addressing high-dimensional challenges.
Contribution
It introduces the concept of weakly coupled POMDPs and provides novel mixed integer linear formulations and algorithms for their solution.
Findings
The MILP formulation computes optimal memoryless policies.
The linear relaxation offers tight upper bounds on policy value.
Algorithms perform efficiently on benchmark and maintenance problems.
Abstract
This paper introduces algorithms for problems where a decision maker has to control a system composed of several components and has access to only partial information on the state of each component. Such problems are difficult because of the partial observations, and because of the curse of dimensionality that appears when the number of components increases. Partially observable Markov decision processes (POMDPs) have been introduced to deal with the first challenge, while weakly coupled stochastic dynamic programs address the second. Drawing from these two branches of the literature, we introduce the notion of weakly coupled POMDPs. The objective is to find a policy maximizing the total expected reward over a finite horizon. Our algorithms rely on two ingredients. The first, which can be used independently, is a mixed integer linear formulation for generic POMDPs that computes an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSupply Chain and Inventory Management · Reinforcement Learning in Robotics · Energy, Environment, and Transportation Policies
