A Minimax-MDP Framework with Future-imposed Conditions for Learning-augmented Problems
Xin Chen, Yuze Chen, Yuan Zhou

TL;DR
This paper introduces a minimax-MDP framework for sequential decision-making with evolving predictions, enabling robust, competitive policies in uncertain environments with refined future information.
Contribution
It develops a novel minimax-MDP approach with future-imposed conditions, facilitating efficient robust policies for decision problems with predictive uncertainty.
Findings
Framework applied to inventory and resource allocation problems
Policies often have closed-form solutions
Demonstrates robustness and competitiveness of the approach
Abstract
We study a class of sequential decision-making problems with augmented predictions, potentially provided by a machine learning algorithm. In this setting, the decision-maker receives prediction intervals for unknown parameters that become progressively refined over time, and seeks decisions that are competitive with the hindsight optimal under all possible realizations of both parameters and predictions. We propose a minimax Markov Decision Process (minimax-MDP) framework, where the system state consists of an adversarially evolving environment state and an internal state controlled by the decision-maker. We introduce a set of future-imposed conditions that characterize the feasibility of minimax-MDPs and enable the design of efficient, often closed-form, robustly competitive policies. We illustrate the framework through three applications: multi-period inventory ordering with refining…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms
MethodsSparse Evolutionary Training
