A Minimax-MDP Framework with Future-imposed Conditions for   Learning-augmented Problems

Xin Chen; Yuze Chen; Yuan Zhou

arXiv:2505.00973·cs.LG·May 5, 2025

A Minimax-MDP Framework with Future-imposed Conditions for Learning-augmented Problems

Xin Chen, Yuze Chen, Yuan Zhou

PDF

Open Access

TL;DR

This paper introduces a minimax-MDP framework for sequential decision-making with evolving predictions, enabling robust, competitive policies in uncertain environments with refined future information.

Contribution

It develops a novel minimax-MDP approach with future-imposed conditions, facilitating efficient robust policies for decision problems with predictive uncertainty.

Findings

01

Framework applied to inventory and resource allocation problems

02

Policies often have closed-form solutions

03

Demonstrates robustness and competitiveness of the approach

Abstract

We study a class of sequential decision-making problems with augmented predictions, potentially provided by a machine learning algorithm. In this setting, the decision-maker receives prediction intervals for unknown parameters that become progressively refined over time, and seeks decisions that are competitive with the hindsight optimal under all possible realizations of both parameters and predictions. We propose a minimax Markov Decision Process (minimax-MDP) framework, where the system state consists of an adversarially evolving environment state and an internal state controlled by the decision-maker. We introduce a set of future-imposed conditions that characterize the feasibility of minimax-MDPs and enable the design of efficient, often closed-form, robustly competitive policies. We illustrate the framework through three applications: multi-period inventory ordering with refining…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms

MethodsSparse Evolutionary Training