Bidding Games on Markov Decision Processes with Quantitative Reachability Objectives
Guy Avni, Martin Kure\v{c}ka, Kaushik Mallik, Petr Novotn\'y, Suman, Sadhukhan

TL;DR
This paper introduces a new class of bidding games on Markov decision processes where two players bid for control over actions to influence reachability probabilities, extending traditional bidding game analysis to stochastic environments.
Contribution
It formalizes bidding games on MDPs with quantitative reachability objectives and develops algorithms to approximate thresholds and optimal strategies, including exact solutions for acyclic MDPs.
Findings
Developed value-iteration algorithms for general MDPs
Computed exact solutions for acyclic MDPs
Showed threshold determination is as hard as simple-stochastic games
Abstract
Graph games are fundamental in strategic reasoning of multi-agent systems and their environments. We study a new family of graph games which combine stochastic environmental uncertainties and auction-based interactions among the agents, formalized as bidding games on (finite) Markov decision processes (MDP). Normally, on MDPs, a single decision-maker chooses a sequence of actions, producing a probability distribution over infinite paths. In bidding games on MDPs, two players -- called the reachability and safety players -- bid for the privilege of choosing the next action at each step. The reachability player's goal is to maximize the probability of reaching a target vertex, whereas the safety player's goal is to minimize it. These games generalize traditional bidding games on graphs, and the existing analysis techniques do not extend. For instance, the central property of traditional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOptimization and Search Problems · Data Management and Algorithms
