Multiple Mean-Payoff Optimization under Local Stability Constraints

David Kla\v{s}ka; Anton\'in Ku\v{c}era; Vojt\v{e}ch K\r{u}r and; V\'it Musil; Vojt\v{e}ch \v{R}eh\'ak

arXiv:2412.13369·cs.AI·December 19, 2024

Multiple Mean-Payoff Optimization under Local Stability Constraints

David Kla\v{s}ka, Anton\'in Ku\v{c}era, Vojt\v{e}ch K\r{u}r and, V\'it Musil, Vojt\v{e}ch \v{R}eh\'ak

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces an efficient, scalable method for optimizing multiple mean payoffs in Markov decision processes while ensuring local stability, addressing a computationally hard problem with practical algorithms.

Contribution

It presents the first practical algorithm for simultaneous mean-payoff optimization under local stability constraints in Markov decision processes.

Findings

01

Developed an efficient algorithm for the problem.

02

Demonstrated scalability on complex models.

03

Achieved stable mean payoff optimization.

Abstract

The long-run average payoff per transition (mean payoff) is the main tool for specifying the performance and dependability properties of discrete systems. The problem of constructing a controller (strategy) simultaneously optimizing several mean payoffs has been deeply studied for stochastic and game-theoretic models. One common issue of the constructed controllers is the instability of the mean payoffs, measured by the deviations of the average rewards per transition computed in a finite "window" sliding along a run. Unfortunately, the problem of simultaneously optimizing the mean payoffs under local stability constraints is computationally hard, and the existing works do not provide a practically usable algorithm even for non-stochastic models such as two-player games. In this paper, we design and evaluate the first efficient and scalable solution to this problem applicable to Markov…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://gitlab.fi.muni.cz/formela/2025-aaai-mmp
noneOfficial

Videos

Multiple Mean-Payoff Optimization Under Local Stability Constraints· underline

Taxonomy

TopicsAdvanced Manufacturing and Logistics Optimization