# Minimizing Maximum Regret in Commitment Constrained Sequential Decision   Making

**Authors:** Qi Zhang, Satinder Singh, Edmund Durfee

arXiv: 1703.04587 · 2017-03-16

## TL;DR

This paper introduces a method for multiagent planning where an agent commits to behaviors with probabilistic guarantees across multiple possible environments, adapting policies based on observations to minimize worst-case regret.

## Contribution

It extends commitment-based planning to a worst-case setting with multiple environments, providing algorithms that adapt policies to observations to meet commitments and minimize maximum regret.

## Key findings

- Algorithms ensure commitment satisfaction with history-dependent policies.
- The approach minimizes maximum regret across possible environments.
- Preliminary empirical results demonstrate effectiveness.

## Abstract

In cooperative multiagent planning, it can often be beneficial for an agent to make commitments about aspects of its behavior to others, allowing them in turn to plan their own behaviors without taking the agent's detailed behavior into account. Extending previous work in the Bayesian setting, we consider instead a worst-case setting in which the agent has a set of possible environments (MDPs) it could be in, and develop a commitment semantics that allows for probabilistic guarantees on the agent's behavior in any of the environments it could end up facing. Crucially, an agent receives observations (of reward and state transitions) that allow it to potentially eliminate possible environments and thus obtain higher utility by adapting its policy to the history of observations. We develop algorithms and provide theory and some preliminary empirical results showing that they ensure an agent meets its commitments with history-dependent policies while minimizing maximum regret over the possible environments.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1703.04587/full.md

## Figures

8 figures with captions in the complete paper: https://tomesphere.com/paper/1703.04587/full.md

## References

14 references — full list in the complete paper: https://tomesphere.com/paper/1703.04587/full.md

---
Source: https://tomesphere.com/paper/1703.04587