# SOS: Safe, Optimal and Small Strategies for Hybrid Markov Decision   Processes

**Authors:** Pranav Ashok, Jan K\v{r}et\'insk\'y, Kim Guldstrand Larsen, Adrien Le, Co\"ent, Jakob Haahr Taankvist, Maximilian Weininger

arXiv: 1906.10640 · 2021-02-02

## TL;DR

This paper introduces methods to convert strategies for hybrid Markov decision processes into compact, understandable decision trees that preserve safety and near-optimality, facilitating deployment in embedded systems.

## Contribution

It presents a novel approach to learn small, safe, and near-optimal decision-tree representations of strategies from UPPAAL Stratego, enabling practical deployment.

## Key findings

- Decision trees are significantly smaller and more understandable.
- Guarantees on safety and near-optimality are maintained.
- Trade-offs between size and optimality can be controlled.

## Abstract

For hybrid Markov decision processes, UPPAAL Stratego can compute strategies that are safe for a given safety property and (in the limit) optimal for a given cost function. Unfortunately, these strategies cannot be exported easily since they are computed as a very long list. In this paper, we demonstrate methods to learn compact representations of the strategies in the form of decision trees. These decision trees are much smaller, more understandable, and can easily be exported as code that can be loaded into embedded systems. Despite the size compression and actual differences to the original strategy, we provide guarantees on both safety and optimality of the decision-tree strategy. On the top, we show how to obtain yet smaller representations, which are still guaranteed safe, but achieve a desired trade-off between size and optimality.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1906.10640/full.md

## Figures

15 figures with captions in the complete paper: https://tomesphere.com/paper/1906.10640/full.md

## References

57 references — full list in the complete paper: https://tomesphere.com/paper/1906.10640/full.md

---
Source: https://tomesphere.com/paper/1906.10640