# Exploring Computational User Models for Agent Policy Summarization

**Authors:** Isaac Lage, Daphna Lifschitz, Finale Doshi-Velez, Ofra Amir

arXiv: 1905.13271 · 2019-06-03

## TL;DR

This paper investigates how different user models affect the quality of policy summaries for AI agents, emphasizing the importance of aligning summary extraction models with user reconstruction models to improve interpretability.

## Contribution

It introduces an imitation learning-based approach for policy summarization and demonstrates the impact of model mismatch on reconstruction quality through simulations and human studies.

## Key findings

- Mismatch between extraction and reconstruction models worsens policy understanding.
- Matching summary models to user reconstruction models improves interpretability.
- User context influences the choice of models for policy reconstruction.

## Abstract

AI agents are being developed to support high stakes decision-making processes from driving cars to prescribing drugs, making it increasingly important for human users to understand their behavior. Policy summarization methods aim to convey strengths and weaknesses of such agents by demonstrating their behavior in a subset of informative states. Some policy summarization methods extract a summary that optimizes the ability to reconstruct the agent's policy under the assumption that users will deploy inverse reinforcement learning. In this paper, we explore the use of different models for extracting summaries. We introduce an imitation learning-based approach to policy summarization; we demonstrate through computational simulations that a mismatch between the model used to extract a summary and the model used to reconstruct the policy results in worse reconstruction quality; and we demonstrate through a human-subject study that people use different models to reconstruct policies in different contexts, and that matching the summary extraction model to these can improve performance. Together, our results suggest that it is important to carefully consider user models in policy summarization.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1905.13271/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/1905.13271/full.md

## References

27 references — full list in the complete paper: https://tomesphere.com/paper/1905.13271/full.md

---
Source: https://tomesphere.com/paper/1905.13271