MAVIPER: Learning Decision Tree Policies for Interpretable Multi-Agent   Reinforcement Learning

Stephanie Milani; Zhicheng Zhang; Nicholay Topin; Zheyuan; Ryan Shi; Charles Kamhoua; Evangelos E. Papalexakis; Fei Fang

arXiv:2205.12449·cs.LG·July 13, 2022·1 cites

MAVIPER: Learning Decision Tree Policies for Interpretable Multi-Agent Reinforcement Learning

Stephanie Milani, Zhicheng Zhang, Nicholay Topin, Zheyuan, Ryan Shi, Charles Kamhoua, Evangelos E. Papalexakis, Fei Fang

PDF

Open Access

TL;DR

This paper introduces MAVIPER, a novel method for extracting interpretable decision tree policies from neural networks trained with multi-agent reinforcement learning, improving interpretability and coordination.

Contribution

It extends single-agent interpretable RL methods to multi-agent settings and proposes a centralized training algorithm for better coordination.

Findings

01

MAVIPER outperforms baselines in multi-agent environments.

02

IVIPER learns high-quality decision-tree policies for individual agents.

03

MAVIPER achieves better coordination among agents.

Abstract

Many recent breakthroughs in multi-agent reinforcement learning (MARL) require the use of deep neural networks, which are challenging for human experts to interpret and understand. On the other hand, existing work on interpretable reinforcement learning (RL) has shown promise in extracting more interpretable decision tree-based policies from neural networks, but only in the single-agent setting. To fill this gap, we propose the first set of algorithms that extract interpretable decision-tree policies from neural networks trained with MARL. The first algorithm, IVIPER, extends VIPER, a recent method for single-agent interpretable RL, to the multi-agent setting. We demonstrate that IVIPER learns high-quality decision-tree policies for each agent. To better capture coordination between agents, we propose a novel centralized decision-tree training algorithm, MAVIPER. MAVIPER jointly grows…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning