Making Teams and Influencing Agents: Efficiently Coordinating Decision Trees for Interpretable Multi-Agent Reinforcement Learning

Rex Chen; Stephanie Milani; Zhicheng Zhang; Norman Sadeh; Fei Fang

arXiv:2505.19316·cs.MA·August 13, 2025

Making Teams and Influencing Agents: Efficiently Coordinating Decision Trees for Interpretable Multi-Agent Reinforcement Learning

Rex Chen, Stephanie Milani, Zhicheng Zhang, Norman Sadeh, Fei Fang

PDF

TL;DR

This paper introduces HYDRAVIPER, a decision tree-based interpretable MARL algorithm that balances performance and computational efficiency, enabling safe and verifiable multi-agent decision-making in real-world scenarios.

Contribution

HYDRAVIPER is a novel interpretable MARL method that adaptively manages environment interaction budgets to optimize both performance and efficiency.

Findings

01

HYDRAVIPER matches state-of-the-art performance with less runtime.

02

It maintains a Pareto frontier of performance across different interaction budgets.

03

Experiments demonstrate effectiveness in multi-agent coordination and traffic control.

Abstract

Poor interpretability hinders the practical applicability of multi-agent reinforcement learning (MARL) policies. Deploying interpretable surrogates of uninterpretable policies enhances the safety and verifiability of MARL for real-world applications. However, if these surrogates are to interact directly with the environment within human supervisory frameworks, they must be both performant and computationally efficient. Prior work on interpretable MARL has either sacrificed performance for computational efficiency or computational efficiency for performance. To address this issue, we propose HYDRAVIPER, a decision tree-based interpretable MARL algorithm. HYDRAVIPER coordinates training between agents based on expected team performance, and adaptively allocates budgets for environment interaction to improve computational efficiency. Experiments on standard benchmark environments for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.