RGMDT: Return-Gap-Minimizing Decision Tree Extraction in Non-Euclidean   Metric Space

Jingdi Chen; Hanhan Zhou; Yongsheng Mei; Carlee Joe-Wong; Gina Adam,; Nathaniel D. Bastian; Tian Lan

arXiv:2410.16517·cs.LG·October 23, 2024

RGMDT: Return-Gap-Minimizing Decision Tree Extraction in Non-Euclidean Metric Space

Jingdi Chen, Hanhan Zhou, Yongsheng Mei, Carlee Joe-Wong, Gina Adam,, Nathaniel D. Bastian, Tian Lan

PDF

Open Access 1 Video

TL;DR

This paper introduces RGMDT, a novel decision tree extraction method for multi-agent deep reinforcement learning that minimizes return gap and guarantees near-optimal performance within complexity constraints.

Contribution

It establishes a return gap upper bound, formulates a non-euclidean clustering approach, and develops a simple, effective RGMDT algorithm for multi-agent interpretability.

Findings

01

RGMDT outperforms heuristic baselines on D4RL tasks

02

Achieves near-optimal returns with limited decision tree complexity

03

Provides quantitative guarantees on return gap in multi-agent settings

Abstract

Deep Reinforcement Learning (DRL) algorithms have achieved great success in solving many challenging tasks while their black-box nature hinders interpretability and real-world applicability, making it difficult for human experts to interpret and understand DRL policies. Existing works on interpretable reinforcement learning have shown promise in extracting decision tree (DT) based policies from DRL policies with most focus on the single-agent settings while prior attempts to introduce DT policies in multi-agent scenarios mainly focus on heuristic designs which do not provide any quantitative guarantees on the expected return. In this paper, we establish an upper bound on the return gap between the oracle expert policy and an optimal decision tree policy. This enables us to recast the DT extraction problem into a novel non-euclidean clustering problem over the local observation and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

RGMDT: Return-Gap-Minimizing Decision Tree Extraction in Non-Euclidean Metric Space· slideslive

Taxonomy

TopicsData Mining Algorithms and Applications · Data Management and Algorithms · Rough Sets and Fuzzy Logic

MethodsFocus