Interpretable Imitation Learning with Dynamic Causal Relations
Tianxiang Zhao, Wenchao Yu, Suhang Wang, Lu Wang, Xiang Zhang, Yuncong, Chen, Yanchi Liu, Wei Cheng, Haifeng Chen

TL;DR
This paper introduces a self-explainable imitation learning framework that models dynamic causal relations to improve interpretability of neural policies while maintaining high accuracy.
Contribution
It proposes a novel dynamic causal discovery approach based on Granger causality, integrated into an end-to-end imitation learning framework for better interpretability.
Findings
Effective causal graph learning on synthetic and real datasets
Enhanced interpretability of policies without sacrificing accuracy
Model captures time-varying causal relations in decision-making
Abstract
Imitation learning, which learns agent policy by mimicking expert demonstration, has shown promising results in many applications such as medical treatment regimes and self-driving vehicles. However, it remains a difficult task to interpret control policies learned by the agent. Difficulties mainly come from two aspects: 1) agents in imitation learning are usually implemented as deep neural networks, which are black-box models and lack interpretability; 2) the latent causal mechanism behind agents' decisions may vary along the trajectory, rather than staying static throughout time steps. To increase transparency and offer better interpretability of the neural agent, we propose to expose its captured knowledge in the form of a directed acyclic causal graph, with nodes being action and state variables and edges denoting the causal relations behind predictions. Furthermore, we design this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Topic Modeling · Machine Learning in Healthcare
