Approximating a deep reinforcement learning docking agent using linear model trees
Vilde B. Gj{\ae}rum, Ella-Lovise H. R{\o}rvik, Anastasios M. Lekkas

TL;DR
This paper demonstrates that linear model trees can effectively approximate deep reinforcement learning policies for autonomous surface vehicle docking, providing transparent and real-time explanations of control decisions.
Contribution
The novel use of linear model trees to approximate DNN policies in autonomous docking, enabling explainability and real-time interpretability.
Findings
LMTs accurately approximate DNN control policies.
LMTs provide transparent feature attributions.
LMTs operate efficiently in real-time scenarios.
Abstract
Deep reinforcement learning has led to numerous notable results in robotics. However, deep neural networks (DNNs) are unintuitive, which makes it difficult to understand their predictions and strongly limits their potential for real-world applications due to economic, safety, and assurance reasons. To remedy this problem, a number of explainable AI methods have been presented, such as SHAP and LIME, but these can be either be too costly to be used in real-time robotic applications or provide only local explanations. In this paper, the main contribution is the use of a linear model tree (LMT) to approximate a DNN policy, originally trained via proximal policy optimization(PPO), for an autonomous surface vehicle with five control inputs performing a docking operation. The two main benefits of the proposed approach are: a) LMTs are transparent which makes it possible to associate directly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Reliability and Analysis Research · Risk and Safety Analysis · Adversarial Robustness in Machine Learning
MethodsLocal Interpretable Model-Agnostic Explanations · Shapley Additive Explanations
