Approximating a deep reinforcement learning docking agent using linear   model trees

Vilde B. Gj{\ae}rum; Ella-Lovise H. R{\o}rvik; Anastasios M. Lekkas

arXiv:2203.00369·cs.RO·March 2, 2022

Approximating a deep reinforcement learning docking agent using linear model trees

Vilde B. Gj{\ae}rum, Ella-Lovise H. R{\o}rvik, Anastasios M. Lekkas

PDF

Open Access

TL;DR

This paper demonstrates that linear model trees can effectively approximate deep reinforcement learning policies for autonomous surface vehicle docking, providing transparent and real-time explanations of control decisions.

Contribution

The novel use of linear model trees to approximate DNN policies in autonomous docking, enabling explainability and real-time interpretability.

Findings

01

LMTs accurately approximate DNN control policies.

02

LMTs provide transparent feature attributions.

03

LMTs operate efficiently in real-time scenarios.

Abstract

Deep reinforcement learning has led to numerous notable results in robotics. However, deep neural networks (DNNs) are unintuitive, which makes it difficult to understand their predictions and strongly limits their potential for real-world applications due to economic, safety, and assurance reasons. To remedy this problem, a number of explainable AI methods have been presented, such as SHAP and LIME, but these can be either be too costly to be used in real-time robotic applications or provide only local explanations. In this paper, the main contribution is the use of a linear model tree (LMT) to approximate a DNN policy, originally trained via proximal policy optimization(PPO), for an autonomous surface vehicle with five control inputs performing a docking operation. The two main benefits of the proposed approach are: a) LMTs are transparent which makes it possible to associate directly…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Reliability and Analysis Research · Risk and Safety Analysis · Adversarial Robustness in Machine Learning

MethodsLocal Interpretable Model-Agnostic Explanations · Shapley Additive Explanations