TL;DR
This paper introduces a novel evolutionary approach to reinforcement learning that constructs interpretable decision trees by combining evolutionary algorithms with Q-learning, enhancing performance and interpretability.
Contribution
It proposes a two-level optimization scheme for decision trees in reinforcement learning, integrating evolutionary algorithms with Q-learning for better interpretability and performance.
Findings
Competitive results on benchmark tasks
Two-level optimization improves performance
Method enhances interpretability of RL models
Abstract
Reinforcement learning techniques achieved human-level performance in several tasks in the last decade. However, in recent years, the need for interpretability emerged: we want to be able to understand how a system works and the reasons behind its decisions. Not only we need interpretability to assess the safety of the produced systems, we also need it to extract knowledge about unknown problems. While some techniques that optimize decision trees for reinforcement learning do exist, they usually employ greedy algorithms or they do not exploit the rewards given by the environment. This means that these techniques may easily get stuck in local optima. In this work, we propose a novel approach to interpretable reinforcement learning that uses decision trees. We present a two-level optimization scheme that combines the advantages of evolutionary algorithms with the advantages of Q-learning.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsGrammatical evolution and Q-learning
