Explaining Online Reinforcement Learning Decisions of Self-Adaptive Systems
Felix Feit, Andreas Metzger, Klaus Pohl

TL;DR
This paper introduces XRL-DINE, a combined explainable reinforcement learning technique for self-adaptive systems, improving understanding of Deep RL decisions to enhance trust and debugging at runtime.
Contribution
It proposes a novel combination of explainable RL techniques specifically tailored for Deep RL in self-adaptive systems, addressing interpretability challenges.
Findings
XRL-DINE effectively explains Deep RL decisions in self-adaptive systems.
Qualitative and quantitative results demonstrate improved interpretability.
The approach enhances trust and debugging capabilities in runtime adaptation.
Abstract
Design time uncertainty poses an important challenge when developing a self-adaptive system. As an example, defining how the system should adapt when facing a new environment state, requires understanding the precise effect of an adaptation, which may not be known at design time. Online reinforcement learning, i.e., employing reinforcement learning (RL) at runtime, is an emerging approach to realizing self-adaptive systems in the presence of design time uncertainty. By using Online RL, the self-adaptive system can learn from actual operational data and leverage feedback only available at runtime. Recently, Deep RL is gaining interest. Deep RL represents learned knowledge as a neural network whereby it can generalize over unseen inputs, as well as handle continuous environment states and adaptation actions. A fundamental problem of Deep RL is that learned knowledge is not explicitly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Advanced Software Engineering Methodologies
