Symbolic Explanation of Affinity-Based Reinforcement Learning Agents with Markov Models
Charl Maree, Christian W. Omlin

TL;DR
This paper introduces a method to interpret reinforcement learning policies by using symbolic Markov models that capture the global affinities of learned strategies, demonstrated in personalized financial decision-making.
Contribution
It presents a novel policy regularization technique that makes reinforcement learning policies inherently interpretable through symbolic Markov models.
Findings
Successfully explained policies in personalized prosperity management
Demonstrated the interpretability of strategies via discretized Markov models
Enhanced understanding of reinforcement learning behaviors
Abstract
The proliferation of artificial intelligence is increasingly dependent on model understanding. Understanding demands both an interpretation - a human reasoning about a model's behavior - and an explanation - a symbolic representation of the functioning of the model. Notwithstanding the imperative of transparency for safety, trust, and acceptance, the opacity of state-of-the-art reinforcement learning algorithms conceals the rudiments of their learned strategies. We have developed a policy regularization method that asserts the global intrinsic affinities of learned strategies. These affinities provide a means of reasoning about a policy's behavior, thus making it inherently interpretable. We have demonstrated our method in personalized prosperity management where individuals' spending behavior in time dictate their investment strategies, i.e. distinct spending personalities may have…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Systems and Time Series Analysis · Explainable Artificial Intelligence (XAI) · Stock Market Forecasting Methods
