Explainable Post hoc Portfolio Management Financial Policy of a Deep Reinforcement Learning agent
Alejandra de la Rica Escudero, Eduardo C. Garrido-Merchan, Maria, Coronado-Vaca

TL;DR
This paper introduces an explainable deep reinforcement learning approach for portfolio management that enhances transparency by interpreting agent actions in real-time using SHAP and LIME, addressing the need for interpretability in financial policies.
Contribution
The work presents the first explainable post hoc portfolio management policy for a DRL agent, integrating PPO with feature importance techniques for real-time interpretability.
Findings
Successfully identified key features influencing decisions
Demonstrated ability to explain agent actions in prediction time
Enhanced transparency in DRL-based portfolio management
Abstract
Financial portfolio management investment policies computed quantitatively by modern portfolio theory techniques like the Markowitz model rely on a set on assumptions that are not supported by data in high volatility markets. Hence, quantitative researchers are looking for alternative models to tackle this problem. Concretely, portfolio management is a problem that has been successfully addressed recently by Deep Reinforcement Learning (DRL) approaches. In particular, DRL algorithms train an agent by estimating the distribution of the expected reward of every action performed by an agent given any financial state in a simulator. However, these methods rely on Deep Neural Networks model to represent such a distribution, that although they are universal approximator models, they cannot explain its behaviour, given by a set of parameters that are not interpretable. Critically, financial…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Systems and Time Series Analysis
MethodsSparse Evolutionary Training · High-Order Consensuses · Shapley Additive Explanations · Local Interpretable Model-Agnostic Explanations
