Exploratory Randomization for Discrete-Time Risk-Sensitive Benchmarked Investment Management with Reinforcement Learning
Sebastien Lleo, Wolfgang Runggaldier

TL;DR
This paper introduces a novel exploration mechanism for risk-sensitive portfolio management using reinforcement learning, combining stochastic control, duality, and fractional Kelly strategies to improve investment policies.
Contribution
It develops a tractable, risk-sensitive portfolio model with endogenous regularization, connecting RL and control theory to guide policy-gradient methods.
Findings
Analytical solution for risk-sensitive control problem
Bounds on exploration based on risk and asset covariance
Interpretation of strategies via fractional Kelly approaches
Abstract
This paper bridges reinforcement learning (RL) and risk-sensitive stochastic control by introducing a tractable exploration mechanism for policy search in risk-sensitive portfolio management, with known and unknown model parameters, that yields an endogenous relative-entropy regularization. We construct a discrete-time risk-sensitive benchmarked investment model. This model combines a factor-based asset universe with periodic portfolio rebalancing. Exploration is incorporated through user-specified Gaussian perturbations to baseline (exploitative) controls. The risk-sensitive stochastic control problem is solved analytically using the Free Energy-Entropy Duality. The Duality recasts the control problem as a linear-quadratic-Gaussian game and introduces a natural penalty for exploration. This approach yields simple sufficiency conditions for optimality. It also induces intuitive bounds…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Risk and Portfolio Optimization · Stochastic processes and financial applications
