Continuous-time optimal investment with portfolio constraints: a reinforcement learning approach
Huy Chau, Duy Nguyen, Thai Nguyen

TL;DR
This paper develops a reinforcement learning framework for continuous-time portfolio optimization with constraints, deriving explicit policies and demonstrating how exploration influences wealth distribution and investment opportunities.
Contribution
It introduces a novel RL approach for constrained continuous-time investment, providing explicit Gaussian and truncated Gaussian policies, and establishes a policy improvement theorem.
Findings
Optimal policies are Gaussian or truncated Gaussian.
Exploration increases wealth dispersion and tail heaviness.
Constraints reduce exploration costs significantly.
Abstract
In a reinforcement learning (RL) framework, we study the exploratory version of the continuous time expected utility (EU) maximization problem with a portfolio constraint that includes widely-used financial regulations such as short-selling constraints and borrowing prohibition. The optimal feedback policy of the exploratory unconstrained classical EU problem is shown to be Gaussian. In the case where the portfolio weight is constrained to a given interval, the corresponding constrained optimal exploratory policy follows a truncated Gaussian distribution. We verify that the closed form optimal solution obtained for logarithmic utility and quadratic utility for both unconstrained and constrained situations converge to the non-exploratory expected utility counterpart when the exploration weight goes to zero. Finally, we establish a policy improvement theorem and devise an implementable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCapital Investment and Risk Analysis
