Reinforcement Learning for optimal dividend problem under diffusion model
Lihua Bai, Thejani Gamage, Jin Ma, Gaozhan Wang

TL;DR
This paper develops a novel reinforcement learning algorithm using neural networks to solve the optimal dividend problem under a diffusion model, especially when system parameters are unknown or state-dependent.
Contribution
It introduces a new RL-based algorithm with neural networks for optimal dividend control, handling unknown parameters and state dependencies.
Findings
The RL algorithm effectively approximates optimal strategies in complex dividend problems.
Numerical experiments demonstrate the algorithm's robustness and applicability beyond standard models.
Abstract
In this paper, we study the optimal dividend problem under the continuous time diffusion model with the bounded dividend rate from the Reinforcement Learning (RL) perspective. Unlike the standard literature, our main focus will be on numerical algorithms that allow part or all of the system parameters to be unspecified so that the optimal control cannot be explicitly determined. Following the RL literature we introduce the entropy-regularized exploratory control problem, which randomizes the control actions and balances the levels of exploitation and exploration, and carry out a theoretical analysis of the associated Policy Improvement (PI) and Policy Evaluation (PE) devices and the corresponding sequence of the approximating optimal strategies. Specifically, our algorithm will be based on two independent neural networks that approximate the value function and its derivative…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
