Reinforcement Learning for Jump-Diffusions, with Financial Applications
Xuefeng Gao, Lingfei Li, Xun Yu Zhou

TL;DR
This paper extends continuous-time reinforcement learning to jump-diffusion processes, demonstrating that existing algorithms can be applied without modification, and explores financial applications like portfolio optimization and option hedging.
Contribution
It generalizes RL algorithms to jump-diffusions, showing their invariance and applicability to complex financial models with jumps.
Findings
RL algorithms for diffusions apply to jump-diffusions without modification.
Jump presence influences actor-critic parameterizations but not the algorithms themselves.
Application to financial models confirms the invariance and effectiveness of the approach.
Abstract
We study continuous-time reinforcement learning (RL) for stochastic control in which system dynamics are governed by jump-diffusion processes. We formulate an entropy-regularized exploratory control problem with stochastic policies to capture the exploration--exploitation balance essential for RL. Unlike the pure diffusion case initially studied by Wang et al. (2020), the derivation of the exploratory dynamics under jump-diffusions calls for a careful formulation of the jump part. Through a theoretical analysis, we find that one can simply use the same policy evaluation and -learning algorithms in Jia and Zhou (2022a, 2023), originally developed for controlled diffusions, without needing to check a priori whether the underlying data come from a pure diffusion or a jump-diffusion. However, we show that the presence of jumps ought to affect parameterizations of actors and critics in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraffic control and management
MethodsQ-Learning · Diffusion
