Reinforcement Learning for Jump-Diffusions, with Financial Applications

Xuefeng Gao; Lingfei Li; Xun Yu Zhou

arXiv:2405.16449·cs.LG·August 26, 2025

Reinforcement Learning for Jump-Diffusions, with Financial Applications

Xuefeng Gao, Lingfei Li, Xun Yu Zhou

PDF

Open Access

TL;DR

This paper extends continuous-time reinforcement learning to jump-diffusion processes, demonstrating that existing algorithms can be applied without modification, and explores financial applications like portfolio optimization and option hedging.

Contribution

It generalizes RL algorithms to jump-diffusions, showing their invariance and applicability to complex financial models with jumps.

Findings

01

RL algorithms for diffusions apply to jump-diffusions without modification.

02

Jump presence influences actor-critic parameterizations but not the algorithms themselves.

03

Application to financial models confirms the invariance and effectiveness of the approach.

Abstract

We study continuous-time reinforcement learning (RL) for stochastic control in which system dynamics are governed by jump-diffusion processes. We formulate an entropy-regularized exploratory control problem with stochastic policies to capture the exploration--exploitation balance essential for RL. Unlike the pure diffusion case initially studied by Wang et al. (2020), the derivation of the exploratory dynamics under jump-diffusions calls for a careful formulation of the jump part. Through a theoretical analysis, we find that one can simply use the same policy evaluation and $q$ -learning algorithms in Jia and Zhou (2022a, 2023), originally developed for controlled diffusions, without needing to check a priori whether the underlying data come from a pure diffusion or a jump-diffusion. However, we show that the presence of jumps ought to affect parameterizations of actors and critics in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTraffic control and management

MethodsQ-Learning · Diffusion