Deep Hedging of Derivatives Using Reinforcement Learning
Jay Cao, Jacky Chen, John Hull, Zissis Poulos

TL;DR
This paper demonstrates how reinforcement learning can optimize derivatives hedging strategies considering transaction costs, outperforming traditional delta hedging in different stochastic asset models.
Contribution
It introduces a novel reinforcement learning framework with dual Q-functions and continuous spaces for optimal hedging, extending prior methods.
Findings
Reinforcement learning effectively minimizes hedging costs with transaction costs.
Hybrid accounting P&L approach with simple valuation models performs well.
The method adapts to both geometric Brownian motion and stochastic volatility models.
Abstract
This paper shows how reinforcement learning can be used to derive optimal hedging strategies for derivatives when there are transaction costs. The paper illustrates the approach by showing the difference between using delta hedging and optimal hedging for a short position in a call option when the objective is to minimize a function equal to the mean hedging cost plus a constant times the standard deviation of the hedging cost. Two situations are considered. In the first, the asset price follows a geometric Brownian motion. In the second, the asset price follows a stochastic volatility process. The paper extends the basic reinforcement learning approach in a number of ways. First, it uses two different Q-functions so that both the expected value of the cost and the expected value of the square of the cost are tracked for different state/action combinations. This approach increases the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCapital Investment and Risk Analysis · Supply Chain and Inventory Management · Innovation Diffusion and Forecasting
