Deep Reinforcement Learning for Equal Risk Pricing and Hedging under Dynamic Expectile Risk Measures
Saeed Marzban, Erick Delage, Jonathan Yumeng Li

TL;DR
This paper introduces a novel deep reinforcement learning approach to compute fair, time-consistent risk-averse hedging strategies and prices for complex financial derivatives, overcoming limitations of traditional methods.
Contribution
It extends a deterministic actor-critic deep RL algorithm to risk-averse, time-consistent pricing, enabling solutions for high-dimensional and incomplete market models.
Findings
Achieves nearly optimal hedging and accurate pricing in simple environments.
Provides good quality policies and prices in complex scenarios with limited resources.
Outperforms static risk measure strategies in dynamic risk evaluation.
Abstract
Recently equal risk pricing, a framework for fair derivative pricing, was extended to consider dynamic risk measures. However, all current implementations either employ a static risk measure that violates time consistency, or are based on traditional dynamic programming solution schemes that are impracticable in problems with a large number of underlying assets (due to the curse of dimensionality) or with incomplete asset dynamics information. In this paper, we extend for the first time a famous off-policy deterministic actor-critic deep reinforcement learning (ACRL) algorithm to the problem of solving a risk averse Markov decision process that models risk using a time consistent recursive expectile risk measure. This new ACRL algorithm allows us to identify high quality time consistent hedging policies (and equal risk prices) for options, such as basket options, that cannot be handled…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRisk and Portfolio Optimization · Stochastic processes and financial applications · Market Dynamics and Volatility
