EX-DRL: Hedging Against Heavy Losses with EXtreme Distributional Reinforcement Learning
Parvin Malekzadeh, Zissis Poulos, Jacky Chen, Zeyu Wang, Konstantinos, N. Plataniotis

TL;DR
This paper introduces EX-DRL, a novel reinforcement learning approach that improves the estimation of extreme loss quantiles in derivatives hedging by modeling the tail with a Generalized Pareto Distribution, enhancing risk assessment accuracy.
Contribution
The paper proposes a new method combining distributional RL with GPD modeling to better estimate extreme quantiles in loss distributions, addressing limitations of traditional QR-based DRL.
Findings
EX-DRL provides more accurate extreme quantile estimates.
Improved risk metrics for derivatives hedging.
Enhanced reliability in financial risk management.
Abstract
Recent advancements in Distributional Reinforcement Learning (DRL) for modeling loss distributions have shown promise in developing hedging strategies in derivatives markets. A common approach in DRL involves learning the quantiles of loss distributions at specified levels using Quantile Regression (QR). This method is particularly effective in option hedging due to its direct quantile-based risk assessment, such as Value at Risk (VaR) and Conditional Value at Risk (CVaR). However, these risk measures depend on the accurate estimation of extreme quantiles in the loss distribution's tail, which can be imprecise in QR-based DRL due to the rarity and extremity of tail data, as highlighted in the literature. To address this issue, we propose EXtreme DRL (EX-DRL), which enhances extreme quantile prediction by modeling the tail of the loss distribution with a Generalized Pareto Distribution…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSupply Chain and Inventory Management · Reinforcement Learning in Robotics · Blockchain Technology Applications and Security
