Autonomous AI Agents for Option Hedging: Enhancing Financial Stability through Shortfall Aware Reinforcement Learning
Minxuan Hu, Ziheng Chen, Jiayu Yi, Wenxi Sun

TL;DR
This paper introduces reinforcement learning frameworks for autonomous AI agents in derivatives markets that focus on minimizing shortfall risk and improving tail risk management, aiming to enhance financial stability.
Contribution
It presents two novel RL approaches, RLOP and QLBS, that prioritize downside risk and align learning with practical hedging objectives, addressing a gap in existing models.
Findings
RLOP reduces shortfall frequency across most scenarios.
RLOP improves tail risk measures in stress testing.
Implied volatility models fit well but perform poorly in after-cost hedging.
Abstract
The deployment of autonomous AI agents in derivatives markets has widened a practical gap between static model calibration and realized hedging outcomes. We introduce two reinforcement learning frameworks, a novel Replication Learning of Option Pricing (RLOP) approach and an adaptive extension of Q-learner in Black-Scholes (QLBS), that prioritize shortfall probability and align learning objectives with downside sensitive hedging. Using listed SPY and XOP options, we evaluate models using realized path delta hedging outcome distributions, shortfall probability, and tail risk measures such as Expected Shortfall. Empirically, RLOP reduces shortfall frequency in most slices and shows the clearest tail-risk improvements in stress, while implied volatility fit often favors parametric models yet poorly predicts after-cost hedging performance. This friction-aware RL framework supports a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRisk and Portfolio Optimization · Stock Market Forecasting Methods · Stochastic processes and financial applications
