Distributional Reinforcement Learning on Path-dependent Options

Ahmet Umur \"Ozsoy

arXiv:2507.12657·q-fin.MF·July 18, 2025

Distributional Reinforcement Learning on Path-dependent Options

Ahmet Umur \"Ozsoy

PDF

Open Access

TL;DR

This paper introduces a distributional reinforcement learning framework for pricing path-dependent financial derivatives, enabling risk-aware valuation and tail-risk estimation by modeling the full payoff distribution rather than just expected values.

Contribution

It presents a novel approach that applies distributional reinforcement learning to derivative pricing, capturing the entire payoff distribution for improved risk management.

Findings

01

Effective modeling of payoff distributions for Asian options.

02

Enhanced risk and tail-risk estimation capabilities.

03

Demonstrated superiority over traditional expected-value methods.

Abstract

We reinterpret and propose a framework for pricing path-dependent financial derivatives by estimating the full distribution of payoffs using Distributional Reinforcement Learning (DistRL). Unlike traditional methods that focus on expected option value, our approach models the entire conditional distribution of payoffs, allowing for risk-aware pricing, tail-risk estimation, and enhanced uncertainty quantification. We demonstrate the efficacy of this method on Asian options, using quantile-based value function approximators.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSmart Grid Energy Management · Advanced Bandit Algorithms Research · Reinforcement Learning in Robotics