Quantile Geometry Regularization for Distributional Reinforcement Learning

Zhaofan Zhang; Minghao Yang; Rufeng Chen; Sihong Xie; Hui Xiong

arXiv:2605.08182·cs.LG·May 12, 2026

Quantile Geometry Regularization for Distributional Reinforcement Learning

Zhaofan Zhang, Minghao Yang, Rufeng Chen, Sihong Xie, Hui Xiong

PDF

TL;DR

This paper introduces RQIQN, a novel distributional RL method that enhances quantile estimation robustness using Wasserstein regularization, improving performance in risk-sensitive tasks.

Contribution

The paper proposes RQIQN, a Wasserstein distributionally robust enhancement for quantile regression in distributional RL, addressing distributional degeneration without altering the core value objective.

Findings

01

RQIQN outperforms existing quantile-based methods in risk-sensitive navigation.

02

RQIQN achieves superior results in Atari game benchmarks.

03

The method effectively regularizes quantile geometry, preventing distribution collapse.

Abstract

Quantile-based distributional reinforcement learning methods learn return distributions through sampled quantile regression, but their bootstrapped target quantiles may induce distorted or degenerate distribution estimates. We propose Robust Quantile-based Implicit Quantile Networks (RQIQN), a lightweight Wasserstein distributionally robust enhancement boosted from a quantile estimation perspective. We first reinterpret a snapshot of IQN loss as a collection of local empirical quantile estimation problems over sampled current fractions. We then robustify each local slot with a Wasserstein distributionally robust quantile estimation formulation, yielding a closed-form, fraction-dependent correction to the Bellman target. This correction directly addresses distributional degeneration: its median antisymmetry preserves the risk-neutral quantile average, while its monotonicity enlarges…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.