Scalable Bi-causal Optimal Transport via KL Relaxation and Policy Gradients
Haoyang Cao, Jesse Hoekstra, Renyuan Xu, Yumin Xu, Ruixun Zhang

TL;DR
This paper introduces a scalable policy-gradient method for bi-causal optimal transport, enabling efficient coupling of stochastic processes under information constraints with applications in finance and time series analysis.
Contribution
It develops a KL-relaxation framework with policy gradients, providing convergence guarantees and practical algorithms for bi-causal OT with continuous distributions.
Findings
Accurately captures marginal laws and temporal dependence.
Performs well in robust subhedging and time series downscaling.
Provides a scalable approach for bi-causal OT in complex settings.
Abstract
Bi-causal optimal transport (OT) is a natural framework for comparing and coupling stochastic processes under nonanticipative information constraints, with important applications in robust finance, sequential uncertainty quantification, and multistage stochastic optimization. In particular, a learned bi-causal coupling naturally serves as a simulator for generating joint sample paths that respect both prescribed marginal laws and the underlying information flow. Its practical use, however, is limited by the computational difficulty of enforcing bi-causal coupling constraints over path space, especially for continuous distributions and long horizons. We develop a scalable stochastic-optimization framework for computing bi-causal OT couplings under general marginals. Our approach introduces a Kullback--Leibler (KL)-penalized relaxation that replaces hard marginal constraints with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
