Multi-marginal Schrodinger bridges
Yongxin Chen, Giovanni Conforti, Tryphon T. Georgiou, Luigia Ripani

TL;DR
This paper extends the classical Schrödinger Bridge Problem to second-order particle dynamics with partial temporal data, formulating it as an optimal control problem with a Fisher information term, and connecting it to measure-valued splines in Wasserstein space.
Contribution
It introduces a novel second-order Schrödinger bridge formulation with partial observations, incorporating Fisher information and linking to measure-valued splines in Wasserstein space.
Findings
Derived a time-symmetric optimal control formulation with Fisher information.
Established a connection to measure-valued splines and Sinkhorn-like algorithms.
Demonstrated potential applications in signal processing and data science.
Abstract
We consider the problem to identify the most likely flow in phase space, of (inertial) particles under stochastic forcing, that is in agreement with spatial (marginal) distributions that are specified at a set of points in time. The question raised generalizes the classical Schrodinger Bridge Problem (SBP) which seeks to interpolate two specified end-point marginal distributions of overdamped particles driven by stochastic excitation. While we restrict our analysis to second-order dynamics for the particles, the data represents partial (i.e., only positional) information on the flow at {\em multiple} time-points. The solution sought, as in SBP, represents a probability law on the space of paths this closest to a uniform prior while consistent with the given marginals. We approach this problem as an optimal control problem to minimize an action integral a la Benamou-Brenier, and derive a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Fluid Dynamics and Turbulent Flows · Statistical Methods and Inference
11institutetext: Georgia Institute of Technology, Atlanta, GA 30332, USA 22institutetext: École Polytechnique, Route de Saclay, 91128, Palaiseau Cedex, France 33institutetext: University of California, Irvine, CA 92697, USA
Multi-marginal Schrödinger bridges††thanks: Partial support was provided by NSF under grants 1665031, 1807664, 1839441 and 1901599, and by AFOSR under grant FA9550-17-1-0435.
Yongxin Chen 11
Giovanni Conforti 22
Tryphon T. Georgiou 33
Luigia Ripani 33
Abstract
We consider the problem to identify the most likely flow in phase space, of (inertial) particles under stochastic forcing, that is in agreement with spatial (marginal) distributions that are specified at a set of points in time. The question raised generalizes the classical Schrödinger Bridge Problem (SBP) which seeks to interpolate two specified end-point marginal distributions of overdamped particles driven by stochastic excitation. While we restrict our analysis to second-order dynamics for the particles, the data represents partial (i.e., only positional) information on the flow at multiple time-points. The solution sought, as in SBP, represents a probability law on the space of paths this closest to a uniform prior while consistent with the given marginals. We approach this problem as an optimal control problem to minimize an action integral a la Benamou-Brenier, and derive a time-symmetric formulation that includes a Fisher information term on the velocity field. We underscore the relation of our problem to recent measure-valued splines in Wasserstein space, which is akin to that between SBP and Optimal Mass Transport (OMT). The connection between the two provides a Sinkhorn-like approach to computing measure-valued splines. We envision that interpolation between measures as sought herein will have a wide range of applications in signal/images processing as well as in data science in cases where data have a temporal dimension.
Keywords:
Schrödinger bridge Optimal mass transport Optimal control Multi-marginal.
1 Introduction
In 1931/32, in an attempt to gain insights into the stochastic nature of quantum mechanics, Schrödinger [22, 23] raised the following question regarding a system of a large number of classical independent identically distributed (i.i.d.) Brownian particles. He hypothesized that this “cloud” of particles is observed to have (empirical) distributions and at two points in time and , respectively, and further that differs from what is dictated by the law of large numbers, i.e., that
[TABLE]
where
[TABLE]
denotes the Brownian transition probability kernel. Schrödinger then sought to find the “most likely” evolution for the cloud of particles to have transitioned from to . In the language or large deviation theory (which was not in place at the time), Schrödinger’s question amounts to seeking a probability law on the path space that is in agreement with the two marginals while being the closest to the Brownian prior in the sense of relative entropy [12]. The solution is known as the Schrödinger bridge since the law “bridges” the two given end-point marginal distributions.
Renewed interest in the Schrödinger Bridge Problem (SBP) has been fueled by its connections to the Monge-Kantorovic Optimal Mass Transport (OMT) and a wide range aplications in image analysis, stochastic control, and physics [18, 19, 6, 13, 4, 9, 21]. More specifically, SBP, seen as a suitable regularization of OMT, provides a natural model for uncertainty in the transport of distributions as well as a valuable computational tool for interpolating distributional data.
In this work we consider a natural generalization of the Schrödinger bridge theory to address the situation where the data consist of possible partial marginal distributions at various points in time. Thus, we postulate a similar experiment with stochastic particles. However, in contrast to the standard SB theory, we concieve these particles to obey second order stochastic differential equations with Brownian stochastic forcing that accounts for random acceleration along trajectories in phase space (see Section 2). This new setting connects with recent results on measure-valued splines [3, 1, 8], which are general notions of splines on the Wasserstein space of measures. In this short paper, we provide a summary of the theory. A more detailed account will appear in a forthcoming publication (in preparation).
2 Multi-marginal Schrödinger bridges for inertial particles
Suppose we are given a large number of independent inertial particles driven by white noise, that is, they follow the dynamics
[TABLE]
where denotes standard Brownian motion and is the phase space ( denoting position and velocity). The flow of probability densities in phase space obeys the Fokker-Planck equation
[TABLE]
with initial condition at time . The law of large numbers dictates that, when the number of particles is large enough, the distributions of the particles will be closed to . The data of the problem we consider will, as in the standard SB problem, be inconsistent with the Fokker-Planck equation, and viewed as a “atypical/rare” event. In standard SB setting, where only two end-point marginals are specified, the “most likely” evolution amounts to an adjustment of the Fokker-Planck equation by adding a suitable drift term to match the two marginals. For inertial particles driven by white noise, the generator is hypoelliptic [16] and the SB theory carries over to matching marginals in phase space.
Throughout, we suppose that we have only access to position and that empirical marginals represent projections, accordingly. The extra degree of freedom, since ’s are partially specified, make the corresponing multi-marginal problem nontrivial. Specifically, we seek the most like paths the particles have taken that match positional distributions at times . To this end, we let be the law of (1) (on path space) and we let be the any other law. We seek the minimizer of
[TABLE]
over all laws that are consistent with the marginals .
To guarantee the boundedness of the relative entropy between and the prior process , has to be of the form
[TABLE]
where is a suitable drift that may depend on the current and past values of the process state. Invoking Girsanov’s theorem [14, 17, 11, 7], we obtain that
[TABLE]
Thus, we arrive at the optimal control formulation
[TABLE]
The difference to the standard Schrödinger bridge problem lies in the constraint (5d); multiple marginals are partially specified. The existence and uniqueness of the solution follow from the fact that it is a strongly convex optimization problem on path space measures. The argument is similar to the standard argument in SB theory [12, 18, 19, 5] and will be presented in an extended version of the paper. By utilizing the Fokker-Planck equation, we can rewrite the above as
[TABLE]
Here since .
Now let , then the diffusion term in (6c) can be absorbed into the convection terms, and then the cost becomes
[TABLE]
Direct calculation yields
[TABLE]
which only depends on the two end distributions. Thus, we need to consider
[TABLE]
A similar formulation with a Fisher information term has been studied in [25, 6, 13, 20] for standard Schrödinger bridge problems. However, in the standard setting, the term can be dropped since the full-state marginal distributions are already specified. It important to note that, compared to (6), (8) is time symmetric.
3 Connections to measure-valued splines
A variational formulation [15] of splines going through in Euclidean space is given by
[TABLE]
where the minimization is taken over all twice-differentiable trajectories that satisfy the constraints. This formation has been generalized to the Wasserstein space of measures [3, 1]. In particular, the fluid dynamic formulation for measure-valued splines in [3] with marginals at reads
[TABLE]
We note that the above formulation is almost the same as (6) except for a missing diffusion term in the constraint. This resembles the relation between standard Schrödinger bridges and optimal mass transport [18, 19, 5].
Indeed, a zero-noise limit argument follows. If we replace the dynamics (1) of the inertial particles by
[TABLE]
then the multi-marginal SB problem becomes
[TABLE]
The “slowed down” formulation (11) reduces to (6) when we take the limit . Therefore, we establish the measure-valued spline as a zero-noise limit of a multi-marginal SB, and SB as a regularized version of measure-valued spline. Rigorous proof of these conclusions will be presented in a forthcoming paper.
4 Algorithms
The Sinkhorn [24, 10] algorithm is a natural iterative scheme in SB problems. Due to its efficiency and simplicity, it has became a workhorse for data science applications of optimal transport [10]. In this section, we develop a Sinkhorn-type algorithm for multi-marginal Schrödinger bridge problems. The relation established in Section 3 implies that the same algorithm can also be used to approximate measure-valued splines.
Using a measure decomposition argument, the Schrödinger problem (11) can be rewritten as
[TABLE]
where and
[TABLE]
The optimization variables are joint distributions on the consecutive time points over the phase space. Since the cost is a summation of relative entropies and the constraints are convex, a natural algorithm is that of Bregman projections [2].
Define convex constraint sets
[TABLE]
for , then the Bregman iterative projection becomes
[TABLE]
where enumerates repeatedly. The projection operator is
[TABLE]
These projections can be derived via Lagrangian method. The projections to are easier and the projections to are more involved. Specifically,
[TABLE]
whereas for :
[TABLE]
In real implementation, we need to discretize the phase space over a grid. After discretization, the algorithm only involves matrix multiplication, pointwise-division, multiplication, square root, and therefore can be parallelized easily. The linear convergence rate is guarantee by the property of Bregman projections [2]. Our algorithm should be compared to that developed in [1]. A major difference is that our algorithm doesn’t require discretization over the time domain.
5 Conclusion
We considered a natural extension of the Schrödinger bridge problems to multi-marginal partially observable setting. We focused on inertial particles, but more general dynamics can be examined similarly. We discussed the physical meaning, stochastic control formulation and several other aspects of the problems. Just like in the standard SB problem, it has a natural relation to the measure-valued spline theory. An efficient algorithm was also developed, which makes ready for possible applications. We envision that this line of research is going to spark interest in optimal transport theory and application with multiple time points.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Benamou, J.D., Gallouët, T., Vialard, F.X.: Second order models for optimal transport and cubic splines on the Wasserstein space. ar Xiv preprint ar Xiv:1801.04144 (2018)
- 2[2] Benamou, J.D., Carlier, G., Cuturi, M., Nenna, L., Peyré, G.: Iterative bregman projections for regularized transportation problems. SIAM Journal on Scientific Computing 37 (2), A 1111–A 1138 (2015)
- 3[3] Chen, Y., Conforti, G., Georgiou, T.: Measure-valued spline curves: an optimal transport viewpoint. SIAM Journal of Mathematical Analysis 50 (6), 5947–5968 (2018)
- 4[4] Chen., Y., Georgiou, T., Pavon, M.: Optimal steering of a linear stochastic system to a final probability distribution, Part I. IEEE Trans. on Automatic Control 61 (5), 1158–1169 (2016)
- 5[5] Chen, Y., Georgiou, T., Pavon, M.: Entropic and displacement interpolation: a computational approach using the Hilbert metric. SIAM Journal on Applied Mathematics 76 (6), 2375–2396 (2016)
- 6[6] Chen, Y., Georgiou, T.T., Pavon, M.: On the relation between optimal transport and Schrödinger bridges: A stochastic control viewpoint. Journal of Optimization Theory and Applications 169 (2), 671–691 (2016)
- 7[7] Chen, Y., Georgiou, T.T., Pavon, M.: Optimal transport over a linear dynamical system. IEEE Transactions on Automatic Control 62 (5), 2137–2152 (2017)
- 8[8] Chen, Y., Karlsson, J.: State tracking of linear ensembles via optimal mass transport. IEEE Control Systems Letters 2 (2), 260–265 (2018)
