Cyclically monotone non-optimal $N$-marginal transport plans and Smirnov-type decompositions for $N$-flows
Mircea Petrache

TL;DR
This paper investigates the limitations of $c$-cyclical monotonicity as a criterion for optimality in multi-marginal transport problems, providing counterexamples for $N \,\geq\ 3$ and introducing $N$-flows and Smirnov-type decompositions.
Contribution
It demonstrates that $c$-cyclical monotonicity is not sufficient for optimality when $N\geq 3$, and introduces the concept of $N$-flows and a Smirnov-type decomposition to analyze these cases.
Findings
Counterexample shows $c$-cyclical monotonicity is not sufficient for $N\geq 3$
Comparison highlights differences from the $N=2$ case
Introduction of $N$-flows and Smirnov-type decomposition for analysis
Abstract
In the setting of optimal transport with marginals, a necessary condition for transport plans to be optimal is that they are -cyclically monotone. For there exist several proofs that in very general settings -cyclical monotoncity is also sufficient for optimality, while for this is only known under strong conditions on . Here we give a counterexample which shows that -cylclical monotonicity is in general not sufficient for optimality if . Comparison with the case shows how the main proof strategies valid for the case might fail for . We leave open the question of what is the optimal condition on under which -cyclical monotonicity is sufficient for optimality. The new concept of an -flow seems to be helpful for understanding the counterexample: our construction is based on the absence of finite-support -cycles…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPoint processes and geometric inequalities · Markov Chains and Monte Carlo Methods · Nonlinear Partial Differential Equations
Cyclically monotone non-optimal -marginal transport plans and Smirnov-type decompositions for -flows.
Mircea Petrache
Pontificia Universidad Catolica de Chile, Santiago, Chile
Abstract.
In the setting of optimal transport with marginals, a necessary condition for transport plans to be optimal is that they are -cyclically monotone. For there exist several proofs that in very general settings -cyclical monotoncity is also sufficient for optimality, while for this is only known under strong conditions on . Here we give a counterexample which shows that -cylclical monotonicity is in general not sufficient for optimality if . Comparison with the case shows how the main proof strategies valid for the case might fail for . We leave open the question of what is the optimal condition on under which -cyclical monotonicity is sufficient for optimality. The new concept of an -flow seems to be helpful for understanding the counterexample: our construction is based on the absence of finite-support -cycles in the set where our counterexample cost is finite. To follow this idea we formulate a Smirnov-type decomposition for -flows.
The author acknowledges support from the FONDECYT Iniciacion en Investigacion 2017 grant N. 11170264.
Contents
1. Introduction, basic definitions and setting
Let be a cost, where , are Polish spaces, and is an ordered group (usually we take or ). If are probability measures with and is the canonical projection of onto , then the possible ways to couple the measures are encoded by the space of so-called transport plans defined as
[TABLE]
The -marginal optimal transport problem with cost as above with marginals , is the following minimization problem:
[TABLE]
Existence of minimizers can be ensured e.g. by assuming that is lower semicontinuous and bounded below, by the direct method of the calculus of variations and via Prokhorov’s theorem. We do not focus on existence and uniqueness in the present work, and for our purposes, when needed, we can just assume that minimizers exist. Both in the theoretical development and in algorithmic implementations, it becomes important to limit the class of plans that we are required to test in the minimization above (see [8] and the references therein).
One important property which in many cases ensures optimality of , and which is the main focus of the present paper, is -monotonicity. This condition was introduced in the case of marginals as a natural extension valid in the case for , and makes a natural connection to convex analysis (see [15] and [12]). For a recent treatment of the case of general from the point of view of the analogy with convex geometry, see [1], [2] and the references therein. We now pass to give the precise definition for general .
A set is -monotone (or -cyclically monotone) if for all and every -ple of permutations , for every -ple with for each index there holds
[TABLE]
As pointed out for example in [9], this is equivalent to requiring that for any finitely supported measure which is absolutely continuous with respect to , the plan has minimal cost amongst plans with the same marginals as . Indeed, assume we have a probability measure with marginals and such that is an atomic measure with finitely many atoms, i.e.
[TABLE]
Then we can replace by the plan where the action of on atomic probability measures with equal atoms is inherited from the action on by composing with the surjective “empirical measure” map
[TABLE]
For , explicitly we have
[TABLE]
As is well known (see [8] and the references therein), the above measures are the extremals of the set of probabilities with marginals equal to those of , and thus the linear optimization problem (1.1) achieves its minimum on this set. Therefore (1.2) implies that has cost lower or equal than any competitor in this case.
In the case the link between -monotonicity of the support of transport plans and their optimality is well understood, see [11], [3], [13] and [4] and the references therein. Recent progress has been made in [9], [1] and [7] for the case of general , with special focus on the case of costs coming from potential theory and mathematical physics.
However a result at the same level of generality as e.g. [11] or [3] is missing for . The result [11] proves that -monotonicity is equivalent to optimality for the case of general in the case of atomic marginals, with the only requirement being that there exists a finite-cost plan. A consequence of the counterexample from the present work is that such general result is simply false for any , due to phenomena absent in the case . This makes the characterization of optimality even more interesting, and a possible source of new mathematics, for . Note that is of exponential growth in both and , thus even in the case of measures with finitely many atoms, condition (1.2) is prohibitively hard to test in practice. It is even prohibitive to efficiently store on a computer all the competitors appearing in (1.2). The later problem is addressed in [8], to which we refer for further references in this direction.
2. The counterexample
It has been proved in [11] that for marginals, -cyclical monotonicity is equivalent to optimality of transport plans. The same has been proved under stronger conditions for a general number of marginals, but the question remained open, of whether or not for marginals the no-hypothesis theorem from [11] about the case of atomic measures also holds or if the ergodic approach [3] can be adapted. On a space , we find:
[TABLE]
2.1. A simple construction for taking the value
Proposition 2.1**.**
There exist satisfying (2.1) with and .
Remark 2.2**.**
- •
(Other ) The case of more than three marginals follows from the above, e.g. by defining a cost which only depends on three coordinates.
- •
(Same result on other metric spaces) The counterexample construction which we will do works on the space , however can be embedded injectively into another space of at least countable cardinality, such as , in several ways: any sequence such that are distinct is such an embedding. In our proof only the value of over the image of such embedding is used, and thus could be extended arbitrarily outside . Therefore one can generate a series of counterexamples in any such as well.
- •
(The fact that is infinite is crucial) For finite , monotonicity implies optimality of finite--cost plans, for any and any . This follows by direct verification from the case of the monotonicity condition (1.2).
- •
(Growth requirements on the cost) The proof becomes clearer if we first allow on a large subset of . So we first provide the proof in this case, and then provide a second proof for finite .
We cannot have Proposition 2.1 to hold for too tame. Indeed Griessler [9] showed that for , if holds -almost everywhere for some , then -cyclical monotonicity implies optimality. The positive result also holds also for general and can be extended by the De Pascale method [6] to more singular , see also [7].
Proof of Proposition 2.1:.
A cost taking the value . The cost will be defined in terms of an auxiliary bounded function as follows:
- •
is symmetric with respect to permutations of the triple .
- •
.
- •
for .
- •
for all the triples not described in the previous two cases.
Required properties of the function . We don’t need to fully specify the values of but we require that
[TABLE]
(This is achieved, for example, if and is decreasing.)
Two plans and their common marginal. We define
[TABLE]
It is easy to check that and that is the first marginal of and of . The two plans and have equal marginals because the plans are symmetric.
The plan is non-optimal. The -cost of is lower than the -cost of due to the properties of :
[TABLE]
The plan is -cyclically monotone. To prove the above, we first note that, due to the symmetry of the problem, without loss of generality we may restrict to symmetric measures with atoms corresponding to integers smaller or equal to :
[TABLE]
The marginals of the above are all equal to
[TABLE]
We need to show that if is another plan with the same marginals then .
If then the desired inequality holds, so we are left with the case . In this case can be written in the form
[TABLE]
and the marginal condition on translates into
[TABLE]
Note that the equations (2.7) imply that
[TABLE]
This means that for . For we have thus we find for all , and by (2.7) we get for all , which inserted in (2.1) gives . Thus is the only symmetric plan with marginals equal to and finite cost. As this is true for every with finitely many atoms, we have that is -cyclically monotone, as desired. ∎
2.2. About the existence of everywhere finite
In the search for a counterexample which is now everywhere finite, we first consider the problem abstractly, in a very general setting.
Assume that there exist finite-cost transport plans, such that is non-optimal and cyclically monotone, and is optimal. Then we consider the following sets of measures:
[TABLE]
Note that is composed of measures with finite-cardinality support by definition, and is also composed of measures of finite support, because for all and . Then the set of “counterexample” costs for which is cyclically monotone, but has better cost than , is given by
[TABLE]
Our assumption that a counterexample cost exists (which we showed for ), means that , and we now consider the question of whether or not
[TABLE]
as well. Note that and are convex, but not closed under weak- convergence of measures since belongs to the closure of but not to , and is in the closure of but not in . As is composed on functionals of the form which are required to be nonpositive on and negative on , this directly shows that all are not continuous under weak- convergence:
Lemma 2.3**.**
Let and be Polish spaces, and assume that is a cost for which there exist of finite cost, which is is cyclically monotone and not optimal. Then the assignment defined on is not continuous with respect to weak- convergence.
Nevertheless, we cannot exclude that there exist counterexamples which are everywhere finite:
Open Problem**.**
Do there exist Polish spaces and a cost function for which there exists a -cyclically monotone non-optimal transport plan ?
3. Cyclical monotonicity and -flows
3.1. Definition of -graphs ad -flows
We give here a principle which allows to construct more complicated counterexamples with taking the value . The idea is that we can interpret as a (continuous or discrete) directed hypergraph and as a weight on .
The case . The simplest case gives usual graphs: We then associate to the weighted directed graph
[TABLE]
Here we obtain usual (countable) weighted graphs if is countable, and a generalization thereof otherwise. We will mostly concern ourselves with atomic measures and discrete spaces for the time being.
The -graph associated to our counterexample. Weighted directed hypergraphs are defined analogously with directed edges replaced by -ples of points in . For example, the construction from the previous section gives a -graph with vertex set given by three copies of
[TABLE]
If we abbreviate the -edge by then the hypergraph relevant to the previous section has -edge set (see Figure 1)
[TABLE]
Anytime we study optimal transport with the cost from the previous section, the only edges which matter to our problem are the edges in , due to being infinite on couplings outside . This is why we can say that the -graph with weight given by encodes our problem.
Definition of -graphs and -flows. More generally, if we consider -marginal optimal transport on the set , we have vertex set and -edges set given by the -ples such that . Such edges will again be denoted , forgetting the second index, for simplicity. Again we put weight according to on -edges. We call the -graph associated to the transport cost . Note that this graph could possibly be uncountable. See Figure 2 for an example of -graph.
To a signed weight and a subset of edges we can associate an -flow, which is the formal sum
[TABLE]
It will be useful later to also introduce the mass of the -flow above, defined as
[TABLE]
Given and a finite -flow, we can integrate on (the integral is set to be if takes the value on ):
[TABLE]
To an -flow as above we associate its -boundary, which is an -ple of atomic measures given by
[TABLE]
whenever the sum is defined (in particular it is, if is finite). If has then we say that is closed (see Figure 3 for an example).
Remark 3.1**.**
We could easily generalize and adapt the above definitions to the case that is a measure over , in case the latter is infinite, replacing the sums in our definitions by suitable integrals. However, defining -cyclical monotonicity does not per se require us to talk about measures, as the definition (1.2) is be formulated in purely combinatorial terms. Indeed, it turns out that the case in which the support of is finite is all we need, and this allows us to avoid discussing measurability issues which would make the discussion superfluously complicated.
3.2. Reformulating -cyclical monotonicity in terms of -flows
We are now ready for a combinatorial reformulation of the monotonicity condition (1.2), which allows us to describe more complicated counterexamples. It says that a set is -monotone if and only if its flows are cost-minimizers at fixed boundary.
Proposition 3.2**.**
A set is -monotone if and only if whenever is finite, and are finite-mass -flows in the -graph associated to such that is supported on and , then there also holds .
Proof.
The proof is an easy verification, based on the following fact: an equivalent condition for to be -monotone is that for any measure supported on a finite subset and any with marginals equal to those of , there holds . The measure support included in the finite set implies that . We can thus associate the -flow to . Similarly, since has atomic marginals it is itself atomic too and we can associate another -flow to as above. The condition of is equivalent to requiring as can be easily verified, and the condition is equivalent to . ∎
The counterexample from the previous section is generalized by the following:
Corollary 3.3**.**
If the -graph associated to supports no finite closed -flows then all transport plans of finite -cost are -montone.
Proof.
Let be a transport plan with . We claim that the condition is never verified for of finite support contained in the set . Indeed, if has finite support then so does , and then also has finite support, and thus would be a finite closed -flow supported on , which is precluded by our hypothesis. The condition of Proposition 3.2 is then automatically verified. ∎
3.3. Special properties available for the case
Before discussing some necessary and some sufficient conditions for the existence of -flows, it is instructive to consider the case , reinterpreting the proof methods [11] and [3].
We define a -flow-path to be any -flow of the form
[TABLE]
where , and and for . If or respectively in the above but no other pair of points coincide, then we call a -flow-loop.
For the next definition, let be three -flows such that . In general we have . If the equality sign holds in the latter inequality, we say that form a decomposition of wihtout cancellations.
It is easy to see that if a closed -flow has the same support as above, then automatically it is a nonzero multiple of . The principle that “mass is conserved along -flow-loops” (which lacks an easy analogue for -flows if , as a consequence of the counterexample in the proof of proposition 2.1) helps us to prove the following:
Lemma 3.4**.**
A finite -flow closed if and only if is a finite sum of weighted -flow-loops without cancellations.
Proof.
The “if” part of the implication is obvious, as a sum of closed -flows is closed. The proof of the other implication is an adaptation of a folklore result on weighted directed graphs satisfying the Kirchhoff balance equation at each vertex, so we only sketch it for the convenience of the reader. The basic idea is to start with an edge of the -graph which is assigned lowest weight in absolute value, and use the Kirchhoff law to find a path of edges which is locally satisfying Kirchhoff law. When such path first self-intersects, we find one -flow-loop such that forms a decomposition of without cancellations. By repeating the procedure with replaced by we can keep diminishing the mass of the part not yet decomposed into loops, while diminishing the total number of loops in the graph corresponding to . When the remaining graph is one single loop, the procedure ends.
In order to describe how we single out one loop in the above sketch of proof, consider the nontrivial case and assume (as we can do up to change of sign symmetry) that for some there holds . We start building a “-path-flow” by defining . By considering the -coefficient in the expression for , and due to the fact that is minimal by definition, we find that there exists such that belongs to the support of and . We add this edge with coefficient to and obtain . By focusing now on the -coefficient in the expression for we find such that and this edge has weight , and we define . The procedure continues this way till we visit one point which was already visited before, and in this case a subpath of the -flow-path we reached is a -flow-loop. The procedure ends before step because is of finite mass, we have that is a decomposition of without cancellations for every , and for every . ∎
The above proof strategy also gives the following countable version of Lemma 3.4:
Lemma 3.5**.**
A finite-mass -flow over a countable set is a linear combination of -flow-loops without cancellations.
If is countable, , and is arbitrary, Lemma 3.5 can be applied to optimal transport as follows. Given two distinct plans with finite cost and with the same marginal, they define flows and such that is a closed -flow. So it is a superposition of finite loops. If we use -monotonicity assumption on for each such loop, we find by linearity that , which due to the arbitrarity of implies the optimality of . This is basically the strategy of [11]. The failure of the above lemmas for shows that such strategy does not extend for general due to the higher combinatorial complexity of this case.
4. Generalized Smirnov decomposition for -flows
In Proposition 3.2 we translate -monotonicity in terms of a condition on closed finite -flows supported in the -graph associated to , and Corollary 3.3 produces counterexamples using the absence of such -flows. It becomes useful to consider the classification of -flows depending on the type of cycles they contain. Here we concentrate more on such properties for the case of -graphs. Note that as before, we could consider countable -graphs and avoid measurability issues, or consider more general -graphs, on which weights and masses would be well-defined by introducing a measure space structure.
The properties emerging in this context are analogous to those from the decomposition of currents/flows into cycles and acyclic parts from [14] and [10], from which we imitate the terminology. We also mention that the use of Smirnov decomposition in Optimal Transport type problems is not new, and has appeared in another setting in [5].
We start with the following definitions:
- •
An -flow is called a cycle if it is closed.
- •
An -flow is a subflow of the -flow if there exists an -flow such that
[TABLE]
- •
If or then is a trivial subflow of .
- •
An -flow is called acyclic if it has no nonzero cyclic subflows.
- •
An -flow is called solenoidal if it is cyclic and has no finite cyclic subflows.
We have the following basic decomposition result, similar to [14] and [10]. A similar result holds with measurability assumptions for more general , corresponding to the case where transport plans correspond to general measures. We do not treat this generalization here, as the below countable case already contains the basic principle. As the method for the case of currents in the above papers is very similar, we only sketch the proof here.
Proposition 4.1**.**
If is an -flow with countable support such that then there exist unique subflows of such that is acyclic, is solenoidal and is a superposition of finite cyclic -flows, such that .
Sketch of proof of Proposition 4.1.
We call a sequence of -flows an increasing sequence if each has finite mass and is a subflow of . We will use the fact that an increasing sequence of subflows has a limit which is itself a subflow. If has a finite-support cyclic subflow, we may remove the largest-mass finite cycle subflow from and diminish its mass. This can be done an at most countable number of times, as each time the mass of diminishes by a nonzero amount. Thus we may assume that has no finite cycle subflow and that . Now note that when we have and , there also holds and . Thus we may remove from acyclic subflows in mass-decreasing order, and we terminate the procedure in a countable number of steps, as each such flow removes a positive amount of boundary mass. The sum of all such subflows is then acyclic and is a subflow of and the difference is then by construction solenoidal. ∎
In order to make Corollary 3.3 more concretely useful, we would need to consider the following.
Open Problem**.**
Find necessary conditions and sufficient conditions for an -graph to support no finite cyclic -flows. Equivalently, find necessary conditions and sufficient conditions such that supports only acyclic and solenoidal flows.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] S. Bartz, H.H. Bauschke, J.M. Borwein, S. Reich, X. Wang, Fitzpatrick functions, cyclic monotonicity and Rockafellar’s antiderivative , Nonlinear Analysis: Theory, Methods and Applications, 66 no.5 (2007), 1198–1223.
- 2[2] S. Bartz, H.H. Sedi, X. Wang, A class of multi-marginal c 𝑐 c -cyclically monotone sets with explicit c 𝑐 c -splitting potentials , Journ. Math. Anal. Appl., 461 no.1 (2018), 333–348.
- 3[3] M. Beiglböck, Cyclical monotonicity and the ergodic theorem , Ergodic Theory and Dynamical Systems, 35 no. 3 (2015), 710–713.
- 4[4] S. Bianchini, L. Caravenna, On optimality of c-cyclically monotone transference plans , C. R. Math. Acad. Sci. Paris, 348 , no. 11-12 (2010), 613–618.
- 5[5] L. Brasco, M. Petrache, A continuous model of transportation revisited , Journal of mathematical sciences, 196 no. 2 (2014), 119–137.
- 6[6] L. De Pascale. Optimal transport with Coulomb cost. Approximation and duality , ESAIM Math. Model. Numer. Anal., 49 no. 6, 1643–1657.
- 7[7] L. De Pascale, On c 𝑐 c -cyclical monotonicity for optimal transport problem with Coulomb cost , to appear in European Journal of Applied Mathematics (2019).
- 8[8] G. Friesecke, D. Vögler, Breaking the curse of dimension in multi-marginal Kantorovich optimal transport on finite state spaces , SIAM Journal on Mathematical Analysis 50 no.4 (2018), 3996–4019.
