On the extremal points of the ball of the Benamou-Brenier energy
Kristian Bredies, Marcello Carioni, Silvio Fanzon, Francisco Romero

TL;DR
This paper characterizes the extremal points of the unit ball in the Benamou-Brenier energy space, revealing they are pairs of measures on characteristic curves, and applies this to sparse solutions in inverse problems.
Contribution
It provides a novel characterization of extremal points in the Benamou-Brenier energy space and applies it to sparse inverse problem solutions.
Findings
Extremal points are pairs of measures on characteristic curves.
Representation formula for sparse solutions in inverse problems.
Extension to a coercive generalization of the energy.
Abstract
In this paper we characterize the extremal points of the unit ball of the Benamou--Brenier energy and of a coercive generalization of it, both subjected to the homogeneous continuity equation constraint. We prove that extremal points consist of pairs of measures concentrated on absolutely continuous curves which are characteristics of the continuity equation. Then, we apply this result to provide a representation formula for sparse solutions of dynamic inverse problems with finite dimensional data and optimal-transport based regularization.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
On the extremal points of the ball of the Benamou–Brenier energy
Kristian Bredies
Institute of Mathematics and Scientific Computing, University of Graz, Heinrichstraße 36, 8010 Graz, Austria.
,
Marcello Carioni
University of Cambridge, Department of Applied Mathematics and Theoretical Physics, Wilberforce Road, Cambridge CB3 0WA, UK
,
Silvio Fanzon
Institute of Mathematics and Scientific Computing, University of Graz, Heinrichstraße 36, 8010 Graz, Austria.
and
Francisco Romero
Institute of Mathematics and Scientific Computing, University of Graz, Heinrichstraße 36, 8010 Graz, Austria.
Abstract.
In this paper we characterize the extremal points of the unit ball of the Benamou–Brenier energy and of a coercive generalization of it, both subjected to the homogeneous continuity equation constraint. We prove that extremal points consist of pairs of measures concentrated on absolutely continuous curves which are characteristics of the continuity equation. Then, we apply this result to provide a representation formula for sparse solutions of dynamic inverse problems with finite dimensional data and optimal-transport based regularization.
Keywords: Benamou–Brenier energy, extremal points, continuity equation, superposition principle, dynamic inverse problems, sparsity
Mathematics Subject Classification (2010): 52A05, 49N45, 49J45, 35F05
1. Introduction
The classical theory of Optimal Transport deals with the problem of efficiently transporting mass from a probability distribution into a target one. In the last thirty years, great advances in the understanding of the underlying theory have been achieved [2, 46, 49]. However, only recently these techniques are starting to be applied in order to solve computational problems in a great variety of fields, with logistic problems [8, 18, 19, 20], crowd dynamics [37, 38], image processing [29, 35, 39, 41, 44, 47, 48], inverse problems [16, 32] and machine learning [5, 27, 28, 40, 45, 52] being a few examples.
In this paper we focus on the so-called Benamou–Brenier formula, which provides an equivalent dynamic formulation of the classical Monge–Kantorovich transport problem [31]. Introduced by Benamou and Brenier in [6], such formula allows to compute an optimal transport between two probability measures and on a closed bounded domain through the minimization of the kinetic energy
[TABLE]
among all the pairs , where is a curve of probability measures on , is a time-dependent vector field and the pair satisfies distributionally the continuity equation
[TABLE]
The interest around the Benamou–Brenier formulation is motivated by its remarkable properties. First, it allows to compute an optimal transport in an efficient way [6] by means of a convex reformulation of (1), by introducing the momentum . More precisely, denoting by the time-space cylinder, the Benamou–Brenier energy (1) can be equivalently defined as a convex functional on the space of bounded Borel measures by setting
[TABLE]
whenever are such that , , and otherwise. With this change of variables, the continuity equation at (2) assumes the form
[TABLE]
In addition, the dynamic nature of the Benamou–Brenier reformulation of optimal transport is at the core of many recent developments in the fields of PDEs, optimal transport and inverse problems. Indeed, the dynamic formulation allows to endow the space of probability measures with a differentiable structure [2, 49], making possible the characterization of differential equations as gradient flows in spaces of measures [4, 30, 43] or the derivation of sharp inequalities [23, 36, 42]. Moreover, it motivated recent developments in unbalanced optimal transport theory [21, 22, 33, 34], i.e., when the marginals are arbitrary positive measures. Finally, as the Benamou–Brenier energy provides a description of the optimal flow of the transported mass at each time , which is a valuable information in applications, it was recently employed as a regularizer for variational inverse problems [12, 13, 16, 29, 35, 50].
The goal of this paper is to characterize the extremal points of the unit ball of the Benamou–Brenier energy at (3), and of a coercive version of it, which is obtained by adding the total variation of to . Both functionals are constrained via the continuity equation (2). Precisely, we introduce the functional
[TABLE]
defined for all and , . We then characterize the extremal points of the subset of defined by
[TABLE]
We emphasize that we do not enforce initial conditions to the continuity equation in (5). To be more specific, we prove the following result (see Theorem 6).
Theorem**.**
Let , . The extremal points of the set are exactly given by the zero measure and the pairs of measures such that
[TABLE]
where is an absolutely continuous curve with weak derivative , and such that . If the condition is satisfied if and only if is not constant.
We therefore show that the extremal points of the set are pairs of measures , with concentrated on some absolutely continuous curve in , and the density of with respect to is given by . Notice that such conditions are equivalent to the existence of a measurable field such that
[TABLE]
thus showing that is a characteristic associated to the continuity equation at (2) with respect to the field . We prove the above Theorem in Section 3, with the aid of a probabilistic version of the superposition principle for positive measure solutions to the continuity equation (2) on the domain (see Theorem 7). We mention that the ideas behind such superposition principle are not new, and they were originally introduced in [3] for positive measures on (see also [2, 7, 51]). The result of Theorem 7 allows to decompose any measure solution of the continuity equation (4) with bounded Benamou–Brenier energy, as superposition of measures concentrated on absolutely continuous characteristics of (4), that is, curves solving (6) with . As a consequence, we show any pair of measures that is not of such a form can be written as a proper convex combination of elements of and thus it is not an extremal point. The opposite inclusion follows from the convexity of the energy and the properties of the continuity equation.
The interest in characterizing extremal points of the Benamou–Brenier energy is not only theoretical. It has been recently shown in [15] and [11] that in the context of variational inverse problems with finite-dimensional data, the structure of sparse solutions is linked to the extremal points of the unit ball of the regularizer. In the classical theory of variational inverse problems one aims to solve
[TABLE]
where is the target space, is a convex regularizer, is a linear observation operator mapping to a finite-dimensional space and is the observation. It has been empirically observed that the presence of the regularizer is promoting the existence of sparse solutions, namely minimizers that can be represented as a finite linear combination of simpler atoms. While this effect has been well-understood in the case when is finite dimensional, the infinite-dimensional case has been only recently addressed [11, 15, 24, 25, 54, 53, 55]. In particular, in [11, 15], it has been shown that, under suitable assumptions on and , there exists a minimizer of (7) that can be represented as a finite linear combination of extremal points of the unit ball of ; namely the atoms forming a sparse solution are the extremal points of the ball of the regularizer.
In view of the above discussion, in Section 4 we apply our characterization of the extremal points of the energy at (5) to understand the structure of sparse solutions for inverse problems with such transport energy acting as regularizer. We mention that the analysis is carried out for the case , as the functional , corresponding to the rescaled Benamou–Brenier energy, lacks of compactness properties (see Remark 1). We verify that the assumptions needed to apply the representation theorems in [15] and [11] are satisfied by , and consequently we deduce the existence of a minimizer that is given by a finite linear combination of measures concentrated on absolutely continuous curves in (see Theorem 10). As a specific application of Theorem 10 we consider the setting introduced in [16], where the regularizer is coupled with a fidelity term that penalizes the distance between the unknown measure computed at , and the observation at such times (see Section 4.2). This setting is relevant for applications, such as variational reconstruction in undersampled dynamic MRI. Employing the previous results we are able to prove the existence of a sparse solution represented with a finite linear combination of measures concentrated on absolutely continuous curves in (see Corollary 12).
To conclude, we mention that characterizing the extremal points for a given regularizer has important consequences in devising algorithms able to compute a sparse solution. Notable examples have been proposed for the total variation regularizer in the space of measures [10, 17] using so-called generalized conditional gradient methods (or Frank–Wolfe-type algorithms [26]). Inspired by the previous methods, and building on the theoretical results obtained in the present paper, we plan to develop numerical algorithms to compute sparse solutions of dynamic inverse problems with the optimal transport energy as a regularizer [12, 13], effectively providing a numerical counterpart to the theoretical framework established in [16]. Finally, we remark that similar results to the ones presented in this paper can be obtained for unbalanced optimal transport energies. This has been recently achieved in [14], by introducing a novel superposition principle for measure solutions to the inhomogeneous continuity equation.
2. Mathematical setting and preliminaries
In this section we give the basic notions about the continuity equation, the Benamou–Brenier energy, and its coercive version anticipated in the introduction. We refer to [2, 6, 16] for a more detailed overview. For measure theoretical notions, we refer to the definitions in [1].
Given a metric space we will denote by (resp. ) the space of bounded Borel measures (resp. bounded vector Borel measures) on . Similarly, and denote the set of bounded positive Borel measures and Borel probability measures on , respectively. Let be an open, bounded domain with . Set ,
[TABLE]
and
[TABLE]
where the solutions of the continuity equation are intended in a distributional sense, that is,
[TABLE]
We remark that the above weak formulation includes no-flux boundary conditions for the momentum on . Also, no initial and final data is prescribed in (8). Moreover, by standard approximation arguments, we can consider in (8) test functions in (see [2, Remark 8.1.1]).
We now introduce the Benamou–Brenier energy. For this purpose, define the convex, lower semicontinuous and one-homogeneous map by setting
[TABLE]
The Benamou–Brenier energy is defined for every pair as
[TABLE]
is such that . Since is one-homogeneous, the above representation of does not depend on . For some fixed , , we consider the following functional
[TABLE]
where denotes the total variation norm in .
Remark 1*.*
Note that in the definition of we add the total variation of to the Benamou–Brenier energy. If this choice enforces the balls of the energy to be compact in the weak* topology of (see Lemma 4). As a consequence, the functional is a natural regularizer for dynamic inverse problems when the initial and final data are not prescribed [16]. We remark that, although in the case the unit ball of the energy is not compact, we can still characterize its extremal points. However, in this case, due to the lack of coercivity, has limited use as a regularizer for dynamic inverse problems.
For a measure , we say that disintegrates with respect to time if there exists a Borel family of measures in such that
[TABLE]
We denote such disintegration with the symbol . Further, we say that a curve of measures is narrowly continuous if the map
[TABLE]
is continuous for each fixed . The family of narrowly continuous curves will be denoted by . We also introduce , as the family of narrowly continuous curves with values into the positive measures on .
We now recall several results about , and measure solutions of the continuity equation (8), which will be useful in the following analysis. For proofs of such results, we refer the interested reader to Propositions 2.2, 2.4, 2.6 and Lemmas 4.5, 4.6 in [16].
Lemma 2** (Properties of ).**
The functional defined in (9) is convex, positively one-homogeneous and sequentially lower semicontinuous with respect to the weak topology on . Moreover it satisfies the following properties:*
- i)
* for all ,* 2. ii)
if , then and , that is, there exists a measurable map such that , 3. iii)
if and for some measurable, then
[TABLE]
Lemma 3** (Properties of the continuity equation).**
Assume that satisfies (8) and that . Then disintegrates with respect to time into , where for a.e. . Moreover is constant, with for a.e. . If in addition , that is,
[TABLE]
where for some measurable, then belongs to .
Lemma 4** (Properties of ).**
Let , . The functional is non-negative, convex, positively one-homogeneous and sequentially lower semicontinuous with respect to weak convergence on . Assume now . For such that we have*
[TABLE]
Moreover, if is a sequence in such that
[TABLE]
then for some and there exists some with , such that, up to subsequences,
[TABLE]
3. Characterization of extremal points
The aim of this section is to characterize the extremal points of the unit ball of the functional at (10) for all , namely, of the convex set
[TABLE]
To this end, let us first introduce the following set.
Definition 5** (Characteristics).**
For , define the set of all the pairs such that
[TABLE]
where satisfies for each and .
We remind that denotes the space of absolutely continuous curves having a weak derivative in . We point out that by definition for all choices of , . Moreover the condition is always satisfied if . When we instead have if and only if , that is, the set does not contain constant curves.
For the extremal points of we have the following characterization.
Theorem 6**.**
Let , be fixed. Then
[TABLE]
The proof of Theorem 6 is postponed to Section 3.2. In order to show the inclusion we will make use of a superposition principle for measure solutions of the continuity equation (8). This result is not new, and it is proved in [2, Ch 8.2] for the case . In Section 3.1 we show that it also holds for bounded closed domains.
3.1. The superposition principle
Before stating the superposition principle in , we introduce the following notation. Let
[TABLE]
be equipped with the supremum norm, i.e., . For every fixed let be the evaluation at , that is, . Notice that is continuous. For a measurable vector field , we define the following subset of consisting of curves solving the ODE (6) in the sense of Carathéodory:
[TABLE]
Moreover define the set of solutions to the ODE which live inside for all times:
[TABLE]
The superposition principle for probability solutions to (8) states as follows.
Theorem 7**.**
Let be a narrowly continuous solution of the continuity equation in the sense of (8), for some measurable such that
[TABLE]
Then there exists a probability measure concentrated on and such that for every , that is,
[TABLE]
Proof.
Let be the extension to zero of to the whole . Similarly, for each , let be the extension to zero of in . Note that the pair is a solution of the continuity equation in in the sense of (8). Moreover and satisfy (14) in . Therefore we can apply Theorem 8.2.1 in [2] and obtain a probability measure concentrated on and such that for all , that is,
[TABLE]
We claim that is concentrated on . In order to show that, partition into
[TABLE]
where
[TABLE]
Notice that, since is open and in , the curves in are constant, so that we can write
[TABLE]
From this, it follows that . Moreover, (16) implies . Therefore, using that is concentrated on , we conclude that , showing that is concentrated on . Finally, (16) implies (15) since is supported in and it coincides with in . Also by definition of , thus concluding the proof. ∎
3.2. Proof of Theorem 6
Let , . We divide the proof into two parts.
Part 1: .
We start by showing that . The fact that follows immediately, since solves the continuity equation and (by Lemma 2). Consider now . Notice that satisfies the continuity equation in the sense of (8): indeed for every we have
[TABLE]
since is compactly supported in . Moreover, thanks to the fact that and , we can invoke (11) to obtain
[TABLE]
proving that .
We now want to show that any is an extremal point for . Hence assume that are such that
[TABLE]
for some . We need to show that . Set . Since is such that , from in Lemma 2 we have that and for some Borel field . In particular, if , (19) forces , hence showing that is an extremal point of .
Let us now consider the case . By (18) we have . From (19), convexity of , and the fact that , , we conclude
[TABLE]
Since solves the continuity equation, and , from Lemma 3 we deduce that for some narrowly continuous curve , with constant in time. We define and notice that : Indeed, would imply , yielding . This would contradict (20). Now, from condition (19) and uniqueness of the disintegration we deduce
[TABLE]
Since (and hence ), the above equality implies that , i.e.,
[TABLE]
We now show that on , that is
[TABLE]
By assumption, in the sense of (8). Therefore, recalling (22) and the fact that , we get that for each ,
[TABLE]
where the last equality follows from (17), since . Let and define , where , so that is a test function for (24). By plugging into (24) we obtain
[TABLE]
where and are the -th component of and , respectively. This implies that for a.e. , that is, a.e. on . With this at hand, by means of (11) we can see that . Since (20) holds, we obtain , thus proving and hence extremality for in .
Part 2: .
Let be an extremal point. In particular, so that by Lemma 2 , we obtain and for some Borel field . Notice that by extremality of and one-homogeneity of we immediately infer that either or . If , by decomposing as
[TABLE]
and using the extremality of together with the one-homogeneity of we deduce that . Thus, we consider the case
[TABLE]
Since by definition, solves the continuity equation in the sense of (8) and , we can apply Lemma 3 to obtain that for some narrowly continuous curve , where .
Claim: is a singleton for each .
Proof of Claim: The hypotheses of Theorem 7 are satisfied, therefore there exists a measure concentrated on and such that for every . Assume by contradiction that there exists a time such that is not a singleton. Therefore, we can find a Borel set such that
[TABLE]
Define the Borel set
[TABLE]
By the relation and definition of we obtain . Therefore, from (26)
[TABLE]
Define
[TABLE]
where . Note that are well defined (possibly being equal to ) as the map
[TABLE]
is lower semicontinuous on , and hence measurable. Notice that
[TABLE]
because is concentrated on . Since belongs to for a.e. , by [9, Theorem 3.6.1] we obtain that the representation formula (15) holds for and a.e. , that is,
[TABLE]
Therefore, from (11), (25), (29) and (30) we deduce .
We now proceed with the proof of the claim separately for the cases and . Suppose first . Notice that thanks to (27) and the fact that . Decompose as
[TABLE]
where we defined
[TABLE]
for , with and . Notice that , since is a positive measure concentrated on , and . We now claim that . First, we prove that in the sense of (8). Let and fix . Since belongs to for a.e. , by [9, Theorem 3.6.1], (15) and the definition of , we get
[TABLE]
Now recall that is concentrated on and that is compactly supported in time, so that
[TABLE]
The calculation for is similar. Also, by definition of and of , one can perform similar calculations to the ones in (29), (30), and prove that . Hence . We now claim that . Suppose by contradiction that . Then in particular , so that by (32) we get
[TABLE]
As are solutions of the continuity equation and , from Lemma 3 it follows that the maps are narrowly continuous. In particular, (35) holds for each . However, by (27) and by definition of , , , we have
[TABLE]
which contradicts (35). Therefore , which shows that the decomposition (31) is non-trivial. This is a contradiction, since we are assuming that is an extremal point for . Thus the claim follows.
Suppose now that and define the set
[TABLE]
Notice that is measurable, due to the measurability of the map at (28). We claim that . In order to prove that, let and define the measures , , so that . Recalling that for all , we can decompose
[TABLE]
where
[TABLE]
Let . Notice that since is a positive measure concentrated on and . Following similar computation as (33)-(34) we infer that solves the continuity equation in the sense of (8). Moreover, by definition of and the fact that is concentrated on we obtain
[TABLE]
where we employed Fubini’s Theorem, which holds thanks to the measurability of the map and the identity (30), the latter implying boundedness of the last term in (37). By (37) and arguing as in (29)-(30), it is immediate to check that . Recalling (25) we then obtain . As is an extremal point of , from (36) we deduce that and thus . In particular, there exists such that . Hence for every measurable, by the positivity of , we have , implying that . By (27) and the definition of , , we conclude that . With this property established, the claim that is a singleton for each follows by repeating the same arguments of the case , employing the decomposition of as in (31).
We have shown that for each , is a singleton. We now conclude the proof of Theorem 6. Since , the latter implies the existence of a curve such that for each . We will now prove that . By narrow continuity of , we have that the map is continuous for all . By testing against the coordinate functions , we obtain continuity for . Consider now with , . Notice that the scalar map is continuous. Moreover, by testing the continuity equation against we get
[TABLE]
which implies that the distributional derivative of the map is given by
[TABLE]
We now remark that the above map belongs to , since
[TABLE]
Therefore, belongs to for every fixed . By choosing , , we conclude that . Since , we can repeat the same argument employed to prove (23), and infer
[TABLE]
From (38) and the fact that , , we then conclude . As , we can apply (11) to compute
[TABLE]
Recalling that (see (25)), from (39) we conclude that with . Therefore belongs to according to Definition 5, and the proof of Theorem 6 is concluded.
4. Application to sparse representation for inverse problems with optimal transport regularization
In this section we deal with the problem of reconstructing a family of time-dependent Radon measures given a finite number of observations. To be more specific, let be a finite dimensional Hilbert space and be a linear continuous operator, where continuity is understood in the following sense: given a sequence in , we require that
[TABLE]
where, with a little abuse of notation, we will denote by both the curve , as well as the measure .
For some given data , we aim to reconstruct a solution to the dynamic inverse problem
[TABLE]
For and we regularize the above inverse problem by means of the energy defined in (10), following the approach in [16]. In practice, upon introducing the space
[TABLE]
we consider the Tikhonov functional defined as
[TABLE]
where is a given fidelity functional for the data , which is assumed to be convex, lower semicontinuous and bounded from below. Additionally, we assume that is proper. We then replace (41) by
[TABLE]
Remark 8*.*
Two common choices for the fidelity term in the case are, for example,
- i)
for a given that forces the constraint ,
- ii)
that recovers a classical penalization.
Remark 9*.*
Under the above assumptions on and , problem (43) admits a solution. Indeed, since is proper, any minimizing sequence is such that is bounded. As is bounded from below and , we deduce that is bounded. Therefore, Lemma 4 implies that converges (up to subsequences) to some , in the sense of (13). By weak* lower semicontinuity of in (see Lemma 4) and by (40) together with the lower-semicontinuity of , we infer that solves (43).
It is well-known that the presence of a finite-dimensional constraint in an inverse problem, such as (41), promotes sparsity in the reconstruction. This observation has been recently made rigorous in [15] and [11], where it has been shown that the atoms of a sparse minimizer are the extremal points of the ball of the regularizers. In Theorem 6, we provided a characterization for the extremal points of the ball of . Therefore, specializing the above-mentioned results to our setting yields the following characterization theorem for sparse minimizers to (43).
Theorem 10**.**
Let . There exists a minimizer of (43) that can be represented as
[TABLE]
where , , , and
[TABLE]
where with for each , and .
In other words, the above theorem ensures the existence of a minimizer of (43) which is a finite linear combination of measures concentrated on the graphs of -trajectories contained in . In Section 4.1 we give a proof of Theorem 10, and we conclude the paper with Section 4.2, where we apply the sparsity result of Theorem 10 to dynamic inverse problems with optimal transport regularization, following the approach of [16].
4.1. Proof of Theorem 10
As already mentioned, the proof is an immediate consequence of Theorem 6 and a particular case of [11, Corollary ] (see also [11, Theorem ]). Before proceeding with the proof, for the reader’s convenience, we recall Corollary from [11]. The definitions appearing in the statement below will be briefly recalled in the proof of Theorem 10.
Theorem 11** ([11]).**
Let be a locally convex space, be a finite-dimensional Hilbert space, , be convex, and be linear. Consider the variational problem
[TABLE]
Suppose that the set of minimizers of (45), denoted by , is non-empty. Additionally, assume that there exists such that the set
[TABLE]
is linearly closed, the lineality space of is and . Then, exactly one of the following conditions holds:
- i)
* is a convex combination of at most extremal points of ,*
- ii)
* is as a convex combination of at most points, which are either extremal points of , or belong to an extreme ray of .*
Proof of Theorem 10.
We just need to verify that we can apply Theorem 11 to the variational problem (43). So, we choose , and and satisfying the assumptions stated above. Let be the set of solutions to (43).
First, notice that in Remark 9 we have already shown that the set of minimizers for (43) is non-empty, so that . Moreover is compact with respect to the weak* topology. Indeed, given a sequence in we can use Lemma 4 to extract a subsequence (not relabelled) such that in and in for every . Using the sequential lower semicontinuity of with respect to weak* convergence combined with the continuity of (according to (40)) and the lower semicontinuity of , we obtain . We conclude that is sequentially weakly* compact and hence weakly* compact, thanks to the metrizability of the weak* convergence on bounded sets. Finally note that is convex thanks to the convexity of and (Lemma 4). By Krein–Milman’s Theorem, we then infer the existence of a .
The lineality space of is defined as , where is the recession cone of defined as the set of all such that . Hence, from the coercivity of in Lemma 4 it is immediate to conclude that . Moreover, is linearly closed if the intersection of with every line is closed. It is easy to verify that, as is weakly* closed (Remark 9), it is also linearly closed. Finally, the assumption is satisfied whenever , as in this case , while . Hence, the hypotheses of Theorem 11 for the functional (42) are verified. Notice also that does not contain extreme rays. In order to prove that, we first recall that a ray of is any set of the form for , . An extreme ray of is a ray such that for every segment intersecting , the whole segment is contained in . Thanks to the coercivity of in Lemma 4, it is immediate to see that contains no rays and thus no extreme rays. Hence, from either of the conclusions in Theorem 11, we deduce that there exists a minimizer of (43) that can be represented as
[TABLE]
where , , and . We remark that if , the assumption in Theorem 11 is not satisfied, but the representation (47) holds trivially. Using the characterization of extremal points in Theorem 6 and (47), we obtain an explicit sparse representation for solutions of (43) and the proof is achieved. ∎
4.2. Dynamic inverse problems
Theorem 10 provides a representation formula for sparse solutions of (43) that holds for every and satisfying the above-stated hypotheses. A relevant choice for and is proposed in [16] as a model for dynamic inverse problems: in particular, the authors apply their framework to variational reconstruction in undersampled dynamic MRI. In what follows we make an explicit choice of and in order to apply Theorem 10 to a special case of the framework in [16], namely the case of discrete time sampling, and finite-dimensionality of the data for each sampled time.
To be more specific, consider a discretization of the interval in points and assume that we want to reconstruct an element of , by only making observations at the time instants . To this aim, let be a family of finite-dimensional Hilbert spaces and introduce the product space , normed by . Let be linear operators, which are assumed to be weak* continuous for each . For a given observation , consider the problem of finding such that
[TABLE]
Following [16], we regularize the above problem by
[TABLE]
In order to recast the above problem into the form (43), let be the linear operator defined by
[TABLE]
Notice that is continuous in the sense of (40), thanks to the assumptions on . We can then equivalently rewrite (48) as
[TABLE]
In this way, we recover a problem of the type of (43), where . Notice that is convex, lower semicontinuous and bounded from below. Moreover, the functional in (49) is proper, since . Hence, we can apply Theorem 10 to conclude the following result.
Corollary 12**.**
Let . There exists a minimizer of (48) that can be represented as
[TABLE]
where , , , and
[TABLE]
where with for each , and .
Remark 13*.*
The upper bound in the representation formula (50) might not be optimal. However, a careful analysis of the faces of the ball of the Benamou–Brenier energy, possibly under additional assumptions on the operator and fidelity term , could be needed to substantiate such conjecture. We leave this question open for future research.
Acknowledgements
We thank the referee for the useful suggestions provided, particularly for encouraging us to include the case in the characterization of Theorem 6, which was previously missing. KB and SF gratefully acknowledge support by the Christian Doppler Research Association (CDG) and Austrian Science Fund (FWF) through the Partnership in Research project PIR-27 “Mathematical methods for motion-aware medical imaging” and project P 29192 “Regularization graphs for variational imaging”. MC is supported by the Royal Society (Newton International Fellowship NIF\R1\192048 Minimal partitions as a robustness boost for neural network classifiers). The Institute of Mathematics and Scientific Computing, to which KB, SF and FR are affiliated, is a member of NAWI Graz (http://www.nawigraz.at/en/). The authors KB, SF and FR are further members of/associated with BioTechMed Graz (https://biotechmedgraz.at/en/).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] L. Ambrosio, N. Fusco, and D. Pallara. Functions of Bounded Variation and Free Discontinuity Problems . Oxford Science Publications, 2000.
- 2[2] L. Ambrosio, N. Gigli, and G. Savaré. Gradient Flows: In Metric Spaces and in the Space of Probability Measures . Birkhäuser Basel, 2005.
- 3[3] L. Ambrosio. Transport equation and Cauchy problem for BV vector fields. Inventiones mathematicae , 158(2):227–260, 2004.
- 4[4] L. Ambrosio, N. Gigli, and G. Savaré. Calculus and heat flow in metric measure spaces and applications to spaces with Ricci bounds from below. Inventiones mathematicae , 195(2):289–391, 2014.
- 5[5] M. Arjovsky, S. Chintala, and L. Bottou. Wasserstein generative adversarial networks. In Proceedings of the 34th International Conference on Machine Learning , volume 70, pages 214–223, 2017.
- 6[6] J.-D. Benamou and Y. Brenier. A computational fluid mechanics solution to the Monge–Kantorovich mass transfer problem. Numerische Mathematik , 84(3):375–393, 2000.
- 7[7] P. Bernard. Young measures, superposition and transport. Indiana University Mathematics Journal , 57(1):247–275, 2008.
- 8[8] M. Bernot, V. Caselles, and J.-M. Morel. Optimal transportation networks . Lecture Notes in Mathematics. Springer-Verlag, Berlin, 2009.
