An optimal transport approach for solving dynamic inverse problems in spaces of measures
Kristian Bredies, Silvio Fanzon

TL;DR
This paper introduces a novel regularization method for dynamic inverse problems using optimal transport, enabling the recovery of measure-valued curves in time-dependent data spaces, with applications in dynamic MRI.
Contribution
It develops a functional-analytic framework for optimal transport-based regularization of dynamic inverse problems, proving existence, uniqueness, and regularization properties of solutions.
Findings
Established existence and uniqueness of minimizers in certain cases.
Applied the framework to dynamic MRI reconstruction with promising results.
Modeled time-varying acquisition, motion, and contrast agent effects.
Abstract
In this paper we propose and study a novel optimal transport based regularization of linear dynamic inverse problems. The considered inverse problems aim at recovering a measure valued curve and are dynamic in the sense that (i) the measured data takes values in a time dependent family of Hilbert spaces, and (ii) the forward operators are time dependent and map, for each time, Radon measures into the corresponding data space. The variational regularization we propose is based on dynamic (un-)balanced optimal transport which means that the measure valued curves to recover (i) satisfy the continuity equation, i.e., the Radon measure at time is advected by a velocity field and varies with a growth rate , and (ii) are penalized with the kinetic energy induced by and a growth energy induced by . We establish a functional-analytic framework for these regularized inverse…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
An optimal transport approach for solving dynamic inverse problems in spaces of measures
Kristian Bredies
University of Graz, Institute of Mathematics and Scientific Computing, Heinrichstraße 36, 8010 Graz, Austria
and
Silvio Fanzon
University of Graz, Institute of Mathematics and Scientific Computing, Heinrichstraße 36, 8010 Graz, Austria
Abstract.
In this paper we propose and study a novel optimal transport based regularization of linear dynamic inverse problems. The considered inverse problems aim at recovering a measure valued curve and are dynamic in the sense that (i) the measured data takes values in a time dependent family of Hilbert spaces, and (ii) the forward operators are time dependent and map, for each time, Radon measures into the corresponding data space. The variational regularization we propose is based on dynamic (un-)balanced optimal transport which means that the measure valued curves to recover (i) satisfy the continuity equation, i.e., the Radon measure at time is advected by a velocity field and varies with a growth rate , and (ii) are penalized with the kinetic energy induced by and a growth energy induced by . We establish a functional-analytic framework for these regularized inverse problems, prove that minimizers exist and are unique in some cases, and study regularization properties. This framework is applied to dynamic image reconstruction in undersampled magnetic resonance imaging (MRI), modelling relevant examples of time varying acquisition strategies, as well as patient motion and presence of contrast agents.
Key words: dynamic inverse problems, optimal transport regularization, continuity equation, time dependent Bochner spaces, dynamic image reconstruction, dynamic MRI. 2010 Mathematics Subject Classification: 65J20, 49J20, 35F05, 46G12, 92C55.
Contents
1. Introduction
In this paper we are concerned with solving ill-posed dynamic inverse problems where the sought unknown is a curve of Radon measures. We propose to regularize such problems via balanced and unbalanced dynamic optimal transport and establish a functional-analytic framework that takes the specificities of dynamic inverse problems, such as the time-varying nature of the measurement process, into account. Well-posedness as well as regularization properties are proven, and the application to magnetic resonance imaging (MRI) is discussed.
Our motivation to consider dedicated strategies for dynamic inverse problems arises from the shortcomings of static reconstruction strategies for inverse problems in the presence of motion during the measurement process. In the static case, measurement data is usually continuously collected such that sufficiently many data is available to enable the unique solution of the underlying inverse problem. In this context, one has to assume that no dynamics occur during the measurement process. However, this assumption is often violated for many applications, including medical imaging techniques such as MRI and computed tomography (CT) that image, e.g., the beating heart or the lung while breathing. Consequently, a consistent reconstruction is no longer possible and static approaches usually admit motion artifacts. A strategy to overcome this is to temporally resolve the dynamics, meaning that for each time instance during the measurement, one seeks to reconstruct a solution where only a small fraction of the necessary data is available. In addition to that, generally, in each time instance, a different part of the data set is acquired. This results in a dynamic inverse problem with time-variant forward operators and data spaces, where for each fixed time instance, the corresponding inverse problem is massively underdetermined. In order to solve such a challenging problem, both an appropriate dynamic regularization strategy as well as a suitable modelling of forward operators and data spaces is necessary.
We propose and study a regularization strategy that bases on optimal transport energies, both in a balanced and unbalanced context, see below for a detailed description. Such strategies are naturally linked with curves of Radon measures, inverse problems in the space of Radon measures and appropriate Radon-norm-based regularizers. Indeed, the fact that for each point in time, an inverse problem with underdetermined data has to be solved calls for dedicated regularization such as the intensively studied sparsity-promoting -type penalties. In the discrete setting, this leads to the celebrated theory of compressed-sensing [16, 27], in which one is able to reconstruct the unknown starting from very few random measurements, yielding better stability properties. The continuous, infinite-dimensional counterpart is given by the space of Radon measures [15, 17], where the regularization can be achieved penalizing the Radon norm and formulating the inverse problem in measure space. Recovering the unknown from very few observations is then possible since the data admits redundancies, particularly in applications to medical imaging.
In the dynamic setting, data redundancy additionally needs to be exploited by taking time correlation into account. Indeed, one can expect displacements between consecutive time samples to be small, and incorporate this information in the regularizer in order to achieve better reconstruction. In particular, the fluid-mechanics formulation of both balanced and unbalanced optimal transport, known as Wasserstein [6, 8] and Wasserstein–Fisher–Rao distance [20, 41, 42], respectively, are particularly well-suited to keep track of motion and possible mass change occurring in the ground truth. Further, this formulation is based on curves of Radon measures and is thus attractive for the regularization of dynamic inverse problems, where in each time instance, a Radon measure should be recovered. The regularization is then enforced by subjecting potential solutions of the dynamic problem, i.e., curves of Radon measures, to the continuity equation, possibly with source, while at the same time penalizing displacement field and growth rate. As we will see, this approach indeed establishes a convex regularization strategy that is sparsity-promoting in each time frame, exploits data redundancy in time and intrinsically recovers the velocity field associated with the motion as well as the rate of brightness changes. Let us emphasize that in particular, the approach allows for continuous measurement in time while providing spatial Radon-norm regularization. In contrast, a straightforward generalization of [15] to the space-time cylinder would, e.g., allow for measures that are singular in time and hence, not regular enough to consistently define global-in-time forward operators and data discrepancies.
Another aspect that has to be considered for dynamic inverse problems is a faithful modelling of the measurement process with respect to time, taking into account the time-varying nature of the measurements. In this paper, the latter is achieved by the construction of ad hoc Bochner-type spaces in which the data can take values in a time-dependent family of Hilbert spaces, which are correlated in time in a very weak way. This enables us to model the dynamic inverse problem with a time-dependent family of linear forward operators, mapping Radon measures to the associated data Hilbert space in each time instance, by a global-in-time forward operator that takes curves of Radon measures to the time-varying measurement data, making it thus possible to consistently define a data-mismatch term. In this respect, our model is truly dynamic and well-adapted to undersampled data as outlined above.
The overall approach then realizes a reconstruction by inverting the global forward operator subject to optimal transport penalization and continuity equation. Such an approach leads to a well-posed and convex variational problem of Tikhonov type, in which we are able to reconstruct the sought solution, along with the displacement field and mass growth rate. The main task of the paper is to establish a rigorous functional-analytic framework in which to set the problem and to obtain well-posedness and stability properties for the proposed variational optimal transport-based regularization. We then apply our theoretical results to dynamic medical imaging, focusing on the case of undersampled MRI, showing that we are able to treat an almost arbitrary variety of sampling strategies, as well as being principally able to reconstruct the image sequence, recover the motion displacement and track the possible presence of contrast agents.
Let us shortly review the existing literature on dynamic inverse problems and optimal transport approaches for inverse problems, image processing as well as computer vision. While the theory of regularization of dynamic inverse problems is a relatively new field of research [61], regularization theory for static ill-posed inverse problems dates back several decades. In this context, the approach in this paper can be classified as Tikhonov regularization in Banach space, a well-established technique where one penalizes the data mismatch by a convex regularizer and solves the corresponding variational problem [29, 60, 62, 63]. For computer vision and image processing applications, research in the last decades focused on specific convex regularizers [56] such as, for instance, edge-preserving functionals (total variation [18, 54], total generalized variation [12, 13]), or functionals that enforce sparsity with respect to a given basis, frame or learned dictionary [1, 16, 24, 34, 64]. In this context, Radon-norm penalties can also be interpreted as sparsity-promoting regularization [15, 17].
Concerning optimal transport, the classical theory deals with the problem of transporting mass from a probability distribution into a target one, while minimizing, e.g., the average squared displacement. Such a minimization problem defines a metric over the space of probability measures, called Wasserstein distance [6, 21, 55, 65]. In [8], the authors showed that the Wasserstein distance can be computed via solving a convex dynamic problem that corresponds to finding a geodesic path in the space of probabilities subjected to the continuity equation that minimizes the kinetic energy. This formulation is the basis for the balanced optimal-transport regularization studied in this paper. Such approach, however, intrinsically assumes mass constancy, which is not always desired in applications, e.g., in mathematical imaging. In the recent years, several ways to overcome this limitation were proposed, leading to so-called unbalanced optimal transport [7, 31, 32, 35, 45, 46, 50, 51]. In this context, common strategies are to add a source term to the continuity equation and consequently, to the kinetic energy, or to allow mass to escape/enter the domain, by interpreting the boundary as an infinite reservoir. In this paper, the energy introduced independently in [20, 41, 42], known as Wasserstein–Fisher–Rao or Hellinger–Kantorovich, is used to provide a unbalanced optimal-transport regularization. The remarkable feature of such formulation is that geodesics have a clear meaning, as they can be interpreted as joint displacement and change of mass, and therefore capture the dynamics of, e.g., image sequences.
Returning to inverse problems, as already mentioned, research in the dynamic framework recently gained some momentum [61], where convex regularizers that penalize the time derivative, interpret the space-time cylinder as a higher-dimensional set or enforce a spatio-temporal decomposition of low rank have been studied in the literature, most prominently in the context of medical imaging applications [23, 36, 44, 48, 57, 58, 66]. In comparison, such approaches, however, do only implicitly account for motion information in contrast to the proposed optimal-transport regularizer which explicitly yields a motion field. In this respect, the employment of optimal transport energies as regularizers for inverse problems is a very recent development. Here, existing literature mainly focuses on static inverse problems and static optimal transport leading, for instance, to Wasserstein-distance type regularization [38, 47]. In contrast, dynamic optimal transport has been utilized for specific image processing and computer vision tasks such as image interpolation [20, 37, 49]. We also mention the work [59], which appeared after the present work. In [59], the authors propose to regularize an inverse problem related to PET image reconstruction through balanced optimal transport, subsequently applying it to the problem of tracking radiolabelled cells. The regularizer they propose is similar to ours, however the forward operator they consider is static and application-specific, whereas we are able to deal with general dynamic inverse problems. Moreover, their analytical framework is greatly simplified, dealing only with discretized unknowns in space-time satisfying a discrete version of the continuity equation, rather than with actual curves of measures, which is the natural framework to obtain well-posedness, as proposed in the present paper. To the best knowledge of the authors, no other works employ dynamic optimal transport regularization for dynamic inverse problems. In particular, a framework for recovering curves of Radon measures from continuously acquired measurements does not exist to date. Let us also mention that the realization of the time-dependent Bochner spaces introduced in this paper is new. Indeed, existing approaches usually assume that almost every data space is isomorphic, which is sufficient to model, i.e., function spaces over time-varying domains [4, 22]. Such isomorphy is not required in our approach which can thus be used to model very general data acquisition strategies.
The paper is organized as follows. In the remainder of this section, we precise the mathematical setting employed for regularizing dynamic inverse problems and summarize the main theoretical results obtained (Section 1.1), including details on the MRI application (Section 1.2). In Section 2 we lay the theoretical foundations to rigorously define the optimal transport regularizer. In Section 3 we introduce and study the above mentioned class of time dependent Bochner spaces, which will be used to model the data measurements. After this preliminary part, in Section 4, we introduce the Tikhonov regularization for the dynamic inverse problem and show well-posedness as well as regularization properties. In Section 5 we apply our theoretical results to dynamic MRI, also providing examples of sampling strategies. Finally, Section 6 concludes with some perspectives for future research and some comments on the related paper [11] as well as forthcoming work, in which we perform numerical analysis for the model proposed in this paper.
1.1. Outline of the mathematical setting and main theoretical results
Let be an open and bounded domain, with , , and consider a time variable . Let be a time-dependent collection of Hilbert spaces modelling the data. The time regularity required for such family will be very mild, as specified below. At each time instance corresponds a given linear continuous forward operator , mapping from the space of Radon measures into . We consider the following inverse problem: Given some data for , find a curve of Radon measures such that
[TABLE]
We propose to regularize (1) by means of balanced/unbalanced optimal transport. This is enforced by subjecting to the continuity equation
[TABLE]
where is a flow field transporting the mass , while is a growth rate keeping track of mass creation and destruction, thus allowing for local mass change. We point out that no initial conditions are prescribed on in (2), since in the context of the inverse problem (1) we only have available indirect measurements on the whole time interval . We propose to regularize (1) by minimizing the Tikhonov functional
[TABLE]
subject to (2). Here, are regularization parameters, is a penalty parameter and the optimization is done for the triple . The second term in (3) is known in the literature as Wasserstein–Fisher–Rao energy for unbalanced optimal transport [20, 41, 42], and as Benamou–Brenier energy [8] for balanced optimal transport when , enforcing and hence mass preservation.
Our main task is to establish problem (3) subject to (2) as a regularizer for (1) in a rigorous functional-analytic framework. In the following we provide some details on how to make the terms appearing in (3) rigorous, in particular providing suitable assumptions on and . The natural space in which to cast (3) is given by where . For define the transport energy as the 1-homogeneous convex functional
[TABLE]
where is any positive measure such that and for we define if , and in all other cases. Introduce the affine set where the continuity equation is in the distributional sense, without initial conditions (Definition 2.1). Whenever and , it follows that , and . Moreover with narrowly continuous, i.e., is continuous for all (Proposition 2.4). By setting in (4) we recover the second term in (3) (Proposition 2.6). Next, we outline how we define the space of measurements. Assume given a family of real Hilbert spaces for , with inner products denoted by , satisfying the following.
Assumption 1.1**.**
There exist a Banach space and linear continuous operators with the properties:
- (H1)
for some constant not depending on , 2. (H2)
is dense in , 3. (H3)
the map is Lebesgue measurable for every fixed .
In other words, we assume that each possesses a dense subset , and such subsets are related by the time-measurability condition (H3). In particular, Assumption 1.1 allows us to define suitable notions of strong measurability and integrability for measurements for such that for a.e. in (see Definitions 3.2, 3.8), leading to the definition of the measurements space
[TABLE]
In Theorem 3.13 we show that (5) is a Hilbert space with . Notice that our construction provides a natural extension of the classic Bochner theory to the case of varying codomains, in the sense that (5) coincides with the classical Bochner space when for all , with given Hilbert space. Details about the above construction are contained in Section 3. We now address the assumptions we make on the forward operators appearing in (1).
Assumption 1.2**.**
For a.e. the linear continuous operators satisfy:
- (K1)
is the adjoint of a linear continuous operator , 2. (K2)
for some constant not depending on , 3. (K3)
the map is strongly measurable for every fixed .
Under (K1)–(K3), (H1)–(H3) we have the following: if is narrowly continuous then the map belongs to (Lemma 4.2). At this point, we are ready to rigorously define the regularization functional anticipated in (3) as , where
[TABLE]
if and otherwise. The discrepancy term in is well defined since, if and , then with narrowly continuous, so that belongs to (Proposition 4.3). Notice that, in addition to the regularizer , we also included in the definition of : This serves the purpose of enforcing weak* coercivity on , since no initial data on is prescribed. Our main theoretical results concerning existence of minimizers for and regularization properties are summarized in the following statements, which are contained in Theorem 4.4 and Theorems 4.7, 4.10, respectively.
Theorem 1.3**.**
Assume (H1)–(H3), (K1)–(K3). Let , . Then admits a minimizer satisfying , with narrowly continuous. If in addition the operators are injective for a.e. , then the minimizer is unique.
In the next theorem, denotes the functional in (6) for and .
Theorem 1.4** (Regularization).**
Assume (H1)–(H3), (K1)–(K3). Let be noisy and exact data respectively, for noise level .
- i)
Suppose that strongly in , and ., Then, up to subsequences, converges weakly to .* 2. ii)
Assume that and , such that . If then, up to subsequences, converges weakly to solving (1) and there exist such that*
[TABLE]
1.2. Application to dynamic MRI
We apply the model (1) and its regularization (3) to undersampled dynamic MRI, yielding a reconstruction approach via convex optimization which is principally capable of capturing motion during the acquisition. A common limiting factor to medical imaging techniques and MRI in particular is acquisition speed such that, for instance, data cannot be collected sufficiently fast in order to temporally resolve the beating heart or the lung while breathing. Consequently, static reconstruction approaches lead to severe artifacts. Thus, motion has to be taken into account by considering the dynamic setting in which at each time instance, data is severely undersampled and temporal data redundancies have to be exploited. For this purpose, we show that the optimal-transport regularization framework developed in this paper can be applied, leading to a regularizer that penalizes the displacements caused by motion and intrinsically recovers the motion field as well as the growth rate.
The forward problem in undersampled dynamic MRI in two dimensions is commonly stated as follows: In each time instance , the proton density , a non-negative quantity, needs to be recovered from the measured data . Taking coil sensitivities into account, and are linked via the Fourier transform. However, for each , the Fourier data is only acquired on subsets specified by the sampling strategy, leading to each generally living on a different subset of the so-called -space and hence, being contained in a time-varying data Hilbert space . Modelling the proton density as a positive measure on the image domain , denoting by an appropriately masked Fourier transform and considering the unit time interval , the forward problem then indeed reads as in for . This is made precise in the following.
Adopting the common model for parallel data acquisition (see, e.g., [53, 40, 39, 57]), let be an open bounded domain representing the image domain and let the complex coil sensitivities for with to each of the receiver coils be given. The time-dependent sampling method is represented by a family of measures for . Such measures are required to satisfy some mild regularity assumptions, namely,
- (M1)
for a.e. , where does not depend on , 2. (M2)
the map is measurable for each ,
allowing for a variety of sampling methods, see Section 5 for details. The data space of measurements is then defined by , interpreted as a real Hilbert space and equipped with the norm , where we denote . The forward operators are given by defined via
[TABLE]
where is the Fourier transform and we interpret each as an element of . In Lemma 5.4 we show that under (M1)–(M2), the spaces and the forward operators fulfill (H1)–(H3) and (K1)–(K3), respectively. In this way the hypotheses of Theorems 1.3, 1.4 are satisfied, and we can regularize the reconstruction problem (1) with the functional defined in (6), which in this framework corresponds to
[TABLE]
where the measurements belong to . This shows in particular that optimal-transport regularization for undersampled dynamic MRI leads to well-posed convex optimization problems. These are accessible to analysis as well as efficient and stable numerical minimization algorithms. Section 6 provides some perspectives for the latter.
2. Dynamic optimal transport
The aim of this section is to provide the essential elements to define the optimal transport regularizer appearing in (6). We refer the reader to Appendix A.1 for measure theory definitions and results which will be needed in the following. Throughout the section, is an open bounded domain, with , and we set to be the time-space cylinder. We also define the space . In Section 2.1 we introduce the concept of measure solution to the continuity equation with source
[TABLE]
where . Here represents a density, a momentum field advecting and a source term, accounting for local mass change. We then investigate properties of solutions of (7). In particular in Proposition 2.2 we show that positive solutions to (7) disintegrate as with Borel family of positive measures over . In Proposition 2.4 we prove that, under some growth assumptions on and , the curve is actually narrowly continuous. Finally, in Section 2.2 we introduce the optimal transport energy at (6), and list some of its properties in Proposition 2.6.
2.1. Continuity equation
We want to consider measure valued (distributional) solutions to the continuity equation (7) with suitable boundary conditions. The precise definition is as follows.
Definition 2.1**.**
We say that is a measure solution to (7) if
[TABLE]
We remark that the above weak formulation includes zero flux boundary conditions for the momentum on , and no initial and final data is prescribed on . Moreover one can test (8) against functions in (see [6, Remark 8.1.1]). In the following proposition we show that positive solutions to (7) can be disintegrated with respect to the Lebesgue measure on (see Section A.1.4 for details on disintegration). To this end, let be the projection on the time coordinate.
Proposition 2.2**.**
Assume that satisfies (8), with . Then disintegrates, with respect to , as , where for a.e. . Moreover is a function of bounded variation, with distributional derivative . In particular, if the source , then the total mass is constant in time.
Proof.
In order to apply Theorem A.1 we need to show that . Let and define . By plugging in (8) we get , so that in the sense of distributions. Since , there exists such that . Therefore and there exists a Borel family such that . In particular, since is a Borel family, the map is measurable. Moreover it belongs to , since which is finite by assumption. Finally notice that , which together with implies that belongs to , with distributional derivative given by . ∎
Definition 2.3**.**
A curve is narrowly continuous if the map
[TABLE]
is continuous for every fixed . We denote by the set of such curves, and by the set of narrowly continuous curves of positive measures.
In the next proposition we show that if solves the continuity equation with appropriate energy bounds, the disintegration measures are defined for every and are narrowly continuous. This is a well-known result for . For completeness we will carry out the proof in Appendix A.2, by adapting the argument used to prove the homogeneous version (see Lemma 8.1.2 in [6]).
Proposition 2.4** (Continuous representative).**
Let be a solution of (8), with . Let be the disintegration of with respect to . Assume that and with , measurable functions such that
[TABLE]
Then there exists a narrowly continuous curve such that a.e. in . Moreover for each and we have
[TABLE]
In the rest of the paper we will identify with its narrowly continuous representative whenever the assumptions of Proposition 2.4 hold, and use the notation .
2.2. Optimal transport energy
We want to introduce the Wasserstein–Fisher–Rao energy (or Hellinger–Kantorovich) as originally done in [19, 20, 42, 43, 41]. First, define the convex set
[TABLE]
with fixed parameter and for every . For set
[TABLE]
where for and for . We have that is the Legendre conjugate of the characteristic function {\mathchoice{\raisebox{0.0pt}{\displaystyle\chi}}{\raisebox{0.0pt}{\textstyle\chi}}{\raisebox{0.0pt}{\scriptstyle\chi}}{\raisebox{0.0pt}{\scriptscriptstyle\chi}}}_{K_{\delta}} [20], that is,
[TABLE]
In particular is convex, lower semicontinuous and 1-homogeneous.
Definition 2.5** (Transport energy).**
Let . We define the transport energy as
[TABLE]
We summarize some of the properties of the functional that will be needed throughout this paper. The proof is omitted, and it can be easily adapted from the one in [55, Prop 5.18].
Proposition 2.6**.**
The functional defined in (12) is convex and lower semicontinuous for the weak convergence. Moreover it satisfies the following properties:*
- (i)
, 2. (ii)
assume that for some . Then
[TABLE] 3. (iii)
if then and , 4. (iv)
if and , then , for measurable , and
[TABLE]
3. Time dependent Bochner spaces
In this section we construct a class of Bochner spaces of Hilbert spaces valued functions, where the Hilbert space can vary in time. Here the underlying measure space is the unit interval with the Lebesgue measure. This can however be easily generalized to arbitrary measure spaces. Moreover a generalization to Banach spaces valued functions seems possible, however, it is out of the scope of this paper. More precisely, we want to define a concept of integrability for functions , where is a Hilbert space for each time , and for all . In order to do that we will closely follow the approach to define classic Bochner spaces (see [26, Ch II], [3, Ch 11]). In Section 3.1 we establish the functional analytic setting and assumptions under which we carry out the construction. In Section 3.2 we define suitable notions of measurability and provide the equivalent of the classic Pettis measurability theorem (see Theorem 3.5). Such result is instrumental to the following analysis, as it provides a practical characterization of strong measurability. In Section 3.3 we define integrability for maps and characterize it in Theorem 3.9. We then proceed to define the time dependent Bochner spaces . Notice that, in contrast to the classic Bochner theory, we will not define a notion of integral for integrable maps , but only of integrability. However, a comparison with the classical theory is possible, and it will be carried out in Appendix B.2.
3.1. Functional setting
Let for be a family of real Hilbert spaces with norms and scalar products denoted by and , respectively. The interval is equipped with the Lebesgue measure. As usual, we denote by the measure of a set and by {\mathchoice{\raisebox{0.0pt}{\displaystyle\chi}}{\raisebox{0.0pt}{\textstyle\chi}}{\raisebox{0.0pt}{\scriptstyle\chi}}{\raisebox{0.0pt}{\scriptscriptstyle\chi}}}_{E} its characteristic function, defined as {\mathchoice{\raisebox{0.0pt}{\displaystyle\chi}}{\raisebox{0.0pt}{\textstyle\chi}}{\raisebox{0.0pt}{\scriptstyle\chi}}{\raisebox{0.0pt}{\scriptscriptstyle\chi}}}_{E}(t):=1 if and {\mathchoice{\raisebox{0.0pt}{\displaystyle\chi}}{\raisebox{0.0pt}{\textstyle\chi}}{\raisebox{0.0pt}{\scriptstyle\chi}}{\raisebox{0.0pt}{\scriptscriptstyle\chi}}}_{E}(t):=0 otherwise. Let . We will denote by maps such that for a.e. . Let be a real Banach space with norm denoted by and duality by . Assume that for a.e. there exists a linear continuous operator with the following properties:
- (H1)
for some constant not depending on , 2. (H2)
is dense in , 3. (H3)
the map is Lebesgue measurable for every fixed .
The adjoint of is , defined by for all (here we identified with its dual). Notice that from (H1) it follows that is linear continuous and such that . Moreover from (H2) we have that is injective. Throughout the section, we say that if the equality holds a.e. in . Moreover we say that a.e. if for a.e. .
3.2. Measurability in time dependent spaces
In this section we introduce suitable measurability notions for maps , and prove our version of Pettis’ Theorem. We refer the reader to [26, Ch II] for classic measurability definitions.
Definition 3.1** (Step function).**
A map is a step function if f=\sum_{j=1}^{N}{\mathchoice{\raisebox{0.0pt}{\displaystyle\chi}}{\raisebox{0.0pt}{\textstyle\chi}}{\raisebox{0.0pt}{\scriptstyle\chi}}{\raisebox{0.0pt}{\scriptscriptstyle\chi}}}_{E_{j}}\varphi_{j} with , and Lebesgue measurable pairwise disjoint subsets of .
Definition 3.2** (Measurability).**
Let . We say that
- i)
is strongly measurable if there exists a sequence of step functions such that
[TABLE] 2. ii)
is weakly measurable if is Lebesgue measurable for each , 3. iii)
is essentially separably valued if there exist a measurable set with and a countable subset with the following property: for every and , there exists an element such that
[TABLE]
Notice that, if for each , with fixed Hilbert space, and , then Definitions 3.1 and 3.2 are equivalent to the classic ones given in Chapter II of [26].
Remark 3.3**.**
Let and f=\sum_{j=1}^{N}{\mathchoice{\raisebox{0.0pt}{\displaystyle\chi}}{\raisebox{0.0pt}{\textstyle\chi}}{\raisebox{0.0pt}{\scriptstyle\chi}}{\raisebox{0.0pt}{\scriptscriptstyle\chi}}}_{E_{j}}\varphi_{j} be a step function. Then the map is measurable and . Indeed, {\left\|i_{t}f(t)\right\|}_{H_{t}}^{2}=\sum_{j=1}^{N}{\mathchoice{\raisebox{0.0pt}{\displaystyle\chi}}{\raisebox{0.0pt}{\textstyle\chi}}{\raisebox{0.0pt}{\scriptstyle\chi}}{\raisebox{0.0pt}{\scriptscriptstyle\chi}}}_{E_{j}}(t){\langle i_{t}\varphi_{j},i_{t}\varphi_{j}\rangle}_{H_{t}}, so that is measurable by (H3), and hence also is. Moreover by (H1)
[TABLE]
Remark 3.4**.**
It is easy to check that strong measurability is stable under sums, scalar multiplication and pointwise a.e. convergence. Moreover if is strongly measurable then the map is Lebesgue measurable, since can be approximated a.e. by step functions and is measurable for every fixed by Remark 3.3.
The above definitions are linked together by the analogous of the classic Pettis measurability Theorem (see [26, Ch II.1, Thm 2]).
Theorem 3.5** (Pettis).**
Let . Then is strongly measurable if and only if is weakly measurable and essentially separably valued.
For a proof of this theorem, see Appendix B.1. By inspecting the proof, one can see that the following corollary holds.
Corollary 3.6**.**
Let . Then is strongly measurable if and only if it is the a.e. uniform limit of a sequence of countably valued functions , that is, if
[TABLE]
uniformly for a.e. , for some f_{n}=\sum_{j=1}^{\infty}{\mathchoice{\raisebox{0.0pt}{\displaystyle\chi}}{\raisebox{0.0pt}{\textstyle\chi}}{\raisebox{0.0pt}{\scriptstyle\chi}}{\raisebox{0.0pt}{\scriptscriptstyle\chi}}}_{E_{n,j}}\varphi_{n_{j}} with and measurable and pairwise disjoint subsets of .
Proposition 3.7** (Separable case).**
Assume that is separable. Then strong measurability is equivalent to weak measurability.
Proof.
Let be countable and dense. Then is dense in : indeed fix and . By (H2) there exists such that . Let be such that where is the constant in (H1). Then by triangle inequality and (H1). Therefore is separable and it is immediate to check that in this case every function is essentially separably valued. By Theorem 3.5 the thesis follows. ∎
3.3. Integration and spaces
Definition 3.8** (Integrability).**
Let be strongly measurable according to Definition 3.2. We say that is integrable if there exists a sequence of step functions such that
[TABLE]
Notice that the definition is well posed, since the map is Lebesgue measurable for each fixed (see Remark 3.4), hence its integral is well defined. Analogously to the classic case ([26, Ch II.2, Thm 2]), we can characterize integrability as stated in the following theorem.
Theorem 3.9** (Characterization of integrability).**
Let be strongly measurable. Then is integrable if and only if .
Proof.
Assume that is integrable. By (13), for sufficiently large we have
[TABLE]
and the thesis follows by Remark 3.3, since is a step function. Conversely, assume that . Since is strongly measurable, by Corollary 3.6 there exists a sequence of countably valued maps such that for a.e. . In particular , so that . By construction g_{n}(t)=\sum_{j=1}^{\infty}{\mathchoice{\raisebox{0.0pt}{\displaystyle\chi}}{\raisebox{0.0pt}{\textstyle\chi}}{\raisebox{0.0pt}{\scriptstyle\chi}}{\raisebox{0.0pt}{\scriptscriptstyle\chi}}}_{E_{n,j}}\varphi_{n,j} with sets measurable and pairwise disjoint. Hence there exists a sequence in such that . Therefore by setting f_{n}(t):=\sum_{j=1}^{k_{n}}{\mathchoice{\raisebox{0.0pt}{\displaystyle\chi}}{\raisebox{0.0pt}{\textstyle\chi}}{\raisebox{0.0pt}{\scriptstyle\chi}}{\raisebox{0.0pt}{\scriptscriptstyle\chi}}}_{E_{n,j}}\varphi_{n,j} we obtain
[TABLE]
proving that is integrable. ∎
As a corollary of the above theorem, we obtain that a suitable version of Lebesgue’s dominated convergence theorem holds in our setting. We postpone its proof to Appendix B.1.
Theorem 3.10** (Dominated convergence).**
Let be a sequence of integrable functions such that a.e. in and that there exists satisfying a.e. in . Then is integrable and strongly in .
Definition 3.11** ( space).**
Fix . We define the space of the -integrable functions
[TABLE]
In (14) we identify functions that coincide almost everywhere. Notice that (14) is well posed: Indeed since is strongly measurable, then the map is measurable (Remark 3.4), and hence is well defined (possibly infinite).
Remark 3.12**.**
If with fixed Hilbert space and , then coincides with the respective classic Bochner spaces (see [26, Ch II]).
Theorem 3.13**.**
Let . We have that is a Banach space with the norm . Moreover is a Hilbert space with the inner product .
The proof of the above theorem is postponed to Appendix B.1.
Remark 3.14** ().**
It is is possible to treat the case by defining as the set of strongly measurable functions such that . By adapting the proof for classical Bochner spaces, one can show that is a Banach space with the norm .
Remark 3.15** (Dual spaces).**
The space is self-dual, being a Hilbert space. We believe that for any one has the isometry with .
Example 3.16** (Narrowly continuous curves).**
Let be an open bounded domain with , . Let , with the supremum norm, with the norm . Define by . It is left to the reader to check that (H1)–(H3) are satisfied. Therefore we can define the space for each . Notice that in this case is isometric to , where .
Finally, we would like to mention that, although we do not define a notion of integral for maps in , a comparison to the classical Bochner theory is still possible. Indeed, if , then by definition and the codomain is a fixed Banach space. In Appendix B.2 we show that, assuming , we always have that is weakly* integrable (see Proposition B.2). However, Bochner integrability fails in general, as shown in Example B.3. Nevertheless, under suitable additional assumptions, one can show that Bochner integrability can be guaranteed (Proposition B.4).
4. Regularization of dynamic inverse problems
In this section we define and study the properties of the optimal transport based functional (6), and we establish it as a regularizer for the dynamic inverse problem (1). Throughout the section, the functional analytic setting will be the following. Let be an open bounded domain, where , , and define again the time space domain . Let be a family of Hilbert spaces for , a Banach space and linear operators which satisfy the assumptions (H1)–(H3) of Section 3.1. Assume given a family of linear continuous operators such that, for a.e. ,
- (K1)
is the adjoint of a linear continuous operator , 2. (K1’)
is weak*-to-weak continuous, 3. (K2)
for some constant that does not depend on , 4. (K3)
the map is strongly measurable in the sense of Definition 3.2, for all .
We remark that conditions (K1) and (K1’) are equivalent. As before, the space of narrowly continuous curves with values in the measures and in the positive measures are denoted by and respectively. The dynamic inverse problem we aim to regularize is the following: Given some data , find a narrowly continuous curve such that
[TABLE]
We regularize (15) as follows. Let be defined by
[TABLE]
and introduce the convex linear space of triples in satisfying the continuity equation
[TABLE]
Definition 4.1** (Regularized problem).**
Let and , . The regularizer of (15) is the functional defined by
[TABLE]
if and otherwise. Here is the disintegration of with respect to time and is the transport energy defined in (12).
We will proceed as follows. First, in Section 4.1 we show that the inverse problem in (15) and the functional in (16) are well defined, in the following sense: Given a triple with finite transport energy , then and it disintegrates into with narrowly continuous. For such curves, we show in Lemma 4.2, that is a measurement belonging to , providing well-definition for (15)–(16). In Section 4.2 we show that
[TABLE]
admits at least one solution, and the minimizer is unique under additional assumptions on the operators . This will be the content of Theorem 4.4. Finally, in Section 4.3, we investigate stability of the solutions to (17) and convergence for vanishing noise level.
4.1. Well-definition
In this section we want to show that the definition of the functional at (16) is well posed. The first step is to ensure that the fidelity term is well defined, namely, that given then the map belongs to . This fact will be established in the following Lemma.
Lemma 4.2**.**
If then .
Let us postpone the proof for a moment. As a consequence of the Lemma we have the following.
Proposition 4.3**.**
The functional at (16) is well defined.
Proof of Lemma 4.2.
Part 1. Let . First we show that the map is strongly measurable according to Definition 3.2. We do so by means of Theorem 3.5, by proving that is weakly measurable and essentially separably valued.
Claim 1: the map is weakly measurable as per Definition 3.2, that is, is measurable for every fixed .
Proof of Claim 1. By definition and (K1) we have Notice that the map is strongly measurable in the classic sense ([26, Ch II]). To see this, since is separable, by the classic Pettis Theorem ([26, Ch II.1, Thm 2]), it is enough to prove that is weakly measurable, meaning that is measurable for every fixed . The latter holds because and the map is strongly measurable by assumption (K3), and hence weakly measurable by Theorem 3.5. By definition of classic strong measurability, there exists a sequence of step functions , such that f_{n}(t)=\sum_{j=1}^{N_{n}}{\mathchoice{\raisebox{0.0pt}{\displaystyle\chi}}{\raisebox{0.0pt}{\textstyle\chi}}{\raisebox{0.0pt}{\scriptstyle\chi}}{\raisebox{0.0pt}{\scriptscriptstyle\chi}}}_{E_{j,n}}f_{j,n} with measurable partition of , , and such that
[TABLE]
for a.e. . We have that the map is measurable for each fixed , since {\langle\rho_{t},f_{n}(t)\rangle}_{\mathcal{M}(\overline{\Omega}),C(\overline{\Omega})}=\sum_{j=1}^{N_{n}}{\mathchoice{\raisebox{0.0pt}{\displaystyle\chi}}{\raisebox{0.0pt}{\textstyle\chi}}{\raisebox{0.0pt}{\scriptstyle\chi}}{\raisebox{0.0pt}{\scriptscriptstyle\chi}}}_{E_{j,n}}\int_{\overline{\Omega}}f_{j,n}\,d\rho_{t} and the maps are continuous by narrow continuity of . By Proposition A.3 we have that . Combining this with (18) yields
[TABLE]
as , for a.e. . Hence is measurable, being the a.e. limit of measurable maps, and the claim follows.
Claim 2: the map is essentially separably valued, that is, there exists a measurable set such that and a countable set with the following property: for every and there exists such that .
Proof of Claim 2. Let be a countable dense subset. Fix . By (K3) the map is strongly measurable and hence essentially separably valued by Theorem 3.5. Therefore there exists a measurable set with and a countable subset with the following property: for every and , there exists such that
[TABLE]
Denote by . Since is countable, the set is measurable, and . Moreover let , so that is countable. Define the set of averages of elements of as
[TABLE]
We have that is countable. Fix , . The claim follows by showing there exists such that
[TABLE]
Indeed, by density, there exists a sequence in such that as . Since it follows that in . By weak*-to-weak continuity of we have weakly in as . By the Banach–Saks property in [25, Ch VIII, Thm 1], there exists a subsequence (not relabelled) such that strongly in . Hence we can choose such that
[TABLE]
Since is a sequence in , by (19) and the definitions of and , we have that for every there exists such that
[TABLE]
Define , so that . By triangle inequality, linearity of , and (21)–(22),
[TABLE]
which yields (20).
Part 2. Since is strongly measurable, also is measurable. By (K2) and Proposition A.3 we have
[TABLE]
Hence by Theorem 3.9 we conclude that is integrable and it belongs to . ∎
Proof of Proposition 4.3.
If then also , hence by Proposition 2.6 we have that , , for Borel maps , such that
[TABLE]
By assumption solves the continuity equation, hence (Proposition 2.2) for some Borel family . In particular we have and . By (23) and Proposition 2.4 we have that . Therefore by Lemma 4.2, and the first term in is finite. ∎
4.2. Existence of minimizers
The aim of this section is to prove that the functional defined in (16) admits at least one minimizer. Such minimizer is unique under suitable hypotheses on the operators . The precise statement is the following.
Theorem 4.4**.**
Let and , . Then there exists with , , that solves the minimization problem
[TABLE]
If in addition is injective for a.e. , then the minimizer is unique.
The proof of the above theorem is based on the direct method of calculus of variations. Before proceeding to the proof, we will establish compactness and lower semicontinuity properties for the functional . This is the object of the following two lemmas.
Lemma 4.5** (Compactness for ).**
Let and , . Assume that there exists a constant such that the sequence in satisfies
[TABLE]
Then for some . Moreover there exists with , such that, up to subsequences,
[TABLE]
Proof.
By the energy bound (24), there exists such that, up to subsequences, weakly* in . From (24) we also have
[TABLE]
so that Proposition 2.6 implies , , for Borel maps , such that
[TABLE]
By (24) we have . Hence Proposition 2.2 implies that for some Borel family in . In particular we have and . Hence by (27) and Proposition 2.4 we get that . Now notice that if , then by definition of (see (12)) we infer
[TABLE]
where is defined in (11) and
[TABLE]
Therefore, by taking we conclude
[TABLE]
By combining (24) with the above estimates we conclude that (up to subsequences) and for some . Since is weak* closed, we get . By Proposition 2.6 the functional is weak* lower semicontinuous. Therefore (26) implies and by repeating the arguments above, we get that , , with and
[TABLE]
In particular . We will now show the second condition in (25). Since solves the continuity equation, by Proposition 2.2 we have that the map belongs to with distributional derivative given by , where we recall that is the projection on the time coordinate. Therefore, by the embedding of into
[TABLE]
where we used (28) and (24). Hence the set is uniformly bounded in , so it belongs to some set which is weak* sequentially compact. Moreover as a consequence of Lemma A.2 we have that for every
[TABLE]
where are the constants defined in the same lemma. The last inequality follows from (27) and the fact that is uniformly bounded, so that the constant does not depend on . Hence by Proposition A.4 there exists a subsequence (not relabelled) and a -continuous curve such that
[TABLE]
In particular is narrowly continuous, since it is -continuous (this fact can be obtained by repeating the same argument given in the proof of Proposition 2.4). Notice that (30) implies that where . Hence . By uniqueness of the disintegration we also get that and the thesis follows. ∎
Lemma 4.6** (Lower semicontinuity for ).**
Let and , . Assume that with , is such that converges to in the sense of (25), where , . Then we have
[TABLE]
In particular is lower semicontinuous with respect to the convergence in (25), that is,
[TABLE]
Proof.
Let us start by showing (31). To this end, fix . By assumption we have that weakly* in , for every . In particular, by (K1’), we have weakly in for a.e. , so that
[TABLE]
as . By proceeding as in (29) we obtain
[TABLE]
for some constant , since and are uniformly bounded by weak* convergence in . Hence by Cauchy–Schwarz and by (K2) we have
[TABLE]
Since we have that belongs to . By combining the above estimate with (33) and invoking the classic dominated convergence theorem we conclude (31).
Let us now prove the remaining part of the Lemma. From (31) we have that converges weakly to in , therefore by lower semicontinuity of the norm with respect to the weak convergence we have
[TABLE]
Moreover is weak* lower semicontinuous by Proposition 2.6, thus
[TABLE]
and (32) follows. ∎
Proof of Theorem 4.4.
Existence: Set with . Then , and by (iv) in Proposition 2.6. Moreover by Lemma 4.2, so that and the infimum is finite. Let be a minimizing sequence, that is, as . Therefore for some constant . From Lemma 4.5 we have and . Moreover there exists with , such that, up to subsequences, converges to in the sense of (25). By Lemma 4.6 and the fact that is a minimizing sequence we conclude that is a minimizer for .
Uniqueness: Assume that is injective for a.e. . The term is convex by Proposition 2.6. Also the term is convex as it is a norm. Since minimizers are necessarily of the form with , in order to prove uniqueness it is sufficient to show that
[TABLE]
is strictly convex for . First, consider such that . As a consequence the set is open (by continuity of ) and not empty, so . Let . By assumption we have . Therefore
[TABLE]
since for . Now let with and . The coefficient of the leading term of is , which is non zero by (35), since . Hence the map in (34) is strictly convex and we conclude. ∎
4.3. Regularization properties
In this section we denote by the exact data and by the noisy data for the noise level , that is, . For a datum and parameters we adopt the following notation:
[TABLE]
if and otherwise.
Theorem 4.7** (Stability).**
Assume that strongly in and that
[TABLE]
Then with , . Moreover admits a subsequence converging in the sense of (25). The limit of each converging subsequence of is a minimizer of .
Proof.
A sequence satisfying (36) exists by Theorem 4.4. By the same theorem it also follows that with . We have
[TABLE]
Since is a minimizer for we can test (36) against to obtain
[TABLE]
where last inequality follows from the convergence in . By applying Lemma 4.5 to , there exists , with , such that converges to in the sense of (25). We are left to show that is a minimizer for . Since for every , by Lemma 4.6 and the convergence we have weakly in . Also by Lemma 4.6,
[TABLE]
for every , since (36) holds and . Hence is a minimizer. ∎
We are now interested in studying properties of the minimizers of for vanishing noise level, that is, for data such that for every . To this end, we need to understand how the regularization term
[TABLE]
behaves for fixed argument . Since multiple parameters are involved, we will also allow and to take the value . We define
[TABLE]
where denotes the convex indicator function of the set . In order to give a similar definition for the case we first need to characterize the subset of where .
Proposition 4.8**.**
Assume that . We have that if and only if , and for some .
Proof.
By Proposition 2.6 point (iv) we have that for any . Conversely, assume that is such that . In particular the energy is finite, so points (iii)–(iv) of Proposition 2.6 imply that , , for some Borel maps , , and we have . Since and , we conclude that and . By assumption solves the continuity equation in the sense of (8), therefore Proposition 2.2 guarantees that for some Borel family . Since and a.e. in , we can apply Proposition 2.4 and conclude that . In particular, for every and with , formula (10) reads . By a density argument one can show that the previous holds for all , and hence for every . The thesis follows by setting . ∎
Proposition 4.8 motivates the following definition:
[TABLE]
where We are now in the position to define minimal energy solutions of the inverse problem
[TABLE]
Definition 4.9** (Minimal energy solution).**
Let and , . We say that is a minimal energy solution of (39) with parameters if
[TABLE]
In the following theorem we show that the minimizers for vanishing noise level converge in the sense of (25) to an energy minimizing solution of the inverse problem (39).
Theorem 4.10** (Convergence for vanishing noise level).**
Let be the exact data and be a sequence of noisy data such that . Let be such that
[TABLE]
Let , , so that, up to subsequences, and as , with . Assume there exists satisfying , , (39) and
[TABLE]
Let be such that
[TABLE]
Then with , and converges to in the sense of (25), up to subsequences. Moreover, every such weak limit of is a minimal energy solution of (39) with parameters and .*
Proof.
First notice that and as by (40). If or , we set and respectively. If either of the sequences do not diverge to , it is possible to find accumulation points . In particular, up to extracting a subsequence, we can assume that and as . A sequence satisfying (42) exists by Theorem 4.4. By the same theorem it also follows that with . By testing (42) against and using (39)–(40) we get
[TABLE]
In particular in . Since by assumption , we obtain in . Dividing the inequality at (43) by , taking the limes superior and keeping (40) intro account yields
[TABLE]
Notice that the right hand side in (44) is always bounded, thanks to definitions (37), (38) and assumption (41). By definition we have , thus
[TABLE]
where we used (44) and the convergence . Therefore an application of Lemma 4.5 guarantees the existence of with , , such that, up to subsequences, converges to in the sense of (25). In particular by Lemma 4.6 we have weakly in . Since we already proved that , by uniqueness of the weak limit we have
[TABLE]
We are left to show that is an energy minimizing solution of (39). By Lemma 4.6
[TABLE]
where we used (44) and that , . Replacing by an arbitrary solution of (39) with finite energy , the argument can be repeated, and from (45)–(46) we conclude that is an energy minimizing solution of (39). ∎
5. Application to dynamic undersampled MRI
We will now detail on the application of the above results to dynamic magnetic resonance imaging as outlined in the introduction. Let be an open bounded domain representing the image frame, and for with be the coil sensitivities. Let for be a family of measures such that
- (M1)
for a.e. , where does not depend on , 2. (M2)
the map is measurable for each .
Let be the Banach space normed by , where . Define Hilbert spaces , equipped with the norm , where . Define as the identity map, acting component-wise. Note that here we are interpreting and as real vector spaces. For a measure we denote its Fourier transform as
[TABLE]
so that . Notice that in the above definition we extend to be zero outside of . For each , define the linear operator as
[TABLE]
In the MRI context, the family of measures for represents the proton density at each time step. Given some data , we want to reconstruct a solution to the dynamic inverse problem
[TABLE]
As proposed in the previous sections, we relax the problem to measures , with , and minimize the functional introduced in (16). Under the assumptions (M1)–(M2) the functional admits at least one minimizer, and minimizers are unique under suitable additional assumptions. This claim is the object of the following theorem.
Theorem 5.1**.**
Let , , . Let for be a family of Radon measures in satisfying (M1)–(M2). Let be coil sensitivities. Then the regularization of (48) according to
[TABLE]
admits a solution with and the curve belonging to . If in addition the supports of the measures have non empty interior for a.e. , and the vector of coil sensitivities satisfies for every , then the minimizer is unique.
Before proceeding with the proof, we want to show how this analytical framework allows us to treat a wide variety of sampling patterns. We will give two examples.
Example 5.2** (Continuous sampling).**
Let and for define the line . Set , that is, the restriction of the -dimensional Hausdorff measure to the lines . It is immediate to check that satisfies (M1)–(M2): indeed , while the map is continuous for in . In the same way we can treat radial sampling, by setting to be a collection of diameters through the origin, evolving in time (see Example B.3).
Example 5.3** (Compressed-sensing sampling).**
In this example we propose to sample along a finite collection of moving points in an open bounded domain . To be more specific, fix , and for every let be a measurable curve. For a.e. define and . Notice that (M1) is satisfied since . Given a map we have that is measurable by construction. Therefore also (M2) is satisfied.
We now want to prove Theorem 5.1. Before that, we need a preliminary lemma, stating that under (M1)–(M2) the above definitions of satisfy assumptions (H1)–(H3) and (K1)–(K3).
Lemma 5.4**.**
Assume that (M1)–(M2) hold. The spaces , and the operators satisfy (H1)–(H3). Moreover the operators in (47) satisfy (K1)–(K3).
Proof.
Notice that is linear and continuous, with . In particular (H1) follows from (M1). Moreover (H2) is trivially satisfied. Finally for we have
[TABLE]
which is measurable by (M2), as it is the real part of a sum of measurable maps. Hence (H3) is also satisfied. Let us now show that (K1)–(K3) hold. For we have
[TABLE]
Hence, each is square integrable with respect to , so that maps into . Moreover, by the above estimate we also have
[TABLE]
where is the vector of coil sensitivities. Therefore is continuous, with and (K2) is satisfied because of assumption (M1). Let us show that is weak*-to-weak continuous. To this end, let in be such that . Since are supported in the compact set , it follows that
[TABLE]
as . Moreover, by weak* convergence we have . As a consequence of (49), there exists a constant such that
[TABLE]
By invoking the dominated convergence theorem in conjunction with (50)–(51) we conclude that weakly in . Hence (K1’) is satisfied and, as a consequence, is the adjoint of some linear continuous operator . Finally we need to show (K3): that the map is strongly measurable according to Definition 3.2, for every fixed . Notice that the space is separable, hence by Proposition 3.7 it is sufficient to show that is weakly measurable according to Definition 3.2. However, since is bounded (see (49)) and maps in are continuous, this is an immediate consequence of (M2). ∎
Proof of Theorem 5.1.
The existence of a minimizer follows from Proposition 5.4 and of Theorem 4.4. For the uniqueness, from Theorem 4.4, it is sufficient to check that the operators are injective for a.e. . To this end, choose such that has non empty interior, and let be such that . In particular in , for every . Since is analytic and contains an open ball, we conclude that in . By injectivity of the Fourier transform we have that , and since we are assuming that for every , we conclude that . ∎
6. Conclusions and perspectives
In the paper, we have shown that it is possible to successfully use energy functionals that are associated with a dynamic formulation of optimal transport as regularization functionals for dynamic inverse problems that aim at the recovery of measure-valued curves. Let us point out some future directions of research. On the one hand, the focus of the paper is on regularizers that penalize mass transport by the squared distance as well as mass growth in terms of quadratic costs for growth rate. Thus, a generalization to other convex optimal transport energies (such as, e.g., the -th power of the euclidean distance) in an appropriate dynamic context (i.e., where the dynamic formulation involves a continuity type equation), would be interesting and seems to be possible. On the other hand, the regularized problems involve, in addition to the transport energy, a Radon-norm term which corresponds to a penalization of the total mass. Also here, a generalization to other regularization functionals should be possible, provided that one can still ensure boundedness of the total mass. This way, it might be possible to impose, e.g., spatial smoothness of the solution curve.
Finally, we would like to mention that a numerical optimization algorithm for the solution of the regularized problem is currently under preparation, in the general setting and also with focus on the application to dynamic MRI. The numerical solution to the dynamic optimal transport problem outside the inverse problems context is addressed for example in [8, 20, 49], where the authors employ proximal splitting algorithms, after a careful discretization of the continuity equation constraint by means of staggered grids, and, subsequently, of the energy. The recent application to dynamic PET imaging [59] also builds on these algorithms. The method we propose, based on Frank-Wolfe-type algorithms [14, 28, 33], is inherently discretization-free and therefore genuinely cast in the space of Radon measures, in the spirit of [15, 52]. Generally, such algorithms are attractive for obtaining sparse solutions for inverse problems in terms of extremal points of the regularizer [9, 10]. In our setting, the idea is to linearize the fidelity term in (3) around some initial guess, and then proceed to minimize (a suitably modified version) of the functional obtained. The key part of the analysis is that such linearized problem admits a solution which is an extremal point of the unit ball of the regularizer, making the minimization problem numerically accessible, although non-convex in general. Indeed, as shown in [11], the extremal points of the Benamou-Brenier regularizer are given by measures concentrated on curves (with a certain regularity), yielding an optimization problem in some Sobolev space. It is then possible to show convergence of the algorithm in the space of measures. Such an approach is then particularly well-suited to recover sparse solutions, given by travelling Dirac deltas with the addition of noise, making it attractive for medical applications in which the tracking of point-sources is relevant. With this approach one can, in principle, deal also with the unbalanced optimal transport case. The key ingredient is of course the characterization of the extremal points of the Wasserstein–Fisher–Rao energy, which is currently under preparation by the authors, and bases on a measure-theoretic superposition principle for the non-homogeneous continuity equation, which is in itself a novel result.
Acknowledgments
The authors gratefully acknowledge support by the Christian Doppler Research Association (CDG) and Austrian Science Fund (FWF) through the Partnership in Research project PIR-27 “Mathematical methods for motion-aware medical imaging”. The Institute of Mathematics and Scientific Computing, to which the authors are affiliated, is a member of NAWI Graz (http://www.nawigraz.at/en/). The authors are further members of/associated with BioTechMed Graz (https://biotechmedgraz.at/en/).
Appendix A Measure theory
A.1. Measure theory preliminaries
In this paper we follow the definitions and notations of [5]. In particular, scalar or vectorial measures will always be defined on the Borel -algebra of some locally compact, separable metric space . Given a measure , we denote with its total variation. We always assume that is at least locally finite. The set of -valued measures for which is denoted by and , while the set of positive finite measures is denoted by .
A.1.1. Absolute continuity, support and restriction
Given and , we say that is absolutely continuous with respect to , in symbols , if whenever , . If and are real or vector valued measures, we say that they are mutually singular, , if there exists a set such that and . For a measure its support, denoted by , is the closure of the set of points such that for every neighbourhood of . If and , the restriction of to is the measure defined as for every .
A.1.2. Push-forward
Let be a locally compact, separable metric space. A map is Borel if for each . If is a real or vector valued measure on we define the push-forward of through as the measure on , defined by for each . If is a real or vector valued map on integrable with respect to then We recall that is is continuous and proper ( is compact if is compact) and is finite, then also is finite.
A.1.3. Convergences
Let be a sequence of measures on . We say that narrowly converges to , in symbols , if for all , i.e., continuous and bounded. We say that weak converges* to , in symbols , if for all . We recall that if and for each there exists compact such that in . Note that if is compact, then narrow convergence and weak* convergence coincide.
A.1.4. Disintegration
Let , be locally compact, separable metric spaces. Consider family of measures on . We say that the family is Borel if the map is Borel measurable for every measurable. Such condition implies that for every bounded Borel function , the map is Borel measurable (see [5, Prop 2.26]). We now state the disintegration theorem. For this version and the following properties, see [2, Sections 2.3, 2.4] and [5, Thm 2.28].
Theorem A.1** (Disintegration).**
Let , be locally compact, separable metric spaces. Let be a real (resp. vector valued) measure on , the projection on the first factor, and a positive measure on , with the property that . Then there exists a Borel family of real (resp. vector valued) measures on such that , that is,
[TABLE]
for every . A family such that is called a disintegration of with respect to .
In the setting of Theorem A.1 the following properties hold:
- (i)
If is another disintegration of with respect to , then for -a.e. , 2. (ii)
If is finite, then also is finite for -a.e. , 3. (iii)
Let , . Then if and only if for -a.e. .
A.2. Narrow continuity results
As in Section 2.1 we denote by the set of narrowly continuous curves and by the set of positive narrowly continuous curves. The remaining notations are the same as in Section 2.
Lemma A.2**.**
Let , , , , with , measurable. Assume that in the sense of (8) and
[TABLE]
Set and . Then and
[TABLE]
We remark that the above lemma is an easy generalization of Lemma 2.2 in [41], where the authors prove the same result under the restriction that .
Proof.
Since , we have . Arguing as in the proof of Proposition 2.4, we obtain that for all , the weak derivative of satisfies
[TABLE]
In particular by applying twice the Cauchy–Schwarz inequality we get
[TABLE]
The proof is concluded if we show that . By applying the above estimate to we get . If we pick and such that and , then we get , from which follows . ∎
Proposition A.3**.**
If then
Proof.
The curve defines a family of functionals in , via . By narrow continuity, the map is continuous for each , yielding that . The principle of uniform boundedness then implies the thesis. ∎
Proposition A.4** (A refined version of Ascoli–Arzelà’s Theorem).**
Let be sequentially weak compact and be such that for all , and*
[TABLE]
where is continuous, symmetric and such that for every . Then there exists a -continuous curve and a subsequence such that for every .
The above statement is a particular case of [6, Prop 3.3.1], since is a metric space and the -norm is weak* sequentially lower semicontinuous.
Proof of Proposition 2.4. Let , . Set , so that is a test function for (8). Define . Notice that is measurable since is a Borel family. Moreover belongs to since where . Testing (8) against yields that weakly. Notice that for a.e. we have
[TABLE]
In particular by assumption (9), so that with
[TABLE]
By the embedding , there exists a unique such that for a.e. , and where does not depend on , thanks to (53). Moreover
[TABLE]
By density of in , for each the map can be uniquely extended to an element of , since the maps are linear and the extensions are unique. This defines a bounded curve in . Such curve is uniformly continuous in , since (52) and (54) imply for every . We are left to prove that for each . This follows from the fact that there is a Borel set with such that for every and is weak* sequentially precompact in , since by Proposition 2.2 we have that is a.e. bounded as belongs to . The weak* continuity of the curve in automatically follows from the one in : indeed let and let be a sequence in such that as . Then it is immediate to check that so that is continuous, since it is uniform limit of continuous maps . Finally let , and define , where is such that , \lim_{\varepsilon\to 0}a_{\varepsilon}(t)={\mathchoice{\raisebox{0.0pt}{\displaystyle\chi}}{\raisebox{0.0pt}{\textstyle\chi}}{\raisebox{0.0pt}{\scriptstyle\chi}}{\raisebox{0.0pt}{\scriptscriptstyle\chi}}}_{(t_{1},t_{2})}(t) for almost every and weakly* in . Testing (8) against and passing to the limit as (by continuity of and (9)) yields (10).
Appendix B Time-Dependent Bochner Spaces
In this appendix we assume (H1)–(H3) as in Section 3.1. Definitions of step functions, strong measurability, weak measurability, separably valued and integrability are as in Sections 3.2, 3.3.
B.1. Auxiliary results and proofs of Section 3
Here we state and prove a suitable version of Egoroff’s Theorem, as well as present the proofs of Theorems 3.5, 3.10, 3.13.
Proposition B.1** (Egoroff).**
Let be strongly measurable and such that, for a.e. , . Then for each fixed there exists a Lebesgue measurable set with and such that uniformly in , that is,
[TABLE]
Proof.
The proof follows by replacing absolute values with the norms in the proof of the classic Egoroff Theorem. Indeed, since are assumed to be strongly measurable, the map is Lebesgue measurable (see Remark 3.4). Then the sets are measurable for each fixed . Moreover, for fixed, we have that and as , since we are assuming that a.e. in . Let be an increasing sequence of indices such that . It is immediate to see that the measurable set satisfies (55). ∎
Proof of Theorem 3.5. Part 1. Assume that is strongly measurable. Hence there exists a sequence of step functions with f_{n}=\sum_{j=1}^{N_{n}}{\mathchoice{\raisebox{0.0pt}{\displaystyle\chi}}{\raisebox{0.0pt}{\textstyle\chi}}{\raisebox{0.0pt}{\scriptstyle\chi}}{\raisebox{0.0pt}{\scriptscriptstyle\chi}}}_{E_{n,j}}\varphi_{n,j} such that a.e. in . We claim that is weakly measurable: Indeed fix and define , for . Clearly is measurable for fixed , by (H3). Moreover using Cauchy–Schwarz and (H1) yields , so that for a.e. , implying that is measurable, and hence is weakly measurable. We will now show that is essentially separably valued. By definition is strongly measurable and a.e., hence Proposition B.1 implies that for every there exists a measurable set with and such that uniformly on . Define the countable set . Let so that . Fix and . Hence there exists an index such that . By uniform convergence we conclude that , for sufficiently large . Therefore Definition 3.2 iii) is satisfied by setting .
Part 2. Let be weakly measurable and essentially separably valued. Let be countable and with satisfying Definition 3.2. For define if and otherwise. Notice that is Lebesgue measurable for each fixed , since is measurable by (H3). We will now show that
[TABLE]
Indeed, the supremum never exceeds by the Cauchy–Schwarz inequality. Conversely, if the equality is trivial, hence assume . Fix . Since is essentially separably valued, there exists such that . In particular . Then
[TABLE]
and since is arbitrarily small we conclude. Notice that the map is measurable by weak measurability of . Thus is measurable, being product of measurable maps. Since the countable pointwise supremum of measurable functions is measurable, by (56) we conclude that is measurable. Also the map is measurable at fixed, as
[TABLE]
is a sum of measurable functions, where the second element is measurable by weak measurability of and the third by (H3). Fix and define the measurable sets and the map by setting if for some , and otherwise. Note that for there exists some index such that . Therefore, by picking the smallest such that this condition is verified, we have . Since this is true for each , this means that a.e. in . Hence we can approximate essentially uniformly by with countably valued. By choosing we obtain countably valued functions such that
[TABLE]
where . Note that by definition g_{n}=\sum_{j=1}^{\infty}{\mathchoice{\raisebox{0.0pt}{\displaystyle\chi}}{\raisebox{0.0pt}{\textstyle\chi}}{\raisebox{0.0pt}{\scriptstyle\chi}}{\raisebox{0.0pt}{\scriptscriptstyle\chi}}}_{E_{n,j}}\varphi_{n,j} with measurable partition of . Therefore for every , there exists such that the set satisfies
[TABLE]
Set , . In this way by (58), since as . Now define step functions obtained by truncating , that is, f_{n}:=\sum_{j=1}^{k_{n}}{\mathchoice{\raisebox{0.0pt}{\displaystyle\chi}}{\raisebox{0.0pt}{\textstyle\chi}}{\raisebox{0.0pt}{\scriptstyle\chi}}{\raisebox{0.0pt}{\scriptscriptstyle\chi}}}_{E_{n,j}}\varphi_{n,j}. If we prove that
[TABLE]
then we conclude that is strongly measurable, since . In order to show (59), fix and . By using (57) and (H1) we have that for all
[TABLE]
Since , by definition there exists an index such that . Hence for every , so that \sum_{j=k_{n}+1}^{\infty}{\mathchoice{\raisebox{0.0pt}{\displaystyle\chi}}{\raisebox{0.0pt}{\textstyle\chi}}{\raisebox{0.0pt}{\scriptstyle\chi}}{\raisebox{0.0pt}{\scriptscriptstyle\chi}}}_{E_{n,j}}(t)\,\varphi_{n,j}=0 for each . Set . From (60) we have for every , implying (59).
Proof of Theorem 3.10. Since a.e., the map in strongly measurable and is Lebesgue measurable. By assumption we have that and a.e. in . Therefore, by the classic dominated convergence theorem, each is integrable and strongly in . To conclude integrability for it is sufficient to employ triangle inequality, integrability of and Theorem 3.9.
Proof of Theorem 3.13. The fact that is a norm follows immediately from the classic case as well as the fact that the map is measurable for each , when is assumed to be strongly measurable (see Remark 3.4). Moreover is an inner product, since the spaces are Hilbert. In order to show completeness, it is sufficient to follow the lines of the proof of the classic Riesz–Fischer theorem. Let and let be a Cauchy sequence in . In any normed linear space, a Cauchy sequence having a convergent subsequence converges to the same limit. Therefore, up to extracting a subsequence, we can assume that
[TABLE]
For every define measurable sets , so that {\mathchoice{\raisebox{0.0pt}{\displaystyle\chi}}{\raisebox{0.0pt}{\textstyle\chi}}{\raisebox{0.0pt}{\scriptstyle\chi}}{\raisebox{0.0pt}{\scriptscriptstyle\chi}}}_{E_{n}}(t)/n^{2}\leq{\left\|f_{n+1}(t)-f_{n}(t)\right\|}_{H_{t}} a.e. in and by (61). In particular one has . Define , so that is a nested sequence of measurable sets, with as . Finally set , which satisfies . By definition, if , then for sufficiently large. Hence for , and sufficiently large one has , and since as , we conclude that is a Cauchy sequence in . For denote by the strong limit of , which exists since is complete. For set . This defines a map , which is strongly measurable since it is the a.e. pointwise limit of a sequence of strongly measurable maps (see Remark 3.4). Moreover, by the a.e. pointwise convergence, we also have that as for a.e. . Since the maps are measurable, we can apply Fatou’s Lemma and conclude that, which is bounded since is a Cauchy sequence in . Hence . Finally, one more application of Fatou’s Lemma combined with (61) yields as .
B.2. Comparison with classic Bochner theory
In this section we will investigate integrability properties for when . Since the codomain of is the fixed space , it makes sense to check whether is integrable in a classic sense. Specifically, in Proposition B.2, we will see that is always Gelfand integrable. On the other hand, is not always Bochner integrable, as we show in Example B.3. The main impediment is that is not strongly measurable in general. Finally in Proposition B.4 we will give sufficient conditions under which is Bochner integrable.
Proposition B.2**.**
Assume that . Then is Gelfand integrable in , that is, for each measurable set there exists an element such that
[TABLE]
Proof.
Let . By duality one has . Therefore is weak* measurable since is weak measurable. By (H1),
[TABLE]
since is integrable. This shows that belongs to for each . Hence is Gelfand integrable by Theorem 11.52 in [3], and (62) holds. ∎
Example B.3** (Radial sampling).**
Let and for define the lines through the origin , so that . Define equipped with the supremum norm. Hence . Define with inner product , where is the 1-dimensional Hausdorff measure. Finally define by . It is straightforward to check that (H1)-(H3) are satisfied, so that we can consider the space defined as in (14).
We will now construct a map belonging to , compute the Gelfand integral of and show that is not Bochner integrable. To this end, notice that for a map such that we have that
[TABLE]
Note that (63) is an easy consequence of the classical coarea formula [30, Thm 3.11], and its proof is left to the reader. Let now be such that and . By applying (63) to and by the assumptions on we have that belongs to for a.e. . Define by . Notice that is strongly measurable, since can be approximated in by functions. Moreover by (63) we infer
[TABLE]
which is finite by assumption on . Hence is integrable by Theorem 3.9, and it belongs to . The Gelfand integral of , which exists by Proposition B.2, is given by
[TABLE]
The above follows immediately by applying (63) to with . However is not Bochner integrable, since it is not strongly measurable in the classic sense [26, Ch II]: For every with we have that the set is not norm separable in . Indeed, it is easy to show that for a.e.
[TABLE]
Since , we infer that is a discrete set for any with . Therefore is norm separable if and only if is countable, which is never the case. Hence is not essentially separably valued, and the classic Pettis Theorem [26, Ch II.1, Thm 2] implies that is not strongly measurable and hence not Bochner integrable.
Proposition B.4**.**
Assume that is essentially norm separable for each countable set and that is reflexive. If then is Bochner integrable.
Proof.
Since is strongly measurable, it is also weakly measurable and essentially separably valued by Theorem 3.5. We start by showing that weak measurability for implies weak measurability for in the classic sense. Indeed by reflexivity the canonical injection is also surjective. Therefore for each there exists a unique with and we have , which is measurable since is weakly measurable. Now let and measurable with and such that Definition 3.2 is satisfied. Therefore for every and there exists such that . Therefore by (H1) we can estimate . Hence the points of are arbitrarily close to . Since is assumed to be essentially separable in , there exists a measurable set with such that is separable in . By defining we obtain that also is norm separable, hence is essentially separably valued in the classic sense. By the classic Pettis Theorem [26, Ch II.1, Thm 2] we conclude that is strongly measurable. Finally (H2) implies that . By [26, Ch II.2, Thm 2], it follows that is Bochner integrable. ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] M. Aharon, M. Elad, and A. Bruckstein. K 𝐾 K -SVD: An algorithm for designing overcomplete dictionaries for sparse representation. IEEE Trans. Signal Process. , 54(11):4311–4322, 2006.
- 2[2] G. Alberti, S. Bianchini, and G. Crippa. A uniqueness result for the continuity equation in two dimensions. J. Eur. Math. Soc. , 16(2):201–234, 2014.
- 3[3] C. D. Aliprantis and K. Border. Infinite Dimensional Analysis . Springer-Verlag Berlin Heidelberg, 2006.
- 4[4] A. Alphonse, C. Elliott, and B. Stinner. An abstract framework for parabolic PD Es on evolving spaces. Port. Math. , 72(1):1–46, 2015.
- 5[5] L. Ambrosio, N. Fusco, and D. Pallara. Functions of Bounded Variation and Free Discontinuity Problems . Oxford Science Publications. Clarendon Press, 2000.
- 6[6] L. Ambrosio, N. Gigli, and G. Savaré. Gradient Flows: In Metric Spaces and in the Space of Probability Measures . Lectures in Mathematics. ETH Zürich. Birkhäuser Basel, 2006.
- 7[7] J.-D. Benamou. Numerical resolution of an “unbalanced” mass transport problem. ESAIM Math. Model. Num. , 37(5):851–868, 2003.
- 8[8] J.-D. Benamou and Y. Brenier. A computational fluid mechanics solution to the Monge–Kantorovich mass transfer problem. Numer. Math. , 84(3):375–393, 2000.
