An optimal transport approach for solving dynamic inverse problems in   spaces of measures

Kristian Bredies; Silvio Fanzon

arXiv:1901.10162·math.FA·April 26, 2023

An optimal transport approach for solving dynamic inverse problems in spaces of measures

Kristian Bredies, Silvio Fanzon

PDF

TL;DR

This paper introduces a novel regularization method for dynamic inverse problems using optimal transport, enabling the recovery of measure-valued curves in time-dependent data spaces, with applications in dynamic MRI.

Contribution

It develops a functional-analytic framework for optimal transport-based regularization of dynamic inverse problems, proving existence, uniqueness, and regularization properties of solutions.

Findings

01

Established existence and uniqueness of minimizers in certain cases.

02

Applied the framework to dynamic MRI reconstruction with promising results.

03

Modeled time-varying acquisition, motion, and contrast agent effects.

Abstract

In this paper we propose and study a novel optimal transport based regularization of linear dynamic inverse problems. The considered inverse problems aim at recovering a measure valued curve and are dynamic in the sense that (i) the measured data takes values in a time dependent family of Hilbert spaces, and (ii) the forward operators are time dependent and map, for each time, Radon measures into the corresponding data space. The variational regularization we propose is based on dynamic (un-)balanced optimal transport which means that the measure valued curves to recover (i) satisfy the continuity equation, i.e., the Radon measure at time $t$ is advected by a velocity field $v$ and varies with a growth rate $g$ , and (ii) are penalized with the kinetic energy induced by $v$ and a growth energy induced by $g$ . We establish a functional-analytic framework for these regularized inverse…

Equations241

K_{t}^{*} ρ_{t} = f_{t} for a.e. t \in [0, 1] .

K_{t}^{*} ρ_{t} = f_{t} for a.e. t \in [0, 1] .

\partial_{t} ρ_{t} + div (v_{t} ρ_{t}) = g_{t} ρ_{t} in (0, 1) \times \overline{Ω},

\partial_{t} ρ_{t} + div (v_{t} ρ_{t}) = g_{t} ρ_{t} in (0, 1) \times \overline{Ω},

ρ_{t}, v_{t}, g_{t} min \frac{1}{2} \int_{0}^{1} ∥ K_{t}^{*} ρ_{t} - f_{t} ∥_{H_{t}}^{2} d t + \frac{α}{2} \int_{0}^{1} \int_{\overline{Ω}} ∣ v_{t} (x) ∣^{2} + δ^{2} ∣ g_{t} (x) ∣^{2} d ρ_{t} (x) d t + β \int_{0}^{1} ρ_{t} (\overline{Ω}) d t,

ρ_{t}, v_{t}, g_{t} min \frac{1}{2} \int_{0}^{1} ∥ K_{t}^{*} ρ_{t} - f_{t} ∥_{H_{t}}^{2} d t + \frac{α}{2} \int_{0}^{1} \int_{\overline{Ω}} ∣ v_{t} (x) ∣^{2} + δ^{2} ∣ g_{t} (x) ∣^{2} d ρ_{t} (x) d t + β \int_{0}^{1} ρ_{t} (\overline{Ω}) d t,

B_{δ} (ρ, m, μ) := \int_{X} Ψ_{δ} (\frac{d ρ}{d λ}, \frac{d m}{d λ}, \frac{d μ}{d λ}) d λ,

B_{δ} (ρ, m, μ) := \int_{X} Ψ_{δ} (\frac{d ρ}{d λ}, \frac{d m}{d λ}, \frac{d μ}{d λ}) d λ,

L^{2} ([0, 1]; H) := {f : [0, 1] \to H : f strongly measurable, \int_{0}^{1} ∥ f_{t} ∥_{H_{t}}^{2} d t < \infty} .

L^{2} ([0, 1]; H) := {f : [0, 1] \to H : f strongly measurable, \int_{0}^{1} ∥ f_{t} ∥_{H_{t}}^{2} d t < \infty} .

J (ρ, m, μ) := \frac{1}{2} ∥ K_{t}^{*} ρ_{t} - f_{t} ∥_{L^{2}}^{2} + α B_{δ} (ρ, m, μ) + β ∥ ρ ∥_{M (X)},

J (ρ, m, μ) := \frac{1}{2} ∥ K_{t}^{*} ρ_{t} - f_{t} ∥_{L^{2}}^{2} + α B_{δ} (ρ, m, μ) + β ∥ ρ ∥_{M (X)},

(ρ^{†}, m^{†}, μ^{†}) \in arg min α^{*} B_{δ} (ρ, m, μ) + β^{*} ∥ ρ ∥_{M (X)} .

(ρ^{†}, m^{†}, μ^{†}) \in arg min α^{*} B_{δ} (ρ, m, μ) + β^{*} ∥ ρ ∥_{M (X)} .

K_{t}^{*} ρ_{t} := (F (c_{1} ρ_{t}), \dots, F (c_{N} ρ_{t})),

K_{t}^{*} ρ_{t} := (F (c_{1} ρ_{t}), \dots, F (c_{N} ρ_{t})),

J (ρ, m, μ) := \frac{1}{2} j = 1 \sum N \int_{0}^{1} F (c_{j} ρ_{t}) - f_{t}^{j}_{L_{σ_{t}}^{2} (R^{2}; C)}^{2} d t + α B_{δ} (ρ, m, μ) + β ∥ ρ ∥_{M ((0, 1) \times \overline{Ω})},

J (ρ, m, μ) := \frac{1}{2} j = 1 \sum N \int_{0}^{1} F (c_{j} ρ_{t}) - f_{t}^{j}_{L_{σ_{t}}^{2} (R^{2}; C)}^{2} d t + α B_{δ} (ρ, m, μ) + β ∥ ρ ∥_{M ((0, 1) \times \overline{Ω})},

\partial_{t} ρ + div m = μ in X,

\partial_{t} ρ + div m = μ in X,

\int_{X} \partial_{t} φ d ρ + \int_{X} \nabla φ \cdot d m + \int_{X} φ d μ = 0 for all φ \in C_{c}^{\infty} (X) .

\int_{X} \partial_{t} φ d ρ + \int_{X} \nabla φ \cdot d m + \int_{X} φ d μ = 0 for all φ \in C_{c}^{\infty} (X) .

t \mapsto \int_{\overline{Ω}} φ (x) d ρ_{t} (x)

t \mapsto \int_{\overline{Ω}} φ (x) d ρ_{t} (x)

\int_{0}^{1} \int_{\overline{Ω}} ∣ v_{t} (x) ∣ d ρ_{t} (x) d t < \infty and \int_{0}^{1} \int_{\overline{Ω}} ∣ g_{t} (x) ∣ d ρ_{t} (x) d t < \infty .

\int_{0}^{1} \int_{\overline{Ω}} ∣ v_{t} (x) ∣ d ρ_{t} (x) d t < \infty and \int_{0}^{1} \int_{\overline{Ω}} ∣ g_{t} (x) ∣ d ρ_{t} (x) d t < \infty .

\int_{t_{1}}^{t_{2}} \int_{\overline{Ω}} (\partial_{t} φ + \nabla φ \cdot v_{t} + φ g_{t}) d ρ_{t} (x) d t = \int_{\overline{Ω}} φ (t_{2}, x) d \tilde{ρ}_{t_{2}} (x) - \int_{\overline{Ω}} φ (t_{1}, x) d \tilde{ρ}_{t_{1}} (x) .

\int_{t_{1}}^{t_{2}} \int_{\overline{Ω}} (\partial_{t} φ + \nabla φ \cdot v_{t} + φ g_{t}) d ρ_{t} (x) d t = \int_{\overline{Ω}} φ (t_{2}, x) d \tilde{ρ}_{t_{2}} (x) - \int_{\overline{Ω}} φ (t_{1}, x) d \tilde{ρ}_{t_{1}} (x) .

K_{δ} := {(a, b, c) \in R \times R^{d} \times R : a + \frac{1}{2} (∣ b ∣^{2} + \frac{c ^{2}}{δ ^{2}}) \leq 0},

K_{δ} := {(a, b, c) \in R \times R^{d} \times R : a + \frac{1}{2} (∣ b ∣^{2} + \frac{c ^{2}}{δ ^{2}}) \leq 0},

Ψ_{δ} (t, x, y) := ⎩ ⎨ ⎧ \frac{∣ x ∣ ^{2} + δ ^{2} y ^{2}}{2 t} 0 \infty if t > 0, if t = ∣ x ∣ = y = 0, otherwise,

Ψ_{δ} (t, x, y) := ⎩ ⎨ ⎧ \frac{∣ x ∣ ^{2} + δ ^{2} y ^{2}}{2 t} 0 \infty if t > 0, if t = ∣ x ∣ = y = 0, otherwise,

Ψ_{δ} (t, x, y) = (a, b, c) \in K_{δ} sup (a t + b \cdot x + cy) for each (t, x, y) \in R \times R^{d} \times R .

Ψ_{δ} (t, x, y) = (a, b, c) \in K_{δ} sup (a t + b \cdot x + cy) for each (t, x, y) \in R \times R^{d} \times R .

B_{δ} (ρ, m, μ) := sup {\int_{X} a d ρ + \int_{X} b \cdot d m + \int_{X} c d μ : (a, b, c) \in C_{0} (X; K_{δ})} .

B_{δ} (ρ, m, μ) := sup {\int_{X} a d ρ + \int_{X} b \cdot d m + \int_{X} c d μ : (a, b, c) \in C_{0} (X; K_{δ})} .

B_{δ} (ρ, m, μ) = \int_{X} Ψ_{δ} (\frac{d ρ}{d λ}, \frac{d m}{d λ}, \frac{d μ}{d λ}) d λ,

B_{δ} (ρ, m, μ) = \int_{X} Ψ_{δ} (\frac{d ρ}{d λ}, \frac{d m}{d λ}, \frac{d μ}{d λ}) d λ,

B_{δ} (ρ, m, μ) = \int_{X} Ψ_{δ} (1, v, g) d ρ = \frac{1}{2} \int_{X} (∣ v ∣^{2} + δ^{2} g^{2}) d ρ .

B_{δ} (ρ, m, μ) = \int_{X} Ψ_{δ} (1, v, g) d ρ = \frac{1}{2} \int_{X} (∣ v ∣^{2} + δ^{2} g^{2}) d ρ .

n lim ∥ i_{t} f_{n} (t) - f (t) ∥_{H_{t}} = 0 for a.e. t \in [0, 1],

n lim ∥ i_{t} f_{n} (t) - f (t) ∥_{H_{t}} = 0 for a.e. t \in [0, 1],

∥ i_{t} φ - f (t) ∥_{H_{t}} < ε .

∥ i_{t} φ - f (t) ∥_{H_{t}} < ε .

\int_{0}^{1} ∥ i_{t} f (t) ∥_{H_{t}}^{p} d t = j = 1 \sum N \int_{E_{j}} ∥ i_{t} φ_{j} ∥_{H_{t}}^{p} d t \leq C^{p} j = 1 \sum N ∣ E_{j} ∣ ∥ φ_{j} ∥_{D}^{p} < \infty .

\int_{0}^{1} ∥ i_{t} f (t) ∥_{H_{t}}^{p} d t = j = 1 \sum N \int_{E_{j}} ∥ i_{t} φ_{j} ∥_{H_{t}}^{p} d t \leq C^{p} j = 1 \sum N ∣ E_{j} ∣ ∥ φ_{j} ∥_{D}^{p} < \infty .

n lim ∥ i_{t} f_{n} (t) - f (t) ∥_{H_{t}} = 0

n lim ∥ i_{t} f_{n} (t) - f (t) ∥_{H_{t}} = 0

n lim \int_{0}^{1} ∥ i_{t} f_{n} (t) - f (t) ∥_{H_{t}} d t = 0 .

n lim \int_{0}^{1} ∥ i_{t} f_{n} (t) - f (t) ∥_{H_{t}} d t = 0 .

\int_{0}^{1} ∥ f (t) ∥_{H_{t}} d t \leq \int_{0}^{1} ∥ i_{t} f_{N} (t) - f (t) ∥_{H_{t}} d t + \int_{0}^{1} ∥ i_{t} f_{N} (t) ∥_{H_{t}} d t < 1 + \int_{0}^{1} ∥ i_{t} f_{N} (t) ∥_{H_{t}} d t

\int_{0}^{1} ∥ f (t) ∥_{H_{t}} d t \leq \int_{0}^{1} ∥ i_{t} f_{N} (t) - f (t) ∥_{H_{t}} d t + \int_{0}^{1} ∥ i_{t} f_{N} (t) ∥_{H_{t}} d t < 1 + \int_{0}^{1} ∥ i_{t} f_{N} (t) ∥_{H_{t}} d t

\int_{0}^{1} ∥ i_{t} f_{n} (t) - f (t) ∥_{H_{t}} d t \leq \int_{0}^{1} ∥ i_{t} g_{n} (t) - f (t) ∥_{H_{t}} d t + \int_{0}^{1} ∥ i_{t} f_{n} (t) - i_{t} g_{n} (t) ∥_{H_{t}} d t < \frac{2}{n},

\int_{0}^{1} ∥ i_{t} f_{n} (t) - f (t) ∥_{H_{t}} d t \leq \int_{0}^{1} ∥ i_{t} g_{n} (t) - f (t) ∥_{H_{t}} d t + \int_{0}^{1} ∥ i_{t} f_{n} (t) - i_{t} g_{n} (t) ∥_{H_{t}} d t < \frac{2}{n},

L^{p} ([0, 1]; H) := {f : [0, 1] \to H : f strongly measurable, \int_{0}^{1} ∥ f (t) ∥_{H_{t}}^{p} d t < \infty} .

L^{p} ([0, 1]; H) := {f : [0, 1] \to H : f strongly measurable, \int_{0}^{1} ∥ f (t) ∥_{H_{t}}^{p} d t < \infty} .

K_{t}^{*} ρ_{t} = f_{t}, for a.e. t \in [0, 1] .

K_{t}^{*} ρ_{t} = f_{t}, for a.e. t \in [0, 1] .

M := M (X) \times M (X; R^{d}) \times M (X),

M := M (X) \times M (X; R^{d}) \times M (X),

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

An optimal transport approach for solving dynamic inverse problems in spaces of measures

Kristian Bredies

University of Graz, Institute of Mathematics and Scientific Computing, Heinrichstraße 36, 8010 Graz, Austria

[email protected]

and

Silvio Fanzon

University of Graz, Institute of Mathematics and Scientific Computing, Heinrichstraße 36, 8010 Graz, Austria

[email protected]

Abstract.

In this paper we propose and study a novel optimal transport based regularization of linear dynamic inverse problems. The considered inverse problems aim at recovering a measure valued curve and are dynamic in the sense that (i) the measured data takes values in a time dependent family of Hilbert spaces, and (ii) the forward operators are time dependent and map, for each time, Radon measures into the corresponding data space. The variational regularization we propose is based on dynamic (un-)balanced optimal transport which means that the measure valued curves to recover (i) satisfy the continuity equation, i.e., the Radon measure at time $t$ is advected by a velocity field $v$ and varies with a growth rate $g$ , and (ii) are penalized with the kinetic energy induced by $v$ and a growth energy induced by $g$ . We establish a functional-analytic framework for these regularized inverse problems, prove that minimizers exist and are unique in some cases, and study regularization properties. This framework is applied to dynamic image reconstruction in undersampled magnetic resonance imaging (MRI), modelling relevant examples of time varying acquisition strategies, as well as patient motion and presence of contrast agents.

Key words: dynamic inverse problems, optimal transport regularization, continuity equation, time dependent Bochner spaces, dynamic image reconstruction, dynamic MRI. 2010 Mathematics Subject Classification: 65J20, 49J20, 35F05, 46G12, 92C55.

1 Introduction
1.1 Outline of the mathematical setting and main theoretical results
1.2 Application to dynamic MRI
2 Dynamic optimal transport
2.1 Continuity equation
2.2 Optimal transport energy
3 Time dependent Bochner spaces
3.1 Functional setting
3.2 Measurability in time dependent spaces
3.3 Integration and $L^{p}$ spaces
4 Regularization of dynamic inverse problems
4.1 Well-definition
4.2 Existence of minimizers
4.3 Regularization properties
5 Application to dynamic undersampled MRI
6 Conclusions and perspectives
A Measure theory
A.1 Measure theory preliminaries
A.1.1 Absolute continuity, support and restriction
A.1.2 Push-forward
A.1.3 Convergences
A.1.4 Disintegration
A.2 Narrow continuity results
B Time-Dependent Bochner Spaces
B.1 Auxiliary results and proofs of Section 3
B.2 Comparison with classic Bochner theory

1. Introduction

In this paper we are concerned with solving ill-posed dynamic inverse problems where the sought unknown is a curve of Radon measures. We propose to regularize such problems via balanced and unbalanced dynamic optimal transport and establish a functional-analytic framework that takes the specificities of dynamic inverse problems, such as the time-varying nature of the measurement process, into account. Well-posedness as well as regularization properties are proven, and the application to magnetic resonance imaging (MRI) is discussed.

Our motivation to consider dedicated strategies for dynamic inverse problems arises from the shortcomings of static reconstruction strategies for inverse problems in the presence of motion during the measurement process. In the static case, measurement data is usually continuously collected such that sufficiently many data is available to enable the unique solution of the underlying inverse problem. In this context, one has to assume that no dynamics occur during the measurement process. However, this assumption is often violated for many applications, including medical imaging techniques such as MRI and computed tomography (CT) that image, e.g., the beating heart or the lung while breathing. Consequently, a consistent reconstruction is no longer possible and static approaches usually admit motion artifacts. A strategy to overcome this is to temporally resolve the dynamics, meaning that for each time instance during the measurement, one seeks to reconstruct a solution where only a small fraction of the necessary data is available. In addition to that, generally, in each time instance, a different part of the data set is acquired. This results in a dynamic inverse problem with time-variant forward operators and data spaces, where for each fixed time instance, the corresponding inverse problem is massively underdetermined. In order to solve such a challenging problem, both an appropriate dynamic regularization strategy as well as a suitable modelling of forward operators and data spaces is necessary.

We propose and study a regularization strategy that bases on optimal transport energies, both in a balanced and unbalanced context, see below for a detailed description. Such strategies are naturally linked with curves of Radon measures, inverse problems in the space of Radon measures and appropriate Radon-norm-based regularizers. Indeed, the fact that for each point in time, an inverse problem with underdetermined data has to be solved calls for dedicated regularization such as the intensively studied sparsity-promoting $\ell_{1}$ -type penalties. In the discrete setting, this leads to the celebrated theory of compressed-sensing [16, 27], in which one is able to reconstruct the unknown starting from very few random measurements, yielding better stability properties. The continuous, infinite-dimensional counterpart is given by the space of Radon measures [15, 17], where the regularization can be achieved penalizing the Radon norm and formulating the inverse problem in measure space. Recovering the unknown from very few observations is then possible since the data admits redundancies, particularly in applications to medical imaging.

In the dynamic setting, data redundancy additionally needs to be exploited by taking time correlation into account. Indeed, one can expect displacements between consecutive time samples to be small, and incorporate this information in the regularizer in order to achieve better reconstruction. In particular, the fluid-mechanics formulation of both balanced and unbalanced optimal transport, known as Wasserstein [6, 8] and Wasserstein–Fisher–Rao distance [20, 41, 42], respectively, are particularly well-suited to keep track of motion and possible mass change occurring in the ground truth. Further, this formulation is based on curves of Radon measures and is thus attractive for the regularization of dynamic inverse problems, where in each time instance, a Radon measure should be recovered. The regularization is then enforced by subjecting potential solutions of the dynamic problem, i.e., curves of Radon measures, to the continuity equation, possibly with source, while at the same time penalizing displacement field and growth rate. As we will see, this approach indeed establishes a convex regularization strategy that is sparsity-promoting in each time frame, exploits data redundancy in time and intrinsically recovers the velocity field associated with the motion as well as the rate of brightness changes. Let us emphasize that in particular, the approach allows for continuous measurement in time while providing spatial Radon-norm regularization. In contrast, a straightforward generalization of [15] to the space-time cylinder would, e.g., allow for measures that are singular in time and hence, not regular enough to consistently define global-in-time forward operators and data discrepancies.

Another aspect that has to be considered for dynamic inverse problems is a faithful modelling of the measurement process with respect to time, taking into account the time-varying nature of the measurements. In this paper, the latter is achieved by the construction of ad hoc Bochner-type spaces in which the data can take values in a time-dependent family of Hilbert spaces, which are correlated in time in a very weak way. This enables us to model the dynamic inverse problem with a time-dependent family of linear forward operators, mapping Radon measures to the associated data Hilbert space in each time instance, by a global-in-time forward operator that takes curves of Radon measures to the time-varying measurement data, making it thus possible to consistently define a data-mismatch term. In this respect, our model is truly dynamic and well-adapted to undersampled data as outlined above.

The overall approach then realizes a reconstruction by inverting the global forward operator subject to optimal transport penalization and continuity equation. Such an approach leads to a well-posed and convex variational problem of Tikhonov type, in which we are able to reconstruct the sought solution, along with the displacement field and mass growth rate. The main task of the paper is to establish a rigorous functional-analytic framework in which to set the problem and to obtain well-posedness and stability properties for the proposed variational optimal transport-based regularization. We then apply our theoretical results to dynamic medical imaging, focusing on the case of undersampled MRI, showing that we are able to treat an almost arbitrary variety of sampling strategies, as well as being principally able to reconstruct the image sequence, recover the motion displacement and track the possible presence of contrast agents.

Let us shortly review the existing literature on dynamic inverse problems and optimal transport approaches for inverse problems, image processing as well as computer vision. While the theory of regularization of dynamic inverse problems is a relatively new field of research [61], regularization theory for static ill-posed inverse problems dates back several decades. In this context, the approach in this paper can be classified as Tikhonov regularization in Banach space, a well-established technique where one penalizes the data mismatch by a convex regularizer and solves the corresponding variational problem [29, 60, 62, 63]. For computer vision and image processing applications, research in the last decades focused on specific convex regularizers [56] such as, for instance, edge-preserving functionals (total variation [18, 54], total generalized variation [12, 13]), or functionals that enforce sparsity with respect to a given basis, frame or learned dictionary [1, 16, 24, 34, 64]. In this context, Radon-norm penalties can also be interpreted as sparsity-promoting regularization [15, 17].

Concerning optimal transport, the classical theory deals with the problem of transporting mass from a probability distribution into a target one, while minimizing, e.g., the average squared displacement. Such a minimization problem defines a metric over the space of probability measures, called Wasserstein distance [6, 21, 55, 65]. In [8], the authors showed that the Wasserstein distance can be computed via solving a convex dynamic problem that corresponds to finding a geodesic path in the space of probabilities subjected to the continuity equation that minimizes the kinetic energy. This formulation is the basis for the balanced optimal-transport regularization studied in this paper. Such approach, however, intrinsically assumes mass constancy, which is not always desired in applications, e.g., in mathematical imaging. In the recent years, several ways to overcome this limitation were proposed, leading to so-called unbalanced optimal transport [7, 31, 32, 35, 45, 46, 50, 51]. In this context, common strategies are to add a source term to the continuity equation and consequently, to the kinetic energy, or to allow mass to escape/enter the domain, by interpreting the boundary as an infinite reservoir. In this paper, the energy introduced independently in [20, 41, 42], known as Wasserstein–Fisher–Rao or Hellinger–Kantorovich, is used to provide a unbalanced optimal-transport regularization. The remarkable feature of such formulation is that geodesics have a clear meaning, as they can be interpreted as joint displacement and change of mass, and therefore capture the dynamics of, e.g., image sequences.

Returning to inverse problems, as already mentioned, research in the dynamic framework recently gained some momentum [61], where convex regularizers that penalize the time derivative, interpret the space-time cylinder as a higher-dimensional set or enforce a spatio-temporal decomposition of low rank have been studied in the literature, most prominently in the context of medical imaging applications [23, 36, 44, 48, 57, 58, 66]. In comparison, such approaches, however, do only implicitly account for motion information in contrast to the proposed optimal-transport regularizer which explicitly yields a motion field. In this respect, the employment of optimal transport energies as regularizers for inverse problems is a very recent development. Here, existing literature mainly focuses on static inverse problems and static optimal transport leading, for instance, to Wasserstein-distance type regularization [38, 47]. In contrast, dynamic optimal transport has been utilized for specific image processing and computer vision tasks such as image interpolation [20, 37, 49]. We also mention the work [59], which appeared after the present work. In [59], the authors propose to regularize an inverse problem related to PET image reconstruction through balanced optimal transport, subsequently applying it to the problem of tracking radiolabelled cells. The regularizer they propose is similar to ours, however the forward operator they consider is static and application-specific, whereas we are able to deal with general dynamic inverse problems. Moreover, their analytical framework is greatly simplified, dealing only with discretized unknowns in space-time satisfying a discrete version of the continuity equation, rather than with actual curves of measures, which is the natural framework to obtain well-posedness, as proposed in the present paper. To the best knowledge of the authors, no other works employ dynamic optimal transport regularization for dynamic inverse problems. In particular, a framework for recovering curves of Radon measures from continuously acquired measurements does not exist to date. Let us also mention that the realization of the time-dependent Bochner spaces introduced in this paper is new. Indeed, existing approaches usually assume that almost every data space is isomorphic, which is sufficient to model, i.e., function spaces over time-varying domains [4, 22]. Such isomorphy is not required in our approach which can thus be used to model very general data acquisition strategies.

The paper is organized as follows. In the remainder of this section, we precise the mathematical setting employed for regularizing dynamic inverse problems and summarize the main theoretical results obtained (Section 1.1), including details on the MRI application (Section 1.2). In Section 2 we lay the theoretical foundations to rigorously define the optimal transport regularizer. In Section 3 we introduce and study the above mentioned class of time dependent Bochner spaces, which will be used to model the data measurements. After this preliminary part, in Section 4, we introduce the Tikhonov regularization for the dynamic inverse problem and show well-posedness as well as regularization properties. In Section 5 we apply our theoretical results to dynamic MRI, also providing examples of sampling strategies. Finally, Section 6 concludes with some perspectives for future research and some comments on the related paper [11] as well as forthcoming work, in which we perform numerical analysis for the model proposed in this paper.

1.1. Outline of the mathematical setting and main theoretical results

Let $\Omega\subset\mathbb{R}^{d}$ be an open and bounded domain, with $d\in\mathbb{N}$ , $d\geq 1$ , and consider a time variable $t\in[0,1]$ . Let $H_{t}$ be a time-dependent collection of Hilbert spaces modelling the data. The time regularity required for such family will be very mild, as specified below. At each time instance $t$ corresponds a given linear continuous forward operator $K_{t}^{*}$ , mapping from the space of Radon measures $\mathcal{M}(\overline{\Omega})$ into $H_{t}$ . We consider the following inverse problem: Given some data $f_{t}\in H_{t}$ for $t\in[0,1]$ , find a curve of Radon measures $t\in[0,1]\mapsto\rho_{t}\in\mathcal{M}(\overline{\Omega})$ such that

[TABLE]

We propose to regularize (1) by means of balanced/unbalanced optimal transport. This is enforced by subjecting $\rho_{t}$ to the continuity equation

[TABLE]

where $v_{t}(x)\colon(0,1)\times\overline{\Omega}\to\mathbb{R}^{d}$ is a flow field transporting the mass $\rho_{t}$ , while $g_{t}(x)\colon(0,1)\times\overline{\Omega}\to\mathbb{R}$ is a growth rate keeping track of mass creation and destruction, thus allowing for local mass change. We point out that no initial conditions are prescribed on $\rho_{t}$ in (2), since in the context of the inverse problem (1) we only have available indirect measurements on the whole time interval $[0,1]$ . We propose to regularize (1) by minimizing the Tikhonov functional

[TABLE]

subject to (2). Here, $\alpha,\beta>0$ are regularization parameters, $\delta\in(0,\infty]$ is a penalty parameter and the optimization is done for the triple $(\rho_{t},v_{t},g_{t})$ . The second term in (3) is known in the literature as Wasserstein–Fisher–Rao energy for unbalanced optimal transport [20, 41, 42], and as Benamou–Brenier energy [8] for balanced optimal transport when $\delta=\infty$ , enforcing $g_{t}=0$ and hence mass preservation.

Our main task is to establish problem (3) subject to (2) as a regularizer for (1) in a rigorous functional-analytic framework. In the following we provide some details on how to make the terms appearing in (3) rigorous, in particular providing suitable assumptions on $K_{t}^{*}$ and $H_{t}$ . The natural space in which to cast (3) is given by $\mathcal{M}:=\mathcal{M}(X)\times\mathcal{M}(X;\mathbb{R}^{d})\times\mathcal{M}(X)$ where $X:=(0,1)\times\overline{\Omega}$ . For $(\rho,m,\mu)\in\mathcal{M}$ define the transport energy as the 1-homogeneous convex functional

[TABLE]

where $\lambda\in\mathcal{M}^{+}(X)$ is any positive measure such that $\rho,m,\mu\ll\lambda$ and for $(t,x,y)\in\mathbb{R}\times\mathbb{R}^{d}\times\mathbb{R}$ we define $\Psi_{\delta}(t,x,y):=\frac{|x|^{2}+\delta^{2}y^{2}}{2t}$ if $t>0$ , $\Psi_{\delta}(0,0,0):=0$ and $\Psi_{\delta}(t,x,y):=\infty$ in all other cases. Introduce the affine set $\mathcal{D}:=\left\{(\rho,m,\mu)\in\mathcal{M}\,\colon\,\partial_{t}\rho+\operatorname{div}m=\mu\right\}$ where the continuity equation is in the distributional sense, without initial conditions (Definition 2.1). Whenever $(\rho,m,\mu)\in\mathcal{D}$ and $B_{\delta}(\rho,m,\mu)<\infty$ , it follows that $\rho\geq 0$ , $m=v\rho$ and $\mu=g\rho$ . Moreover $\rho=dt\otimes\rho_{t}$ with $t\mapsto\rho_{t}\in\mathcal{M}(\overline{\Omega})$ narrowly continuous, i.e., $t\mapsto\int_{\overline{\Omega}}\varphi(x)\,d\rho_{t}(x)$ is continuous for all $\varphi\in C(\overline{\Omega})$ (Proposition 2.4). By setting $\lambda=\rho$ in (4) we recover the second term in (3) (Proposition 2.6). Next, we outline how we define the space of measurements. Assume given a family of real Hilbert spaces $\{H_{t}\}_{t}$ for $t\in[0,1]$ , with inner products denoted by ${\langle\cdot,\cdot\rangle}_{H_{t}}$ , satisfying the following.

Assumption 1.1.

There exist a Banach space $D$ and linear continuous operators $i_{t}\colon D\to H_{t}$ with the properties:

(H1)

$\left\|i_{t}\right\|\leq C$ for some constant $C>0$ not depending on $t$ , 2. (H2)

$i_{t}(D)$ is dense in $H_{t}$ , 3. (H3)

the map $t\in[0,1]\mapsto{\langle i_{t}\varphi,i_{t}\psi\rangle}_{H_{t}}\in\mathbb{R}$ is Lebesgue measurable for every fixed $\varphi,\psi\in D$ .

In other words, we assume that each $H_{t}$ possesses a dense subset $i_{t}(D)$ , and such subsets are related by the time-measurability condition (H3). In particular, Assumption 1.1 allows us to define suitable notions of strong measurability and integrability for measurements $f\colon[0,1]\to H$ for $H:=\cup_{t\in[0,1]}H_{t}$ such that $f_{t}\in H_{t}$ for a.e. $t$ in $[0,1]$ (see Definitions 3.2, 3.8), leading to the definition of the measurements space

[TABLE]

In Theorem 3.13 we show that (5) is a Hilbert space with ${\langle f,g\rangle}_{L^{2}}:=\int_{0}^{1}{\langle f_{t},g_{t}\rangle}_{H_{t}}\,dt$ . Notice that our construction provides a natural extension of the classic Bochner theory to the case of varying codomains, in the sense that (5) coincides with the classical Bochner space when $H_{t}=H$ for all $t$ , with $H$ given Hilbert space. Details about the above construction are contained in Section 3. We now address the assumptions we make on the forward operators $K_{t}^{*}$ appearing in (1).

Assumption 1.2.

For a.e. $t\in[0,1]$ the linear continuous operators $K_{t}^{*}\colon\mathcal{M}(\overline{\Omega})\to H_{t}$ satisfy:

(K1)

$K_{t}^{*}$ is the adjoint of a linear continuous operator $K_{t}\colon H_{t}\to C(\overline{\Omega})$ , 2. (K2)

$\left\|K_{t}\right\|\leq C$ for some constant $C>0$ not depending on $t$ , 3. (K3)

the map $t\in[0,1]\mapsto K_{t}^{*}\rho\in H_{t}$ is strongly measurable for every fixed $\rho\in\mathcal{M}(\overline{\Omega})$ .

Under (K1)–(K3), (H1)–(H3) we have the following: if $t\mapsto\rho_{t}\in\mathcal{M}(\overline{\Omega})$ is narrowly continuous then the map $t\mapsto K_{t}^{*}\rho_{t}$ belongs to $L^{2}([0,1];H)$ (Lemma 4.2). At this point, we are ready to rigorously define the regularization functional anticipated in (3) as $J\colon\mathcal{M}\to[0,\infty]$ , where

[TABLE]

if $(\rho,m,\mu)\in\mathcal{D}$ and $J:=\infty$ otherwise. The discrepancy term in $J$ is well defined since, if $(\rho,m,\mu)\in\mathcal{D}$ and $B_{\delta}(\rho,m,\mu)<\infty$ , then $\rho=dt\otimes\rho_{t}$ with $t\mapsto\rho_{t}$ narrowly continuous, so that $t\mapsto K_{t}^{*}\rho_{t}$ belongs to $L^{2}([0,1];H)$ (Proposition 4.3). Notice that, in addition to the regularizer $B_{\delta}(\rho,m,\mu)$ , we also included $\left\|\rho\right\|_{\mathcal{M}(X)}$ in the definition of $J$ : This serves the purpose of enforcing weak* coercivity on $J$ , since no initial data on $\rho$ is prescribed. Our main theoretical results concerning existence of minimizers for $J$ and regularization properties are summarized in the following statements, which are contained in Theorem 4.4 and Theorems 4.7, 4.10, respectively.

Theorem 1.3.

Assume (H1)–(H3), (K1)–(K3). Let $f\in L^{2}([0,1];H)$ , $\alpha,\beta>0$ . Then $J$ admits a minimizer $(\rho,m,\mu)\in\mathcal{D}$ satisfying $\rho\geq 0$ , $\rho=dt\otimes\rho_{t}$ with $t\mapsto\rho_{t}$ narrowly continuous. If in addition the operators $K_{t}^{*}$ are injective for a.e. $t\in[0,1]$ , then the minimizer is unique.

In the next theorem, $J_{\alpha,\beta,f}$ denotes the functional $J$ in (6) for $\alpha,\beta>0$ and $f\in L^{2}([0,1];H)$ .

Theorem 1.4 (Regularization).

Assume (H1)–(H3), (K1)–(K3). Let $f^{\gamma},f^{\dagger}\in L^{2}([0,1];H)$ be noisy and exact data respectively, for noise level $\gamma>0$ .

i)

Suppose that $f^{n}\to f^{\gamma}$ strongly in $L^{2}$ , $\alpha,\beta>0$ and $(\rho^{n},m^{n},\mu^{n})\in\operatorname*{arg\,min}J_{\alpha,\beta,f^{n}}$ ., Then, up to subsequences, $(\rho^{n},m^{n},\mu^{n})$ converges weakly to $(\rho,m,\mu)\in\operatorname*{arg\,min}J_{\alpha,\beta,f^{\gamma}}$ .* 2. ii)

Assume that $\left\|f^{\gamma_{n}}-f^{\dagger}\right\|_{L^{2}}\leq\gamma_{n}$ and $\alpha_{n},\beta_{n}\searrow 0$ , such that $\gamma_{n}^{2}/\min\{\alpha_{n},\beta_{n}\}\to 0$ . If $(\rho^{n},m^{n},\mu^{n})\in\operatorname*{arg\,min}J_{\alpha_{n},\beta_{n},f^{\gamma_{n}}}$ then, up to subsequences, $(\rho^{n},m^{n},\mu^{n})$ converges weakly to $(\rho^{\dagger},m^{\dagger},\mu^{\dagger})\in\mathcal{D}$ solving (1) and there exist $\alpha^{*},\beta^{*}\in[1,\infty]$ such that*

[TABLE]

1.2. Application to dynamic MRI

We apply the model (1) and its regularization (3) to undersampled dynamic MRI, yielding a reconstruction approach via convex optimization which is principally capable of capturing motion during the acquisition. A common limiting factor to medical imaging techniques and MRI in particular is acquisition speed such that, for instance, data cannot be collected sufficiently fast in order to temporally resolve the beating heart or the lung while breathing. Consequently, static reconstruction approaches lead to severe artifacts. Thus, motion has to be taken into account by considering the dynamic setting in which at each time instance, data is severely undersampled and temporal data redundancies have to be exploited. For this purpose, we show that the optimal-transport regularization framework developed in this paper can be applied, leading to a regularizer that penalizes the displacements caused by motion and intrinsically recovers the motion field as well as the growth rate.

The forward problem in undersampled dynamic MRI in two dimensions is commonly stated as follows: In each time instance $t$ , the proton density $\rho_{t}$ , a non-negative quantity, needs to be recovered from the measured data $f_{t}$ . Taking coil sensitivities into account, $\rho_{t}$ and $f_{t}$ are linked via the Fourier transform. However, for each $t$ , the Fourier data is only acquired on subsets specified by the sampling strategy, leading to each $f_{t}$ generally living on a different subset of the so-called $k$ -space and hence, being contained in a time-varying data Hilbert space $H_{t}$ . Modelling the proton density $\rho_{t}$ as a positive measure on the image domain $\Omega\subset\mathbb{R}^{2}$ , denoting by $K_{t}^{*}$ an appropriately masked Fourier transform and considering the unit time interval $[0,1]$ , the forward problem then indeed reads as $K_{t}^{*}\rho_{t}=f_{t}$ in $H_{t}$ for $t\in[0,1]$ . This is made precise in the following.

Adopting the common model for parallel data acquisition (see, e.g., [53, 40, 39, 57]), let $\Omega\subset\mathbb{R}^{2}$ be an open bounded domain representing the image domain and let the complex coil sensitivities $c_{j}\in C_{0}(\mathbb{R}^{2};\mathbb{C})$ for $j=1,\dots,N$ with $N\geq 1$ to each of the $N$ receiver coils be given. The time-dependent sampling method is represented by a family of measures $\sigma_{t}\in\mathcal{M}^{+}(\mathbb{R}^{2})$ for $t\in[0,1]$ . Such measures are required to satisfy some mild regularity assumptions, namely,

(M1)

$\left\|\sigma_{t}\right\|_{\mathcal{M}(\mathbb{R}^{2})}\leq C$ for a.e. $t\in[0,1]$ , where $C>0$ does not depend on $t$ , 2. (M2)

the map $t\mapsto\int_{\mathbb{R}^{2}}\varphi(x)\,d\sigma_{t}(x)$ is measurable for each $\varphi\in C_{0}(\mathbb{R}^{2};\mathbb{C})$ ,

allowing for a variety of sampling methods, see Section 5 for details. The data space of measurements is then defined by $H_{t}:=L^{2}_{\sigma_{t}}(\mathbb{R}^{2};\mathbb{C}^{N})$ , interpreted as a real Hilbert space and equipped with the norm ${\left\|h\right\|}_{H_{t}}^{2}:=\sum_{j=1}^{N}\int_{\mathbb{R}^{2}}|h^{j}(x)|^{2}\,d\sigma_{t}(x)$ , where we denote $h=(h^{1},\dots,h^{N})$ . The forward operators are given by $K_{t}^{*}\colon\mathcal{M}(\overline{\Omega})\to H_{t}$ defined via

[TABLE]

where $\mathscr{F}$ is the Fourier transform and we interpret each $\mathscr{F}(c_{j}\rho_{t})$ as an element of $L^{2}_{\sigma_{t}}(\mathbb{R}^{2},\mathbb{C})$ . In Lemma 5.4 we show that under (M1)–(M2), the spaces $H_{t}$ and the forward operators $K_{t}^{*}$ fulfill (H1)–(H3) and (K1)–(K3), respectively. In this way the hypotheses of Theorems 1.3, 1.4 are satisfied, and we can regularize the reconstruction problem (1) with the functional $J\colon\mathcal{M}\to[0,\infty]$ defined in (6), which in this framework corresponds to

[TABLE]

where the measurements $f_{t}=(f_{t}^{1},\dots,f_{t}^{N})$ belong to $L^{2}([0,1];H)$ . This shows in particular that optimal-transport regularization for undersampled dynamic MRI leads to well-posed convex optimization problems. These are accessible to analysis as well as efficient and stable numerical minimization algorithms. Section 6 provides some perspectives for the latter.

2. Dynamic optimal transport

The aim of this section is to provide the essential elements to define the optimal transport regularizer appearing in (6). We refer the reader to Appendix A.1 for measure theory definitions and results which will be needed in the following. Throughout the section, $\Omega\subset\mathbb{R}^{d}$ is an open bounded domain, with $d\in\mathbb{N}$ , $d\geq 1$ and we set $X:=(0,1)\times\overline{\Omega}$ to be the time-space cylinder. We also define the space $\mathcal{M}:=\mathcal{M}(X)\times\mathcal{M}(X;\mathbb{R}^{d})\times\mathcal{M}(X)$ . In Section 2.1 we introduce the concept of measure solution to the continuity equation with source

[TABLE]

where $(\rho,m,\mu)\in\mathcal{M}$ . Here $\rho$ represents a density, $m$ a momentum field advecting $\rho$ and $\mu$ a source term, accounting for local mass change. We then investigate properties of solutions $\rho\in\mathcal{M}(X)$ of (7). In particular in Proposition 2.2 we show that positive solutions to (7) disintegrate as $\rho=dt\otimes\rho_{t}$ with $\{\rho_{t}\}_{t\in[0,1]}$ Borel family of positive measures over $\overline{\Omega}$ . In Proposition 2.4 we prove that, under some growth assumptions on $m$ and $\mu$ , the curve $t\mapsto\rho_{t}$ is actually narrowly continuous. Finally, in Section 2.2 we introduce the optimal transport energy $B_{\delta}$ at (6), and list some of its properties in Proposition 2.6.

2.1. Continuity equation

We want to consider measure valued (distributional) solutions to the continuity equation (7) with suitable boundary conditions. The precise definition is as follows.

Definition 2.1.

We say that $(\rho,m,\mu)\in\mathcal{M}$ is a measure solution to (7) if

[TABLE]

We remark that the above weak formulation includes zero flux boundary conditions for the momentum $m$ on $\partial\Omega$ , and no initial and final data is prescribed on $\rho$ . Moreover one can test (8) against functions in $C^{1}_{c}(X)$ (see [6, Remark 8.1.1]). In the following proposition we show that positive solutions to (7) can be disintegrated with respect to the Lebesgue measure $dt$ on $(0,1)$ (see Section A.1.4 for details on disintegration). To this end, let $\pi\colon X\to(0,1)$ be the projection on the time coordinate.

Proposition 2.2.

Assume that $(\rho,m,\mu)\in\mathcal{M}$ satisfies (8), with $\rho\in\mathcal{M}^{+}(X)$ . Then $\rho$ disintegrates, with respect to $dt$ , as $\rho=dt\otimes\rho_{t}$ , where $\rho_{t}\in\mathcal{M}^{+}(\overline{\Omega})$ for a.e. $t$ . Moreover $t\mapsto\rho_{t}(\overline{\Omega})$ is a function of bounded variation, with distributional derivative $\pi_{\#}\mu$ . In particular, if the source $\mu=0$ , then the total mass $\rho_{t}(\overline{\Omega})$ is constant in time.

Proof.

In order to apply Theorem A.1 we need to show that $\pi_{\#}\rho\ll dt$ . Let $\tilde{\varphi}\in C^{\infty}_{c}((0,1))$ and define $\varphi:=\tilde{\varphi}\circ\pi\in C^{\infty}_{c}(X)$ . By plugging $\varphi$ in (8) we get $\int_{0}^{1}\tilde{\varphi}^{\prime}\,d(\pi_{\#}\rho)=-\int_{0}^{1}\tilde{\varphi}\,d(\pi_{\#}\mu)$ , so that $(\pi_{\#}\rho)^{\prime}=\pi_{\#}\mu$ in the sense of distributions. Since $\pi_{\#}\mu\in\mathcal{M}((0,1))$ , there exists $u\in BV((0,1))$ such that $\pi_{\#}\mu=u^{\prime}$ . Therefore $\pi_{\#}\rho\ll dt$ and there exists a Borel family $\rho_{t}\in\mathcal{M}(\overline{\Omega})$ such that $\rho=dt\otimes\rho_{t}$ . In particular, since $\rho_{t}$ is a Borel family, the map $t\mapsto\rho_{t}(\overline{\Omega})$ is measurable. Moreover it belongs to $L^{1}((0,1))$ , since $\int_{0}^{1}|\rho_{t}(\overline{\Omega})|\,dt=\rho(X)$ which is finite by assumption. Finally notice that $\pi_{\#}(dt\otimes\rho_{t})=\rho_{t}(\overline{\Omega})\,dt$ , which together with $(\pi_{\#}\rho)^{\prime}=\pi_{\#}\mu$ implies that $t\mapsto\rho_{t}(\overline{\Omega})$ belongs to $BV((0,1))$ , with distributional derivative given by $\pi_{\#}\mu$ . ∎

Definition 2.3.

A curve $t\in[0,1]\mapsto\rho_{t}\in\mathcal{M}(\overline{\Omega})$ is narrowly continuous if the map

[TABLE]

is continuous for every fixed $\varphi\in C(\overline{\Omega})$ . We denote by $C_{\rm w}([0,1];\mathcal{M}(\overline{\Omega}))$ the set of such curves, and by $C_{\rm w}([0,1];\mathcal{M}^{+}(\overline{\Omega}))$ the set of narrowly continuous curves of positive measures.

In the next proposition we show that if $(\rho,m,\mu)$ solves the continuity equation with appropriate energy bounds, the disintegration measures $\rho_{t}$ are defined for every $t$ and are narrowly continuous. This is a well-known result for $\mu=0$ . For completeness we will carry out the proof in Appendix A.2, by adapting the argument used to prove the homogeneous version (see Lemma 8.1.2 in [6]).

Proposition 2.4 (Continuous representative).

Let $(\rho,m,\mu)\in\mathcal{M}$ be a solution of (8), with $\rho\in\mathcal{M}^{+}(X)$ . Let $\rho_{t}\in\mathcal{M}^{+}(\overline{\Omega})$ be the disintegration of $\rho$ with respect to $dt$ . Assume that $m=dt\otimes v_{t}\rho_{t}$ and $\mu=dt\otimes g_{t}\rho_{t}$ with $v_{t}\colon X\to\mathbb{R}^{d}$ , $g_{t}\colon X\to\mathbb{R}$ measurable functions such that

[TABLE]

Then there exists a narrowly continuous curve $(t\mapsto\tilde{\rho}_{t})\in C_{\rm w}([0,1];\mathcal{M}^{+}(\overline{\Omega}))$ such that $\rho_{t}=\tilde{\rho}_{t}$ a.e. in $(0,1)$ . Moreover for each $\varphi\in C^{1}_{c}([0,1]\times\overline{\Omega})$ and $0\leq t_{1}\leq t_{2}\leq 1$ we have

[TABLE]

In the rest of the paper we will identify $\rho_{t}$ with its narrowly continuous representative $\tilde{\rho}_{t}$ whenever the assumptions of Proposition 2.4 hold, and use the notation $\rho_{t}$ .

2.2. Optimal transport energy

We want to introduce the Wasserstein–Fisher–Rao energy (or Hellinger–Kantorovich) as originally done in [19, 20, 42, 43, 41]. First, define the convex set

[TABLE]

with $\delta\in(0,\infty]$ fixed parameter and $\frac{c^{2}}{\infty}=0$ for every $c\in\mathbb{R}$ . For $(t,x,y)\in\mathbb{R}\times\mathbb{R}^{d}\times\mathbb{R}$ set

[TABLE]

where $\infty y^{2}=\infty$ for $y\neq 0$ and $\infty y^{2}=0$ for $y=0$ . We have that $\Psi_{\delta}$ is the Legendre conjugate of the characteristic function ${\mathchoice{\raisebox{0.0pt}{$ \displaystyle\chi $}}{\raisebox{0.0pt}{$ \textstyle\chi $}}{\raisebox{0.0pt}{$ \scriptstyle\chi $}}{\raisebox{0.0pt}{$ \scriptscriptstyle\chi $}}}_{K_{\delta}}$ [20], that is,

[TABLE]

In particular $f$ is convex, lower semicontinuous and 1-homogeneous.

Definition 2.5 (Transport energy).

Let $(\rho,m,\mu)\in\mathcal{M}$ . We define the transport energy as

[TABLE]

We summarize some of the properties of the functional $B_{\delta}$ that will be needed throughout this paper. The proof is omitted, and it can be easily adapted from the one in [55, Prop 5.18].

Proposition 2.6.

The functional $B_{\delta}$ defined in (12) is convex and lower semicontinuous for the weak convergence. Moreover it satisfies the following properties:*

(i)

$B_{\delta}(\rho,m,\mu)\geq 0$ , 2. (ii)

assume that $\rho,m,\mu\ll\lambda$ for some $\lambda\in\mathcal{M}^{+}(X)$ . Then

[TABLE] 3. (iii)

if $B_{\delta}(\rho,m,\mu)<\infty$ then $\rho\geq 0$ and $m,\mu\ll\rho$ , 4. (iv)

if $\rho\geq 0$ and $m,\mu\ll\rho$ , then $m=v\rho$ , $\mu=g\rho$ for measurable $v\colon X\to\mathbb{R}^{d}$ , $g\colon X\to\mathbb{R}$ and

[TABLE]

3. Time dependent Bochner spaces

In this section we construct a class of Bochner spaces of Hilbert spaces valued functions, where the Hilbert space can vary in time. Here the underlying measure space is the unit interval with the Lebesgue measure. This can however be easily generalized to arbitrary measure spaces. Moreover a generalization to Banach spaces valued functions seems possible, however, it is out of the scope of this paper. More precisely, we want to define a concept of integrability for functions $f\colon[0,1]\to\{H_{t}\}_{t}$ , where $H_{t}$ is a Hilbert space for each time $t$ , and $f(t)\in H_{t}$ for all $t$ . In order to do that we will closely follow the approach to define classic Bochner spaces (see [26, Ch II], [3, Ch 11]). In Section 3.1 we establish the functional analytic setting and assumptions under which we carry out the construction. In Section 3.2 we define suitable notions of measurability and provide the equivalent of the classic Pettis measurability theorem (see Theorem 3.5). Such result is instrumental to the following analysis, as it provides a practical characterization of strong measurability. In Section 3.3 we define integrability for maps $f\colon[0,1]\to H$ and characterize it in Theorem 3.9. We then proceed to define the time dependent Bochner spaces $L^{p}([0,1];H)$ . Notice that, in contrast to the classic Bochner theory, we will not define a notion of integral for integrable maps $f\colon[0,1]\to H$ , but only of integrability. However, a comparison with the classical theory is possible, and it will be carried out in Appendix B.2.

3.1. Functional setting

Let $\{H_{t}\}$ for $t\in[0,1]$ be a family of real Hilbert spaces with norms and scalar products denoted by ${\left\|\cdot\right\|}_{H_{t}}$ and ${\langle\cdot,\cdot\rangle}_{H_{t}}$ , respectively. The interval $[0,1]$ is equipped with the Lebesgue measure. As usual, we denote by $|E|$ the measure of a set $E\subset[0,1]$ and by ${\mathchoice{\raisebox{0.0pt}{$ \displaystyle\chi $}}{\raisebox{0.0pt}{$ \textstyle\chi $}}{\raisebox{0.0pt}{$ \scriptstyle\chi $}}{\raisebox{0.0pt}{$ \scriptscriptstyle\chi $}}}_{E}$ its characteristic function, defined as ${\mathchoice{\raisebox{0.0pt}{$ \displaystyle\chi $}}{\raisebox{0.0pt}{$ \textstyle\chi $}}{\raisebox{0.0pt}{$ \scriptstyle\chi $}}{\raisebox{0.0pt}{$ \scriptscriptstyle\chi $}}}_{E}(t):=1$ if $t\in E$ and ${\mathchoice{\raisebox{0.0pt}{$ \displaystyle\chi $}}{\raisebox{0.0pt}{$ \textstyle\chi $}}{\raisebox{0.0pt}{$ \scriptstyle\chi $}}{\raisebox{0.0pt}{$ \scriptscriptstyle\chi $}}}_{E}(t):=0$ otherwise. Let $H:=\cup_{t\in[0,1]}H_{t}$ . We will denote by $f\colon[0,1]\to H$ maps such that $f(t)\in H_{t}$ for a.e. $t\in[0,1]$ . Let $D$ be a real Banach space with norm denoted by $\left\|\cdot\right\|_{D}$ and duality by ${\langle\cdot,\cdot\rangle}_{D^{*},D}$ . Assume that for a.e. $t\in[0,1]$ there exists a linear continuous operator $i_{t}\colon D\to H_{t}$ with the following properties:

(H1)

$\left\|i_{t}\right\|\leq C$ for some constant $C>0$ not depending on $t$ , 2. (H2)

$i_{t}(D)$ is dense in $H_{t}$ , 3. (H3)

the map $t\mapsto{\langle i_{t}\varphi,i_{t}\psi\rangle}_{H_{t}}$ is Lebesgue measurable for every fixed $\varphi,\psi\in D$ .

The adjoint of $i_{t}$ is $i_{t}^{*}\colon H_{t}\to D^{*}$ , defined by ${\langle i_{t}^{*}h,\varphi\rangle}_{D^{*},D}:={\langle h,i_{t}\varphi\rangle}_{H_{t}}$ for all $h\in H_{t},\varphi\in D$ (here we identified $H_{t}$ with its dual). Notice that from (H1) it follows that $i_{t}^{*}\colon H_{t}\to D^{*}$ is linear continuous and such that $\left\|i_{t}^{*}\right\|\leq C$ . Moreover from (H2) we have that $i_{t}^{*}$ is injective. Throughout the section, we say that $g=f$ if the equality holds a.e. in $[0,1]$ . Moreover we say that $f_{n}\to f$ a.e. if $\lim_{n}{\left\|f_{n}(t)-f(t)\right\|}_{H_{t}}=0$ for a.e. $t\in[0,1]$ .

3.2. Measurability in time dependent spaces

In this section we introduce suitable measurability notions for maps $f\colon[0,1]\to H$ , and prove our version of Pettis’ Theorem. We refer the reader to [26, Ch II] for classic measurability definitions.

Definition 3.1 (Step function).

A map $f\colon[0,1]\to D$ is a step function if $f=\sum_{j=1}^{N}{\mathchoice{\raisebox{0.0pt}{$ \displaystyle\chi $}}{\raisebox{0.0pt}{$ \textstyle\chi $}}{\raisebox{0.0pt}{$ \scriptstyle\chi $}}{\raisebox{0.0pt}{$ \scriptscriptstyle\chi $}}}_{E_{j}}\varphi_{j}$ with $N\in\mathbb{N}$ , $\varphi_{j}\in D$ and $E_{j}$ Lebesgue measurable pairwise disjoint subsets of $[0,1]$ .

Definition 3.2 (Measurability).

Let $f\colon[0,1]\to H$ . We say that

i)

$f$ is strongly measurable if there exists a sequence of step functions $f_{n}$ such that

[TABLE] 2. ii)

$f$ is weakly measurable if $t\mapsto{\langle i_{t}\varphi,f(t)\rangle}_{H_{t}}$ is Lebesgue measurable for each $\varphi\in D$ , 3. iii)

$f$ is essentially separably valued if there exist a measurable set $E\subset[0,1]$ with $|E|=0$ and a countable subset $S\subset D$ with the following property: for every $\varepsilon>0$ and $t\in[0,1]\smallsetminus E$ , there exists an element $\varphi\in S$ such that

[TABLE]

Notice that, if $H_{t}=H$ for each $t\in[0,1]$ , with $H$ fixed Hilbert space, $D=H$ and $i_{t}\varphi:=\varphi$ , then Definitions 3.1 and 3.2 are equivalent to the classic ones given in Chapter II of [26].

Remark 3.3.

Let $p\geq 1$ and $f=\sum_{j=1}^{N}{\mathchoice{\raisebox{0.0pt}{$ \displaystyle\chi $}}{\raisebox{0.0pt}{$ \textstyle\chi $}}{\raisebox{0.0pt}{$ \scriptstyle\chi $}}{\raisebox{0.0pt}{$ \scriptscriptstyle\chi $}}}_{E_{j}}\varphi_{j}$ be a step function. Then the map $t\to{\left\|i_{t}f(t)\right\|}_{H_{t}}^{p}$ is measurable and $\int_{0}^{1}{\left\|i_{t}f(t)\right\|}_{H_{t}}^{p}\,dt<\infty$ . Indeed, ${\left\|i_{t}f(t)\right\|}_{H_{t}}^{2}=\sum_{j=1}^{N}{\mathchoice{\raisebox{0.0pt}{$ \displaystyle\chi $}}{\raisebox{0.0pt}{$ \textstyle\chi $}}{\raisebox{0.0pt}{$ \scriptstyle\chi $}}{\raisebox{0.0pt}{$ \scriptscriptstyle\chi $}}}_{E_{j}}(t){\langle i_{t}\varphi_{j},i_{t}\varphi_{j}\rangle}_{H_{t}}$ , so that $t\mapsto{\left\|i_{t}f(t)\right\|}_{H_{t}}^{2}$ is measurable by (H3), and hence also $t\mapsto{\left\|i_{t}f(t)\right\|}_{H_{t}}^{p}$ is. Moreover by (H1)

[TABLE]

Remark 3.4.

It is easy to check that strong measurability is stable under sums, scalar multiplication and pointwise a.e. convergence. Moreover if $f\colon[0,1]\to H$ is strongly measurable then the map $t\mapsto{\left\|f(t)\right\|}_{H_{t}}$ is Lebesgue measurable, since $f$ can be approximated a.e. by step functions $f_{n}$ and $t\mapsto{\left\|i_{t}f_{n}(t)\right\|}_{H_{t}}$ is measurable for every fixed $n$ by Remark 3.3.

The above definitions are linked together by the analogous of the classic Pettis measurability Theorem (see [26, Ch II.1, Thm 2]).

Theorem 3.5 (Pettis).

Let $f\colon[0,1]\to H$ . Then $f$ is strongly measurable if and only if $f$ is weakly measurable and essentially separably valued.

For a proof of this theorem, see Appendix B.1. By inspecting the proof, one can see that the following corollary holds.

Corollary 3.6.

Let $f\colon[0,1]\to H$ . Then $f$ is strongly measurable if and only if it is the a.e. uniform limit of a sequence of countably valued functions $f_{n}\colon[0,1]\to D$ , that is, if

[TABLE]

uniformly for a.e. $t\in[0,1]$ , for some $f_{n}=\sum_{j=1}^{\infty}{\mathchoice{\raisebox{0.0pt}{$ \displaystyle\chi $}}{\raisebox{0.0pt}{$ \textstyle\chi $}}{\raisebox{0.0pt}{$ \scriptstyle\chi $}}{\raisebox{0.0pt}{$ \scriptscriptstyle\chi $}}}_{E_{n,j}}\varphi_{n_{j}}$ with $\varphi_{n_{j}}\in D$ and $\{E_{n,j}\}_{j\in\mathbb{N}}$ measurable and pairwise disjoint subsets of $[0,1]$ .

Proposition 3.7 (Separable case).

Assume that $D$ is separable. Then strong measurability is equivalent to weak measurability.

Proof.

Let $S=\{\varphi_{n}\}\subset D$ be countable and dense. Then $\{i_{t}\varphi_{n}\}$ is dense in $H_{t}$ : indeed fix $\varepsilon>0$ and $h\in H_{t}$ . By (H2) there exists $\varphi\in D$ such that ${\left\|h-i_{t}\varphi\right\|}_{H_{t}}<\varepsilon/2$ . Let $\varphi_{n}$ be such that $\left\|\varphi-\varphi_{n}\right\|_{D}<\varepsilon/2C$ where $C$ is the constant in (H1). Then ${\left\|h-i_{t}\varphi_{n}\right\|}_{H_{t}}<\varepsilon$ by triangle inequality and (H1). Therefore $H_{t}$ is separable and it is immediate to check that in this case every function $f\colon[0,1]\to H$ is essentially separably valued. By Theorem 3.5 the thesis follows. ∎

3.3. Integration and $L^{p}$ spaces

Definition 3.8 (Integrability).

Let $f\colon[0,1]\to H$ be strongly measurable according to Definition 3.2. We say that $f$ is integrable if there exists a sequence $\{f_{n}\}$ of step functions $f_{n}\colon[0,1]\to D$ such that

[TABLE]

Notice that the definition is well posed, since the map $t\to{\left\|i_{t}f_{n}(t)-f(t)\right\|}_{H_{t}}$ is Lebesgue measurable for each fixed $n$ (see Remark 3.4), hence its integral is well defined. Analogously to the classic case ([26, Ch II.2, Thm 2]), we can characterize integrability as stated in the following theorem.

Theorem 3.9 (Characterization of integrability).

Let $f\colon[0,1]\to H$ be strongly measurable. Then $f$ is integrable if and only if $\int_{0}^{1}{\left\|f(t)\right\|}_{H_{t}}\,dt<\infty$ .

Proof.

Assume that $f$ is integrable. By (13), for sufficiently large $N$ we have

[TABLE]

and the thesis follows by Remark 3.3, since $f_{N}$ is a step function. Conversely, assume that $\int_{0}^{1}{\left\|f(t)\right\|}_{H_{t}}\,dt<\infty$ . Since $f$ is strongly measurable, by Corollary 3.6 there exists a sequence $\{g_{n}\}$ of countably valued maps $g_{n}\colon[0,1]\to D$ such that ${\left\|i_{t}g_{n}(t)-f(t)\right\|}_{H_{t}}<1/n$ for a.e. $t\in[0,1]$ . In particular ${\left\|i_{t}g_{n}(t)\right\|}_{H_{t}}\leq{\left\|f(t)\right\|}_{H_{t}}+1/n$ , so that $\int_{0}^{1}{\left\|i_{t}g_{n}(t)\right\|}_{H_{t}}\,dt<\infty$ . By construction $g_{n}(t)=\sum_{j=1}^{\infty}{\mathchoice{\raisebox{0.0pt}{$ \displaystyle\chi $}}{\raisebox{0.0pt}{$ \textstyle\chi $}}{\raisebox{0.0pt}{$ \scriptstyle\chi $}}{\raisebox{0.0pt}{$ \scriptscriptstyle\chi $}}}_{E_{n,j}}\varphi_{n,j}$ with sets $E_{n,j}\subset[0,1]$ measurable and pairwise disjoint. Hence there exists a sequence $\{k_{n}\}$ in $\mathbb{N}$ such that $\sum_{j=k_{n}+1}^{\infty}\int_{E_{n,j}}{\left\|i_{t}g_{n}(t)\right\|}_{H_{t}}\,dt<n^{-1}$ . Therefore by setting $f_{n}(t):=\sum_{j=1}^{k_{n}}{\mathchoice{\raisebox{0.0pt}{$ \displaystyle\chi $}}{\raisebox{0.0pt}{$ \textstyle\chi $}}{\raisebox{0.0pt}{$ \scriptstyle\chi $}}{\raisebox{0.0pt}{$ \scriptscriptstyle\chi $}}}_{E_{n,j}}\varphi_{n,j}$ we obtain

[TABLE]

proving that $f$ is integrable. ∎

As a corollary of the above theorem, we obtain that a suitable version of Lebesgue’s dominated convergence theorem holds in our setting. We postpone its proof to Appendix B.1.

Theorem 3.10 (Dominated convergence).

Let $f_{n}\colon[0,1]\to H$ be a sequence of integrable functions such that $f_{n}\to f$ a.e. in $[0,1]$ and that there exists $g\in L^{1}([0,1])$ satisfying ${\left\|f_{n}(t)\right\|}_{H_{t}}\leq g(t)$ a.e. in $[0,1]$ . Then $f$ is integrable and $f_{n}\to f$ strongly in $L^{1}([0,1];H)$ .

Definition 3.11 ( $L^{p}$ space).

Fix $1\leq p<\infty$ . We define the space of the $p$ -integrable functions

[TABLE]

In (14) we identify functions that coincide almost everywhere. Notice that (14) is well posed: Indeed since $f$ is strongly measurable, then the map $t\mapsto{\left\|f(t)\right\|}_{H_{t}}^{p}$ is measurable (Remark 3.4), and hence $\int_{0}^{1}{\left\|f(t)\right\|}_{H_{t}}^{p}\,dt$ is well defined (possibly infinite).

Remark 3.12.

If $H_{t}\equiv H$ with $H$ fixed Hilbert space and $D=H$ , then $L^{p}([0,1];H)$ coincides with the respective classic Bochner spaces (see [26, Ch II]).

Theorem 3.13.

Let $1\leq p<\infty$ . We have that $L^{p}([0,1];H)$ is a Banach space with the norm $\left\|f\right\|_{L^{p}}^{p}:=\int_{0}^{1}{\left\|f(t)\right\|}_{H_{t}}^{p}\,dt$ . Moreover $L^{2}([0,1];H)$ is a Hilbert space with the inner product ${\langle f,g\rangle}_{L^{2}}:=\int_{0}^{1}{\langle f(t),g(t)\rangle}_{H_{t}}\,dt$ .

The proof of the above theorem is postponed to Appendix B.1.

Remark 3.14 ( $p=\infty$ ).

It is is possible to treat the case $p=\infty$ by defining $L^{\infty}([0,1];H)$ as the set of strongly measurable functions $f\colon[0,1]\to H$ such that $\operatorname*{ess\,sup}_{t\in[0,1]}{\left\|f(t)\right\|}_{H_{t}}<\infty$ . By adapting the proof for classical Bochner spaces, one can show that $L^{\infty}([0,1];H)$ is a Banach space with the norm $\left\|f\right\|_{L^{\infty}}:=\operatorname*{ess\,sup}_{t\in[0,1]}{\left\|f(t)\right\|}_{H_{t}}$ .

Remark 3.15 (Dual spaces).

The space $L^{2}([0,1];H)$ is self-dual, being a Hilbert space. We believe that for any $p\geq 1$ one has the isometry $L^{p}([0,1];H)^{*}=L^{q}([0,1];H)$ with $1/p+1/q=1$ .

Example 3.16 (Narrowly continuous curves).

Let $\Omega\subset\mathbb{R}^{d}$ be an open bounded domain with $d\in\mathbb{N}$ , $d\geq 1$ . Let $\rho_{t}\in C_{\rm w}([0,1];\mathcal{M}^{+}(\overline{\Omega}))$ , $D:=C(\overline{\Omega})$ with the supremum norm, $H_{t}:=L^{2}_{\rho_{t}}(\Omega)$ with the norm ${\left\|h\right\|}_{H_{t}}^{2}:=\int_{\Omega}|h(x)|^{2}\,d\rho_{t}(x)$ . Define $i_{t}\colon D\to H_{t}$ by $i_{t}\varphi:=\varphi$ . It is left to the reader to check that (H1)–(H3) are satisfied. Therefore we can define the space $L^{p}([0,1];H)$ for each $p\geq 1$ . Notice that in this case $L^{2}([0,1];H)$ is isometric to $L^{2}_{\rho}([0,1]\times\overline{\Omega})$ , where $\rho:=dt\otimes\rho_{t}$ .

Finally, we would like to mention that, although we do not define a notion of integral for maps in $L^{p}([0,1];H)$ , a comparison to the classical Bochner theory is still possible. Indeed, if $f\colon[0,1]\to H$ , then by definition $i_{t}^{*}f\colon[0,1]\to D^{*}$ and the codomain is a fixed Banach space. In Appendix B.2 we show that, assuming $f\in L^{1}([0,1];H)$ , we always have that $i_{t}^{*}f$ is weakly* integrable (see Proposition B.2). However, Bochner integrability fails in general, as shown in Example B.3. Nevertheless, under suitable additional assumptions, one can show that Bochner integrability can be guaranteed (Proposition B.4).

4. Regularization of dynamic inverse problems

In this section we define and study the properties of the optimal transport based functional (6), and we establish it as a regularizer for the dynamic inverse problem (1). Throughout the section, the functional analytic setting will be the following. Let $\Omega\subset\mathbb{R}^{d}$ be an open bounded domain, where $d\in\mathbb{N}$ , $d\geq 1$ , and define again the time space domain $X:=(0,1)\times\overline{\Omega}$ . Let $\{H_{t}\}$ be a family of Hilbert spaces for $t\in[0,1]$ , $D$ a Banach space and $i_{t}\colon D\to H_{t}$ linear operators which satisfy the assumptions (H1)–(H3) of Section 3.1. Assume given a family of linear continuous operators $K^{*}_{t}\colon\mathcal{M}(\overline{\Omega})\to H_{t}$ such that, for a.e. $t\in[0,1]$ ,

(K1)

$K^{*}_{t}$ is the adjoint of a linear continuous operator $K_{t}\colon H_{t}\to C(\overline{\Omega})$ , 2. (K1’)

$K_{t}^{*}$ is weak*-to-weak continuous, 3. (K2)

$\left\|K_{t}^{*}\right\|\leq C$ for some constant $C>0$ that does not depend on $t$ , 4. (K3)

the map $t\mapsto K^{*}_{t}\rho$ is strongly measurable in the sense of Definition 3.2, for all $\rho\in\mathcal{M}(\overline{\Omega})$ .

We remark that conditions (K1) and (K1’) are equivalent. As before, the space of narrowly continuous curves with values in the measures and in the positive measures are denoted by $C_{\rm w}([0,1];\mathcal{M}(\overline{\Omega}))$ and $C_{\rm w}([0,1];\mathcal{M}^{+}(\overline{\Omega}))$ respectively. The dynamic inverse problem we aim to regularize is the following: Given some data $f\in L^{2}([0,1];H)$ , find a narrowly continuous curve $t\mapsto\rho_{t}$ such that

[TABLE]

We regularize (15) as follows. Let $\mathcal{M}$ be defined by

[TABLE]

and introduce the convex linear space of triples in $\mathcal{M}$ satisfying the continuity equation

[TABLE]

Definition 4.1 (Regularized problem).

Let $f\in L^{2}([0,1];H)$ and $\alpha,\beta>0$ , $\delta\in(0,\infty]$ . The regularizer of (15) is the functional $J\colon\mathcal{M}\to[0,\infty]$ defined by

[TABLE]

if $(\rho,m,\mu)\in\mathcal{D}$ and $J=\infty$ otherwise. Here $\rho=dt\otimes\rho_{t}$ is the disintegration of $\rho$ with respect to time and $B_{\delta}$ is the transport energy defined in (12).

We will proceed as follows. First, in Section 4.1 we show that the inverse problem in (15) and the functional $J$ in (16) are well defined, in the following sense: Given a triple $(\rho,m,\mu)\in\mathcal{D}$ with finite transport energy $B_{\delta}(\rho,m,\mu)$ , then $\rho\geq 0$ and it disintegrates into $\rho=dt\otimes\rho_{t}$ with $t\mapsto\rho_{t}$ narrowly continuous. For such curves, we show in Lemma 4.2, that $t\mapsto K_{t}^{*}\rho_{t}$ is a measurement belonging to $L^{2}([0,1];H)$ , providing well-definition for (15)–(16). In Section 4.2 we show that

[TABLE]

admits at least one solution, and the minimizer is unique under additional assumptions on the operators $K_{t}^{*}$ . This will be the content of Theorem 4.4. Finally, in Section 4.3, we investigate stability of the solutions to (17) and convergence for vanishing noise level.

4.1. Well-definition

In this section we want to show that the definition of the functional $J$ at (16) is well posed. The first step is to ensure that the fidelity term is well defined, namely, that given $\rho_{t}\in C_{\rm w}([0,1];\mathcal{M}(\overline{\Omega}))$ then the map $t\in[0,1]\mapsto K_{t}^{*}\rho_{t}\in H_{t}$ belongs to $L^{2}([0,1];H)$ . This fact will be established in the following Lemma.

Lemma 4.2.

If $\rho_{t}\in C_{\rm w}([0,1];\mathcal{M}(\overline{\Omega}))$ then $(t\mapsto K_{t}^{*}\rho_{t})\in L^{2}([0,1];H)$ .

Let us postpone the proof for a moment. As a consequence of the Lemma we have the following.

Proposition 4.3.

The functional $J$ at (16) is well defined.

Proof of Lemma 4.2.

Part 1. Let $\rho_{t}\in C_{\rm w}([0,1];\mathcal{M}(\overline{\Omega}))$ . First we show that the map $t\mapsto K_{t}^{*}\rho_{t}$ is strongly measurable according to Definition 3.2. We do so by means of Theorem 3.5, by proving that $t\mapsto K_{t}^{*}\rho_{t}$ is weakly measurable and essentially separably valued.

Claim 1: the map $t\mapsto K_{t}^{*}\rho_{t}$ is weakly measurable as per Definition 3.2, that is, $t\mapsto{\langle K_{t}^{*}\rho_{t},i_{t}\varphi\rangle}_{H_{t}}$ is measurable for every fixed $\varphi\in D$ .

Proof of Claim 1. By definition and (K1) we have ${\langle K_{t}^{*}\rho_{t},i_{t}\varphi\rangle}_{H_{t}}={\langle\rho_{t},K_{t}i_{t}\varphi\rangle}_{\mathcal{M}(\overline{\Omega}),C(\overline{\Omega})}\,.$ Notice that the map $t\in[0,1]\mapsto K_{t}i_{t}\varphi\in C(\overline{\Omega})$ is strongly measurable in the classic sense ([26, Ch II]). To see this, since $C(\overline{\Omega})$ is separable, by the classic Pettis Theorem ([26, Ch II.1, Thm 2]), it is enough to prove that $t\mapsto K_{t}i_{t}\varphi$ is weakly measurable, meaning that $t\mapsto{\langle\rho,K_{t}i_{t}\varphi\rangle}_{\mathcal{M}(\overline{\Omega}),C(\overline{\Omega})}$ is measurable for every fixed $\rho\in\mathcal{M}(\overline{\Omega})$ . The latter holds because ${\langle\rho,K_{t}i_{t}\varphi\rangle}_{\mathcal{M}(\overline{\Omega}),C(\overline{\Omega})}={\langle K_{t}^{*}\rho,i_{t}\varphi\rangle}_{H_{t}}$ and the map $t\mapsto K^{*}_{t}\rho$ is strongly measurable by assumption (K3), and hence weakly measurable by Theorem 3.5. By definition of classic strong measurability, there exists a sequence $\{f_{n}\}$ of step functions $f_{n}\colon[0,1]\to C(\overline{\Omega})$ , such that $f_{n}(t)=\sum_{j=1}^{N_{n}}{\mathchoice{\raisebox{0.0pt}{$ \displaystyle\chi $}}{\raisebox{0.0pt}{$ \textstyle\chi $}}{\raisebox{0.0pt}{$ \scriptstyle\chi $}}{\raisebox{0.0pt}{$ \scriptscriptstyle\chi $}}}_{E_{j,n}}f_{j,n}$ with $\{E_{j,n}\}_{j=1}^{N_{n}}$ measurable partition of $[0,1]$ , $f_{j,n}\in C(\overline{\Omega})$ , and such that

[TABLE]

for a.e. $t\in[0,1]$ . We have that the map $t\mapsto{\langle\rho_{t},f_{n}(t)\rangle}_{\mathcal{M}(\overline{\Omega}),C(\overline{\Omega})}$ is measurable for each fixed $n\in\mathbb{N}$ , since ${\langle\rho_{t},f_{n}(t)\rangle}_{\mathcal{M}(\overline{\Omega}),C(\overline{\Omega})}=\sum_{j=1}^{N_{n}}{\mathchoice{\raisebox{0.0pt}{$ \displaystyle\chi $}}{\raisebox{0.0pt}{$ \textstyle\chi $}}{\raisebox{0.0pt}{$ \scriptstyle\chi $}}{\raisebox{0.0pt}{$ \scriptscriptstyle\chi $}}}_{E_{j,n}}\int_{\overline{\Omega}}f_{j,n}\,d\rho_{t}$ and the maps $t\mapsto\int_{\overline{\Omega}}f_{j,n}\,d\rho_{t}$ are continuous by narrow continuity of $t\mapsto\rho_{t}$ . By Proposition A.3 we have that $\sup_{t\in[0,1]}\left\|\rho_{t}\right\|_{\mathcal{M}(\overline{\Omega})}<\infty$ . Combining this with (18) yields

[TABLE]

as $n\to\infty$ , for a.e. $t\in[0,1]$ . Hence $t\mapsto{\langle\rho_{t},K_{t}i_{t}\varphi\rangle}_{\mathcal{M}(\overline{\Omega}),C(\overline{\Omega})}$ is measurable, being the a.e. limit of measurable maps, and the claim follows.

Claim 2: the map $t\mapsto K_{t}^{*}\rho_{t}$ is essentially separably valued, that is, there exists a measurable set $E\subset[0,1]$ such that $|E|=0$ and a countable set $S\subset D$ with the following property: for every $\varepsilon>0$ and $t\in[0,1]\smallsetminus E$ there exists $\varphi\in S$ such that ${\left\|K_{t}^{*}\rho_{t}-i_{t}\varphi\right\|}_{H_{t}}<\varepsilon$ .

Proof of Claim 2. Let $T\subset[0,1]$ be a countable dense subset. Fix $t\in T$ . By (K3) the map $s\mapsto K_{s}^{*}\rho_{t}$ is strongly measurable and hence essentially separably valued by Theorem 3.5. Therefore there exists a measurable set $E_{t}\subset[0,1]$ with $|E_{t}|=0$ and a countable subset $S_{t}\subset D$ with the following property: for every $\varepsilon>0$ and $s\in[0,1]\smallsetminus E_{t}$ , there exists $\varphi\in S_{t}$ such that

[TABLE]

Denote by $E:=\cup_{t\in T}E_{t}$ . Since $T$ is countable, the set $E$ is measurable, and $|E|=0$ . Moreover let $S^{0}:=\cup_{t\in T}S_{t}$ , so that $S^{0}\subset D$ is countable. Define the set of averages of elements of $S^{0}$ as

[TABLE]

We have that $S\subset D$ is countable. Fix $\varepsilon>0$ , $t\in[0,1]\smallsetminus E$ . The claim follows by showing there exists $\varphi\in S$ such that

[TABLE]

Indeed, by density, there exists a sequence $\{t_{n}\}$ in $T$ such that $t_{n}\to t$ as $n\to\infty$ . Since $\rho_{t}\in C_{\rm w}([0,1];\mathcal{M}(\overline{\Omega}))$ it follows that $\rho_{t_{n}}\stackrel{{\scriptstyle*}}{{\rightharpoonup}}\rho_{t}$ in $\mathcal{M}(\overline{\Omega})$ . By weak*-to-weak continuity of $K_{t}^{*}$ we have $K_{t}^{*}\rho_{t_{n}}\rightharpoonup K_{t}^{*}\rho_{t}$ weakly in $H_{t}$ as $n\to\infty$ . By the Banach–Saks property in $H_{t}$ [25, Ch VIII, Thm 1], there exists a subsequence (not relabelled) such that $\frac{1}{n}\sum_{j=1}^{n}K_{t}^{*}\rho_{t_{j}}\to K_{t}^{*}\rho_{t}$ strongly in $H_{t}$ . Hence we can choose $N\in\mathbb{N}$ such that

[TABLE]

Since $\{t_{n}\}$ is a sequence in $T$ , by (19) and the definitions of $S^{0}$ and $E$ , we have that for every $n\in\mathbb{N}$ there exists $\varphi_{n}\in S^{0}$ such that

[TABLE]

Define $\varphi:=\frac{1}{N}\sum_{j=1}^{N}\varphi_{j}$ , so that $\varphi\in S$ . By triangle inequality, linearity of $i_{t}$ , and (21)–(22),

[TABLE]

which yields (20).

Part 2. Since $t\mapsto K_{t}^{*}\rho_{t}$ is strongly measurable, also $t\mapsto{\left\|K_{t}^{*}\rho_{t}\right\|}_{H_{t}}$ is measurable. By (K2) and Proposition A.3 we have

[TABLE]

Hence by Theorem 3.9 we conclude that $K_{t}^{*}\rho_{t}$ is integrable and it belongs to $L^{2}([0,1];H)$ . ∎

Proof of Proposition 4.3.

If $J(\rho,m,\mu)<\infty$ then also $B_{\delta}(\rho,m,\mu)<\infty$ , hence by Proposition 2.6 we have that $\rho\geq 0$ , $m=v_{t}\rho$ , $\mu=g_{t}\rho$ for Borel maps $v_{t}\colon X\to\mathbb{R}^{d}$ , $g_{t}\colon X\to\mathbb{R}$ such that

[TABLE]

By assumption $(\rho,m,\mu)$ solves the continuity equation, hence (Proposition 2.2) $\rho=dt\otimes\rho_{t}$ for some Borel family $\rho_{t}\in\mathcal{M}^{+}(\overline{\Omega})$ . In particular we have $m=dt\otimes(v_{t}\rho_{t})$ and $\mu=dt\otimes(g_{t}\rho_{t})$ . By (23) and Proposition 2.4 we have that $\rho_{t}\in C_{\rm w}([0,1];\mathcal{M}^{+}(\overline{\Omega}))$ . Therefore $K_{t}^{*}\rho_{t}\in L^{2}([0,1];H)$ by Lemma 4.2, and the first term in $J$ is finite. ∎

4.2. Existence of minimizers

The aim of this section is to prove that the functional $J$ defined in (16) admits at least one minimizer. Such minimizer is unique under suitable hypotheses on the operators $K_{t}^{*}$ . The precise statement is the following.

Theorem 4.4.

Let $f\in L^{2}([0,1];H)$ and $\alpha,\beta>0$ , $\delta\in(0,\infty]$ . Then there exists $(\rho^{*},m^{*},\mu^{*})\in\mathcal{D}$ with $\rho^{*}=dt\otimes\rho_{t}^{*}$ , $(t\mapsto\rho^{*}_{t})\in C_{\rm w}([0,1];\mathcal{M}^{+}(\overline{\Omega}))$ , that solves the minimization problem

[TABLE]

If in addition $K_{t}^{*}$ is injective for a.e. $t\in[0,1]$ , then the minimizer is unique.

The proof of the above theorem is based on the direct method of calculus of variations. Before proceeding to the proof, we will establish compactness and lower semicontinuity properties for the functional $J$ . This is the object of the following two lemmas.

Lemma 4.5 (Compactness for $J$ ).

Let $f\in L^{2}([0,1];H)$ and $\alpha,\beta>0$ , $\delta\in(0,\infty]$ . Assume that there exists a constant $E\geq 0$ such that the sequence $\{(\rho^{n},m^{n},\mu^{n})\}$ in $\mathcal{M}$ satisfies

[TABLE]

Then $\rho^{n}=dt\otimes\rho_{t}^{n}$ for some $(t\mapsto\rho_{t}^{n})\in C_{\rm w}([0,1];\mathcal{M}^{+}(\overline{\Omega}))$ . Moreover there exists $(\rho,m,\mu)\in\mathcal{D}$ with $\rho=dt\otimes\rho_{t}$ , $(t\mapsto\rho_{t})\in C_{\rm w}([0,1];\mathcal{M}^{+}(\overline{\Omega}))$ such that, up to subsequences,

[TABLE]

Proof.

By the energy bound (24), there exists $\rho\in\mathcal{M}(X)$ such that, up to subsequences, $\rho^{n}\stackrel{{\scriptstyle*}}{{\rightharpoonup}}\rho$ weakly* in $\mathcal{M}(X)$ . From (24) we also have

[TABLE]

so that Proposition 2.6 implies $\rho^{n}\geq 0$ , $m^{n}=v_{t}^{n}\rho^{n}$ , $\mu^{n}=g_{t}^{n}\rho^{n}$ for Borel maps $v_{t}^{n}\colon X\to\mathbb{R}^{d}$ , $g_{t}^{n}\colon X\to\mathbb{R}$ such that

[TABLE]

By (24) we have $(\rho^{n},m^{n},\mu^{n})\in\mathcal{D}$ . Hence Proposition 2.2 implies that $\rho^{n}=dt\otimes\rho^{n}_{t}$ for some Borel family $\{\rho_{t}^{n}\}_{t\in[0,1]}$ in $\mathcal{M}^{+}(\overline{\Omega})$ . In particular we have $m^{n}=dt\otimes(v_{t}^{n}\rho_{t}^{n})$ and $\mu=dt\otimes(g_{t}^{n}\rho_{t}^{n})$ . Hence by (27) and Proposition 2.4 we get that $(t\mapsto\rho_{t}^{n})\in C_{\rm w}([0,1];\mathcal{M}^{+}(\overline{\Omega}))$ . Now notice that if $\rho\in\mathcal{M}^{+}(X)$ , then by definition of $B_{\delta}$ (see (12)) we infer

[TABLE]

where $K_{\delta}$ is defined in (11) and

[TABLE]

Therefore, by taking $0<\varepsilon<\sqrt{2\beta}\min\{1,\delta\}$ we conclude

[TABLE]

By combining (24) with the above estimates we conclude that (up to subsequences) $m^{n}\stackrel{{\scriptstyle*}}{{\rightharpoonup}}m$ and $\mu^{n}\stackrel{{\scriptstyle*}}{{\rightharpoonup}}\mu$ for some $m\in\mathcal{M}(X;\mathbb{R}^{d}),\mu\in\mathcal{M}(X)$ . Since $\mathcal{D}$ is weak* closed, we get $(\rho,m,\mu)\in\mathcal{D}$ . By Proposition 2.6 the functional $B_{\delta}$ is weak* lower semicontinuous. Therefore (26) implies $B_{\delta}(\rho,m,\mu)<\infty$ and by repeating the arguments above, we get that $\rho=dt\otimes\rho_{t}$ , $m=dt\otimes(v_{t}\rho_{t})$ , $\mu=dt\otimes(g_{t}\rho_{t})$ with $\rho_{t}\in C_{\rm w}([0,1];\mathcal{M}^{+}(\overline{\Omega}))$ and

[TABLE]

In particular $(\rho,m,\mu)\in\mathcal{D}$ . We will now show the second condition in (25). Since $(\rho^{n},m^{n},\mu^{n})$ solves the continuity equation, by Proposition 2.2 we have that the map $t\mapsto\rho_{t}^{n}(\overline{\Omega})$ belongs to $BV((0,1))$ with distributional derivative given by $\pi_{\#}\mu^{n}$ , where we recall that $\pi\colon X\to(0,1)$ is the projection on the time coordinate. Therefore, by the embedding of $BV((0,1))$ into $L^{\infty}((0,1))$

[TABLE]

where we used (28) and (24). Hence the set $\{\rho_{t}^{n}\}_{t,n}$ is uniformly bounded in $\mathcal{M}(\overline{\Omega})$ , so it belongs to some set $K\subset\mathcal{M}(\overline{\Omega})$ which is weak* sequentially compact. Moreover as a consequence of Lemma A.2 we have that for every $t,s\in[0,1]$

[TABLE]

where $E_{n},C_{n}$ are the constants defined in the same lemma. The last inequality follows from (27) and the fact that $\{\rho_{t}^{n}(\overline{\Omega})\}_{t,n}$ is uniformly bounded, so that the constant $C>0$ does not depend on $n$ . Hence by Proposition A.4 there exists a subsequence (not relabelled) and a $C^{1}(\overline{\Omega})^{*}$ -continuous curve $\tilde{\rho}_{t}\colon[0,1]\to\mathcal{M}(\overline{\Omega})$ such that

[TABLE]

In particular $\tilde{\rho}_{t}$ is narrowly continuous, since it is $C^{1}(\overline{\Omega})^{*}$ -continuous (this fact can be obtained by repeating the same argument given in the proof of Proposition 2.4). Notice that (30) implies that $\rho^{n}\stackrel{{\scriptstyle*}}{{\rightharpoonup}}\tilde{\rho}$ where $\tilde{\rho}:=dt\otimes\tilde{\rho}_{t}$ . Hence $\tilde{\rho}=\rho$ . By uniqueness of the disintegration we also get that $\rho_{t}=\tilde{\rho}_{t}$ and the thesis follows. ∎

Lemma 4.6 (Lower semicontinuity for $J$ ).

Let $f\in L^{2}([0,1];H)$ and $\alpha,\beta>0$ , $\delta\in(0,\infty]$ . Assume that $(\rho^{n},m^{n},\mu^{n})\in\mathcal{D}$ with $\rho^{n}=dt\otimes\rho^{n}_{t}$ , $(t\mapsto\rho_{t}^{n})\in C_{\rm w}([0,1];\mathcal{M}^{+}(\overline{\Omega}))$ is such that $(\rho^{n},m^{n},\mu^{n})$ converges to $(\rho,m,\mu)$ in the sense of (25), where $\rho=dt\otimes\rho_{t}$ , $(t\mapsto\rho_{t})\in C_{\rm w}([0,1];\mathcal{M}^{+}(\overline{\Omega}))$ . Then we have

[TABLE]

In particular $J$ is lower semicontinuous with respect to the convergence in (25), that is,

[TABLE]

Proof.

Let us start by showing (31). To this end, fix $g\in L^{2}([0,1];H)$ . By assumption we have that $\rho_{t}^{n}\stackrel{{\scriptstyle*}}{{\rightharpoonup}}\rho_{t}$ weakly* in $\mathcal{M}(\overline{\Omega})$ , for every $t\in[0,1]$ . In particular, by (K1’), we have $K_{t}^{*}\rho_{t}^{n}\rightharpoonup K_{t}^{*}\rho_{t}$ weakly in $H_{t}$ for a.e. $t\in[0,1]$ , so that

[TABLE]

as $n\to\infty$ . By proceeding as in (29) we obtain

[TABLE]

for some constant $C\geq 0$ , since $\rho^{n}$ and $\mu^{n}$ are uniformly bounded by weak* convergence in $\mathcal{M}(X)$ . Hence by Cauchy–Schwarz and by (K2) we have

[TABLE]

Since $g\in L^{2}([0,1];H)$ we have that $t\mapsto{\left\|g(t)\right\|}_{H_{t}}$ belongs to $L^{1}((0,1))$ . By combining the above estimate with (33) and invoking the classic dominated convergence theorem we conclude (31).

Let us now prove the remaining part of the Lemma. From (31) we have that $K_{t}^{*}\rho_{t}^{n}-f_{t}$ converges weakly to $K_{t}^{*}\rho_{t}-f_{t}$ in $L^{2}([0,1];H)$ , therefore by lower semicontinuity of the norm with respect to the weak convergence we have

[TABLE]

Moreover $B_{\delta}$ is weak* lower semicontinuous by Proposition 2.6, thus

[TABLE]

and (32) follows. ∎

Proof of Theorem 4.4.

Existence: Set $\rho:=dt\otimes\sigma$ with $\sigma\in\mathcal{M}^{+}(\overline{\Omega})$ . Then $(\rho,0,0)\in\mathcal{D}$ , and $B_{\delta}(\rho,0,0)=0$ by (iv) in Proposition 2.6. Moreover $(t\mapsto K_{t}^{*}\sigma)\in L^{2}([0,1];H)$ by Lemma 4.2, so that $J(\rho,0,0)<\infty$ and the infimum is finite. Let $(\rho^{n},m^{n},\mu^{n})$ be a minimizing sequence, that is, $J(\rho^{n},m^{n},\mu^{n})\to\inf_{\mathcal{M}}J$ as $n\to\infty$ . Therefore $\sup_{n}J(\rho^{n},m^{n},\mu^{n})\leq E$ for some constant $E\geq 0$ . From Lemma 4.5 we have $\rho^{n}=dt\otimes\rho_{t}^{n}$ and $\rho_{t}^{n}\in C_{\rm w}([0,1];\mathcal{M}^{+}(\overline{\Omega}))$ . Moreover there exists $(\rho^{*},m^{*},\mu^{*})\in\mathcal{D}$ with $\rho^{*}=dt\otimes\rho_{t}^{*}$ , $(t\mapsto\rho_{t}^{*})\in C_{\rm w}([0,1];\mathcal{M}^{+}(\overline{\Omega}))$ such that, up to subsequences, $(\rho^{n},m^{n},\mu^{n})$ converges to $(\rho^{*},m^{*},\mu^{*})$ in the sense of (25). By Lemma 4.6 and the fact that $(\rho^{n},m^{n},\mu^{n})$ is a minimizing sequence we conclude that $(\rho^{*},m^{*},\mu^{*})$ is a minimizer for $J$ .

Uniqueness: Assume that $K_{t}^{*}$ is injective for a.e. $t\in[0,1]$ . The term $B_{\delta}$ is convex by Proposition 2.6. Also the term $\left\|\cdot\right\|_{\mathcal{M}(X)}$ is convex as it is a norm. Since minimizers are necessarily of the form $(\rho,m,\mu)\in\mathcal{D}$ with $\rho\geq 0$ , in order to prove uniqueness it is sufficient to show that

[TABLE]

is strictly convex for $\rho_{t}\in C_{\rm w}([0,1];\mathcal{M}^{+}(\overline{\Omega}))$ . First, consider $\rho_{t}\in C_{\rm w}([0,1];\mathcal{M}^{+}(\overline{\Omega}))$ such that $\rho_{t}\not\equiv 0$ . As a consequence the set $E:=\{t\in[0,1]\,\colon\,\rho_{t}(\overline{\Omega})\neq 0\}$ is open (by continuity of $t\mapsto\rho_{t}(\overline{\Omega})$ ) and not empty, so $|E|>0$ . Let $F:=\left\{t\in[0,1]\,\colon\,K_{t}^{*}\,\text{ is injective}\right\}$ . By assumption we have $|[0,1]\smallsetminus F|=0$ . Therefore

[TABLE]

since $K_{t}^{*}\rho_{t}\neq 0$ for $t\in E\cap F$ . Now let $\rho_{t}^{1},\rho_{t}^{2}\in C_{\rm w}([0,1];\mathcal{M}^{+}(\overline{\Omega}))$ with $\rho^{1}_{t}\not\equiv\rho^{2}_{t}$ and $\lambda\in(0,1)$ . The coefficient of the leading term of $\lambda\mapsto\left\|K_{t}^{*}(\lambda\rho^{1}_{t}+(1-\lambda)\rho^{2}_{t})\right\|_{L^{2}}^{2}$ is $\left\|K_{t}^{*}(\rho_{t}^{1}-\rho_{t}^{2})\right\|_{L^{2}}^{2}$ , which is non zero by (35), since $\rho_{t}^{1}\not\equiv\rho_{t}^{2}$ . Hence the map in (34) is strictly convex and we conclude. ∎

4.3. Regularization properties

In this section we denote by $f^{\dagger}\in L^{2}([0,1];H)$ the exact data and by $f^{\gamma}\in L^{2}([0,1];H)$ the noisy data for the noise level $\gamma>0$ , that is, $\left\|f^{\gamma}-f^{\dagger}\right\|_{L^{2}}\leq\gamma$ . For a datum $f\in L^{2}([0,1];H)$ and parameters $\alpha,\beta>0$ we adopt the following notation:

[TABLE]

if $(\rho,m,\mu)\in\mathcal{D}$ and $J_{\alpha,\beta,f}(\rho,m,\mu)=\infty$ otherwise.

Theorem 4.7 (Stability).

Assume that $f^{n}\to f^{\gamma}$ strongly in $L^{2}([0,1];H)$ and that

[TABLE]

Then $(\rho^{n},m^{n},\mu^{n})\in\mathcal{D}$ with $\rho^{n}=dt\otimes\rho_{t}^{n}$ , $\rho_{t}^{n}\in C_{\rm w}([0,1];\mathcal{M}^{+}(\overline{\Omega}))$ . Moreover $(\rho^{n},m^{n},\mu^{n})$ admits a subsequence converging in the sense of (25). The limit of each converging subsequence of $(\rho^{n},m^{n},\mu^{n})$ is a minimizer of $J_{\alpha,\beta,f^{\gamma}}$ .

Proof.

A sequence $(\rho^{n},m^{n},\mu^{n})\in\mathcal{D}$ satisfying (36) exists by Theorem 4.4. By the same theorem it also follows that $\rho^{n}=dt\otimes\rho_{t}^{n}$ with $\rho_{t}^{n}\in C_{\rm w}([0,1];\mathcal{M}^{+}(\overline{\Omega}))$ . We have

[TABLE]

Since $(\rho^{n},m^{n},\mu^{n})$ is a minimizer for $J_{\alpha,\beta,f^{n}}$ we can test (36) against $(0,0,0)\in\mathcal{D}$ to obtain

[TABLE]

where last inequality follows from the convergence $f^{n}\to f^{\gamma}$ in $L^{2}([0,1];H)$ . By applying Lemma 4.5 to $J_{\alpha,\beta,f^{\gamma}}$ , there exists $(\tilde{\rho},\tilde{m},\tilde{\mu})\in\mathcal{D}$ , with $\tilde{\rho}=dt\otimes\tilde{\rho}_{t}$ , $\tilde{\rho}_{t}\in C_{\rm w}([0,1];\mathcal{M}^{+}(\overline{\Omega}))$ such that $(\rho^{n},m^{n},\mu^{n})$ converges to $(\tilde{\rho},\tilde{m},\tilde{\mu})$ in the sense of (25). We are left to show that $(\tilde{\rho},\tilde{m},\tilde{\mu})$ is a minimizer for $J_{\alpha,\beta,f^{\gamma}}$ . Since $\rho_{t}^{n}\stackrel{{\scriptstyle*}}{{\rightharpoonup}}\tilde{\rho}_{t}$ for every $t\in[0,1]$ , by Lemma 4.6 and the convergence $f^{n}\to f^{\gamma}$ we have $(K_{t}^{*}\rho_{t}^{n}-f^{n}_{t})\rightharpoonup(K_{t}^{*}\tilde{\rho}_{t}-f^{\gamma}_{t})$ weakly in $L^{2}([0,1];H)$ . Also by Lemma 4.6,

[TABLE]

for every $(\rho,m,\mu)\in\mathcal{M}$ , since (36) holds and $f^{n}\to f^{\gamma}$ . Hence $(\tilde{\rho},\tilde{m},\tilde{\mu})$ is a minimizer. ∎

We are now interested in studying properties of the minimizers of $J_{\alpha,\beta,f^{\gamma}}$ for vanishing noise level, that is, for data such that $\left\|f^{\gamma}-f^{\dagger}\right\|_{L^{2}}\leq\gamma$ for every $\gamma\geq 0$ . To this end, we need to understand how the regularization term

[TABLE]

behaves for fixed argument $(\rho,m,\mu)\in\mathcal{D}$ . Since multiple parameters are involved, we will also allow $\alpha$ and $\beta$ to take the value $\infty$ . We define

[TABLE]

where $I_{\{0\}}$ denotes the convex indicator function of the set ${\{0\}}$ . In order to give a similar definition for the case $\alpha=\infty$ we first need to characterize the subset of $\mathcal{D}$ where $B_{\delta}(\rho,m,\mu)=0$ .

Proposition 4.8.

Assume that $(\rho,m,\mu)\in\mathcal{D}$ . We have that $B_{\delta}(\rho,m,\mu)=0$ if and only if $m=0$ , $\mu=0$ and $\rho=dt\otimes\sigma$ for some $\sigma\in\mathcal{M}^{+}(\overline{\Omega})$ .

Proof.

By Proposition 2.6 point (iv) we have that $B_{\delta}(dt\otimes\sigma,0,0)=0$ for any $\sigma\in\mathcal{M}^{+}(\overline{\Omega})$ . Conversely, assume that $(\rho,m,\mu)\in\mathcal{D}$ is such that $B_{\delta}(\rho,m,\mu)=0$ . In particular the energy is finite, so points (iii)–(iv) of Proposition 2.6 imply that $\rho\geq 0$ , $m=v\rho$ , $\mu=g\rho$ for some Borel maps $v\colon X\to\mathbb{R}^{d}$ , $g\colon X\to\mathbb{R}$ , and we have $B_{\delta}(\rho,m,\mu)=\frac{1}{2}\int_{X}(|v|^{2}+\delta^{2}g^{2})\,d\rho$ . Since $B_{\delta}(\rho,m,\mu)=0$ and $\rho\geq 0$ , we conclude that $m=0$ and $\mu=0$ . By assumption $(\rho,0,0)$ solves the continuity equation in the sense of (8), therefore Proposition 2.2 guarantees that $\rho=dt\otimes\rho_{t}$ for some Borel family $\rho_{t}\in\mathcal{M}^{+}(\overline{\Omega})$ . Since $v=0$ and $g=0$ a.e. in $X$ , we can apply Proposition 2.4 and conclude that $(t\mapsto\rho_{t})\in C_{\rm w}([0,1];\mathcal{M}^{+}(\overline{\Omega}))$ . In particular, for every $0\leq t_{1}\leq t_{2}\leq 1$ and $\varphi(t,x):=a(x)$ with $a\in C^{1}(\overline{\Omega})$ , formula (10) reads $\int_{\overline{\Omega}}a(x)\,d\rho_{t_{1}}(x)=\int_{\overline{\Omega}}a(x)\,d\rho_{t_{2}}(x)$ . By a density argument one can show that the previous holds for all $a\in C(\overline{\Omega})$ , and hence $\rho_{t}=\rho_{0}$ for every $t\in[0,1]$ . The thesis follows by setting $\sigma:=\rho_{0}$ . ∎

Proposition 4.8 motivates the following definition:

[TABLE]

where $Z:=\left\{(\rho,0,0)\in\mathcal{D}\,\colon\,\rho=dt\otimes\sigma\,,\,\,\sigma\in\mathcal{M}^{+}(\overline{\Omega})\right\}\,.$ We are now in the position to define minimal energy solutions of the inverse problem

[TABLE]

Definition 4.9 (Minimal energy solution).

Let $f^{\dagger}\in L^{2}([0,1];H)$ and $\alpha^{*},\beta^{*}\in[1,\infty]$ , $\delta\in(0,\infty]$ . We say that $(\rho^{\dagger},m^{\dagger},\mu^{\dagger})\in\mathcal{M}$ is a minimal energy solution of (39) with parameters $\alpha^{*},\beta^{*}$ if

[TABLE]

In the following theorem we show that the minimizers for vanishing noise level converge in the sense of (25) to an energy minimizing solution of the inverse problem (39).

Theorem 4.10 (Convergence for vanishing noise level).

Let $f^{\dagger}\in L^{2}([0,1];H)$ be the exact data and $\{f^{n}\}$ be a sequence of noisy data such that $\left\|f^{n}-f^{\dagger}\right\|_{L^{2}}\leq\gamma_{n}$ . Let $\alpha_{n},\beta_{n}>0$ be such that

[TABLE]

Let $c_{n}:=\min\{\alpha_{n},\beta_{n}\}$ , $\tilde{\alpha}_{n}:=\alpha_{n}/c_{n}$ , $\tilde{\beta}_{n}:=\beta_{n}/c_{n}$ so that, up to subsequences, $\tilde{\alpha}_{n}\to\alpha^{*}$ and $\tilde{\beta}_{n}\to\beta^{*}$ as $n\to\infty$ , with $\alpha^{*},\beta^{*}\in[1,\infty]$ . Assume there exists $(\rho^{\dagger},m^{\dagger},\mu^{\dagger})\in\mathcal{D}$ satisfying $\rho^{\dagger}=dt\otimes\rho^{\dagger}_{t}$ , $\rho^{\dagger}_{t}\in C_{\rm w}([0,1];\mathcal{M}^{+}(\overline{\Omega}))$ , (39) and

[TABLE]

Let $(\rho^{n},m^{n},\mu^{n})\in\mathcal{M}$ be such that

[TABLE]

Then $\rho^{n}=dt\otimes\rho_{t}^{n}$ with $\rho_{t}^{n}\in C_{\rm w}([0,1];\mathcal{M}^{+}(\overline{\Omega}))$ , and $(\rho^{n},m^{n},\mu^{n})$ converges to $(\rho^{*},m^{*},\mu^{*})$ in the sense of (25), up to subsequences. Moreover, every such weak limit of $(\rho^{n},m^{n},\mu^{n})$ is a minimal energy solution of (39) with parameters $\alpha^{*}$ and $\beta^{*}$ .*

Proof.

First notice that $c_{n}\to 0$ and $\gamma^{2}_{n}/c_{n}\to 0$ as $n\to\infty$ by (40). If $\tilde{\alpha}_{n}\to\infty$ or $\tilde{\beta}_{n}\to\infty$ , we set $\alpha^{*}:=\infty$ and $\beta^{*}:=\infty$ respectively. If either of the sequences do not diverge to $\infty$ , it is possible to find accumulation points $\alpha^{*},\beta^{*}\in[1,\infty)$ . In particular, up to extracting a subsequence, we can assume that $\tilde{\alpha}_{n}\to\alpha^{*}$ and $\tilde{\beta}_{n}\to\beta^{*}$ as $n\to\infty$ . A sequence $(\rho^{n},m^{n},\mu^{n})$ satisfying (42) exists by Theorem 4.4. By the same theorem it also follows that $\rho^{n}=dt\otimes\rho_{t}^{n}$ with $\rho_{t}^{n}\in C_{\rm w}([0,1];\mathcal{M}^{+}(\overline{\Omega}))$ . By testing (42) against $(\rho^{\dagger},m^{\dagger},\mu^{\dagger})$ and using (39)–(40) we get

[TABLE]

In particular $(K_{t}^{*}\rho_{t}^{n}-f_{t}^{n})\to 0$ in $L^{2}([0,1];H)$ . Since by assumption $f^{n}\to f^{\dagger}$ , we obtain $K_{t}^{*}\rho^{n}_{t}\to f_{t}^{\dagger}$ in $L^{2}([0,1];H)$ . Dividing the inequality at (43) by $c_{n}$ , taking the limes superior and keeping (40) intro account yields

[TABLE]

Notice that the right hand side in (44) is always bounded, thanks to definitions (37), (38) and assumption (41). By definition we have $\tilde{\alpha}_{n},\tilde{\beta}_{n}\geq 1$ , thus

[TABLE]

where we used (44) and the convergence $K_{t}^{*}\rho^{n}_{t}\to f_{t}^{\dagger}$ . Therefore an application of Lemma 4.5 guarantees the existence of $(\rho^{*},m^{*},\mu^{*})\in\mathcal{D}$ with $\rho^{*}=dt\otimes\rho^{*}_{t}$ , $\rho^{*}_{t}\in C_{\rm w}([0,1];\mathcal{M}^{+}(\overline{\Omega}))$ , such that, up to subsequences, $(\rho^{n},m^{n},\mu^{n})$ converges to $(\rho^{*},m^{*},\mu^{*})$ in the sense of (25). In particular by Lemma 4.6 we have $K_{t}^{*}\rho^{n}_{t}\rightharpoonup K_{t}^{*}\rho^{*}_{t}$ weakly in $L^{2}([0,1];H)$ . Since we already proved that $K_{t}^{*}\rho^{n}_{t}\to f^{\dagger}_{t}$ , by uniqueness of the weak limit we have

[TABLE]

We are left to show that $(\rho^{*},m^{*},\mu^{*})$ is an energy minimizing solution of (39). By Lemma 4.6

[TABLE]

where we used (44) and that $\tilde{\alpha}_{n}\to\alpha^{*}$ , $\tilde{\beta}_{n}\to\beta^{*}$ . Replacing $(\rho^{\dagger},m^{\dagger},\mu^{\dagger})$ by an arbitrary solution of (39) with finite energy $B_{\delta}$ , the argument can be repeated, and from (45)–(46) we conclude that $(\rho^{*},m^{*},\mu^{*})$ is an energy minimizing solution of (39). ∎

5. Application to dynamic undersampled MRI

We will now detail on the application of the above results to dynamic magnetic resonance imaging as outlined in the introduction. Let $\Omega\subset\mathbb{R}^{2}$ be an open bounded domain representing the image frame, and $c_{j}\in C_{0}(\mathbb{R}^{2};\mathbb{C})$ for $j=1,\dots,N$ with $N\geq 1$ be the coil sensitivities. Let $\sigma_{t}\in\mathcal{M}^{+}(\mathbb{R}^{2})$ for $t\in[0,1]$ be a family of measures such that

(M1)

$\left\|\sigma_{t}\right\|_{\mathcal{M}(\mathbb{R}^{2})}\leq C$ for a.e. $t\in[0,1]$ , where $C>0$ does not depend on $t$ , 2. (M2)

the map $t\mapsto\int_{\mathbb{R}^{2}}\varphi(x)\,d\sigma_{t}(x)$ is measurable for each $\varphi\in C_{0}(\mathbb{R}^{2};\mathbb{C})$ .

Let $D:=C_{0}(\mathbb{R}^{2};\mathbb{C}^{N})$ be the Banach space normed by $\left\|\varphi\right\|_{\infty}:=\max_{\{j=1,\dots,N\}}\max_{x\in\mathbb{R}^{2}}|\varphi^{j}(x)|$ , where $\varphi=(\varphi^{1},\dots,\varphi^{N})$ . Define Hilbert spaces $H_{t}:=L^{2}_{\sigma_{t}}(\mathbb{R}^{2};\mathbb{C}^{N})$ , equipped with the norm ${\left\|h\right\|}_{H_{t}}^{2}:=\sum_{j=1}^{N}\int_{\mathbb{R}^{2}}|h^{j}(x)|^{2}\,d\sigma_{t}(x)$ , where $h=(h^{1},\dots,h^{N})$ . Define $i_{t}\colon D\to H_{t}$ as the identity map, acting component-wise. Note that here we are interpreting $D$ and $H_{t}$ as real vector spaces. For a measure $\rho\in\mathcal{M}(\overline{\Omega};\mathbb{C})$ we denote its Fourier transform as

[TABLE]

so that $\mathscr{F}\rho\in C(\mathbb{R}^{2};\mathbb{C})$ . Notice that in the above definition we extend $\rho$ to be zero outside of $\overline{\Omega}$ . For each $t\in[0,1]$ , define the linear operator $K_{t}^{*}\colon\mathcal{M}(\overline{\Omega})\to H_{t}$ as

[TABLE]

In the MRI context, the family of measures $\rho_{t}\in\mathcal{M}^{+}(\overline{\Omega})$ for $t\in[0,1]$ represents the proton density at each time step. Given some data $f\in L^{2}([0,1];H)$ , we want to reconstruct a solution to the dynamic inverse problem

[TABLE]

As proposed in the previous sections, we relax the problem to measures $\rho\in\mathcal{M}(X)$ , with $X:=(0,1)\times\overline{\Omega}$ , and minimize the functional $J$ introduced in (16). Under the assumptions (M1)–(M2) the functional $J$ admits at least one minimizer, and minimizers are unique under suitable additional assumptions. This claim is the object of the following theorem.

Theorem 5.1.

Let $\alpha,\beta>0$ , $\delta\in(0,\infty]$ , $f\in L^{2}([0,1];H)$ . Let $\{\sigma_{t}\}$ for $t\in[0,1]$ be a family of Radon measures in $\mathcal{M}^{+}(\mathbb{R}^{2})$ satisfying (M1)–(M2). Let $c_{1},\dots,c_{N}\in C_{0}(\mathbb{R}^{2};\mathbb{C})$ be coil sensitivities. Then the regularization of (48) according to

[TABLE]

admits a solution $(\rho^{*},m^{*},\mu^{*})\in\mathcal{D}$ with $\rho^{*}=dt\otimes\rho_{t}^{*}$ and the curve $t\mapsto\rho_{t}^{*}$ belonging to $C_{\rm w}([0,1];\mathcal{M}^{+}(\overline{\Omega}))$ . If in addition the supports of the measures $\sigma_{t}$ have non empty interior for a.e. $t\in[0,1]$ , and the vector of coil sensitivities satisfies $c(x)\neq 0$ for every $x\in\overline{\Omega}$ , then the minimizer is unique.

Before proceeding with the proof, we want to show how this analytical framework allows us to treat a wide variety of sampling patterns. We will give two examples.

Example 5.2 (Continuous sampling).

Let $\Omega:=(-\frac{1}{2},\frac{1}{2})^{2}$ and for $t\in[0,1]$ define the line $L_{t}:=(-\frac{1}{2},\frac{1}{2})\times\{t-\frac{1}{2}\}$ . Set $\sigma_{t}:=\mathcal{H}^{1}\mathbin{\vrule height=6.88889pt,depth=0.0pt,width=0.55974pt\vrule height=0.55974pt,depth=0.0pt,width=5.59721pt}L_{t}$ , that is, the restriction of the $1$ -dimensional Hausdorff measure to the lines $L_{t}$ . It is immediate to check that $\sigma_{t}\in\mathcal{M}^{+}(\mathbb{R}^{2})$ satisfies (M1)–(M2): indeed $\left\|\sigma_{t}\right\|_{\mathcal{M}^{+}(\mathbb{R}^{2})}=\mathcal{H}^{1}(L_{t})=1$ , while the map $t\mapsto\int_{\mathbb{R}^{2}}\varphi(x)d\sigma_{t}(x)$ is continuous for $\varphi$ in $C_{0}(\mathbb{R}^{2};\mathbb{C})$ . In the same way we can treat radial sampling, by setting $L_{t}$ to be a collection of diameters through the origin, evolving in time (see Example B.3).

Example 5.3 (Compressed-sensing sampling).

In this example we propose to sample along a finite collection of moving points in an open bounded domain $\Omega\subset\mathbb{R}^{2}$ . To be more specific, fix $M\in\mathbb{N}$ , $M\geq 1$ and for every $j=1,\dots,M$ let $t\in[0,1]\mapsto x_{t}^{j}\in\Omega$ be a measurable curve. For a.e. $t\in[0,1]$ define $P_{t}:=\{x_{t}^{1},\dots,x_{t}^{M}\}$ and $\sigma_{t}:=\mathcal{H}^{0}\mathbin{\vrule height=6.88889pt,depth=0.0pt,width=0.55974pt\vrule height=0.55974pt,depth=0.0pt,width=5.59721pt}P_{t}=\sum_{j=1}^{M}\delta_{x_{t}^{j}}$ . Notice that (M1) is satisfied since $\left\|\sigma_{t}\right\|_{\mathcal{M}(\mathbb{R}^{2})}=M$ . Given a map $\varphi\in C_{0}(\mathbb{R}^{2};\mathbb{C})$ we have that $t\mapsto\int_{\mathbb{R}^{2}}\varphi(x)\,d\sigma_{t}(x)=\sum_{j=1}^{M}\varphi(x_{t}^{j})$ is measurable by construction. Therefore also (M2) is satisfied.

We now want to prove Theorem 5.1. Before that, we need a preliminary lemma, stating that under (M1)–(M2) the above definitions of $D,H_{t},i_{t},K_{t}^{*}$ satisfy assumptions (H1)–(H3) and (K1)–(K3).

Lemma 5.4.

Assume that (M1)–(M2) hold. The spaces $D$ , $H_{t}$ and the operators $i_{t}$ satisfy (H1)–(H3). Moreover the operators $K_{t}^{*}$ in (47) satisfy (K1)–(K3).

Proof.

Notice that $i_{t}$ is linear and continuous, with $\left\|i_{t}\right\|^{2}\leq N\left\|\sigma_{t}\right\|_{\mathcal{M}(\mathbb{R}^{2})}$ . In particular (H1) follows from (M1). Moreover (H2) is trivially satisfied. Finally for $\varphi,\psi\in D$ we have

[TABLE]

which is measurable by (M2), as it is the real part of a sum of measurable maps. Hence (H3) is also satisfied. Let us now show that (K1)–(K3) hold. For $\rho\in\mathcal{M}(\overline{\Omega})$ we have

[TABLE]

Hence, each $\mathscr{F}(c_{j}\rho)$ is square integrable with respect to $\sigma_{t}$ , so that $K_{t}^{*}$ maps $\mathcal{M}(\overline{\Omega})$ into $H_{t}$ . Moreover, by the above estimate we also have

[TABLE]

where $c:=(c_{1},\dots,c_{N})$ is the vector of coil sensitivities. Therefore $K^{*}_{t}$ is continuous, with $\left\|K^{*}_{t}\right\|^{2}\leq\frac{N}{4\pi^{2}}\left\|c\right\|_{\infty}^{2}\left\|\sigma_{t}\right\|_{\mathcal{M}(\mathbb{R}^{2})}$ and (K2) is satisfied because of assumption (M1). Let us show that $K_{t}^{*}$ is weak*-to-weak continuous. To this end, let $\{\rho_{n}\}$ in $\mathcal{M}(\overline{\Omega})$ be such that $\rho_{n}\stackrel{{\scriptstyle*}}{{\rightharpoonup}}\rho$ . Since $\rho_{n},\rho$ are supported in the compact set $\overline{\Omega}$ , it follows that

[TABLE]

as $n\to\infty$ . Moreover, by weak* convergence we have $\sup_{n}\left\|\rho_{n}\right\|_{\mathcal{M}(\overline{\Omega})}<\infty$ . As a consequence of (49), there exists a constant $C\geq 0$ such that

[TABLE]

By invoking the dominated convergence theorem in conjunction with (50)–(51) we conclude that $K_{t}^{*}\rho_{n}\rightharpoonup K_{t}^{*}\rho$ weakly in $L^{2}_{\sigma_{t}}(\mathbb{R}^{2};\mathbb{C}^{N})$ . Hence (K1’) is satisfied and, as a consequence, $K_{t}^{*}$ is the adjoint of some linear continuous operator $K_{t}\colon H_{t}\to C(\overline{\Omega})$ . Finally we need to show (K3): that the map $t\mapsto K_{t}^{*}\rho$ is strongly measurable according to Definition 3.2, for every fixed $\rho\in\mathcal{M}(\overline{\Omega})$ . Notice that the space $D=C_{0}(\mathbb{R}^{2};\mathbb{C}^{N})$ is separable, hence by Proposition 3.7 it is sufficient to show that $t\mapsto K_{t}^{*}\rho$ is weakly measurable according to Definition 3.2. However, since $\mathscr{F}(c_{j}\rho)\in C(\mathbb{R}^{2};\mathbb{C})$ is bounded (see (49)) and maps in $D$ are continuous, this is an immediate consequence of (M2). ∎

Proof of Theorem 5.1.

The existence of a minimizer follows from Proposition 5.4 and of Theorem 4.4. For the uniqueness, from Theorem 4.4, it is sufficient to check that the operators $K_{t}^{*}\colon\mathcal{M}(\overline{\Omega})\to H_{t}$ are injective for a.e. $t\in[0,1]$ . To this end, choose $t\in[0,1]$ such that $\operatorname{supp}\sigma_{t}$ has non empty interior, and let $\rho\in\mathcal{M}(\overline{\Omega})$ be such that $K_{t}^{*}\rho=0$ . In particular $\mathscr{F}(c_{j}\rho)=0$ in $\operatorname{supp}\sigma_{t}$ , for every $j=1,\dots,N$ . Since $\mathscr{F}(c_{j}\rho)$ is analytic and $\operatorname{supp}\sigma_{t}$ contains an open ball, we conclude that $\mathscr{F}(c_{j}\rho)=0$ in $\mathbb{R}^{2}$ . By injectivity of the Fourier transform we have that $c_{j}\rho=0$ , and since we are assuming that $c(x)\neq 0$ for every $x\in\overline{\Omega}$ , we conclude that $\rho=0$ . ∎

6. Conclusions and perspectives

In the paper, we have shown that it is possible to successfully use energy functionals that are associated with a dynamic formulation of optimal transport as regularization functionals for dynamic inverse problems that aim at the recovery of measure-valued curves. Let us point out some future directions of research. On the one hand, the focus of the paper is on regularizers that penalize mass transport by the squared distance as well as mass growth in terms of quadratic costs for growth rate. Thus, a generalization to other convex optimal transport energies (such as, e.g., the $p$ -th power of the euclidean distance) in an appropriate dynamic context (i.e., where the dynamic formulation involves a continuity type equation), would be interesting and seems to be possible. On the other hand, the regularized problems involve, in addition to the transport energy, a Radon-norm term which corresponds to a penalization of the total mass. Also here, a generalization to other regularization functionals should be possible, provided that one can still ensure boundedness of the total mass. This way, it might be possible to impose, e.g., spatial smoothness of the solution curve.

Finally, we would like to mention that a numerical optimization algorithm for the solution of the regularized problem is currently under preparation, in the general setting and also with focus on the application to dynamic MRI. The numerical solution to the dynamic optimal transport problem outside the inverse problems context is addressed for example in [8, 20, 49], where the authors employ proximal splitting algorithms, after a careful discretization of the continuity equation constraint by means of staggered grids, and, subsequently, of the energy. The recent application to dynamic PET imaging [59] also builds on these algorithms. The method we propose, based on Frank-Wolfe-type algorithms [14, 28, 33], is inherently discretization-free and therefore genuinely cast in the space of Radon measures, in the spirit of [15, 52]. Generally, such algorithms are attractive for obtaining sparse solutions for inverse problems in terms of extremal points of the regularizer [9, 10]. In our setting, the idea is to linearize the fidelity term in (3) around some initial guess, and then proceed to minimize (a suitably modified version) of the functional obtained. The key part of the analysis is that such linearized problem admits a solution which is an extremal point of the unit ball of the regularizer, making the minimization problem numerically accessible, although non-convex in general. Indeed, as shown in [11], the extremal points of the Benamou-Brenier regularizer are given by measures concentrated on curves (with a certain regularity), yielding an optimization problem in some Sobolev space. It is then possible to show convergence of the algorithm in the space of measures. Such an approach is then particularly well-suited to recover sparse solutions, given by travelling Dirac deltas with the addition of noise, making it attractive for medical applications in which the tracking of point-sources is relevant. With this approach one can, in principle, deal also with the unbalanced optimal transport case. The key ingredient is of course the characterization of the extremal points of the Wasserstein–Fisher–Rao energy, which is currently under preparation by the authors, and bases on a measure-theoretic superposition principle for the non-homogeneous continuity equation, which is in itself a novel result.

Acknowledgments

The authors gratefully acknowledge support by the Christian Doppler Research Association (CDG) and Austrian Science Fund (FWF) through the Partnership in Research project PIR-27 “Mathematical methods for motion-aware medical imaging”. The Institute of Mathematics and Scientific Computing, to which the authors are affiliated, is a member of NAWI Graz (http://www.nawigraz.at/en/). The authors are further members of/associated with BioTechMed Graz (https://biotechmedgraz.at/en/).

Appendix A Measure theory

A.1. Measure theory preliminaries

In this paper we follow the definitions and notations of [5]. In particular, scalar or vectorial measures will always be defined on the Borel $\sigma$ -algebra $\mathcal{B}(X)$ of some locally compact, separable metric space $X$ . Given a measure $\mu$ , we denote with $|\mu|$ its total variation. We always assume that $|\mu|$ is at least locally finite. The set of $\mathbb{R}^{m}$ -valued measures for which $|\mu|(X)<\infty$ is denoted by $\mathcal{M}(X;\mathbb{R}^{m})$ and $\mathcal{M}(X):=\mathcal{M}(X;\mathbb{R})$ , while the set of positive finite measures is denoted by $\mathcal{M}^{+}(X)$ .

A.1.1. Absolute continuity, support and restriction

Given $\mu\in\mathcal{M}(X)$ and $\nu\in\mathcal{M}(X;\mathbb{R}^{m})$ , we say that $\nu$ is absolutely continuous with respect to $\mu$ , in symbols $\nu\ll\mu$ , if $|\nu|(E)=0$ whenever $\mu(E)=0$ , $E\in\mathcal{B}(X)$ . If $\mu$ and $\sigma$ are real or vector valued measures, we say that they are mutually singular, $\mu\perp\nu$ , if there exists a set $E\in\mathcal{B}(X)$ such that $|\mu|(E)=0$ and $|\nu|(X\smallsetminus E)=0$ . For a measure $\mu$ its support, denoted by $\operatorname{supp}\mu$ , is the closure of the set of points $x\in X$ such that $|\mu|(U)>0$ for every neighbourhood $U$ of $x$ . If $A\in\mathcal{B}(X)$ and $\mu\in\mathcal{M}(X;\mathbb{R}^{m})$ , the restriction of $\mu$ to $A$ is the measure $\mu\mathbin{\vrule height=6.88889pt,depth=0.0pt,width=0.55974pt\vrule height=0.55974pt,depth=0.0pt,width=5.59721pt}A$ defined as $\mu\mathbin{\vrule height=6.88889pt,depth=0.0pt,width=0.55974pt\vrule height=0.55974pt,depth=0.0pt,width=5.59721pt}A(E):=\mu(A\cap E)$ for every $E\in\mathcal{B}(X)$ .

A.1.2. Push-forward

Let $Y$ be a locally compact, separable metric space. A map $f\colon X\to Y$ is Borel if $f^{-1}(E)\in\mathcal{B}(X)$ for each $E\in\mathcal{B}(Y)$ . If $\mu$ is a real or vector valued measure on $X$ we define the push-forward of $\mu$ through $f$ as the measure $f_{\#}\mu$ on $Y$ , defined by $f_{\#}\mu(E):=\mu(f^{-1}(E))$ for each $E\in\mathcal{B}(Y)$ . If $\varphi$ is a real or vector valued map on $Y$ integrable with respect to $f_{\#}\mu$ then $\int_{Y}\varphi\,d(f_{\#}\mu)=\int_{X}\varphi\circ f\,d\mu\,.$ We recall that is $f$ is continuous and proper ( $f^{-1}(K)$ is compact if $K\subset Y$ is compact) and $\mu$ is finite, then also $f_{\#}\mu$ is finite.

A.1.3. Convergences

Let $\{\mu_{n}\}$ be a sequence of measures on $X$ . We say that $\mu_{n}$ narrowly converges to $\mu$ , in symbols $\mu_{n}\rightharpoonup\mu$ , if $\int_{X}\varphi\,d\mu_{n}\to\int_{X}\varphi\,d\mu$ for all $\varphi\in C_{b}(X)$ , i.e., $\varphi$ continuous and bounded. We say that $\mu_{n}$ weak converges* to $\mu$ , in symbols $\mu_{n}\stackrel{{\scriptstyle*}}{{\rightharpoonup}}\mu$ , if $\int_{X}\varphi\,d\mu_{n}\to\int_{X}\varphi\,d\mu$ for all $\varphi\in C_{0}(X)$ . We recall that $\varphi\in C_{0}(X)$ if $\varphi\in C(X)$ and for each $\varepsilon>0$ there exists $K\subset X$ compact such that $|\varphi|<\varepsilon$ in $X\smallsetminus K$ . Note that if $X$ is compact, then narrow convergence and weak* convergence coincide.

A.1.4. Disintegration

Let $X$ , $Y$ be locally compact, separable metric spaces. Consider $\{\mu_{x}\,\colon\,x\in X\}$ family of measures on $Y$ . We say that the family $\{\mu_{x}\}$ is Borel if the map $x\in X\mapsto\mu_{x}(B)$ is Borel measurable for every $B\subset Y$ measurable. Such condition implies that for every bounded Borel function $\varphi\colon X\times Y\to\mathbb{R}$ , the map $x\in X\mapsto\int_{Y}\varphi(x,y)\,d\mu_{x}(y)$ is Borel measurable (see [5, Prop 2.26]). We now state the disintegration theorem. For this version and the following properties, see [2, Sections 2.3, 2.4] and [5, Thm 2.28].

Theorem A.1 (Disintegration).

Let $X$ , $Y$ be locally compact, separable metric spaces. Let $\mu$ be a real (resp. vector valued) measure on $X\times Y$ , $\pi\colon X\times Y\to X$ the projection on the first factor, and $\nu$ a positive measure on $X$ , with the property that $\pi_{\#}|\mu|\ll\nu$ . Then there exists a Borel family $\{\mu_{x}\,\colon\,x\in X\}$ of real (resp. vector valued) measures on $Y$ such that $\mu=\nu\otimes\mu_{x}$ , that is,

[TABLE]

for every $f\in L^{1}(X\times Y;|\mu|)$ . A family $\{\mu_{x}\}$ such that $\mu=\nu\otimes\mu_{x}$ is called a disintegration of $\mu$ with respect to $\nu$ .

In the setting of Theorem A.1 the following properties hold:

(i)

If $\{\tilde{\mu}_{x}\}$ is another disintegration of $\mu$ with respect to $\nu$ , then $\mu_{x}=\tilde{\mu}_{x}$ for $\nu$ -a.e. $x$ , 2. (ii)

If $\mu$ is finite, then also $\mu_{x}$ is finite for $\nu$ -a.e. $x$ , 3. (iii)

Let $E\in\mathcal{B}(X)$ , $F\in\mathcal{B}(Y)$ . Then $\mu(E\times F)=0$ if and only if $\mu_{x}(F)=0$ for $\nu$ -a.e. $x$ .

A.2. Narrow continuity results

As in Section 2.1 we denote by $C_{\rm w}([0,1];\mathcal{M}(\overline{\Omega}))$ the set of narrowly continuous curves $t\mapsto\rho_{t}$ and by $C_{\rm w}([0,1];\mathcal{M}^{+}(\overline{\Omega}))$ the set of positive narrowly continuous curves. The remaining notations are the same as in Section 2.

Lemma A.2.

Let $\rho_{t}\in C_{\rm w}([0,1];\mathcal{M}^{+}(\overline{\Omega}))$ , $\rho:=dt\otimes\rho_{t}$ , $m=dt\otimes(v_{t}\rho_{t})$ , $\mu=dt\otimes(g_{t}\rho_{t})$ , with $v_{t}\colon X\to\mathbb{R}^{d}$ , $g_{t}\colon X\to\mathbb{R}$ measurable. Assume that $\partial_{t}\rho+\operatorname{div}m=\mu$ in the sense of (8) and

[TABLE]

Set $m:=\min_{t\in[0,1]}\rho_{t}(\overline{\Omega}),M:=\max_{t\in[0,1]}\rho_{t}(\overline{\Omega})$ and $C:=4(m+E)$ . Then $M\leq C$ and

[TABLE]

We remark that the above lemma is an easy generalization of Lemma 2.2 in [41], where the authors prove the same result under the restriction that $v_{t}=\nabla_{x}\,g_{t}$ .

Proof.

Since $\rho_{t}\in C_{\rm w}([0,1];\mathcal{M}^{+}(\overline{\Omega}))$ , we have $m,M<\infty$ . Arguing as in the proof of Proposition 2.4, we obtain that for all $\varphi\in C^{1}(\overline{\Omega})$ , the weak derivative of $\rho_{t}(\varphi):=\int_{\overline{\Omega}}\varphi\,d\rho_{t}$ satisfies

[TABLE]

In particular by applying twice the Cauchy–Schwarz inequality we get

[TABLE]

The proof is concluded if we show that $M\leq C$ . By applying the above estimate to $\varphi\equiv 1$ we get $|\rho_{t}(\overline{\Omega})-\rho_{s}(\overline{\Omega})|\leq\sqrt{2ME}$ . If we pick $s$ and $t$ such that $\rho_{s}(\overline{\Omega})=m$ and $\rho_{t}(\overline{\Omega})=M$ , then we get $M\leq m+\sqrt{2ME}\leq m+\sqrt{2}(M+E)/2$ , from which follows $M\leq 4(m+E)=C$ . ∎

Proposition A.3.

If $\rho_{t}\in C_{\rm w}([0,1];\mathcal{M}(\overline{\Omega}))$ then $\sup_{t\in[0,1]}\left\|\rho_{t}\right\|_{\mathcal{M}(\overline{\Omega})}<\infty\,.$

Proof.

The curve $\rho_{t}$ defines a family $\{\rho_{t}\}_{t\in[0,1]}$ of functionals in $\mathcal{M}(\overline{\Omega})$ , via $\varphi\mapsto\int_{\overline{\Omega}}\varphi\,d\rho_{t}$ . By narrow continuity, the map $t\mapsto\int_{\overline{\Omega}}\varphi\,d\rho_{t}$ is continuous for each $\varphi\in C(\overline{\Omega})$ , yielding that $\sup_{t\in[0,1]}\int_{\overline{\Omega}}\varphi\,d\rho_{t}<\infty$ . The principle of uniform boundedness then implies the thesis. ∎

Proposition A.4 (A refined version of Ascoli–Arzelà’s Theorem).

Let $K\subset\mathcal{M}(\overline{\Omega})$ be sequentially weak compact and $\rho^{n}_{t}\colon[0,1]\to\mathcal{M}(\overline{\Omega})$ be such that $\rho_{t}^{n}\in K$ for all $t\in[0,1]$ , $n\in\mathbb{N}$ and*

[TABLE]

where $\omega\colon[0,1]\times[0,1]\to[0,\infty)$ is continuous, symmetric and such that $\omega(t,t)=0$ for every $t\in[0,1]$ . Then there exists a $C^{1}(\overline{\Omega})^{*}$ -continuous curve $\rho_{t}\colon[0,1]\mapsto\mathcal{M}(\overline{\Omega})$ and a subsequence $\{\rho_{t}^{n_{k}}\}$ such that $\rho_{t}^{n_{k}}\stackrel{{\scriptstyle*}}{{\rightharpoonup}}\rho_{t}$ for every $t\in[0,1]$ .

The above statement is a particular case of [6, Prop 3.3.1], since $(\mathcal{M}(\overline{\Omega}),\|\cdot\|_{C^{1}(\overline{\Omega})^{*}})$ is a metric space and the $C^{1}(\overline{\Omega})^{*}$ -norm is weak* sequentially lower semicontinuous.

Proof of Proposition 2.4. Let $a\in C^{\infty}_{c}((0,1))$ , $b\in C^{\infty}(\overline{\Omega})$ . Set $\varphi(t,x):=a(t)b(x)$ , so that $\varphi$ is a test function for (8). Define $\rho_{t}(b):=\int_{\overline{\Omega}}b(x)\,d\rho_{t}(x)$ . Notice that $t\mapsto\rho_{t}(b)$ is measurable since $\rho_{t}$ is a Borel family. Moreover $t\mapsto\rho_{t}(b)$ belongs to $L^{1}((0,1))$ since $\int_{0}^{1}|\rho_{t}(b)|\,dt\leq\left\|b\right\|_{C^{1}(\overline{\Omega})}\left\|\rho\right\|_{\mathcal{M}(X)}$ where $\left\|b\right\|_{C^{1}(\overline{\Omega})}:=\max\{\left\|b\right\|_{\infty},\left\|\nabla b\right\|_{\infty}\}$ . Testing (8) against $\varphi$ yields that $\rho_{t}^{\prime}(b)=\int_{\overline{\Omega}}\left[\nabla b(x)\cdot v_{t}(x)+b(x)g_{t}(x)\right]\,d\rho_{t}(x)$ weakly. Notice that for a.e. $t\in[0,1]$ we have

[TABLE]

In particular $V\in L^{1}((0,1))$ by assumption (9), so that $\rho_{t}(b)\in W^{1,1}((0,1))$ with

[TABLE]

By the embedding $W^{1,1}((0,1))\hookrightarrow C([0,1])$ , there exists a unique $\tilde{\rho}_{t}(b)\in C([0,1])$ such that $\tilde{\rho}_{t}(b)=\rho_{t}(b)$ for a.e. $t\in[0,1]$ , and $\left\|\tilde{\rho}_{t}(b)\right\|_{\infty}\leq C\left\|b\right\|_{C^{1}(\overline{\Omega})}$ where $C>0$ does not depend on $b$ , thanks to (53). Moreover

[TABLE]

By density of $C^{\infty}(\overline{\Omega})$ in $C^{1}(\overline{\Omega})$ , for each $t\in[0,1]$ the map $b\mapsto\tilde{\rho}_{t}(b)$ can be uniquely extended to an element of ${C^{1}(\overline{\Omega})}^{*}$ , since the maps $b\mapsto\rho_{t}(b)$ are linear and the extensions $\tilde{\rho}_{t}(b)$ are unique. This defines a bounded curve $t\mapsto\tilde{\rho}_{t}$ in ${C^{1}(\overline{\Omega})}^{*}$ . Such curve is uniformly continuous in $[0,1]$ , since (52) and (54) imply $|\tilde{\rho}_{t}(b)-\tilde{\rho}_{s}(b)|\leq\left\|b\right\|_{C^{1}(\overline{\Omega})}\,\int_{s}^{t}V(\lambda)\,d\lambda$ for every $b\in C^{1}(\overline{\Omega})$ . We are left to prove that $\tilde{\rho}_{t}\in\mathcal{M}^{+}(\overline{\Omega})$ for each $t\in[0,1]$ . This follows from the fact that there is a Borel set $E\subset[0,1]$ with $|[0,1]\smallsetminus E|=0$ such that $\rho_{t}\in\mathcal{M}^{+}(\overline{\Omega})$ for every $t\in E$ and $\{\rho_{t}\}_{t\in E}$ is weak* sequentially precompact in $\mathcal{M}^{+}(\overline{\Omega})$ , since by Proposition 2.2 we have that $\{\rho_{t}(\overline{\Omega})\}$ is a.e. bounded as $t\mapsto\rho_{t}(\overline{\Omega})$ belongs to $BV((0,1))$ . The weak* continuity of the curve $t\mapsto\tilde{\rho}_{t}$ in $\mathcal{M}^{+}(\overline{\Omega})$ automatically follows from the one in ${C^{1}(\overline{\Omega})}^{*}$ : indeed let $\varphi\in C(\overline{\Omega})$ and let $\{\varphi_{n}\}$ be a sequence in $C^{1}(\overline{\Omega})$ such that $\left\|\varphi_{n}-\varphi\right\|_{\infty}\to 0$ as $n\to\infty$ . Then it is immediate to check that $\sup_{t\in[0,1]}|\tilde{\rho}_{t}(\varphi_{n})-\tilde{\rho}_{t}(\varphi)|\to 0$ so that $t\mapsto\tilde{\rho}_{t}(\varphi)$ is continuous, since it is uniform limit of continuous maps $t\mapsto\tilde{\rho}_{t}(\varphi_{n})$ . Finally let $\varphi\in C^{1}_{c}([0,1]\times\overline{\Omega})$ , $0\leq t_{1}\leq t_{2}\leq 1$ and define $\varphi_{\varepsilon}(t,x):=a_{\varepsilon}(t)\varphi(t,x)$ , where $a_{\varepsilon}\in C^{\infty}_{c}((t_{1},t_{2}))$ is such that $0\leq a_{\varepsilon}(t)\leq 1$ , $\lim_{\varepsilon\to 0}a_{\varepsilon}(t)={\mathchoice{\raisebox{0.0pt}{$ \displaystyle\chi $}}{\raisebox{0.0pt}{$ \textstyle\chi $}}{\raisebox{0.0pt}{$ \scriptstyle\chi $}}{\raisebox{0.0pt}{$ \scriptscriptstyle\chi $}}}_{(t_{1},t_{2})}(t)$ for almost every $t\in[0,1]$ and $\lim_{\varepsilon\to 0}a^{\prime}_{\varepsilon}=\delta_{t_{1}}-\delta_{t_{2}}$ weakly* in $\mathcal{M}([0,1])$ . Testing (8) against $\varphi_{\varepsilon}$ and passing to the limit as $\varepsilon\to 0$ (by continuity of $t\mapsto\int_{\overline{\Omega}}\varphi(t,x)\,d\tilde{\rho}_{t}(x)$ and (9)) yields (10).

Appendix B Time-Dependent Bochner Spaces

In this appendix we assume (H1)–(H3) as in Section 3.1. Definitions of step functions, strong measurability, weak measurability, separably valued and integrability are as in Sections 3.2, 3.3.

B.1. Auxiliary results and proofs of Section 3

Here we state and prove a suitable version of Egoroff’s Theorem, as well as present the proofs of Theorems 3.5, 3.10, 3.13.

Proposition B.1 (Egoroff).

Let $f_{n},f\colon[0,1]\to H$ be strongly measurable and such that, for a.e. $t\in[0,1]$ , $\lim_{n}{\left\|f_{n}(t)-f(t)\right\|}_{H_{t}}=0$ . Then for each fixed $\varepsilon>0$ there exists a Lebesgue measurable set $E\subset[0,1]$ with $|E|<\varepsilon$ and such that $f_{n}\to f$ uniformly in $[0,1]\smallsetminus E$ , that is,

[TABLE]

Proof.

The proof follows by replacing absolute values with the $H_{t}$ norms in the proof of the classic Egoroff Theorem. Indeed, since $f_{n},f\colon[0,1]\to H$ are assumed to be strongly measurable, the map $t\mapsto{\left\|f_{n}(t)-f(t)\right\|}_{H_{t}}$ is Lebesgue measurable (see Remark 3.4). Then the sets $E_{k}(n):=\cup_{m\geq k}\{t\in[0,1]\,\colon\,{\left\|f_{m}(t)-f(t)\right\|}_{H_{t}}\geq 1/n\}$ are measurable for each fixed $n,k\in\mathbb{N}$ . Moreover, for $n$ fixed, we have that $E_{k+1}(n)\subset E_{k}(n)$ and $|E_{k}(n)|\searrow 0$ as $k\to\infty$ , since we are assuming that $f_{n}\to f$ a.e. in $[0,1]$ . Let $\{k_{n}\}$ be an increasing sequence of indices such that $|E_{k_{n}}(n)|<\varepsilon/2^{n}$ . It is immediate to see that the measurable set $E:=\cup_{n=1}^{\infty}E_{k_{n}}(n)$ satisfies (55). ∎

Proof of Theorem 3.5. Part 1. Assume that $f$ is strongly measurable. Hence there exists a sequence $\{f_{n}\}$ of step functions $f_{n}\colon[0,1]\to D$ with $f_{n}=\sum_{j=1}^{N_{n}}{\mathchoice{\raisebox{0.0pt}{$ \displaystyle\chi $}}{\raisebox{0.0pt}{$ \textstyle\chi $}}{\raisebox{0.0pt}{$ \scriptstyle\chi $}}{\raisebox{0.0pt}{$ \scriptscriptstyle\chi $}}}_{E_{n,j}}\varphi_{n,j}$ such that ${\left\|i_{t}f_{n}(t)-f(t)\right\|}_{H_{t}}\to 0$ a.e. in $[0,1]$ . We claim that $f$ is weakly measurable: Indeed fix $\varphi\in D$ and define $\theta(t):={\langle i_{t}\varphi,f(t)\rangle}_{H_{t}}$ , $\theta_{n}(t):={\langle i_{t}\varphi,i_{t}f_{n}(t)\rangle}_{H_{t}}$ for $t\in[0,1]$ . Clearly $\theta_{n}$ is measurable for fixed $n$ , by (H3). Moreover using Cauchy–Schwarz and (H1) yields $|\theta_{n}(t)-\theta(t)|\leq C\left\|\varphi\right\|_{D}{\left\|i_{t}f_{n}(t)-f(t)\right\|}_{H_{t}}$ , so that $\theta_{n}\to\theta$ for a.e. $t\in[0,1]$ , implying that $\theta$ is measurable, and hence $f$ is weakly measurable. We will now show that $f$ is essentially separably valued. By definition $i_{t}f_{n}$ is strongly measurable and $i_{t}f_{n}\to f$ a.e., hence Proposition B.1 implies that for every $n\in\mathbb{N}$ there exists a measurable set $E_{n}\subset[0,1]$ with $|E_{n}|\leq 1/n$ and such that ${\left\|i_{t}f_{n}(t)-f(t)\right\|}_{H_{t}}\to 0$ uniformly on $[0,1]\smallsetminus E_{n}$ . Define the countable set $S:=\cup_{n=1}^{\infty}f_{n}([0,1])\subset D$ . Let $E:=\cap_{n=1}^{\infty}E_{n}$ so that $|E|=0$ . Fix $\varepsilon>0$ and $t\in[0,1]\smallsetminus E$ . Hence there exists an index $n\in\mathbb{N}$ such that $t\in[0,1]\smallsetminus E_{n}$ . By uniform convergence we conclude that ${\left\|i_{t}f_{n}(t)-f(t)\right\|}_{H_{t}}<\varepsilon$ , for sufficiently large $n$ . Therefore Definition 3.2 iii) is satisfied by setting $\varphi:=f_{n}(t)$ .

Part 2. Let $f$ be weakly measurable and essentially separably valued. Let $S=\{\varphi_{n}\}\subset D$ be countable and $E\subset[0,1]$ with $|E|=0$ satisfying Definition 3.2. For $t\in[0,1]$ define $\psi_{n}(t):=1/{\left\|i_{t}\varphi_{n}\right\|}_{H_{t}}$ if $i_{t}\varphi_{n}\neq 0$ and $\psi_{n}(t):=0$ otherwise. Notice that $\psi_{n}$ is Lebesgue measurable for each fixed $n$ , since $t\mapsto{\left\|i_{t}\varphi_{n}\right\|}_{H_{t}}$ is measurable by (H3). We will now show that

[TABLE]

Indeed, the supremum never exceeds ${\left\|f(t)\right\|}_{H_{t}}$ by the Cauchy–Schwarz inequality. Conversely, if $f(t)=0$ the equality is trivial, hence assume $f(t)\neq 0$ . Fix $0<\varepsilon<{\left\|f(t)\right\|}_{H_{t}}/2$ . Since $f$ is essentially separably valued, there exists $\varphi_{n}\in S$ such that ${\left\|i_{t}\varphi_{n}-f(t)\right\|}_{H_{t}}<\varepsilon$ . In particular $i_{t}\varphi_{n}\neq 0$ . Then

[TABLE]

and since $\varepsilon$ is arbitrarily small we conclude. Notice that the map $t\mapsto|{\langle i_{t}\varphi_{n},f(t)\rangle}_{H_{t}}|$ is measurable by weak measurability of $f$ . Thus $t\mapsto\psi_{n}(t)|{\langle i_{t}\varphi_{n},f(t)\rangle}_{H_{t}}|$ is measurable, being product of measurable maps. Since the countable pointwise supremum of measurable functions is measurable, by (56) we conclude that $t\mapsto{\left\|f(t)\right\|}_{H_{t}}$ is measurable. Also the map $\theta_{n}(t):={\left\|f(t)-i_{t}\varphi_{n}\right\|}_{H_{t}}$ is measurable at $n$ fixed, as

[TABLE]

is a sum of measurable functions, where the second element is measurable by weak measurability of $f$ and the third by (H3). Fix $\varepsilon>0$ and define the measurable sets $E_{n}:=\{t\in[0,1]\,\colon\,\theta_{n}(t)<\varepsilon\}$ and the map $g\colon[0,1]\to D$ by setting $g(t):=\varphi_{n}$ if $t\in E_{n}\smallsetminus\cup_{j=1}^{n-1}E_{j}$ for some $n$ , and $g(t)=0$ otherwise. Note that for $t\in[0,1]\setminus E$ there exists some index $n$ such that ${\left\|f(t)-i_{t}\varphi_{n}\right\|}_{H_{t}}<\varepsilon$ . Therefore, by picking the smallest $n$ such that this condition is verified, we have $g(t)=\varphi_{n}$ . Since this is true for each $t\in[0,1]\setminus E$ , this means that ${\left\|f(t)-i_{t}g(t)\right\|}_{H_{t}}<\varepsilon$ a.e. in $[0,1]$ . Hence we can approximate $f$ essentially uniformly by $i_{t}g(t)$ with $g$ countably valued. By choosing $\varepsilon=1/n$ we obtain countably valued functions $g_{n}\colon[0,1]\to D$ such that

[TABLE]

where $|[0,1]\setminus E|=0$ . Note that by definition $g_{n}=\sum_{j=1}^{\infty}{\mathchoice{\raisebox{0.0pt}{$ \displaystyle\chi $}}{\raisebox{0.0pt}{$ \textstyle\chi $}}{\raisebox{0.0pt}{$ \scriptstyle\chi $}}{\raisebox{0.0pt}{$ \scriptscriptstyle\chi $}}}_{E_{n,j}}\varphi_{n,j}$ with $\{E_{n,j}\}_{j\in\mathbb{N}}$ measurable partition of $[0,1]$ . Therefore for every $n\in\mathbb{N}$ , there exists $k_{n}$ such that the set $\cup_{j=1}^{k_{n}}E_{n,j}$ satisfies

[TABLE]

Set $F_{n}:=\cap_{s=n}^{\infty}\cup_{j=1}^{k_{s}}E_{s,j}$ , $F:=\cup_{n=1}^{\infty}F_{n}$ . In this way $|[0,1]\smallsetminus F|=0$ by (58), since $\left|[0,1]\smallsetminus F_{n}\right|\leq\sum_{s=n}^{\infty}\frac{1}{s^{2}}\to 0$ as $n\to\infty$ . Now define step functions $f_{n}\colon[0,1]\to D$ obtained by truncating $g_{n}$ , that is, $f_{n}:=\sum_{j=1}^{k_{n}}{\mathchoice{\raisebox{0.0pt}{$ \displaystyle\chi $}}{\raisebox{0.0pt}{$ \textstyle\chi $}}{\raisebox{0.0pt}{$ \scriptstyle\chi $}}{\raisebox{0.0pt}{$ \scriptscriptstyle\chi $}}}_{E_{n,j}}\varphi_{n,j}$ . If we prove that

[TABLE]

then we conclude that $f$ is strongly measurable, since $|[0,1]\smallsetminus(F\cap E)|=0$ . In order to show (59), fix $\varepsilon>0$ and $t\in F\cap E$ . By using (57) and (H1) we have that for all $n\in\mathbb{N}$

[TABLE]

Since $t\in F$ , by definition there exists an index $N_{t}$ such that $t\in F_{N_{t}}$ . Hence $t\in\cup_{j=1}^{k_{n}}E_{n,j}$ for every $n\geq N_{t}$ , so that $\sum_{j=k_{n}+1}^{\infty}{\mathchoice{\raisebox{0.0pt}{$ \displaystyle\chi $}}{\raisebox{0.0pt}{$ \textstyle\chi $}}{\raisebox{0.0pt}{$ \scriptstyle\chi $}}{\raisebox{0.0pt}{$ \scriptscriptstyle\chi $}}}_{E_{n,j}}(t)\,\varphi_{n,j}=0$ for each $n\geq N_{t}$ . Set $n_{\varepsilon,t}:=\max\{N_{t},1/\varepsilon\}$ . From (60) we have ${\left\|f(t)-i_{t}f_{n}(t)\right\|}_{H_{t}}<\varepsilon$ for every $n\geq n_{\varepsilon,t}$ , implying (59).

Proof of Theorem 3.10. Since $f_{n}\to f$ a.e., the map $f$ in strongly measurable and $\theta_{n}(t):={\left\|f_{n}(t)-f(t)\right\|}_{H_{t}}$ is Lebesgue measurable. By assumption we have that $\theta_{n}\to 0$ and $\theta_{n}\leq 2g$ a.e. in $[0,1]$ . Therefore, by the classic dominated convergence theorem, each $\theta_{n}$ is integrable and $f_{n}\to f$ strongly in $L^{1}([0,1];H)$ . To conclude integrability for $f$ it is sufficient to employ triangle inequality, integrability of $\theta_{n}$ and Theorem 3.9.

Proof of Theorem 3.13. The fact that $\left\|\cdot\right\|_{L^{p}}$ is a norm follows immediately from the classic case as well as the fact that the map $t\mapsto{\left\|f(t)\right\|}_{H_{t}}^{p}$ is measurable for each $p\geq 1$ , when $f$ is assumed to be strongly measurable (see Remark 3.4). Moreover ${\langle\cdot,\cdot\rangle}_{L^{2}}$ is an inner product, since the spaces $H_{t}$ are Hilbert. In order to show completeness, it is sufficient to follow the lines of the proof of the classic Riesz–Fischer theorem. Let $1\leq p<\infty$ and let $\{f_{n}\}$ be a Cauchy sequence in $L^{p}([0,1];H)$ . In any normed linear space, a Cauchy sequence having a convergent subsequence converges to the same limit. Therefore, up to extracting a subsequence, we can assume that

[TABLE]

For every $n\in\mathbb{N}$ define measurable sets $E_{n}:=\{t\in[0,1]\,\colon\,{\left\|f_{n+1}(t)-f_{n}(t)\right\|}_{H_{t}}\geq 1/n^{2}\}$ , so that ${\mathchoice{\raisebox{0.0pt}{$ \displaystyle\chi $}}{\raisebox{0.0pt}{$ \textstyle\chi $}}{\raisebox{0.0pt}{$ \scriptstyle\chi $}}{\raisebox{0.0pt}{$ \scriptscriptstyle\chi $}}}_{E_{n}}(t)/n^{2}\leq{\left\|f_{n+1}(t)-f_{n}(t)\right\|}_{H_{t}}$ a.e. in $[0,1]$ and $|E_{n}|<n^{2p}/2^{np}$ by (61). In particular one has $\sum_{n}|E_{n}|<\sum_{n}n^{2p}/2^{np}<\infty$ . Define $F_{n}:=\cup_{m\geq n}E_{m}$ , so that $\{F_{n}\}$ is a nested sequence of measurable sets, with $|F_{n}|\leq\sum_{m\geq n}m^{2p}/2^{mp}\to 0$ as $n\to\infty$ . Finally set $F:=\cap_{n}F_{n}$ , which satisfies $|F|=0$ . By definition, if $t\in[0,1]\smallsetminus F$ , then ${\left\|f_{n+1}(t)-f_{n}(t)\right\|}_{H_{t}}<n^{-2}$ for $n$ sufficiently large. Hence for $t\in[0,1]\smallsetminus F$ , $m>n$ and $n$ sufficiently large one has ${\left\|f_{m}(t)-f_{n}(t)\right\|}_{H_{t}}\leq\sum_{j\geq n}{\left\|f_{j+1}(t)-f_{j}(t)\right\|}_{H_{t}}<\sum_{j\geq n}1/j^{2}$ , and since $\sum_{j\geq n}1/j^{2}\to 0$ as $n\to\infty$ , we conclude that $\{f_{n}(t)\}$ is a Cauchy sequence in $H_{t}$ . For $t\in[0,1]\smallsetminus F$ denote by $f(t)\in H_{t}$ the strong limit of $\{f_{n}(t)\}$ , which exists since $H_{t}$ is complete. For $t\in F$ set $f(t)=0$ . This defines a map $f\colon[0,1]\to H$ , which is strongly measurable since it is the a.e. pointwise limit of a sequence of strongly measurable maps (see Remark 3.4). Moreover, by the a.e. pointwise convergence, we also have that ${\left\|f_{n}(t)\right\|}_{H_{t}}\to{\left\|f(t)\right\|}_{H_{t}}$ as $n\to\infty$ for a.e. $t$ . Since the maps $t\mapsto{\left\|f_{n}(t)\right\|}_{H_{t}},t\mapsto{\left\|f(t)\right\|}_{H_{t}}$ are measurable, we can apply Fatou’s Lemma and conclude that $\int_{0}^{1}{\left\|f(t)\right\|}_{H_{t}}^{p}\,dt\leq\liminf_{n}\int_{0}^{1}{\left\|f_{n}(t)\right\|}_{H_{t}}^{p}\,dt$ , which is bounded since $\{f_{n}\}$ is a Cauchy sequence in $L^{p}([0,1];H)$ . Hence $f\in L^{p}([0,1];H)$ . Finally, one more application of Fatou’s Lemma combined with (61) yields $\left\|f_{n}-f\right\|_{L^{p}}\to 0$ as $n\to\infty$ .

B.2. Comparison with classic Bochner theory

In this section we will investigate integrability properties for $i_{t}^{*}f\colon[0,1]\to D^{*}$ when $f\in L^{1}([0,1];H)$ . Since the codomain of $i_{t}^{*}f$ is the fixed space $D^{*}$ , it makes sense to check whether $i_{t}^{*}f$ is integrable in a classic sense. Specifically, in Proposition B.2, we will see that $i_{t}^{*}f$ is always Gelfand integrable. On the other hand, $i_{t}^{*}f$ is not always Bochner integrable, as we show in Example B.3. The main impediment is that $i_{t}^{*}f$ is not strongly measurable in general. Finally in Proposition B.4 we will give sufficient conditions under which $i_{t}^{*}f$ is Bochner integrable.

Proposition B.2.

Assume that $f\in L^{1}([0,1];H)$ . Then $i_{t}^{*}f$ is Gelfand integrable in $D^{*}$ , that is, for each measurable set $E\subset[0,1]$ there exists an element $I_{E}(i_{t}^{*}f)\in D^{*}$ such that

[TABLE]

Proof.

Let $\varphi\in D$ . By duality one has ${\langle i_{t}^{*}f(t),\varphi\rangle}_{D^{*},D}={\langle f(t),i_{t}\varphi\rangle}_{H_{t}}$ . Therefore $i_{t}^{*}f$ is weak* measurable since $f\colon[0,1]\to H$ is weak measurable. By (H1),

[TABLE]

since $f$ is integrable. This shows that $t\mapsto{\langle i_{t}^{*}f(t),\varphi\rangle}_{D^{*},D}$ belongs to $L^{1}([0,1])$ for each $\varphi\in D$ . Hence $i_{t}^{*}f$ is Gelfand integrable by Theorem 11.52 in [3], and (62) holds. ∎

Example B.3 (Radial sampling).

Let $\Omega:=B_{1}(0)=\{x\in\mathbb{R}^{2}\,\colon\,|x|<1\}$ and for $t\in[0,1]$ define the lines through the origin $S_{t}:=\{(\cos(\pi t)s,\sin(\pi t)s)\,\colon\,|s|<1\}$ , so that $S_{t}\subset\Omega$ . Define $D=C_{0}(\Omega)$ equipped with the supremum norm. Hence $D^{*}=\mathcal{M}(\Omega)$ . Define $H_{t}:=L^{2}_{\sigma_{t}}(S_{t})$ with inner product ${\langle h_{1},h_{2}\rangle}_{H_{t}}:=\int_{S_{t}}h_{1}h_{2}\,d\sigma_{t}$ , $\sigma_{t}:=\mathcal{H}^{1}\mathbin{\vrule height=6.88889pt,depth=0.0pt,width=0.55974pt\vrule height=0.55974pt,depth=0.0pt,width=5.59721pt}S_{t}$ where $\mathcal{H}^{1}$ is the 1-dimensional Hausdorff measure. Finally define $i_{t}\colon D\to H_{t}$ by $i_{t}\varphi=\varphi|_{S_{t}}$ . It is straightforward to check that (H1)-(H3) are satisfied, so that we can consider the space $L^{2}([0,1];H)$ defined as in (14).

We will now construct a map $f$ belonging to $L^{2}([0,1];H)$ , compute the Gelfand integral of $i_{t}^{*}f\colon[0,1]\to D^{*}$ and show that $i_{t}^{*}f$ is not Bochner integrable. To this end, notice that for a map $h\colon\Omega\to\mathbb{R}$ such that $h/|x|\in L^{1}(\Omega)$ we have that

[TABLE]

Note that (63) is an easy consequence of the classical coarea formula [30, Thm 3.11], and its proof is left to the reader. Let now $\tilde{f}\colon\Omega\to\mathbb{R}$ be such that $\tilde{f}\not\equiv 0$ and $\tilde{f}/|x|\in L^{2}(\Omega)$ . By applying (63) to $|\tilde{f}|^{2}$ and by the assumptions on $\tilde{f}$ we have that $\tilde{f}|_{S_{t}}$ belongs to $H_{t}$ for a.e. $t\in[0,1]$ . Define $f\colon[0,1]\to H$ by $f(t):=\tilde{f}|_{S_{t}}$ . Notice that $f$ is strongly measurable, since $\tilde{f}$ can be approximated in $\Omega$ by $C_{0}$ functions. Moreover by (63) we infer

[TABLE]

which is finite by assumption on $\tilde{f}$ . Hence $f$ is integrable by Theorem 3.9, and it belongs to $L^{2}([0,1];H)$ . The Gelfand integral of $i_{t}^{*}f$ , which exists by Proposition B.2, is given by

[TABLE]

The above follows immediately by applying (63) to $\varphi\tilde{f}$ with $\varphi\in D$ . However $i_{t}^{*}f$ is not Bochner integrable, since it is not strongly measurable in the classic sense [26, Ch II]: For every $E\subset[0,1]$ with $|E|=0$ we have that the set $i_{t}^{*}f([0,1]\smallsetminus E)$ is not norm separable in $\mathcal{M}(\Omega)$ . Indeed, it is easy to show that for a.e. $w\neq t$

[TABLE]

Since $\tilde{f}\not\equiv 0$ , we infer that $i_{t}^{*}f(F)$ is a discrete set for any $F\subset[0,1]$ with $|F|>0$ . Therefore $i_{t}^{*}f(F)$ is norm separable if and only if $F$ is countable, which is never the case. Hence $i_{t}^{*}f$ is not essentially separably valued, and the classic Pettis Theorem [26, Ch II.1, Thm 2] implies that $i_{t}^{*}f$ is not strongly measurable and hence not Bochner integrable.

Proposition B.4.

Assume that $i_{t}^{*}i_{t}(S)\subset D^{*}$ is essentially norm separable for each countable set $S\subset D$ and that $D$ is reflexive. If $f\in L^{1}([0,1];H)$ then $i_{t}^{*}f\colon[0,1]\to D^{*}$ is Bochner integrable.

Proof.

Since $f\colon[0,1]\to H$ is strongly measurable, it is also weakly measurable and essentially separably valued by Theorem 3.5. We start by showing that weak measurability for $f$ implies weak measurability for $i_{t}^{*}f$ in the classic sense. Indeed by reflexivity the canonical injection $j\colon D\to D^{**}$ is also surjective. Therefore for each $\varphi^{**}\in D^{**}$ there exists a unique $\varphi\in D$ with $j(\varphi)=\varphi^{**}$ and we have ${\langle\varphi^{**},i^{*}_{t}f(t)\rangle}_{D^{**},D^{*}}={\langle f(t),i_{t}\varphi\rangle}_{H_{t}}$ , which is measurable since $f$ is weakly measurable. Now let $S=\{\varphi_{n}\}\subset D$ and $E\subset[0,1]$ measurable with $|E|=0$ and such that Definition 3.2 is satisfied. Therefore for every $\varepsilon>0$ and $t\in[0,1]\smallsetminus E$ there exists $\varphi_{n}\in S$ such that ${\left\|i_{t}\varphi_{n}-f(t)\right\|}_{H_{t}}<\varepsilon/C$ . Therefore by (H1) we can estimate $\left\|i_{t}^{*}i_{t}\varphi_{n}-i_{t}^{*}f(t)\right\|_{D^{*}}\leq C\,{\left\|i_{t}\varphi_{n}-f(t)\right\|}_{H_{t}}<\varepsilon$ . Hence the points of $i_{t}^{*}f([0,1]\smallsetminus E)$ are arbitrarily close to $\{i_{t}^{*}i_{t}\varphi_{n}\,,\,n\in\mathbb{N},\,t\in[0,1]\smallsetminus E\}$ . Since $i_{t}^{*}i_{t}(S)$ is assumed to be essentially separable in $D^{*}$ , there exists a measurable set $F\subset[0,1]$ with $|F|=0$ such that $\{i_{t}^{*}i_{t}\varphi_{n}\,,\,n\in\mathbb{N},\,t\in[0,1]\smallsetminus F\}$ is separable in $D^{*}$ . By defining $\tilde{E}:=E\cup F$ we obtain that also $i_{t}^{*}f([0,1]\smallsetminus\tilde{E})$ is norm separable, hence $i_{t}^{*}f$ is essentially separably valued in the classic sense. By the classic Pettis Theorem [26, Ch II.1, Thm 2] we conclude that $i_{t}^{*}f$ is strongly measurable. Finally (H2) implies that $\int_{0}^{1}\left\|i_{t}^{*}f(t)\right\|_{D^{*}}\,dt\leq C\int_{0}^{1}{\left\|f(t)\right\|}_{H_{t}}\,dt<\infty$ . By [26, Ch II.2, Thm 2], it follows that $i_{t}^{*}f$ is Bochner integrable. ∎

Bibliography66

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] M. Aharon, M. Elad, and A. Bruckstein. K 𝐾 K -SVD: An algorithm for designing overcomplete dictionaries for sparse representation. IEEE Trans. Signal Process. , 54(11):4311–4322, 2006.
2[2] G. Alberti, S. Bianchini, and G. Crippa. A uniqueness result for the continuity equation in two dimensions. J. Eur. Math. Soc. , 16(2):201–234, 2014.
3[3] C. D. Aliprantis and K. Border. Infinite Dimensional Analysis . Springer-Verlag Berlin Heidelberg, 2006.
4[4] A. Alphonse, C. Elliott, and B. Stinner. An abstract framework for parabolic PD Es on evolving spaces. Port. Math. , 72(1):1–46, 2015.
5[5] L. Ambrosio, N. Fusco, and D. Pallara. Functions of Bounded Variation and Free Discontinuity Problems . Oxford Science Publications. Clarendon Press, 2000.
6[6] L. Ambrosio, N. Gigli, and G. Savaré. Gradient Flows: In Metric Spaces and in the Space of Probability Measures . Lectures in Mathematics. ETH Zürich. Birkhäuser Basel, 2006.
7[7] J.-D. Benamou. Numerical resolution of an “unbalanced” mass transport problem. ESAIM Math. Model. Num. , 37(5):851–868, 2003.
8[8] J.-D. Benamou and Y. Brenier. A computational fluid mechanics solution to the Monge–Kantorovich mass transfer problem. Numer. Math. , 84(3):375–393, 2000.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

An optimal transport approach for solving dynamic inverse problems in spaces of measures

Abstract.

Contents

1. Introduction

1.1. Outline of the mathematical setting and main theoretical results

Assumption 1.1**.**

Assumption 1.2**.**

Theorem 1.3**.**

Theorem 1.4** (Regularization).**

1.2. Application to dynamic MRI

2. Dynamic optimal transport

2.1. Continuity equation

Definition 2.1**.**

Proposition 2.2**.**

Proof.

Definition 2.3**.**

Proposition 2.4** (Continuous representative).**

2.2. Optimal transport energy

Definition 2.5** (Transport energy).**

Proposition 2.6**.**

3. Time dependent Bochner spaces

3.1. Functional setting

3.2. Measurability in time dependent spaces

Definition 3.1** (Step function).**

Definition 3.2** (Measurability).**

Remark 3.3**.**

Remark 3.4**.**

Theorem 3.5** (Pettis).**

Corollary 3.6**.**

Proposition 3.7** (Separable case).**

Proof.

3.3. Integration and LpL^{p}Lp spaces

Definition 3.8** (Integrability).**

Theorem 3.9** (Characterization of integrability).**

Proof.

Theorem 3.10** (Dominated convergence).**

Definition 3.11** (LpL^{p}Lp space).**

Remark 3.12**.**

Theorem 3.13**.**

Remark 3.14** (p=∞p=\inftyp=∞).**

Remark 3.15** (Dual spaces).**

Example 3.16** (Narrowly continuous curves).**

4. Regularization of dynamic inverse problems

Definition 4.1** (Regularized problem).**

4.1. Well-definition

Lemma 4.2**.**

Proposition 4.3**.**

Proof of Lemma 4.2.

Proof of Proposition 4.3.

4.2. Existence of minimizers

Theorem 4.4**.**

Lemma 4.5** (Compactness for JJJ).**

Proof.

Lemma 4.6** (Lower semicontinuity for JJJ).**

Proof.

Proof of Theorem 4.4.

4.3. Regularization properties

Theorem 4.7** (Stability).**

Proof.

Proposition 4.8**.**

Proof.

Definition 4.9** (Minimal energy solution).**

Theorem 4.10** (Convergence for vanishing noise level).**

Proof.

5. Application to dynamic undersampled MRI

Theorem 5.1**.**

Example 5.2** (Continuous sampling).**

Example 5.3** (Compressed-sensing sampling).**

Lemma 5.4**.**

Proof.

Proof of Theorem 5.1.

6. Conclusions and perspectives

Acknowledgments

Appendix A Measure theory

Assumption 1.1.

Assumption 1.2.

Theorem 1.3.

Theorem 1.4 (Regularization).

Definition 2.1.

Proposition 2.2.

Definition 2.3.

Proposition 2.4 (Continuous representative).

Definition 2.5 (Transport energy).

Proposition 2.6.

Definition 3.1 (Step function).

Definition 3.2 (Measurability).

Remark 3.3.

Remark 3.4.

Theorem 3.5 (Pettis).

Corollary 3.6.

Proposition 3.7 (Separable case).

3.3. Integration and $L^{p}$ spaces

Definition 3.8 (Integrability).

Theorem 3.9 (Characterization of integrability).

Theorem 3.10 (Dominated convergence).

Definition 3.11 ( $L^{p}$ space).

Remark 3.12.

Theorem 3.13.

Remark 3.14 ( $p=\infty$ ).

Remark 3.15 (Dual spaces).

Example 3.16 (Narrowly continuous curves).

Definition 4.1 (Regularized problem).

Lemma 4.2.

Proposition 4.3.

Theorem 4.4.

Lemma 4.5 (Compactness for $J$ ).

Lemma 4.6 (Lower semicontinuity for $J$ ).

Theorem 4.7 (Stability).

Proposition 4.8.

Definition 4.9 (Minimal energy solution).

Theorem 4.10 (Convergence for vanishing noise level).

Theorem 5.1.

Example 5.2 (Continuous sampling).

Example 5.3 (Compressed-sensing sampling).

Lemma 5.4.

Theorem A.1 (Disintegration).

Lemma A.2.

Proposition A.3.

Proposition A.4 (A refined version of Ascoli–Arzelà’s Theorem).

Proposition B.1 (Egoroff).

Proposition B.2.

Example B.3 (Radial sampling).

Proposition B.4.