Applications of microlocal analysis to inverse problems
Mikko Salo

TL;DR
This paper discusses how microlocal analysis techniques are applied to solve inverse problems, providing insights and methods for improving problem-solving strategies in this area.
Contribution
It introduces new applications of microlocal analysis specifically tailored for inverse problems, expanding the toolkit for researchers.
Findings
Enhanced understanding of wavefront set analysis in inverse problems
Development of new microlocal techniques for imaging reconstruction
Improved stability estimates for inverse problem solutions
Abstract
These are lecture notes for a minicourse on applications of microlocal analysis in inverse problems, given in Helsinki and Shanghai in June 2019.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Applications of microlocal analysis to inverse problems
Mikko Salo
Department of Mathematics and Statistics, University of Jyväskylä
Abstract.
These are lecture notes for a minicourse on applications of microlocal analysis in inverse problems, to be given in Helsinki and Shanghai in June 2019.
Preface
Microlocal analysis originated in the 1950s, and by now it is a substantial mathematical theory with many different facets and applications. One might view microlocal analysis as
- •
a kind of ”variable coefficient Fourier analysis” for solving variable coefficient PDEs; or
- •
as a theory of pseudodifferential operators (DOs) and Fourier integral operators (FIOs); or
- •
as a phase space (or time-frequency) approach to studying functions, operators and their singularities (wave front sets).
DOs were introduced by Kohn and Nirenberg [KN65], and FIOs and wave front sets were studied systematically by Hörmander [Hö71]. Much of the theory up to the early 1980s is summarized in the four volume treatise of Hörmander [Hö85]. There are remarkable applications of microlocal analysis and related ideas in many fields of mathematics. Classical examples include spectral theory and the Atiyah-Singer index theorem, and more recent examples include scattering theory, behavior of chaotic systems, inverse problems, and general relativity.
In this minicourse we will try to describe some classical applications of microlocal analysis to inverse problems, together with a very rough non-technical overview of relevant parts of microlocal analysis. In a nutshell, here are a few typical applications:
Computed tomography / X-ray transform: the X-ray transform is an FIO, and under certain conditions its normal operator is an elliptic DO. Microlocal analysis can be used to predict which sharp features (singularities) of the image can be reconstructed in a stable way from limited data measurements. Microlocal analysis is also a powerful tool in the study of geodesic X-ray transforms related to seismic imaging applications. 2. 2.
Calderón problem / Electrical Impedance Tomography: the boundary measurement map (Dirichlet-to-Neumann map) is a DO, and the boundary values of the conductivity as well as its derivatives can be computed from the symbol of this DO. 3. 3.
Gel’fand problem / seismic imaging: the boundary measurement operator (hyperbolic Dirichlet-to-Neumann map) is an FIO, and the scattering relation of the sound speed as well as certain X-ray transforms of the coefficients can be computed from the canonical relation and the symbol of this FIO.
These notes are organized as follows. In Section 1, we will motivate the theory of DOs and discuss some of its properties without giving proofs. Section 2 will continue with a brief introduction to wave front sets and FIOs (again with no proofs). The rest of the notes is concerned with applications to inverse problems. Section 3 considers the Radon transform in and its normal operator, and describes what kind of information about the singularities of can be stably recovered from the Radon transform. Sections 4 and 5 discuss the Gel’fand and Calderón problems, and prove results related to recovering X-ray transforms or boundary determination. The treatment is motivated by DO and FIO theory, but we give direct and (in principle) elementary proofs based on a quasimode constructions. The results discussed in these notes are classical. For more recent results, we refer to the surveys [IM19, KQ15, La18, Uh14].
Notation
We will use multi-index notation. Let be the set natural numbers. Then consists of all -tuples where the are nonnegative integers. Such an -tuple is called a multi-index. We write and for . For partial derivatives, we will write
[TABLE]
If is a bounded domain with boundary, we denote by the set of infinitely differentiable functions in whose all derivatives extend continuously to . The space consist of functions having compact support in . The standard based Sobolev spaces are denoted by with norm , with denoting the Fourier transform. We also write . The notation means that for some uniform (with respect to the relevant parameters) constant . In general, all coefficients, boundaries etc are assumed to be for ease of presentation.
1. Pseudodifferential operators
In this minicourse we will try to give a very brief idea of the different points of view to microlocal analysis mentioned in the introduction (and repeated below), as
- (1)
a kind of ”variable coefficient Fourier analysis” for solving variable coefficient PDEs; or 2. (2)
a theory of DOs and FIOs; or 3. (3)
a phase space (or time-frequency) approach to studying functions, operators and their singularities (wave front sets).
In this section we will discuss (1) and (2) in the context of DOs (we will continue with (2) and (3) in the context of FIOs in Section 2). The treatment is mostly formal and we will give no proofs whatsoever. A complete reference for the results in this section is [Hö85, Section 18.1].
1.1. Constant coefficient PDEs
We recall the following facts about the Fourier transform (valid for sufficiently nice functions):
If is a function in , its Fourier transform is the function
[TABLE] 2. 2.
The Fourier transform converts derivatives to polynomials (this is why it is useful for solving PDEs):
[TABLE] 3. 3.
A function can be recovered from by the Fourier inversion formula , where
[TABLE]
is the inverse Fourier transform.
As a motivating example, let us solve formally (i.e. without worrying about how to precisely justify each step) the equation
[TABLE]
This is a constant coefficient PDE, and such equations can be studied with the help of the Fourier transform. We formally compute
[TABLE]
The same formal argument applies to a general constant coefficient PDE
[TABLE]
where . Then where is the symbol of . Moreover, one has
[TABLE]
The argument leading to (1.1) gives a formal solution of :
[TABLE]
Thus formally can be solved by dividing by the symbol on the Fourier side. Of course, to make this precise one would need to show that the division by (which may have zeros) is somehow justified.
1.2. Variable coefficient PDEs
We now try to use a similar idea to solve the variable coefficient PDE
[TABLE]
where and for all multi-indices . Since the coefficients depend on , Fourier transforming the equation is not immediately helpful. However, we can compute an analogue of (1.2):
[TABLE]
where
[TABLE]
is the (full) symbol of .
Now, we could try to obtain a solution to in by dividing by the symbol as in (1.3):
[TABLE]
Again, this is only formal since the division by needs to be justified. However, this can be done in a certain sense if is elliptic:
Definition**.**
The principal symbol (i.e. the part containing the highest order derivatives) of the differential operator is
[TABLE]
We say that is elliptic if its principal symbol is nonvanishing for .
A basic result of microlocal analysis states that the function
[TABLE]
with
[TABLE]
where is a cutoff with in a sufficiently large neighborhood of (so that does not vanish outside this neighborhood), is an approximate solution of in the sense that
[TABLE]
where is one derivative smoother than . Moreover, it is possible to construct an approximate solution so that
[TABLE]
1.3. Pseudodifferential operators
In analogy with the formula (1.4), a pseudodifferential operator (DO) is an operator of the form
[TABLE]
where is a symbol with certain properties. The most standard symbol class is defined as follows:
Definition**.**
The symbol class consists of functions such that for any there is with
[TABLE]
If , the corresponding DO is defined by (1.7). We denote by the set of DOs corresponding to .
Note that symbols in behave roughly like polynomials of order in the -variable. In particular, the symbols in (1.5) belong to and the corresponding differential operators belong to . Moreover, if is elliptic, then the symbol as in (1.6) belongs to . Thus the class of DOs is large enough to include differential operators as well as approximate inverses of elliptic operators. Also normal operators of the X-ray transform or Radon transform in are DOs.
Remark 1.1** (Homogeneous symbols).**
We saw in Section 1.1 that the elliptic operator has the inverse
[TABLE]
The symbol is not in , since it is not smooth near [math]. However, one often thinks of as a DO by writing
[TABLE]
where satisfies near [math]. Now is a DO in , since , and is smoothing in the sense that it maps any function into a function (at least if ).
In general, in DO theory smoothing operators are considered to be negligible (since at least they do not introduce new singularities), and many computations in DO calculus are made only modulo smoothing error terms. In this sense one often views as a DO by identifying it with . The same kind of identification is done for operators whose symbol is homogenous of some order in . More generally one can consider polyhomogeneous symbols having the form
[TABLE]
where each is homogeneous of order in , and is a certain asympotic summation. Corresponding DOs are called classical DOs.
It is very important that one can compute with DOs in much the same way as with differential operators. One often says that DOs have a calculus. The following theorem lists typical rules of computation (it is instructive to think first why such rules are valid for differential operators):
Theorem 1.2** (DO calculus).**
**
- (a)
(Principal symbol) There is a one-to-one correspondence between operators in and (full) symbols in , and each operator has a well defined principal symbol . The principal symbol may be computed by testing against highly oscillatory functions111This is valid if is a classical DO.:
[TABLE] 2. (b)
(Composition) If and , then and ; 3. (c)
(Sobolev mapping properties) Each is a bounded operator for any ; 4. (d)
(Elliptic operators have approximate inverses) If is elliptic, there is so that and where , i.e. are smoothing (they map any function to for any , hence also to by Sobolev embedding).
The above properties are valid in the standard DO calculus in . However, motivated by different applications, DOs have been considered in various other settings. Each of these settings comes with an associated calculus whose rules of computation are similar but adapted to the situation at hand. Examples of different settings for DO calculus include
- (1)
open sets in (local setting); 2. (2)
compact manifolds without boundary, possibly acting on sections of vector bundles; 3. (3)
compact manifolds with boundary (transmission condition / Boutet de Monvel calculus); 4. (4)
non-compact manifolds (e.g. Melrose scattering calculus); and 5. (5)
operators with a small or large parameter (semiclassical calculus).
2. Wave front sets and Fourier integral operators
For a reference to wave front sets, see [Hö85, Chapter 8]. Sobolev wave front sets are considered in [Hö85, Section 18.1]. FIOs are discussed in [Hö85, Chapter 25].
2.1. The role of singularities
We first discuss the singular support of , which consists of those points such that is not a smooth function in any neighborhood of . We also consider the Sobolev singular support, which also measures the ”strength” of the singularity (in the Sobolev scale).
Definition** (Singular support).**
We say that a function or distribution is (resp. ) near if there is with near such that is in (resp. in ). We define
[TABLE]
Example 2.1**.**
Let be bounded domains with boundary in so that for , and define
[TABLE]
where are constants, and is the characteristic function of . Then
[TABLE]
since for , but
[TABLE]
since is not near any boundary point. Thus in this case the singularities of are exactly at the points where has a jump discontinuity, and their strength is precisely . Knowing the singularities of can already be useful in applications. For instance, if represents some internal medium properties in medical imaging, the singularities of could determine the location of interfaces between different tissues. On the other hand, if represents an image, then the singularities in some sense determine the ”sharp features” of the image.
Next we discuss the wave front set which is a more refined notion of a singularity. For example, if is the characteristic function of a bounded strictly convex domain and if , one could think that is in some sense smooth in tangential directions at (since restricted to a tangent hyperplane is identically zero, except possibly at ), but that is not smooth in normal directions at since in these directions there is a jump. The wave front set is a subset of , the cotangent space with the zero section removed:
[TABLE]
Definition** (Wave front set).**
Let be a distribution in . We say that is (microlocally) (resp. ) near if there exist with near and so that near and is homogeneous of degree [math], such that
[TABLE]
(resp. ). The wave front set (resp. wave front set ) consists of those points where is not microlocally (resp. ).
Example 2.2**.**
The wave front set of the function in Example 2.1 is
[TABLE]
where is the conormal bundle of ,
[TABLE]
The wave front set describes singularities more precisely than the singular support, since one always has
[TABLE]
where is the projection to -space.
It is an important fact that applying a DO to a function or distribution never creates new singularities:
Theorem 2.3** (Pseudolocal/microlocal property of DOs).**
Any has the pseudolocal property
[TABLE]
and the microlocal property
[TABLE]
Elliptic operators are those that completely preserve singularities:
Theorem 2.4**.**
(Elliptic regularity) Let be elliptic. Then, for any ,
[TABLE]
Thus any solution of is singular precisely at those points where is singular. There are corresponding statements for Sobolev singularities.
Proof.
First note that by Theorem 2.3,
[TABLE]
Conversely, since is elliptic, by Theorem 1.2(d) there is so that
[TABLE]
Thus for any one has
[TABLE]
Since is smoothing, , which implies that modulo . Thus it follows that
[TABLE]
Thus . The claim for singular supports follows by (2.1). ∎
2.2. Fourier integral operators
We have seen in Section 1.3 that the class of pseudodifferential operators includes approximate inverses of elliptic operators. In order to handle approximate inverses for hyperbolic and transport equations, it is required to work with a larger class of operators.
Motivation**.**
Consider the initial value problem for the wave equation,
[TABLE]
This is again a constant coefficient PDE, and we will solve this formally by taking the Fourier transform in space,
[TABLE]
After taking Fourier transforms in space, the above equation becomes
[TABLE]
For each fixed this is an ODE in , and the solution is
[TABLE]
Taking inverse Fourier transforms in space, we obtain
[TABLE]
Generalizing (2.2), we can consider operators of the form
[TABLE]
where is a symbol (for instance in ), and is a real valued phase function. Such operators are examples of Fourier integral operators (more precisely, FIOs whose canonical relation is locally the graph of a canonical transformation, see [Hö85, Section 25.3]). For DOs the phase function is always , but for FIOs the phase function can be quite general, though it is usually required to be homogeneous of degree in , and to satisfy the non-degeneracy condition .
We will not go into precise definitions, but only remark that the class of FIOs includes pseudodifferential operators as well as approximate inverses of hyperbolic and transport operators (or more generally real principal type operators). There is a calculus for FIOs, analogous to the pseudodifferential calculus, under certain conditions in various settings. An important property of FIOs is that they, unlike pseudodifferential operators, can move singularities. This aspect will be discussed next.
2.3. Propagation of singularities
Example 2.5**.**
Let be fixed, and consider the operators from (2.2),
[TABLE]
Then
[TABLE]
Using FIO theory, since the phase functions are , it follows that
[TABLE]
where is the canonical transformation (i.e. diffeomorphism of that preserves the symplectic structure) given by
[TABLE]
This means that the FIO takes a singularity of the initial data and moves it along the line through in direction to . Thus singularities of solutions of the wave equation propagate along straight lines with constant speed one.
Remark 2.6**.**
In general, any FIO has an associated canonical relation that describes what the FIO does to singularities. The canonical relation of the FIO defined in (2.3) is (see [Hö85, Section 25.3])
[TABLE]
and moves singularities according to the rule
[TABLE]
where
[TABLE]
Using these formulas, it is easy to check that the canonical relation of in Example 2.5 is the graph of in the sense that
[TABLE]
and one indeed has .
There is a far reaching extension of Example 2.5, which shows that the singularities of a solution of propagate along certain curves in phase space (so called null bicharacteristic curves) as long as has real valued principal symbol.
Theorem 2.7** (Propagation of singularities).**
Let have real principal symbol that is homogeneous of degree in . If
[TABLE]
then is contained in the characteristic set . Moreover, if , then the whole null bicharacteristic curve through is in , where
[TABLE]
Example 2.8**.**
We compute the null bicharacteristic curves for the wave operator . The principal symbol of is
[TABLE]
The characteristic set is
[TABLE]
which consists of light-like cotangent vectors on . The equations for the null bicharacteristic curves are
[TABLE]
Thus, if , then the null bicharacteristic curve through is
[TABLE]
The result of Example 2.5 may thus be interpreted so that singularities of solutions of the wave equation propagate along null bicharacteristic curves for the wave operator.
3. The Radon transform in the plane
In this section we outline some applications of microlocal analysis to the study of the Radon transform in the plane. Similar ideas apply to X-ray and Radon transforms in higher dimensions and Riemannian manifolds as well. The microlocal approach to Radon transforms was introduced by Guillemin [Gu75]. We refer to [Qu06], [KQ15] and references therein for a more detailed treatment of the material in this section.
3.1. Basic properties of the Radon transform
The X-ray transform of a function in encodes the integrals of over all straight lines, whereas the Radon transform encodes the integrals of over -dimensional planes. We will focus on the case , where the two transforms coincide. There are many ways to parametrize the set of lines in . We will parametrize lines by their direction vector and distance from the origin.
Definition**.**
If , the Radon transform of is the function
[TABLE]
Here is the vector in obtained by rotating counterclockwise by .
There is a well-known relation between and the Fourier transform . We denote by the Fourier transform of with respect to .
Theorem 3.1**.**
(Fourier slice theorem)
[TABLE]
Proof.
Parametrizing by , we have
[TABLE]
This result gives the first proof of injectivity of the Radon transform:
Corollary 3.2**.**
If is such that , then .
Proof.
If , then by Theorem 3.1 and consequently . ∎
To obtain a different inversion method, and for later purposes, we will consider the adjoint of . The formal adjoint of is the backprojection operator222The formula for is obtained as follows: if , one has
[TABLE]
The following result shows that the normal operator is a classical DO of order in , and also gives an inversion formula.
Theorem 3.3**.**
(Normal operator) One has
[TABLE]
and can be recovered from by the formula
[TABLE]
Remark 3.4**.**
Above we have written, for ,
[TABLE]
The notation is also used.
Proof.
The proof is based on computing using the Parseval identity, Fourier slice theorem, symmetry and polar coordinates:
[TABLE]
The same argument, based on computing instead of , leads to the famous filtered backprojection (FBP) inversion formula:
[TABLE]
where . This formula is efficient to implement and gives good reconstructions when one has complete X-ray data and relatively small noise, and hence FBP (together with its variants) has been commonly used in X-ray CT scanners.
However, if one is mainly interested in the singularities (i.e. jumps or sharp features) of the image, it is possible to use the even simpler backprojection method: just apply the backprojection operator to the data . Since is an elliptic DO, Theorem 2.4 guarantees that the singularities are recovered:
[TABLE]
Moreover, since is a DO of order , hence smoothing of order , one expects that gives a slightly blurred version of where the main singularities should still be visible.
3.2. Visible singularities
There are various imaging situations where complete X-ray data (i.e. the function for all and ) is not available. This is the case for limited angle tomography (e.g. in luggage scanners at airports, or dental applications), region of interest tomography, or exterior data tomography. In such cases explicit inversion formulas such as FBP are usually not available, but microlocal analysis (for related normal operators or FIOs) still provides a powerful paradigm for predicting which singularities can be recovered stably from the measurements.
We will try to explain this paradigm a little bit more, starting with an example:
Example 3.5**.**
Let be the characteristic function of the unit disc , i.e. if and for . Then is singular precisely on the unit circle (in normal directions). We have
[TABLE]
Thus is singular precisely at those points with , which correspond to those lines that are tangent to the unit circle.
There is a similar relation between the singularities of and in general, and this is explained by microlocal analysis:
Theorem 3.6**.**
The operator is an elliptic FIO of order . There is a precise relationship between the singularities of and singularities of .
We will not spell out the precise relationship here, but only give some consequences. It will be useful to think of the Radon transform as defined on the set of (non-oriented) lines in . If is an open subset of lines in , we consider the Radon transform restricted to lines in . Recovering (or some properties of ) from is a limited data tomography problem. Examples:
- •
If \mathcal{A}=\{\text{lines not meeting \overline{\mathbb{D}}}\}, then is called exterior data.
- •
If and \mathcal{A}=\{\text{lines whose angle with x<a}\}, then is called limited angle data.
It is known that any is uniquely determined by exterior data (Helgason support theorem), and any is uniquely determined by limited angle data (Fourier slice and Paley-Wiener theorems). However, both inverse problems are very unstable (inversion is not Lipschitz continuous in any Sobolev norms, but one has conditional logarithmic stability).
Definition**.**
A singularity at is called visible from if the line through in direction is in .
One has the following dichotomy:
- •
If is visible from , then from the singularities of one can determine for any whether or not . If uniquely determines , one expects the reconstruction of visible singularities to be stable.
- •
If is not visible from , then this singularity is smoothed out in the measurement . Even if would determine uniquely, the inversion is not Lipschitz stable in any Sobolev norms.
4. Gel’fand problem
Seismic imaging gives rise to various inverse problems related to determining interior properties, e.g. oil deposits or deep structure, of the Earth. Often this is done by using acoustic or elastic waves. We will consider the following problem, also known as the inverse boundary spectral problem (see the monograph [KKL01]):
Gel’fand problem: Is it possible to determine the interior structure of Earth by controlling acoustic waves and measuring vibrations at the surface?
In seismic imaging one often tries to recover an unknown sound speed. However, in this presentation we consider the simpler case where the sound speed is constant (equal to one) and one attempts to recover an unknown potential at each point , where is a ball in .
Consider the free wave operator
[TABLE]
We assume that the medium is at rest at time and that we take measurements until time . If we prescribe the amplitude of the wave to be on , this leads to a solution of the wave equation
[TABLE]
Given any , there is a unique solution (see [Ev10, Theorem 7 in §7.2.3]). We assume that we can measure the normal derivative , where and is the outer unit normal to . Doing such measurements for many different functions , the ideal boundary measurements are encoded by the hyperbolic Dirichlet-to-Neumann map (DN map for short)
[TABLE]
The Gel’fand problem for this model amounts to recovering from the knowledge of the map . We will prove the following result due to [RS88].
Theorem 4.1** (Recovering the X-ray transform).**
Let and assume that . If
[TABLE]
then and satisfy
[TABLE]
whenever is a maximal line segment in with length .
It is natural that the region where one can recover information depends on . By finite propagation speed the map is unaffected if one changes outside the set 333If and solve (4.1) for potentials and with the same Dirichlet data , and if in , then solves where vanishes in and in . Moreover, on and . By finite speed of propagation . This proves that .
[TABLE]
For large enough, one can recover everything:
Corollary 4.2**.**
If , then implies .
Proof.
If , then by Theorem 4.1 one has
[TABLE]
for any maximal line segment in . Thus and have the same X-ray transform in . This transform is injective by Corollary 3.2 when . Tiling by two-planes gives injectivity when . Thus . ∎
Theorem 4.1 could be proved based on the following facts, see e.g. [SY18]:
The map is an FIO of order on . 2. 2.
The X-ray transform of can be read off from the symbol of (more precisely, from the principal symbol of ).
We will give an elementary proof that is based on testing against highly oscillatory boundary data (compare with (1.8)).
The first step is an integral identity.
Lemma 4.3** (Integral identity).**
Assume that . For any , one has
[TABLE]
where solves (4.1) with and , and solves an analogous problem with vanishing Cauchy data on :
[TABLE]
Proof.
We first compute the adjoint of the DN map: one has
[TABLE]
where with solving so that and on . To prove this, we let be the solution of (4.1) and integrate by parts:
[TABLE]
Now, if and are as stated, the computation above gives
[TABLE]
The result follows by subtracting these two identities. ∎
The second step is to construct special solutions to the wave equation that concentrate near curves where is a line segment. These curves are projections to the variables of null bicharacteristic curves for (see Example 2.8). Thus the following result is in line with Theorem 2.7 concerning propagation of singularities. The proof is based on a standard geometrical optics / WKB quasimode construction.
Proposition 4.4** (Concentrating solutions).**
Assume that , and let be a maximal line segment in with . For any there is a solution of in with on , such that for any one has
[TABLE]
Moreover, if , there is a solution of in with on , such that for any one has
[TABLE]
At this point it is easy to prove the main result:
Proof of Theorem 4.1.
Using the assumption and Lemma 4.3, we have
[TABLE]
for any solutions of in so that on , and on .
Let be a maximal unit speed line segment in with , and let be the solution constructed in Proposition 4.4 for the potential with on . Moreover, let be the solution constructed in the end of Proposition 4.4 for the potential with on . Taking the limit as in (4.5) and using (4.4) with , we obtain that
[TABLE]
Thus the integrals of and over maximal line segments of length in are the same. ∎
Proof of Proposition 4.4.
Let be a maximal unit speed line segment in with , and let be the unit speed line so that for . Write and , so that and . After a translation and rotation, we may assume that and .
We first construct an approximate solution for the operator , having the form
[TABLE]
where is a real phase function, and is an amplitude supported near the curve . Note that
[TABLE]
Using a similar expression for , we compute
[TABLE]
We would like to have . To this end, we first choose so that the term in (4.6) vanishes. This will be true if solves the eikonal equation
[TABLE]
There are many possible solutions, but we make the simple choice
[TABLE]
With this choice, (4.6) becomes
[TABLE]
where is the constant vector field
[TABLE]
It is convenient to consider new coordinates in , where
[TABLE]
Then corresponds to in the sense that
[TABLE]
where corresponds to in the new coordinates:
[TABLE]
We next look for the amplitude in the form
[TABLE]
Inserting this to (4.6) and equating like powers of , we get
[TABLE]
We would like the last expression to be . This will hold if and satisfy the transport equations
[TABLE]
Let be supported near [math], and choose
[TABLE]
We will later choose to depend on . Next we choose
[TABLE]
These functions satisfy (4.12), and they vanish unless is small (i.e. is close to ). Then (4.9) becomes
[TABLE]
where
[TABLE]
Using the Cauchy-Schwartz inequality, one can check that
[TABLE]
uniformly over . This concludes the construction of the approximate solution .
We next find an exact solution of (4.1) having the form
[TABLE]
where is a correction term. Note that for close to [math], is supported near and hence on . Note also that . Thus will solve (4.1) for if solves
[TABLE]
By the wellposedness of this problem [Ev10, Theorem 5 in §7.2.3], there is a unique solution with
[TABLE]
We now fix the choice of so that (4.3) will hold. Let satisfy near [math] and , and choose
[TABLE]
where
[TABLE]
With this choice
[TABLE]
It follows that
[TABLE]
Since , the integral in (4.3) has the form
[TABLE]
Using that is compactly supported in , we have
[TABLE]
by changing variables as in (4.8). Finally, changing to and to and letting (so ) yields
[TABLE]
by the normalization and the fact that . This proves (4.3).
It remains to prove (4.4). Since , we have on , and we may alternatively arrange that solves (4.13) with on instead of . We can do such a construction for the potential instead of . Since and are independent of , the same argument as above proves (4.4). ∎
5. Calderón problem: boundary determination
Electrical Impedance Tomography (EIT) is an imaging method with potential applications in medical imaging and nondestructive testing. The method is based on the following important inverse problem.
Calderón problem: Is it possible to determine the electrical conductivity of a medium by making voltage and current measurements on its boundary?
The treatment in this section follows [FSU].
Let us begin by recalling the mathematical model of EIT. The purpose is to determine the electrical conductivity at each point , where represents the body which is imaged (in practice ). We assume that is a bounded open set with boundary, and that is positive.
Under the assumption of no sources or sinks of current in , a voltage potential at the boundary induces a voltage potential in , which solves the Dirichlet problem for the conductivity equation,
[TABLE]
Since is positive, the equation is uniformly elliptic, and there is a unique solution for any boundary value . One can define the Dirichlet-to-Neumann map (DN map) as
[TABLE]
Here is the outer unit normal to and is the normal derivative of . Physically, is the current flowing through the boundary.
The Calderón problem (also called the inverse conductivity problem) is to determine the conductivity function from the knowledge of the map . That is, if the measured current is known for all boundary voltages , one would like to determine the conductivity .
We will prove the following theorem.
Theorem 5.1** (Boundary determination).**
Let be positive. If
[TABLE]
then the Taylor series of and coincide at any point of .
This result was proved in [KV84], and it in particular implies that any real-analytic conductivity is uniquely determined by the DN map. The argument extends to piecewise real-analytic conductivities. A different proof was given in [SU88], based on two facts:
The DN map is an elliptic DO of order on . 2. 2.
The Taylor series of at a boundary point can be read off from the symbol of computed in suitable coordinates. The symbol of can be computed by testing against highly oscillatory boundary data (compare with (1.8)).
Remark 5.2**.**
The above argument is based on studying the singularities of the integral kernel of the DN map, and it only determines the Taylor series of the conductivity at the boundary. The values of the conductivity in the interior are encoded in the part of the kernel, and different methods (based on complex geometrical optics solutions) are required for interior determination.
Let us start with a simple example:
Example 5.3** (DN map in half space is a DO).**
Let , so . We wish to compute the DN map for the Laplace equation (i.e. ) in . Consider
[TABLE]
Writing and taking Fourier transforms in gives
[TABLE]
Solving this ODE for fixed and choosing the solution that decays for gives
[TABLE]
We may now compute the DN map:
[TABLE]
Thus the DN map on the boundary is just corresponding to the Fourier multiplier . This shows that at least in this simple case, the DN map is an elliptic DO of order .
We will now prove Theorem 5.1 by an argument that avoids showing that the DN map is a DO, but is rather based on directly testing the DN map against oscillatory boundary data. The first step is a basic integral identity (sometimes called Alessandrini identity) for the DN map.
Lemma 5.4** (Integral identity).**
Let . If , then
[TABLE]
where solves in with .
Proof.
We first observe that the DN map is symmetric: if is positive and if solves in with , then an integration by parts shows that
[TABLE]
Thus
[TABLE]
The result follows by subtracting the above two identities. ∎
Next we show that if is a boundary point, there is an approximate solution of the conductivity equation that concentrates near , has highly oscillatory boundary data, and decays exponentially in the interior. As a simple example, the solution of
[TABLE]
that decays for is given by , which concentrates near and decays exponentially when if is large. Roughly, this means that the solution of a Laplace type equation with highly oscillatory boundary data concentrates near the boundary.
Proposition 5.5**.**
(Concentrating approximate solutions) Let be positive, let , let be a unit tangent vector to at , and let be supported near . Let also . For any there exists having the form
[TABLE]
such that
[TABLE]
and as
[TABLE]
Moreover, if is positive and is the corresponding approximate solution constructed for , then for any and one has
[TABLE]
for some .
We can now give the proof of the boundary determination result.
Proof of Theorem 5.1.
Using the assumption that together with the integral identity in Lemma 5.4, we have that
[TABLE]
whenever solves in .
Let , let be a unit tangent vector to at , and let satisfy near . We use Proposition 5.5 to construct functions
[TABLE]
so that
[TABLE]
We obtain exact solutions of by setting
[TABLE]
where the correction terms are the unique solutions of
[TABLE]
By standard energy estimates [Ev10, Section 6.2] and by (5.4), the solutions satisfy
[TABLE]
We now insert the solutions into (5.3). Using (5.5) and (5.4), it follows that
[TABLE]
as . Letting , the formula (5.2) yields
[TABLE]
In particular, .
We will prove by induction that
[TABLE]
The case was proved above (here we may vary slightly). We make the induction hypothesis that (5.7) holds for . Let be boundary normal coordinates so that corresponds to [math], and near corresponds to . The induction hypothesis states that
[TABLE]
Considering the Taylor expansion of with respect to gives that
[TABLE]
for some smooth function with . Inserting this formula in (5.6), we obtain that
[TABLE]
Now in boundary normal coordinates. Assuming that was chosen larger than , we may take the limit as and use (5.2) to obtain that
[TABLE]
This shows that for near [math], which concludes the induction. ∎
It remains to prove Proposition 5.5, which constructs approximate solutions (also called quasimodes) concentrating near a boundary point. This is a typical geometrical optics / WKB type construction for quasimodes with complex phase. The proof is elementary, although a bit long. The argument is simplified slightly by using the Borel summation lemma, which is used frequently in microlocal analysis in various different forms.
Lemma 5.6** (Borel summation, [Hö85, Theorem 1.2.6]).**
Let for . There exists such that
[TABLE]
Proof of Proposition 5.5.
We will first carry out the proof in the case where and is flat near [math], i.e. for some (the general case will be considered in the end of the proof). We also assume where .
We look for in the form
[TABLE]
Write . The principal symbol of is
[TABLE]
Since , we compute
[TABLE]
We want to choose and so that . Looking at the term in (5.9), we first choose so that
[TABLE]
We additionally want that and (this will imply that ). In fact, using (5.8) we can just choose
[TABLE]
and then in .
We next look for in the form
[TABLE]
Since , (5.9) implies that
[TABLE]
We will choose the functions so that
[TABLE]
We will additionally arrange that
[TABLE]
and that each is compactly supported so that
[TABLE]
for some fixed .
To find , we prescribe , successively and use the Borel summation lemma to construct with this Taylor series at . We first set . Writing , we observe that
[TABLE]
Thus, in order to have we must have
[TABLE]
We prescribe to have the above value (which depends on the already prescribed quantity ). Next we compute
[TABLE]
where depends on the already prescribed quantities and . We thus set
[TABLE]
which ensures that . Continuing in this way and using Borel summation, we obtain a function so that to infinite order at . The other equations in (5.16) are solved in a similar way, which gives the required functions . In the construction, we may arrange so that (5.19) and (5.20) are valid.
If and are chosen in the above way, then (5.11) implies that
[TABLE]
where each vanishes to infinite order at and is compactly supported in . Thus, for any there is so that in , and consequently
[TABLE]
Since in we have
[TABLE]
Choosing and computing the integrals over , we get that
[TABLE]
It is also easy to compute that
[TABLE]
Thus, choosing , we have proved all the claims except (5.2).
To show (5.2), we observe that
[TABLE]
Using a similar formula for (where is independent of the conductivity), we have
[TABLE]
Now and where , and similarly for . Hence
[TABLE]
We can change variables and use dominated convergence to take the limit as . The limit is
[TABLE]
where .
The proof is complete in the case when and is flat near [math]. In the general case, we choose boundary normal coordinates so that corresponds to [math] and near locally corresponds to . The equation in the new coordinates becomes an equation
[TABLE]
where is a smooth positive matrix only depending on the geometry of near . The construction of now proceeds in a similary way as above, except that the equation (5.10) for the phase function can only be solved to infinite order on instead of solving it globally in . ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[KN 65] J.J. Kohn, L. Nirenberg, An algebra of pseudo-differential operators , Comm. Pure Appl. Math. 18 (1965), 269–305.
- 2[Ev 10] L.C. Evans, Partial differential equations. 2nd edition, AMS, 2010.
- 3[FSU] J. Feldman, M. Salo, G. Uhlmann, The Calderón problem - an introduction to inverse problems. Book in progress (draft available on request).
- 4[Gu 75] V. Guillemin, Some remarks on integral geometry , Tech. report, MIT, 1975.
- 5[Hö71] L. Hörmander, Fourier integral operators. I, Acta Math. 127 (1971), 79–183.
- 6[Hö85] L. Hörmander, The analysis of linear partial differential operators, vols. I–IV. Springer-Verlag, Berlin Heidelberg, 1983–1985.
- 7[IM 19] J. Ilmavirta, F. Monard, Integral geometry on manifolds with boundary and applications , chapter in The Radon transform: the first 100 years and beyond (eds. R. Ramlay, O. Scherzer), de Gruyter, 2019.
- 8[KKL 01] A. Katchalov, Y. Kurylev, M. Lassas, Inverse boundary spectral problems. Monographs and Surveys in Pure and Applied Mathematics 123, Chapman Hall/CRC, 2001.
