Uniqueness, stability and global convergence for a discrete inverse elliptic Robin transmission problem
Bastian Harrach

TL;DR
This paper establishes conditions for uniqueness, stability, and convergence in an inverse elliptic Robin transmission problem, enabling reliable reconstruction of unknown coefficients from limited boundary measurements.
Contribution
It introduces a simple criterion for stability and convergence that applies to inverse elliptic problems, and demonstrates its use in reconstructing Robin coefficients from finitely many measurements.
Findings
A criterion based on directional derivatives ensures uniqueness and stability.
The inverse Robin coefficient can be uniquely reconstructed with finitely many measurements.
The method achieves global convergence in the reconstruction process.
Abstract
We derive a simple criterion that ensures uniqueness, Lipschitz stability and global convergence of Newton's method for the finite dimensional zero-finding problem of a continuously differentiable, pointwise convex and monotonic function. Our criterion merely requires to evaluate the directional derivative of the forward function at finitely many evaluation points and for finitely many directions. We then demonstrate that this result can be used to prove uniqueness, stability and global convergence for an inverse coefficient problem with finitely many measurements. We consider the problem of determining an unknown inverse Robin transmission coefficient in an elliptic PDE. Using a relation to monotonicity and localized potentials techniques, we show that a piecewise-constant coefficient on an a-priori known partition with a-priori known bounds is uniquely determined by finitely many…
| interval | dimension | Lipschitz constant |
|---|---|---|
| interval | dimension | Lipschitz constant |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
∎
11institutetext: B. Harrach 22institutetext: Institute for Mathematics, Goethe-University Frankfurt, Germany
22email: [email protected]
Uniqueness, stability and global convergence for a discrete inverse elliptic Robin transmission problem
Bastian Harrach
Abstract
We derive a simple criterion that ensures uniqueness, Lipschitz stability and global convergence of Newton’s method for the finite dimensional zero-finding problem of a continuously differentiable, pointwise convex and monotonic function. Our criterion merely requires to evaluate the directional derivative of the forward function at finitely many evaluation points and for finitely many directions.
We then demonstrate that this result can be used to prove uniqueness, stability and global convergence for an inverse coefficient problem with finitely many measurements. We consider the problem of determining an unknown inverse Robin transmission coefficient in an elliptic PDE. Using a relation to monotonicity and localized potentials techniques, we show that a piecewise-constant coefficient on an a-priori known partition with a-priori known bounds is uniquely determined by finitely many boundary measurements and that it can be uniquely and stably reconstructed by a globally convergent Newton iteration. We derive a constructive method to identify these boundary measurements, calculate the stability constant and give a numerical example.
Keywords:
Uniqueness Lipschitz stability global convergence Robin transmission problem Newton method monotonicity method localized potentials
MSC:
35R30 65M32 58C15
1 Introduction
New technologies for medical imaging, non-destructive testing, or geophysical exploration are often based on mathematical inverse coefficient problems, where the coefficient of a partial differential equation is to be reconstructed from (partial) knowledge of its solutions. A prominent example is the emerging technique of electrical impedance tomography (EIT), cf. henderson1978impedance ; barber1984applied ; wexler1985impedance ; newell1988electric ; metherall1996three ; cheney1999electrical ; borcea2002electrical ; borcea2003addendum ; lionheart2003eit ; holder2004electrical ; bayford2006bioimpedance ; uhlmann2009electrical ; martinsen2011bioimpedance ; seo2013electrical ; adler2015electrical , and the references therein for a broad overview. Inverse coefficient problems are usually highly non-linear and ill-posed, and uniqueness and stability questions, as well as the design and the theoretical study of reconstruction algorithms are extremely challenging topics in theoretical and applied research.
In practical applications, only finitely many measurements can be made, the unknown coefficient has to be parametrized with finitely many variables (e.g., by assuming piecewise-constantness on a given partition), and physical considerations typically limit the unknown coefficient to fall between certain a-priori known bounds. Thus, after shifting and rescaling, a practical inverse coefficient problem can be put in the form of finding the zero of a non-linear function ,
[TABLE]
It is of utmost importance to determine what measurements make this finite-dimensional inverse (or zero-finding) problem uniquely solvable, to evaluate the stability of the solution with respect to model and measurement errors, and to design convergent numerical reconstruction algorithms.
In this paper, we will study this problem under the assumption that is a pointwise monotonic and convex function, which often arises in elliptic inverse coefficient problems (cf. our remarks on the end of this introduction). We will derive a simple criterion that implies the existence of a zero, and the injectivity of in a certain neighborhood of . It also allows us to quantify the Lipschitz stability constant of the left inverse and, for , ensures global convergence of Newton’s method. The criterion is easy-to-check as it merely requires to evaluate the directional derivative for a finite number of evaluation points and finitely many directions .
We then show how our result can be applied to the inverse problem of determining a Robin transmission coefficient in an elliptic PDE from the associated Neumann-to-Dirichlet operator that is motivated by EIT-based corrosion detection harrach2019global . We assume that the Robin coefficient is piecewise-constant on a-priori known partition of the interface into parts, and that we a-priori know upper and lower bounds for the Robin coefficient’s values. We then show how to construct boundary measurements that uniquely determine the unknown Robin values, and for which a standard Newton method globally converges. We also quantify the stability of the solution with respect to errors, and numerically demonstrate our result on a simple setting.
Let us give some references to put our result in the context of existing literature. The arguably most famous elliptic inverse coefficient problem is the Calderón problem calderon1980inverse ; calderon2006inverse , that arises in EIT, cf. kohn1984determining ; kohn1985determining ; druskin1998uniqueness ; sylvester1987global ; nachman1996global ; astala2006calderon ; kenig2007calderon ; haberman2013uniqueness ; caro2016global ; krupchyk2016calderon for an incomplete list of seminal breakthroughs for the uniqueness question in an infinite-dimensional setting.
Several recent works have addressed uniqueness and Lipschitz stability questions for the problem of determining finitely many parameters (e.g., by assuming piecewise-constantness) from finitely or infinitively many measurements in inverse coefficient problems, cf., kazemi1993stability ; alessandrini1996determining ; imanuvilov1998lipschitz ; imanuvilov2001global ; cheng2003lipschitz ; alessandrini2005lipschitz ; bacchelli2006lipschitz ; bellassoued2006lipschitz ; klibanov2006lipschitz ; klibanov2006lipschitz_nonstandard ; bellassoued2007lipschitz ; sincich2007lipschitz ; yuan2007lipschitz ; yuan2009lipschitz ; beretta2011lipschitz ; beretta2013lipschitz ; melendez2013lipschitz ; alessandrini2017lipschitz ; beretta2017uniqueness ; beilina2017lipschitz ; alessandrini2018lipschitz ; alberti2019calderon ; ruland2019lipschitz ; harrach2019uniqueness ; harrach2019global ; alberti2019infinite ; harrach2020monotonicity . To the knowledge of the author, the results presented herein, is the first on explicitly calculating those measurements that uniquely determine the unknown parameters, and, together with harrach2019global , it is the first result to explicitly calculate the Lipschitz constant for a given setting. Moreover, we obtain the unique existence of a solution also for noisy measurements, so that Lipschitz stability also yields an error estimate in the case of noise.
Reconstruction algorithms for inverse coefficient problems typically rely on Newton-type approaches or on globally minimizing a non-convex regularized data-fitting functional. Both approaches require an initial guess close to the unknown solution, so that most algorithms are only known to converge locally. For the sake of rigor, it should be noted at this point, that the difficulty in this context is not to construct globally convergent methods but computationally feasible globally convergent methods. To elaborate on this point, let us consider a minimization problem with a continuous functional over a finite-dimensional interval . A trivial optimization algorithm is to choose a countable dense subset , and setting ,
[TABLE]
Then, obviously, any accumulation point of is a global minimizer of . But this type of approach requires a completely unfeasible amount of function evaluations and is thus usually disregarded in practice. Note, however, that together with estimates on the convergence range of an iterative algorithm as in the recent preprint of Alberti and Santacesaria alberti2019infinite , and the progress of parallel computing power, these type of approaches may become feasible at least for lower dimension numbers.
To obtain (computationally feasible) globally convergent algorithms, quasi-reversibility and convexification ideas have been developed in the the seminal work of Klibanov et al., cf., e.g. beilina2008globally ; beilina2012approximate ; klibanov2017convexification ; klibanov2019convexification . Alternatively, one can use very specific properties of the considered problem, cf., e.g., the global convergence result of Knudsen, Lassas, Mueller and Siltanen knudsen2009regularized for the d-bar method for EIT, or resort to only reconstructing the shape of an anomaly, cf. ikehata1999draw ; ikehata2000reconstruction ; ide2007probing ; harrach2010exact ; harrach2013monotonicity ; harrach2016enhancing ; harrach2018monotonicity for some globally convergent approaches. The theory developed in this work shows that, somewhat surprisingly, global convergence holds for the standard zero-finding Newton method, when the right measurements are being used, and this also implies fast quadratic convergence. On the other hand, so far, our theory does not allow to utilize more measurements than unknowns or to explicitly add regularization, which would be advantageous in practical applications. Moreover, the calculated measurements tend to become more or more oscillatory for higher dimensional problems. Hence, so far, we can only expect our approach to be practically feasible for relatively few unknowns where discretization sufficiently regularizes the ill-posed problem.
On the methodological side, this work builds upon harrach2019global ; harrach2019uniqueness and stems from the theory of combining monotonicity estimates with localized potentials, cf. harrach2009uniqueness ; harrach2010exact ; harrach2012simultaneous ; arnold2013unique ; harrach2013monotonicity ; barth2017detecting ; harrach2017local ; brander2018monotonicity ; griesmaier2018monotonicity ; harrach2018localizing ; seo2019learning ; harrach2019fractional_I ; harrach2019helmholtz ; harrach2019dimension ; eberle2020lipschitz ; harrach2020monotonicity for related works, and tamburrino2002new ; harrach2015combining ; harrach2015resolution ; harrach2016enhancing ; maffucci2016novel ; tamburrino2016monotonicity ; garde2017convergence ; su2017monotonicity ; ventre2017design ; garde2018comparison ; harrach2018monotonicity ; zhou2018monotonicity ; candiani2019monotonicity ; garde2019regularized ; garde2020reconstruction for practical monotonicity-based reconstruction methods. In this work, the monotonicity and convexity of the forward function is based on the so-called monotonicity relation which goes back to Ikehata, Kang, Seo, and Sheen ikehata1998size ; kang1997inverse . The existence of measurements that fulfill the extra criterion on the directional derivative evaluations follows from localized potentials arguments gebauer2008localized . Hence, it might be possible to extend the theory developed herein to other elliptic inverse coefficient problems where monotonicity and localized potentials results are also available. Note however, that this extension is not obvious since the localized potentials results for the herein considered Robin transmission problem are stronger than those known for other coefficient problems such as EIT.
Finally, it should be noted, that the monotonicity and localized potentials techniques evolved from the factorization method kirsch1998characterization ; bruhl2000numerical ; gebauer2007factorization ; lechleiter2008factorization ; harrach2013recent , and that global convergence for Newton’s method for finite-dimensional zero-finding problems is a classical result for pointwise convex functions that are inverse monotonic (also called Collatz monotone collatz1952aufgaben ), cf., e.g., the book of Ortega and Rheinboldt (ortega1970iterative, , Thm. 13.3.2). Such problems arise, e.g., in solving non-linear elliptic PDEs. Roughly speaking, one might be tempted to say, that elliptic forward coefficient problems lead to inverse monotonic convex function, and inverse elliptic coefficient problems lead to forward monotonic convex functions. Our extra criterion on the directional derivative evaluations allows us to write our forward monotonic function as an affine transformation of an inverse monotonic function in a certain region and (together with some technical arguments to ensure the iterates staying in this region), this is the major key in proving our global convergence result.
The paper is organized as follows. In section 2, we prove uniqueness, stability and global convergence of the Newton method for continuously differentiable, pointwise convex and monotonic functions under a simple extra condition on the directional derivatives. In section 3, we apply this result to an inverse Robin coefficient problem, and show how to determine those measurements that uniquely and stably determine the unknown coefficient with a desired resolution via a globally convergent Newton iteration. We also give a numerical example in section 3. Throughout this paper, we take the somewhat lengthy, but hopefully reader-friendly approach of first presenting less technical intermediate results to motivate our approach.
2 Uniqueness, stability and global Newton convergence
We consider a continuously differentiable, pointwise convex and monotonic function
[TABLE]
where , , and is a convex open set. In this section, we will derive a simple criterion that implies injectivity of on a multidimensional interval. The criterion also allows us to estimate the Lipschitz stability constant of the left inverse and, for , ensures global convergence of Newton’s method.
Remark 1
Throughout this work, ”” is always understood pointwise for finite-dimensional vectors and matrices, and denotes the converse, i.e., that has at least one positive entry.
Monotonicity and convexity are understood with respect to this pointwise partial order, i.e., is monotonic if
[TABLE]
and is convex if
[TABLE]
We also say that a function is anti-monotone if is monotonic.
For continuously differentiable functions, it is easily shown that monotonicity is equivalent to
[TABLE]
and thus equivalent to . Convexity is known to be equivalent to
[TABLE]
cf., e.g., (ortega1970iterative, , Thm. 13.3.2). All the proofs in this section use the monotonicity and convexity assumption in the form (1) and (2).
Throughout this work, we denote by the -th unit vector in , , and . denotes the -dimensional identity matrix, and is the matrix containing in all of its entries.
2.1 A simple criterion for uniqueness and Lipschitz stability
Before we state our result in its final form in subsection 2.3, we derive two weaker (and less technical) results that motivate our arguments and may be of independent interest. We first show a simple criterion that yields injectivity of and allows us to estimate the Lipschitz stability constant of its left inverse .
Theorem 2.1
Let , , be a continuously differentiable, pointwise convex and monotonic function on a convex open set containing . If, additionally,
[TABLE]
then the following holds:
- (a)
* is injective on .* 2. (b)
* is injective for all .* 3. (c)
With
[TABLE]
we have that for all
[TABLE]
where denotes the left inverse of .
To prove Theorem 2.1, we will first formulate an auxiliary lemma, which will also be used in our more technical results. Note that assumption (5) in the following lemma simply means that a row permutation of the non-negative matrix is strictly diagonally dominant in its first rows.
Lemma 1
Let , , be continuously differentiable, pointwise convex and monotonic on a convex open set , and let .
- (a)
If fulfills
[TABLE]
then is injective, and its left inverse fulfills . 2. (b)
If, additionally, also fulfills (5), then
[TABLE]
Proof
- (a)
For all , at least one of the entries of must be either or , so that there exists with either
[TABLE]
We thus obtain from the monotonicity assumption (1) that either
[TABLE]
In both cases, it follows from (5) that
[TABLE]
This proves injectivity of and the bound on its left inverse. 2. (b)
Likewise, for , either
[TABLE]
so that by monotonicity (1) and assumption (5), either
[TABLE]
By convexity (2), it then follows that
[TABLE]
We can now prove Theorem 2.1 with lemma 1.
Proof (of Theorem 2.1)
Let . Writing
[TABLE]
we have that
[TABLE]
Thus we deduce from (1) and (2) that for all
[TABLE]
With the definition of in (4), this shows that
[TABLE]
so that (a), (b) and (c) follow from lemma 1.
2.2 A simple criterion for global convergence of the Newton iteration
We will now show how to ensure that a convex monotonic function has a unique zero, and that the Newton method globally converges against this zero.
Theorem 2.2
Let , , be continuously differentiable, pointwise convex and monotonic on a convex open set . If , and
[TABLE]
with
[TABLE]
then the following holds:
- (a)
* is injective on , is invertible for all , and for all *
[TABLE]
where
[TABLE] 2. (b)
If, additionally, , then there exists a unique
[TABLE]
and this is the only zero of in .The Newton iteration sequence
[TABLE]
is well defined (i.e., is invertible in each step) and converges against . Furthermore, for all , , and
[TABLE]
where . The rate of convergence of is superlinear. If is locally Lipschitz, then the rate of convergence is quadratic.
To prove Theorem 2.2 we will first show the following lemma.
Lemma 2
Under the assumptions and with the notations of Theorem 2.2, the following holds:
- (a)
For all ,
[TABLE] 2. (b)
* is injective on , is invertible for all , and, for all ,*
[TABLE] 3. (c)
For all , and all
[TABLE] 4. (d)
* is invertible, . For all , and *
[TABLE]
and thus .
Proof
- (a)
Let . Using , we have that for all
[TABLE]
and thus it follows from monotonicity (1) and convexity (2) that
[TABLE]
which proves (a) using the definition of in (9). 2. (b)
Since (a) implies a fortiori that for all
[TABLE]
the assertion (b) follows from lemma 1. 3. (c)
Let , and . If there exists an index with , then , so that
[TABLE]
By contraposition, this shows that
[TABLE]
which also shows that . 4. (d)
It is easily checked that
[TABLE]
which shows that is invertible and
Moreover, using (c) it follows that implies that for all ,
[TABLE]
so that implies . This also shows that
[TABLE]
and thus .
Note that by lemma 2(d), is a convex function with Collatz monotone derivative collatz1952aufgaben , i.e. . If the Newton iterates do not leave the region where convexity and Collatz monotony holds, then classical results on monotone Newton methods (cf., e.g., Ortega and Rheinboldt (ortega1970iterative, , Thm. 13.3.4)) yield global Newton convergence for , and thus for since the Newton method is invariant under linear transformation. The following lemma utilizes some arguments from the classical result (ortega1970iterative, , Thm. 13.3.4) for our situation.
Lemma 3
Let be continuously differentiable and pointwise convex on a convex open set containing zero, and let . We assume that for some point , is invertible,
[TABLE]
Then for all
[TABLE]
fulfills . Moreover, if then .
Proof
The assumptions (11) and the convexity (2) yield that for all
[TABLE]
Moreover,
[TABLE]
so that is proven.
If , then we also obtain from the convexity assumption (2) that
[TABLE]
which shows that .
Proof (of Theorem 2.2)
Theorem 2.2(a) has been proven in lemma 2(b).
To prove (b), let with and . Then, by (a), is invertible, so that
[TABLE]
is well defined.
We will prove that . We argue by contradiction, and assume that this is not the case. Then, by continuity, there exists with and, by lemma 3,
[TABLE]
Convexity (2) then yields that
[TABLE]
and using lemma 2(c) this would imply that
[TABLE]
For all , we obtain from (12) and (13)
[TABLE]
Hence, , so that . Since this contradicts , we have proven that .
Using lemma 3 again, this shows that for all
[TABLE]
the next Newton iterate is well-defined and also fulfills
[TABLE]
Hence, for , the Newton algorithm produces a well-defined sequence for which is monotonically non-increasing and bounded. Hence, and thus also converge. We define
[TABLE]
Since is continuously differentiable and is invertible, it follows from the Newton iteration formula (10) that . Also, the monotone convergence of shows that
[TABLE]
Moreover, since this is the standard Newton iteration, the convergence speed is superlinear and the speed is quadratic if is Lipschitz continuous in a neighbourhood of .
It only remains to show that . For this, we use the convexity to obtain
[TABLE]
which then implies by lemma 2(c) that
[TABLE]
From this we obtain that
[TABLE]
which yields . Using (14) again, we then obtain
[TABLE]
which shows .
2.3 A result with tighter domain assumptions
Our results in subsections 2.1 and 2.2 require the considered function to be defined (and convex and monotonic) on a much larger set than . For some applications (such as the inverse coefficient problem in section 3), the following more technical variant of Theorem 2.2 is useful, as it allows us treat the case where the domain of definition is an arbitrarily small neighbourhood of .
Theorem 2.3
Let and . Let , , be continuously differentiable, pointwise convex and monotonic on a convex open set . If , and
[TABLE]
where ,
[TABLE]
then the following holds on .
- (a)
* is injective on . For all , is invertible and*
[TABLE]
where
[TABLE] 2. (b)
If, additionally, , then there exists a unique
[TABLE]
The Newton iteration sequence
[TABLE]
is well defined (i.e., is invertible in each step) and converges against . For all
[TABLE]
where . The rate of convergence of is superlinear. If is locally Lipschitz then the rate of convergence is quadratic.
To prove Theorem 2.3 we first prove a variant of lemma 2 with tighter domain assumptions.
Lemma 4
Under the assumptions and with the notations of Theorem 2.3, the following holds:
- (a)
For all , and ,
[TABLE] 2. (b)
* is injective on . For all , is invertible,*
[TABLE] 3. (c)
For all , and
[TABLE] 4. (d)
* is invertible, . For all , and *
[TABLE]
and thus .
Proof
We use the same arguments as in lemma 2. To prove (a) let and . Then, by the definition of , there exists , so that
[TABLE]
It follows from the definition of and in (16) and (17) that
[TABLE]
We thus obtain
[TABLE]
which proves (a). Since this also implies a fortiori that
[TABLE]
(b) follows from lemma 1.
To show (c) by contraposition, let , , and assume that for some index , we have that . Then , so that
[TABLE]
By contraposition, this shows that
[TABLE]
and the latter also implies that .
For the proof of (d), it is easily checked that
[TABLE]
which shows the invertibility of and the asserted formula for . Moreover, for all , and ,
[TABLE]
Hence, using (c), for all , implies . As this also shows that
[TABLE]
we have .
Proof (of Theorem 2.3)
We proceed as in the proof of Theorem 2.2. Assertion (a) has already been proven in lemma 4(b).
To prove (b), let with and . Then, by (a), is invertible, so that
[TABLE]
is well defined.
We will prove that . We argue by contradiction, and assume that this is not the case. Then, by continuity, there exists with , so that, by lemma 3,
[TABLE]
Convexity (2) then yields that
[TABLE]
and using lemma 4(c) this would imply
[TABLE]
For all , we obtain from (20) and (21)
[TABLE]
and thus
[TABLE]
An elementary computation shows that
[TABLE]
where we used for the first inequality, and we used the assumption for the last inequality. Hence, for all ,
[TABLE]
This contradicts , and thus shows that .
Using lemma 3 again, this shows that for all
[TABLE]
the next Newton iterate is well-defined and also fulfills
[TABLE]
Hence, for , the Newton algorithm produces a well-defined sequence for which is monotonically non-increasing and bounded. Hence, and thus also converge. We define
[TABLE]
Since is continuously differentiable and is invertible, it follows from the Newton iteration formula that . Also, the monotone convergence of shows that
[TABLE]
Moreover, since this is the standard Newton iteration, the convergence speed is superlinear and the speed is quadratic if is Lipschitz continuous in a neighbourhood of .
3 Application to an inverse Robin transmission problem
We will now show how to use our result to obtain uniqueness, stability and global convergence results for an inverse coefficient problem with finitely many measurements. More precisely, we show how to choose finitely many measurements so that they uniquely determine the unknown coefficient function with a given resolution, Lipschitz stability holds, and Newton’s method globally converges.
3.1 The setting
We consider the inverse Robin transmission problem for the Laplace equation from harrach2019global , that is motivated by corrosion detection. Note that similar problems have also been studied for the Helmholtz equation under the name conductive boundary condition or transition boundary condition lyalinov1998transition ; angell1990resistive . We first introduce the idealized infinite-dimensional forward and inverse problem following harrach2019global and then study the case of finitely many measurements.
3.1.1 The infinite-dimensional forward and inverse problem
Let (), be a bounded domain with Lipschitz boundary and let be an open subset of , with , Lipschitz boundary and connected complement , cf. figure 1 in the numerical section for a sketch of the setting.
We assume that describes an electrically conductive imaging domain, with a-priori known conductivity that we set to for the ease of presentation. (Note that all of the following results remain valid if the conductivity in and in are a-priori known spatially dependent functions as long as they are sufficiently regular to allow unique continuation arguments). We assume that corrosion effects on the interface can be modelled with an unknown Robin transmission parameter , where denotes the subset of -functions with positive essential infima.
Applying an electrical current flux on the boundary then yields an electric potential solving the following Robin transmission problem with Neumann boundary values
[TABLE]
where is the unit normal vector to the interface or pointing outward of , resp., , and
[TABLE]
denote the jump of the Dirichlet, resp., Neumann trace values on , with the superscript ”” denoting that the trace is taken from and ”” denoting the trace taken from . In the following, we often denote the solution of (22)–(25) by to point out its dependence on the Robin transmission coefficient and the Neumann boundary data .
It is easily seen that this problem is equivalent to the variational formulation of finding such that
[TABLE]
and that (26) is uniquely solvable by the Lax-Milgram-Theorem. Hence, we can define the Neumann-to-Dirichlet map
[TABLE]
It is easy to show that is a compact self-adjoint linear operator.
One may regard as an idealized model (the so-called continuum model) of all electric current/voltage measurements that can be carried out on the outer boundary . Hence the infinite-dimensional inverse coefficient problem of determining a Robin transmission coefficient from boundary measurements can be formulated as the problem to
[TABLE]
We summarize some more properties of the infinite-dimensional forward mapping in the following lemma.
Lemma 5
- (a)
The non-linear mapping
[TABLE]
is Fréchet differentiable. Its derivative
[TABLE]
is given by the bilinear form
[TABLE]
for all , , and , where solves (26). is locally Lipschitz continuous and is compact and self-adjoint. 2. (b)
For all and all ,
[TABLE] 3. (c)
For all , , and ,
[TABLE]
Proof
Obviously, for all and , (27) defines a compact self-adjoint linear operator . Moreover, it follows from the monotonicity estimate in (harrach2019global, , Lemma 4.1) that for all (that are sufficiently small so that )
[TABLE]
and thus
[TABLE]
This shows that is Fréchet differentiable for all , and that is its derivative. Since it is easily shown that depends locally Lipschitz continuously on , it also follows that is locally Lipschitz continuous. This proves (a).
(b) is shown in (harrach2019global, , Lemma 4.1), and (c) follows from (27).
Note that lemma 5 shows that is a convex, anti-monotone function with respect to the pointwise partial order on , and the Loewner partial order in the space of compact self-adjoint operators on . These properties are the key to formulate the finite-dimensional inverse problem as a zero finding problem for a pointwise convex and monotonic forward function.
3.1.2 The inverse problem with finitely many measurements
In practical applications, it is natural to assume that the unknown Robin transmission coefficient is a piecewise constant function, i.e., where and are the characteristic functions on pairwise disjoint subsets of a given partition . For the ease of notation, here and in the following, we identify a piecewise constant function with the vector . We also simply write for the constant function , and for the vector (and use analogously). Throughout this work we always assume that .
It is also natural to assume that one knows bounds on the unknown Robin transmission coefficient, so that for all . For the semi-discretized inverse problem of reconstructing the finite-dimensional coefficient vector from the (infinite-dimensional) measurements , the results in (harrach2019global, , Thm. 2.1 and 2.2) show uniquely solvability and Lipschitz stability. Moroever, (harrach2019global, , Thm. 5.2) shows how to explicitly calculate the Lipschitz constant for a given setting using arguments similar to (and inspiring) section 2 in this work.
We now go one step further and assume that we can only measure finitely many components of , i.e., that we can measure
[TABLE]
for a finite number of Neumann boundary data . Hence, the fully discretized inverse Robin transmission problem leads to the finite dimensional non-linear inverse problem to
[TABLE]
The following practically important questions are then to be answered: Given bounds and a partition of (i.e., a desired resolution), how many and which Neumann boundary functions , should be used, so that uniquely determines ? How good is the stability of the resulting inverse problem with regard to noisy measurements? How can one construct a globally convergent numerical algorithm to practically determine from ? And how good will the solution be in the case that the true Robin transmission function is not piecewise constant?
The following subsections show how these questions can be answered using the theory developed in section 2. For this, let us first observe, that the symmetric choice leads to an inverse problem with a pointwise convex and monotonic forward function.
Lemma 6
Let , , and
[TABLE]
* is a pointwise convex and anti-monotone, continuously differentiable function with locally Lipschitz continuous derivative , where*
[TABLE]
Given a vector with , we define the rescaled function
[TABLE]
with . Then is a pointwise convex and monotonic, continuously differentiable function with locally Lipschitz continuous derivative, and for all . Also, .
Proof
This follows immediately from lemma 5.
3.2 Uniqueness, stability and global Newton convergence
We summarize the assumptions on the setting: Let (), be a bounded domain with Lipschitz boundary and let be an open subset of , with , Lipschitz boundary and connected complement . We assume that the true unknown Robin transmission coefficient is bounded by for a.e., with a-priori known bounds , and that , , is piecewise constant on an a-priori known partition into , , pairwise disjoint measurable subsets with positive measure.
We will show how to construct Neumann boundary functions , so that is uniquely determined by the measurements
[TABLE]
We also quantify the Lipschitz stability constant of this inverse problem, and show that the inverse problem can be numerically solved with a globally convergent Newton method. Our main tool is to reformulate the finite-dimensional inverse problem as a zero finding problem for the pointwise monotonic and convex function introduced in lemma 6. Then we use a relation to the concept of localized potentials gebauer2008localized to prove that we can choose the measurements in such a way that also fulfills the additional assumptions on the directional derivative of the forward function as required by our Newton convergence theory in section 2.
At this point, let us stress that our theory in section 2 allows us to find measurements that uniquely recover the unknown components of the Robin coefficient, but it also requires to use exactly those measurements to ensure global convergence of Newton’s method. In practice, it would be highly desirable to utilize additional measurements for the sake of redundancy and error reduction. But our convergence theory in section 2 does not cover the case yet, and an extension to, e.g., Gauss-Newton or Levenberg-Marquardt methods seems far from trivial. Moreover, for some applications, it would be desirable to also treat the interior boundary as unknown. But it is not clear whether the interplay of parametrization, differentiability and localized potentials results that is required to fulfill the assumptions of section 2 can also be extended to this case. Hence, in all of the following we will only consider the case where the interior boundary is known and utilize exactly measurements.
3.2.1 Choosing the measurements (for specific bounds)
To demonstrate the key idea, we will first consider the specific (and rather restrictive) case where the bounds fufill
[TABLE]
since this case can be treated by simply combining Theorem 2.2 with a known localized potentials result from harrach2019global . The case of general bounds is more technical and requires an extended result on simultaneously localized potentials. It will be treated in subsection 3.2.2.
Theorem 3.1
Given bounds that fulfill , we define the piecewise constant functions , , by setting
[TABLE]
- (a)
If, for all , fulfills
[TABLE]
then the finite-dimensional non-linear inverse problem of determining
[TABLE]
has a unique solution in , and depends Lipschitz continuously on . Moreover, the iterates of Newton’s method applied to the problem of determining from , with initial value , quadratically converge to the unique solution (see lemma 7 for more details on the properties of ). 2. (b)
In any large enough finite-dimensional subspace of , one can find fulfilling (29) by the following construction:
Let be a sequence of vectors with dense linear span in . Let , , and let be the symmetric matrix with entries ()
[TABLE]
Then, for sufficiently large dimension , the matrix has a positive eigenvalue . For a corresponding normalized eigenvector , (29) is fulfilled by
[TABLE]
To prove Theorem 3.1, we formulate the consequences of our Newton convergence theory in section 2 in the following lemma.
Lemma 7
If then
[TABLE]
where , and . If, for all , fulfills (29), then the function
[TABLE]
has the following properties
- (a)
* is pointwise convex, anti-monotone, and continuously differentiable.* 2. (b)
* is injective on , and its Jacobian is invertible for all .* 3. (c)
With , we have that for all
[TABLE] 4. (d)
For all , . Moreover, for all with there exists a unique with . The Newton iteration
[TABLE]
produces a sequence that converges quadratically to .
Proof
We define and as in lemma 6. Then,
[TABLE]
so that the interval inclusions follow from the anti-monotonicity of .
Lemma 6 yields assertion (a) and that is a continuously differentiable pointwise convex and monotonic function on . The assumption guarantees that contains . Moreover, using that
[TABLE]
it follows from lemma 6 and (29) that
[TABLE]
so that fulfills the assumptions of Theorem 2.2.
It follows that is injective on , and that for all , is invertible,
[TABLE]
This proves that fulfills (b) and (c) on .
The first assertion in (d) follows from the anti-monotonicity of . For the remaining assertions in (d) note that, by lemma 6, is also locally Lipschitz continuous, and . Hence, Theorem 2.2(b) yields that there exists a unique with , and thus a unique that solves .
Moreover, Theorem 2.2(b) yields that the Newton iteration applied to with initial value does not leave the interval and quadratically converges against the unique solution of . Since, the Newton iteration is invariant under invertible linear transformations, this yields that the Newton iteration applied to with does not leave and converges quadratically against the unique solution of .
Proof (of Theorem 3.1)
- (a)
follows from lemma 7. 2. (b)
Let . From the localized potentials result in (harrach2019global, , Lemma 4.3), it follows that there exists with
[TABLE]
By density and continuity, for sufficiently large , there exists a function with
[TABLE]
Writing with , and , we thus have
[TABLE]
This shows that the symmetric matrix must have a positive eigenvalue when its dimension is sufficiently large. On the other hand, every normalized eigenvector corresponding to a positive eigenvector fulfills
[TABLE]
with , so that (b) is proven.
3.2.2 Choosing the measurements (for general bounds)
We now show how to treat the case of general bounds .
Theorem 3.2
Given , choose , and set , and . Let denote the piecewise constant function
[TABLE]
- (a)
If , , fulfills
[TABLE]
for all , then the finite-dimensional non-linear inverse problem of determining
[TABLE]
has a unique solution in , and depends Lipschitz continuously . Moreover, the iterates of Newton’s method applied to the problem of determining from , with initial value , quadratically converge to the unique solution (see lemma 8 for more details on the properties of ). 2. (b)
In any large enough finite dimensional subspace of , one can find fulfilling (34) by the following construction:
Let be a sequence of vectors with dense linear span in . Let , , and be a normalized eigenvector corresponding to a largest eigenvalue of the symmetric matrix
[TABLE]
where , and for ,
[TABLE]
Then, for sufficiently large dimension , (34) is fulfilled by
[TABLE]
As in subsection 3.2.1, we prove Theorem 3.2(a) by applying our Newton convergence theory from section 2 in the following lemma.
Lemma 8
We have that
[TABLE]
If, for all , fulfills (34) for all , then the function
[TABLE]
has the following properties:
- (a)
* is pointwise convex, anti-monotone, and continuously differentiable.* 2. (b)
* is injective on , and its Jacobian is invertible for all .* 3. (c)
For all ,
[TABLE]
where . 4. (d)
For all , . Moreover, for all with there exists a unique with . The Newton iteration
[TABLE]
produces a sequence that converges quadratically to .
Proof
We define and as in lemma 6. Then and
[TABLE]
Lemma 6 yields assertion (a) and that is a continuously differentiable pointwise convex and monotonic function on , with locally Lipschitz continuous , and . Also, contains since .
Moreover, with defined by (16) and (17), we have that
[TABLE]
It thus follows from lemma 6 and (34) that
[TABLE]
so that fulfills the assumptions of Theorem 2.3 which then yields the assertions (b)–(d).
To prove Theorem 3.2(b), we need to ensure that there exist Neumann data so that the corresponding solutions are much larger on than on , and this property has to hold for several Robin transmission coefficients , , simultaneously.
Note that for fixed the Robin transmission coefficients only differ on . Hence, the following lemma will allow us to estimate on by .
Lemma 9
Let with on , where is a measurable subset of with positive measure. Then, for all , the corresponding solutions of (22)–(25) with , and , respectively, fulfill
[TABLE]
Proof
We proceed analogously to (harrach2019fractional_I, , Lemma 3.6). It follows from the variational formulation (26) that
[TABLE]
Hence, we obtain
[TABLE]
and the assertion follows.
The next lemma will allow us to construct for which is large on (and thus by lemma 10 all are large on ), and at the same time all are small on .
Lemma 10
Let be a measurable subset of with positive measure, , and .
Then, for all , there exists , so that the corresponding solutions of (22)–(25) fulfill
[TABLE]
Proof
The existence of simultaneously localized potentials for the fractional Schrödinger equation has recently been shown in (harrach2020monotonicity, , Theorem 3.11), and we proceed similarly in this proof. Following the original localized potentials approach in gebauer2008localized , we start by reformulating the assertion as operator range (non-)inclusions, by introducing the operators
[TABLE]
where , solves
[TABLE]
and solves
[TABLE]
It is easily shown (see, e.g., the proof of (harrach2019global, , Theorem 3.1)) that the adjoints of these operators are given by
[TABLE]
where solve (22)–(25) with Neumann boundary data and Robin transmission coefficients , respectively.
By a simple normalization argument, the assertion is now equivalent to showing that
[TABLE]
Using a functional analytic relation between operator ranges and the norms or their adjoints (cf., (gebauer2008localized, , Lemma 2.5), (fruhauf2007detecting, , Cor. 3.5)), the property (37) (and thus the assertion) is proven if we can show that
[TABLE]
We prove (38) by contradiction, and assume that
[TABLE]
Then, for every , there exist , so that
[TABLE]
Let be the associated solutions from the definition of (with ), and set . Then , and , so that by unique continuation in . But this also yields that , and from this we obtain that in , so that in all of .
Hence, using (35) and (36), we obtain for all ,
[TABLE]
and this shows that
[TABLE]
However, since this holds for all , this would imply that
[TABLE]
where
[TABLE]
are the compact trace operator and the continuous multiplication operator by . Hence, the closed infinite-dimensional space would be the range of a compact operator, which is not possible cf., e.g., (rudin1991functional, , Thm. 4.18). This contradiction shows that (38) must hold, and thus the assertion is proven.
Proof (of Theorem 3.2)
- (a)
follows from lemma 8. 2. (b)
Let . As in the proof of Theorem 3.1(b), it follows from the simultaneously localized potentials result in lemma 10, and a density argument, that will have a positive eigenvalue if the dimension is sufficiently large. Hence, for sufficiently large , an eigenvector corresponding to a largest eigenvalue of will fulfill (with )
[TABLE]
Using that for all
[TABLE]
we have that
[TABLE]
and thus it follows from lemma 9
[TABLE]
Hence, (34) is fulfilled.
Remark 2
Regarding the formulation of Theorem 3.2(b), note that we actually prove that the matrix has a positive eigenvalue if the dimension is sufficiently large, and that an eigenvector corresponding to a positive eigenvalue leads to a boundary current that fulfills (34). But the estimates that we use in the proof of 3.2(b) are far from being sharp, so that it seems worth checking (34) already for eigenvectors to a largest eigenvalue that is not yet positive.
3.3 Numerical results
We test our results on the simple example setting shown in figure 1. is the two-dimensional unit circle, and is a square with corner coordinates , , and . The boundary is decomposed into parts denoting (in this order) the right, top, left, and bottom side of the square. We assume that the unknown true Robin transmission coefficient is a-priori known to be bounded by and and that it is known to be piecewise-constant with respect to the partition of , i.e.,
[TABLE]
Recall that, for the ease of notation, we identify a piecewise-constant function with the vector , and simply write for the constant function , and for the vector (and use analogously).
3.3.1 Choosing the measurements
We first apply Theorem 3.2 to construct Neumann boundary functions so that the four measurements
[TABLE]
and the Newton method applied to globally converges.
To implement Theorem 3.2, we choose , which yields , and as the standard trigonometric polynomial basis functions
[TABLE]
with denoting the angle of a point on the unit circle . For each , we then calculate the matrix starting with , and increase the dimension , until an eigenvector corresponding to a largest eigenvalue of this matrix has the property that
[TABLE]
fulfills (34) for all . To calculate the entries of , and for checking (34), the required solutions of the Robin transmission problem (22)–(25) with Neumann boundary functions , and Robin transmission coefficients , as defined in (32), were obtained using the commercial finite element software Comsol.
For our setting we had to increase the dimension up to at most , so that all are trigonometric polynomials of order less or equal . Figure 2 shows the boundary functions plotted with respect to the boundary angle on the unit circle .
From checking (34), we also obtain the Lipschitz stability constant for as described in lemma 8(c). For our setting we obtain the stability estimate
[TABLE]
where is the slightly enlarged interval from lemma 8.
Note that here and in the following we consider the measurement error relative to as this is the width of the measuring range .
The property (34) can be interpreted in the sense that the boundary current generates an electric potential for which the corresponding solution is much larger on than on the remaining boundary , and that this simultaneously holds for several (but finitely many) Robin transmission coefficients . To illustrate this localization property, figure 3 shows (in logarithmic scale) for and .
Let us make a comment on improving the computation time. Note that the properties of only depend on whether the used Neumann functions have the desired property (34), and that our rigorous approach of constructing is computationally more expensive than checking whether some given fulfills (34). In fact, for fixed , the construction of the matrix requires solving the PDE (22)–(25) for all combinations of different Robin transmission coefficients and different Neumann boundary values. On the other hand, checking whether a given Neumann boundary function has the desired property (34) only requires solving the PDE (22)–(25) for different Robin transmission coefficients and the single Neumann boundary function . Moreover, as long as does not fulfill (34), the checking might require very few PDE solutions if (34) already fails to hold for a small .
Hence, one might try computationally cheaper heuristic approaches to construct that satisfy (34). In our experiments, we successfully used the ad-hoc approximation
[TABLE]
(which only requires PDE solutions), and always found that increasing the dimension lead to Neumann boundary function fulfilling (34) for all . Moreover, in our experiments, we found the functions constructed with this faster heuristic approach virtually identical to those constructed with the exact matrix from Theorem 3.2.
3.3.2 Global convergence of Newton’s method
We numerically study the theoretically predicted global convergence of the standard Newton method when applied to the measurements constructed in the last subsection. We slightly change the definition of and define the measurements relative to the known lower bound
[TABLE]
and numerically evaluate using that, for all , and ,
[TABLE]
which immediately follows from the variational formulation of the Robin transmission problem (26). Note that this approach is numerically more stable than calculating and separately as it avoids loss of significance effects.
We choose the true coefficient value as , and first test the reconstruction for noiseless data . Starting with the lower bound , we implement the standard Newton method
[TABLE]
where the -th entry of the Jacobian matrix is given by , cf. lemma 6.
We repeat this for noisy data with relative noise level , that we obtain by adding a vector with random entries to , so that . Note that , so that this chooses the norm level relative to the measurement range. For the noiseless case we committed the so-called inverse crime of using the same forward solver (i.e., the same finite element mesh) for simulating the data and for evaluating and in the Newton iteration. For the noisy cases , we used a different mesh for the forward and inverse solvers.
Figure 4 shows the error of the first Newton iterations for the case , , , and , and demonstrates the theoretically predicted quadratic convergence properties. At this point, let us stress that also for noisy data , Lemma 8 yields that there exists a unique solution of
[TABLE]
and that the standard Newton method converges to this solution , as long as lies within the bounds (which is easily guaranteed by capping or flooring the values in ). Moreover, the obtained solution will satisfy the error estimate
[TABLE]
due to the stability estimate (39) obtained in the last subsection.
We finish this subsection with an example where the true Robin transmission coefficient is not piecewise constant but within the a-priori known bounds
[TABLE]
Then will still satisfy , so that there exists a unique solution of , i.e., there exists a unique piecewise constant Robin transmission coefficient that leads to the same measured data as the true non-piecewise-constant coefficient function. The Newton iteration applied to (or a noisy version ) will globally converge to this piecewise constant solution (or an approximation ), see figure 5 for a numerical example.
3.3.3 Effect of interval width and number or unknowns
Our result in Theorem 3.2 holds for any a-priori known bounds and any number of unknowns . Thus, in theory, we can treat arbitrary large intervals and arbitrary fine resolutions of . However, numerically, the constructed trigonometric polynomials will quickly become more and more oscillatory, and the calculated Lipschitz constants will quickly increase.
To demonstrate the effect of the interval width, we proceed as in subsection 3.3.1 to calculate four boundary currents that uniquely determine and yield global Newton convergence for , and . Table 1 shows the dimension of the trigonometric polynomial subspace of that contains and the obtained Lipschitz constant for the inverse problem of determining from the corresponding measurements.
To demonstrate the effect of the number of unknowns, we then replace the square by regular polygons with , , , and sides keeping the polygon center and circumradius the same as in the square () case. We assume that is piecewise constant with respect to the polygon sides. As in subsection 3.3.1 we then calculate boundary currents that uniquely determine and yield global Newton convergence. The required dimension of the trigonometric polynomial subspace of and the obtained Lipschitz constant are shown in table 2.
In both situations, the boundary currents quickly become highly oscillatory, and the calculated stability constant worsens. Hence, at the current state, our approach will only be feasible for moderate contrasts and relatively few unknowns as stated in the introduction. It should be noted, however, that our criterion (34) in Theorem 3.2 is sufficient but possibly not necessary for uniqueness, Lipschitz stability and global Newton convergence. The constructed boundary currents and the calculated Lipschitz constants may be far from optimal. Since our result is (to the knowledge of the author) the very first on uniqueness, global convergence and explicit Lipschitz stability constants for a discretized inverse coefficient problem, there may well be room for improvement and significantly sharper estimates that could practically yield in less oscillations and better stability constants.
4 Conclusions
We have derived a method to determine which (finitely many) measurements uniquely determine the unknown coefficient in an inverse coefficient problem with a given resolution, and proved global convergence of Newton’s method for the resulting discretized non-linear inverse problem. Our method also allows to explicitly calculate the Lipschitz stability constant, and yields an error estimate for noisy data. To the knowledge of the author, these are the first such results for discretized inverse coefficient problems.
Our method stems on an extension of classical global Newton convergence theory from convex inverse-monotonic to convex (forward-)monotonic functions that arise in elliptic inverse coefficient problems. The extension required an extra assumption on the directional derivatives of the considered function that we were able to fulfill by choosing the right measurements.
Our proofs mainly utilized monotonicity ideas and localized potential techniques that are also known for several other elliptic inverse coefficient problems. So the ideas in this work might be applicable to other applications as well. A particularly interesting extension would be the case of EIT where it has recently been shown harrach2019uniqueness that an unknown conductivity distribution with a given resolution is uniquely determined by voltage-current-measurements on sufficiently many electrodes, but the number of required electrodes is not known. The main difficulty of such an extension is that localized potentials in EIT cannot concentrate on each domain part separately as in the simpler Robin transmission problem considered in this work. Roughly speaking, a localized potential in EIT with high energy in some region will also have a high energy on its way from the boundary electrodes to . This behavior will make the application of our herein presented ideas more challenging.
Acknowledgments
The author would like to thank Professor Michael Klibanov for his inspiring work on global convergence, and Professor Frank Natterer for pointing out possible relations between the monotonicity method and Collatz monotone functions.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1(1) Adler, A., Gaburro, R., Lionheart, W.: Electrical impedance tomography. Handbook of Mathematical Methods in Imaging pp. 701–762 (2015)
- 2(2) Alberti, G.S., Santacesaria, M.: Calderón’s inverse problem with a finite number of measurements. In: Forum of Mathematics, Sigma, vol. 7. Cambridge University Press (2019)
- 3(3) Alberti, G.S., Santacesaria, M.: Infinite-dimensional inverse problems with finite measurements. ar Xiv preprint ar Xiv:1906.10028 (2019)
- 4(4) Alessandrini, G., Beretta, E., Vessella, S.: Determining linear cracks by boundary measurements: Lipschitz stability. SIAM Journal on Mathematical Analysis 27 (2), 361–375 (1996)
- 5(5) Alessandrini, G., de Hoop, M.V., Gaburro, R., Sincich, E.: Lipschitz stability for a piecewise linear Schrödinger potential from local Cauchy data. Asymptotic Analysis 108 (3), 115–149 (2018)
- 6(6) Alessandrini, G., Maarten, V., Gaburro, R., Sincich, E.: Lipschitz stability for the electrostatic inverse boundary value problem with piecewise linear conductivities. Journal de Mathématiques Pures et Appliquées 107 (5), 638–664 (2017)
- 7(7) Alessandrini, G., Vessella, S.: Lipschitz stability for the inverse conductivity problem. Advances in Applied Mathematics 35 (2), 207–241 (2005)
- 8(8) Angell, T., Kleinman, R., Hettlich, F.: The resistive and conductive problems for the exterior helmholtz equation. SIAM Journal on Applied Mathematics 50 (6), 1607–1622 (1990)
