Analysis of the error in constitutive equation approach for time-harmonic elasticity imaging
Wilkins Aquino, Marc Bonnet (POEMS)

TL;DR
This paper develops a theoretical foundation for the MECE approach in time-harmonic elasticity imaging, explaining its robustness and properties, especially under incomplete boundary conditions, with implications for practical inverse problems.
Contribution
It proves the existence and uniqueness of solutions for the coupled system in MECE formulations, even with incomplete boundary data, and links MECE to classical least squares and ECE methods.
Findings
Unique and stable solutions at any frequency with sufficient data
Applicability to partial interior data in elastography
Convergence and differentiability of finite element discretizations
Abstract
We consider the identification of heterogeneous linear elastic moduli in the context of time-harmonic elastodynamics. This inverse problem is formulated as the minimization of the modified error in constitutive equation (MECE), an energy-based cost functional defined as an weighted additive combination of the error in constitutive equation (ECE) , expressed using an energy seminorm, and a quadratic error term incorporating the kinematical measurements. MECE-based identification are known from existing computational evidence to enjoy attractive properties such as improved convexity, robustness to resonant frequencies, and tolerance to incompletely specified boundary conditions (BCs). The main goal of this work is to develop theoretical foundations, in a continuous setting, allowing to explain and justify some of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsUltrasonics and Acoustic Wave Propagation · Seismic Imaging and Inversion Techniques · Numerical methods in inverse problems
Analysis of the error in constitutive equation approach for time-harmonic elasticity imaging
Wilkins Aquino
Dept. of Mech. Eng. and Mater. Sci., Duke University, Durham, USA
and
Marc Bonnet
POEMS (CNRS, INRIA, ENSTA), ENSTA, 91120 Palaiseau, France. [email protected]
Abstract.
We consider the identification of heterogeneous linear elastic moduli in the context of time-harmonic elastodynamics. This inverse problem is formulated as the minimization of the modified error in constitutive equation (MECE), an energy-based cost functional defined as an weighted additive combination of the error in constitutive equation (ECE) , expressed using an energy seminorm, and a quadratic error term incorporating the kinematical measurements. MECE-based identification are known from existing computational evidence to enjoy attractive properties such as improved convexity, robustness to resonant frequencies, and tolerance to incompletely specified boundary conditions (BCs). The main goal of this work is to develop theoretical foundations, in a continuous setting, allowing to explain and justify some of the aforementioned beneficial properties, in particular addressing the general case where BCs may be underspecified. A specific feature of MECE formulations is that forward and adjoint solutions are governed by a fully coupled system, whose mathematical properties play a fundamental role in the qualitative and computational aspects of MECE minimization. We prove that this system has a unique and stable solution at any frequency, provided data is abundant enough (in a sense made precise therein) to at least compensate for any missing information on BCs. As a result, our formulation leads in such situations to a well-defined solution even though the relevant forward problem is not a priori clearly defined. This result has practical implications such as applicability of MECE to partial interior data (with important practical applications including ultrasound elastography), convergence of finite element discretizations and differentiability of the reduced MECE functional. In addition, we establish that usual least squares and pure ECE formulations are limiting cases of MECE formulations for small and large values of , respectively. For the latter case, which corresponds to exact enforcement of kinematic data, we furthermore show that the reduced MECE Hessian is asymptotically positive for any parameter perturbation supported on the measurement region, thereby corroborating existing computational evidence on convexity improvement brought by MECE functionals. Finally, numerical studies that support and illustrate our theoretical findings, including a parameter reconstruction example using interior data, are presented.
1. Introduction
Energy-based objective functionals are strong alternatives to conventional least square methods for various parameter identification problems. Such functionals, often called error in constitutive equation (ECE) functionals in the area of solid mechanics, were initially introduced in [22] for error estimation in the linear elastic FEM and in e.g. [20, 19] for electrical impedance tomography. ECE functionals have since been successfully used in various mechanical parameter identification problems under linear static [15], nonlinear quasistatic [24], time-harmonic [23, 21, 5, 4] and, more recently, transient conditions [1, 14, 26, 8]. Mathematical and numerical issues are also discussed in e.g. [10, 17].
The main idea behind ECE approaches is to relax the constitutive equations connecting fluxes (e.g. stresses) and gradients of state variables (e.g strains). To this end, stresses (assumed to satisfy dynamic equilibrium) and strains (assumed to satisfy kinematical constraints) are treated as independent quantities. ECE functionals then measure residuals in constitutive equations evaluated on given stresses and strains, thereby assuming a physical meaning directly connected to constitutive parameter identification. In their original form, ECE functionals assume the measured displacements or strains to be strictly enforced as part of the kinematical admissibility constraints, but this is often undesirable as real data is usually polluted with noise. For this reason, ECE-based identification is nowadays rather formulated by means of so-called modified error in constitutive equation (MECE) functionals [23], where reliable and unreliable informations are treated differently. Equilibrium equations, initial conditions and known boundary conditions (BCs) are deemed reliable and enforced strictly (e.g. using Lagrange multipliers). By contrast, measured data, constitutive properties and (when applicable) imperfectly known BCs are deemed unreliable and incorporated as constitutive or observation residuals using an additive combination of ECE and least-squares components and , with a positive weight parameter.
Optimization problems that arise from MECE-based identification have been observed to display very attractive properties such as improved convexity, robustness to resonant frequencies, and tolerance to partially or completely unknown BCs. For instance, the MECE objective for transient elastodynamics was observed in [14] to be convex over a wide region of the parameter space. Convexity improvement relative to least-squares functionals was also reported in [9] in the context of linear elastostatics. Moreover, for time-harmonic conditions, the FEM-discretized coupled stationarity problem was shown in [4] to remain uniquely solvable at resonant prescribed frequencies. In addition, as shown in [12, 8], ECE approaches can naturally accommodate configurations with partially or completely unknown BCs, a feature also used in [6] for parameter identification using elastostatic interior data (i.e. unknown BCs). This makes such formulations perfectly suited to situations where interior data is abundant over a subset of the medium being probed. Important practical applications include elasticity imaging [16] where interior displacements are tracked inside soft tissue using ultrasound, but information about region boundaries is difficult to ascertain.
These advantages of MECE formulations over conventional least squares counterparts are backed by abundant numerical and experimental evidence, but to our best knowledge lack theoretical support in infinite-dimensional settings . Some of the existing analyses address discretized MECE formulations [4, 8], while [17] studied existence of solutions and convexity of a continuous ECE formulation using complete internal data. Accordingly, the main goal of this work is to develop theoretical foundations allowing to explain and justify some of the aforementioned beneficial properties of MECE formulations. We focus on elasticity imaging under time-harmonic conditions and adopt a Hilbert space setting. Moreover, our analysis addresses the general case where BCs may be underdetermined.
A specific feature of MECE formulations is that first-order optimality conditions lead to fully coupled forward and adjoint problems, rather than unidirectionally coupled problems arising in conventional least squares approaches. The mathematical properties of this coupled system play a fundamental role in the qualitative (e.g. existence of solutions, continuity and differentiability of the objective with respect to parameters) and numerical aspects of MECE-based imaging. In particular, our formulation leads to a well-defined solution in the underspecified BC case, for which a relevant forward problem is not a priori clearly defined. Hence, establishing the well-posedness of the coupled system is our first main goal. Treating this system as a perturbed mixed problem, we prove that it has a unique and stable solution at any frequency, subject to conditions that ensure that the available data more than compensates insufficient information on BCs. This result has important practical implications such as applicability of MECE to partial interior data, convergence of finite element discretizations and differentiability of the reduced MECE objective functional. Secondly, establish that least squares and pure ECE formulations are limiting forms for and , respectively, of MECE formulations. Thirdly, for the latter case (corresponding to exact enforcement of kinematic data), we show that the reduced MECE hessian becomes positive for any parameter perturbation supported on the measurement region. The latter result also has strong practical implications as convexity translates into robustness with respect to the initial guess in gradient-based optimization.
This work is organized as follows. In Section 2, we define the elasticity imaging problem and set notations, introduce the relevant MECE functional and state the relevant first-order stationarity conditions. Section 3 then addresses the well-posedness of the coupled stationarity problem, allowing for underdetermined BCs. Then, the limiting forms of the stationarity solution as or are established in Section 4. The reduced MECE Hessian, and in particular its asymptotic convexity in the limit, is studied in Sec. 5. Then, numerical studies that support and illustrate our theoretical findings, including a parameter reconstruction example using interior data, are presented in Section 6. Finally, most of the proofs are deferred to Section 7.
2. Problem setting
Let a solid elastic body occupy a bounded and connected domain with boundary . The time-harmonic motion of this body is governed by (i) the balance equations
[TABLE]
where is the displacement field, represents the specified angular frequency, denotes the known mass density, is a given body force density, represents the stress tensor, and are the given surface force density (traction) and its support, respectively, and is the outward unit vector normal to ; (ii) the kinematic compatibility equations
[TABLE]
where denotes the linearized strain tensor associated with and is the portion of the boundary where the displacement is prescribed; and (iii) the (linear elastic) constitutive relation
[TABLE]
where is the fourth-order elasticity tensor field. For simplicity, the kinematical boundary condition specified in (2) is homogeneous; the case of a non-homogeneous boundary condition can be treated with minor modifications, as can loading arrangements other than those appearing in (1).
Here, the boundary subsets and are only required to not overlap (i.e., ) and may cover only partially (i.e., ). In other words, we may have , where . In such event, equations (1)-(3) admit multiple solutions, whereas a unique solution exists (except for a countable set of eigenfrequencies ) when . This non-standard boundary condition setting is adopted here to model experimental situations where full-field interior data (to be introduced thereafter) is available while boundary conditions are underspecified.
Measurements
In addition to the fundamental equations (1)-(3), we assume availability for the prescribed frequency of measured time-harmonic displacements in (interior data), which is for example usual in elastography applications [16]. The forthcoming analyses can be adapted to accommodate measured displacements on a portion of .
Inverse problem
We address the inverse problem of reconstructing the elasticity tensor field such that (i) the governing equations of motion (1)-(3) are satisfied, and (ii) is consistent with the measurement , under prescribed time-harmonic conditions. Equality between the data and its model counterpart , implicit in requirement (ii), should hold for error-free model and measurements but will be relaxed in the upcoming formulation to allow for expected uncertainties.
2.1. Weak formulation of motion
Let \big{(}\hskip 1.00006pt\bm{a},\bm{b}\hskip 1.00006pt\big{)} denote the scalar product of second-order tensor fields :
[TABLE]
where the overline means complex conjugation. Repeated indices imply summation wherever indicial notation is used. The scalar product of vector or scalar fields follows the same notational style with suitable adjustments; so does the scalar product of fields defined on a surface , denoted as \big{(}\hskip 1.00006pt\bm{a},\bm{b}\hskip 1.00006pt\big{)}_{\Gamma}. The weak formulation of the balance equations (1) then reads
[TABLE]
where the continuous linear functional embodies all given excitations; for example \big{\langle}\hskip 1.00006pt\bm{f},\widetilde{\bm{w}}{}\hskip 1.00006pt\big{\rangle}_{\mathcal{V}^{\prime},\mathcal{V}}:=\big{(}\hskip 1.00006pt\bm{b},\bm{w}\hskip 1.00006pt\big{)}+\big{(}\hskip 1.00006pt\bm{g},\bm{w}\hskip 1.00006pt\big{)}_{\Gamma_{\text{N}}} for the given force densities appearing in (1). In (5) and thereafter, denotes the duality pairing between a Hilbert space an its dual (i.e. \big{\langle}\hskip 1.00006pt\bm{f},\widetilde{\bm{w}}{}\hskip 1.00006pt\big{\rangle}_{X^{\prime},X} evaluates the linear functional at ). In addition, let the spaces, , and of kinematically admissible displacements, dynamically admissible stresses and admissible elasticity tensor fields, respectively, be defined as
[TABLE]
(where denotes the finite-dimensional vector space of fourth-order tensors with major and minor symmetries, i.e. , and is some positive constant). Finally, the mass density field must be bounded below by a positive constant.
2.2. MECE optimization problem
We follow the inversion approach initiated in [4, 27], whereby the foregoing inverse problem is formulated as an optimization problem in which the unknown elasticity tensor field is estimated by minimizing an objective function that additively combines two error terms: (i) an error in constitutive equation (ECE) functional [22] defined by
[TABLE]
that measures (in units of energy) the discrepancy in the constitutive equation (3), and (ii) a quadratic error term , where is a positive bilinear form, that quantifies the mismatch between the predicted (or model) displacements and the measured ones. This objective function, hereafter called the modified ECE (MECE) functional, is defined by
[TABLE]
where is a weight parameter. For a given triple (\bm{u},\bm{\sigma},\text{\boldmath\mathcal{C}})\in\mathcal{U}\times\mathcal{S}(\bm{u})\times\mathcal{Q} (with these sets as defined in (6a,6b,6c)), \Lambda_{\kappa}(\bm{u},\bm{\sigma},\text{\boldmath\mathcal{C}}) defines a quantitative measure of the consistency of these variables with (i) the constitutive equation, and (ii) the available measurements . Accordingly, the elasticity imaging inverse problem is formulated as the PDE-constrained optimization problem
[TABLE]
2.3. Stationarity conditions
We now collect the first-order stationarity conditions for the minimization problem (9). To this end, let the Lagrangian be defined as
[TABLE]
where (i) plays the role of the Lagrange multiplier, (ii) the constraint is the dynamic balance equation (5), and (iii) the term \big{(}\hskip 1.00006pt\bm{\sigma}\!\cdot\!\bm{n},\bm{w}\hskip 1.00006pt\big{)}_{\Gamma\setminus\Gamma_{\text{N}}} is crucial for the case in which (i.e., boundary conditions are not prescribed over the entire boundary). The first-order optimality conditions for the minimization problem (9) are then given, in terms of first-order Gâteaux derivatives of , by
[TABLE]
where the test functions are constrained, consistently with the assumption , whereas are for now unconstrained. We begin by exploiting conditions (11a) and (11b). Condition (11a) yields
[TABLE]
while condition (11b) simply restates the balance constraint (5). Using first the subspace of containing all with vanishing trace on , equation (12) provides
[TABLE]
The second term in (12) then enforces the essential condition
[TABLE]
the stress-type unknown in the fourth term of (5) being the associated Lagrange multiplier, see e.g. [3]. We elect in this work to treat condition (14) via elimination, and hence seek in the function space defined by
[TABLE]
For the same reason, we now restrict equation (5) to test functions , thereby also eliminating the Lagrange multiplier , and obtain, after using (13):
[TABLE]
We note that ; more precisely, (with strict inclusion) when (insufficient boundary data), while when (sufficient boundary data).
Then, moving to the third condition (11c), we find
[TABLE]
so that, on substituting (13) and rearranging, (11c) yields
[TABLE]
Finally, condition (11d) yields the following equation, which is nonlinear in :
[TABLE]
Concluding, the first-order stationarity conditions for the minimization problem (9) consist of the coupled weak equations (16), (18), (19) governing (\bm{u},\bm{w},\text{\boldmath\mathcal{C}}), with then given explicitly by (13).
2.4. Reduced optimization problem
In a full-space approach, the stationarity system (13,16,18,19) is solved (iteratively) as a whole, as done e.g. in [13]. Alternatively, a reduced-space approach can be formulated from observing that equations (16,18) define for given a linear (coupled) problem for . Then, letting solve equations (16,18) for a given \text{\boldmath\mathcal{C}}\in\mathcal{Q}, reduced versions of the ECE and data-misfit components of that depend only on can be defined by
[TABLE]
where we have used \bm{\sigma}=\text{\boldmath\mathcal{C}}\!:\![\bm{u}+\bm{w}], see (13). Then, problem (9) can be recast in reduced minimization form as
[TABLE]
We observe that the functional \Lambda_{\kappa}(\bm{u},\bm{\sigma},\text{\boldmath\mathcal{C}}) is, for any fixed \text{\boldmath\mathcal{C}}\in\mathcal{Q}, differentiable and convex in due to the requisite ellipticity of (see (6c)). Problem (16,18) is hence equivalent to solving the partial minimization of \Lambda_{\kappa}(\bm{u},\bm{\sigma},\text{\boldmath\mathcal{C}}) with given, and we have
[TABLE]
For that reason, we will henceforth refer to problem (16,18) as the stationarity system.
In this work, we adopt and study this reduced-space approach, which thanks to the characterization (22) can in fact be shown to be equivalent to the full-space approach in the following sense:
Lemma 2.1**.**
Problems (21) and (9) are equivalent: a solution to (21) also solves (9) and vice versa.
Proof 2.2**.**
First, let \text{\boldmath\mathcal{C}}^{\star} solve the reduced problem (21), and solve (16-18) for \text{\boldmath\mathcal{C}}=\text{\boldmath\mathcal{C}}^{\star}. Then:
[TABLE]
for any (\bm{u},\bm{w},\text{\boldmath\mathcal{C}})\in\mathcal{U}\times\mathcal{W}\times\mathcal{Q}, where the first inequality stems from (21) and the second from (22). Hence, (\bm{u}^{\star},\bm{w}^{\star},\text{\boldmath\mathcal{C}}^{\star}) solves (9).
Conversely, let (\bm{u}^{\sharp},\bm{w}^{\sharp},\text{\boldmath\mathcal{C}}^{\sharp}) solve problem (9). This triple then verifies (16-18) with \text{\boldmath\mathcal{C}}=\text{\boldmath\mathcal{C}}^{\sharp}, i.e. has the form (\bm{u}_{\mathcal{C}^{\sharp}},\bm{w}_{\mathcal{C}^{\sharp}},\text{\boldmath\mathcal{C}}^{\sharp}), so that \Lambda_{\kappa}(\bm{u}^{\sharp},\bm{w}^{\sharp},\text{\boldmath\mathcal{C}}^{\sharp})=\tilde{\Lambda}{}_{\kappa}(\text{\boldmath\mathcal{C}}^{\sharp}). Since in addition problem (21) consists in minimizing over a subset of , \text{\boldmath\mathcal{C}}^{\sharp} is a minimizer of . The proof is complete.
2.5. Coupled stationarity system: a key component of MECE-based imaging
The stationarity system (16,18) plays for several reasons a fundamental role in MECE-based imaging:
- (a)
The definition (21) of a reduced-space approach needs well-posedness of the stationarity system; 2. (b)
The present MECE framework aims at treating situations with underspecified BCs, for which a relevant forward problem is not a priori clearly defined, in contrast with usual PDE-constrained inversion methods. Instead, the field acts as a forward solution, provided problem (16,18) is well-posed. It is in particular important to determine conditions on the interior data ensuring that it compensates the insufficient BC information and make problem (16,18) well-posed. 3. (c)
In cases involving underspecified BCs, the relevant forward problem is not a priori clearly defined. Instead, the stationarity system (16,18) acts as a combination of the forward and adjoint problems. The latter are coupled in the present framework, while they are usually uncoupled in inverse problems solved using standard minimization [13, 25, 18] (see Sec. 5.5 for further elaboration). 4. (d)
Well-posedness of the stationarity system implies that the solution mapping \text{\boldmath\mathcal{C}}\mapsto(\bm{u},\bm{w}) is Fréchet differentiable (by virtue of the implicit function theorem, e.g. [11, Thm. 7.13-1], whose applicability can then readily be verified). Hence, \tilde{\Lambda}{}_{\kappa}(\text{\boldmath\mathcal{C}}) is also differentiable in that case, since \Lambda_{\kappa}(\bm{u},\bm{\sigma},\text{\boldmath\mathcal{C}}) is, upon expressing with (13), quadratic in and affine in . 5. (e)
In turn, continuity of \tilde{\Lambda}{}_{\kappa}(\text{\boldmath\mathcal{C}}) is one of two key ingredients needed to establish existence of solutions to Problem (21). The other key ingredient would be establishing the existence of minimizing sequences in that contain convergent subsequences; it is outside of the scope of this work. Interested readers are referred to [17] for an in-depth study of existence of solutions in problems involving functionals similar to those studied herein.
The above considerations show the importance of establishing the well-posednedess of the stationarity system (16,18); this is the goal of the next section.
3. Analysis of the stationarity problem
In preparation for the forthcoming analysis, we rewrite the stationarity problems (18-16) in the form
[TABLE]
where is the elastic stiffness bilinear form given for any by \mathcal{A}(\bm{v},\widetilde{\bm{v}}{},\text{\boldmath\mathcal{C}}):=\big{(}\hskip 1.00006pt\bm{\varepsilon}[\bm{v}],\text{\boldmath\mathcal{C}}\!:\!\bm{\varepsilon}[\widetilde{\bm{v}}{}]\hskip 1.00006pt\big{)}, is the dynamic stiffness bilinear form, the linear functional defined by \big{\langle}\hskip 1.00006pt\bm{d},\widetilde{\bm{u}}{}\hskip 1.00006pt\big{\rangle}_{\mathcal{U}^{\prime},\mathcal{U}}=\mathcal{D}(\bm{u}^{\text{\scriptsize m}},\widetilde{\bm{u}}{}) incorporates the kinematic data, the linear functional synthesizes all given applied loads, and the function spaces are respectively defined by (6a) and (15). Moreover, we endow and with the inner product and norm defined by
[TABLE]
The norm is equivalent (for fixed ) to the standard norm (if , the alternative definition is also suitable). The bilinear forms , and are assumed to be continuous for this norm, i.e. there exist constants , and such that
[TABLE]
for all and . In what follows, we will denote by the linear operators implicitly defined by the bilinear forms , e.g.:
[TABLE]
Moreover, will denote the transposed operator associated to , defined by
[TABLE]
and will denotes the null space of a linear operator .
3.1. Well-posedness of the coupled problem
We regard problem (24) as a perturbed mixed problem [7, Sec. 4.3]. It can in fact be given the form
[TABLE]
where is the Hilbert space equipped with the inner product defined by (the associated norm being given by ) and with the (continuous, symmetric) bilinear form and the (continuous) linear form defined by
[TABLE]
Remark 3.1**.**
The quadratic form with is not sign-definite: for , and for any satisfying (24a) with .
Let be the (bounded) linear operator associated to the bilinear form . The above definition of implies . The variational problem (24) can be shown to be well-posed by checking the applicability of the closed range theorem (see e.g. [11, Thm. 5.11-6]), which here has the form:
Lemma 3.2**.**
Let be a Hilbert space and a bounded linear operator such that . is invertible with bounded inverse if and only if there is a constant such that for any , in which case .
The null spaces and of and play an important role for studying the well-posedness of problem (24), so we now characterize them. To this end, let denote the subspace of such that any solves the homogeneous problem
[TABLE]
As is well-known, unless belongs to the countable set of eigenvalues for problem (27), in which case is finite-dimensional.
An element of is characterized by for all . On applying the first Green identity in , the strong form of this variational problem is found to be
[TABLE]
in strong form. Then, two cases arise:
Case (i). If (i.e. ), problems (28a) and (28b) are identical and coincide with problem (27); therefore .
Case (ii). (i.e. ), the situation is completely different as the boundary conditions are undetermined in (28a) and overdetermined in (28b). In the latter case, homogeneous Dirichlet and Neumann data is simultaneously imposed on , therefore problem (28b) has only the trivial solution by virtue of the unique continuation principle [2, Corollary], i.e. . By contrast, now includes forced responses for any excitation applied on and eigenfunctions when is an eigenvalue for any kind of homogeneous data on (see problem (28a)), and is thus infinite-dimensional. We do not attempt to characterize more precisely, since in this case we will only use the fact that .
The following property of the bilinear form , whose proof is given in Sec. 7.1, is crucial for establishing the well-posedness of the stationarity problem (24):
Lemma 3.3**.**
There exists such that the bilinear form introduced in problem (24) satisfies the inf-sup condition
[TABLE]
Moreover, let be the eigenvalues associated with the homogeneous problem (27). Then, there exists such that the inf-sup constant satisfies
[TABLE]
If either or , the above inequality holds with .
In addition, the following assumptions are made on the bilinear forms and :
Assumption 1**.**
The measurement bilinear form is coercive on : there exists such that for all .
Assumption 2**.**
The experimental conditions are such that . The stiffness bilinear form is therefore coercive on .
Remark 3.4**.**
Let be the null space of . Assumption 1 implies that . If it is not verified, there exists . Then, solves (24) with , i.e. uniqueness fails for the original stationarity problem. Assumption 1 is therefore necessary for the well-posedness of problem (24), and will be seen to be also sufficient. It means that any nontrivial elastodynamic state satisfying the (possibly incomplete) homogeneous boundary conditions on must register on the measurement apparatus.
For the case of incomplete BCs (for which is infinite-dimensional), assumption 1 is a stringent requirement, as it cannot be met with a norm on measurement residuals (since would then be compact for the norm). In this case, may for example be defined so as to be equivalent to the norm. By contrast, for the complete BC case (for which is at most finite-dimensional), coercivity on can be achieved with defined in terms of a norm.
Assumption 2 excludes the case , i.e. pure Neumann boundary conditions. The resulting coercivity of over the whole will make the forthcoming analyses simpler and is not detrimental in practice: under usual experimental conditions, the sample will not undergo known surface forces on the whole boundary while being completely “unsupported.”
We are now in a position to establish the well-posedness of the stationarity problem (24). To do so, still following [7], we use the decomposition and split any and according to
[TABLE]
to cater for being potentially non-trivial, noting that . We therefore have
[TABLE]
Using these definitions and notations, we obtain the following main result, whose proof is given in Section 7.2:
Theorem 3.5**.**
Let Assumptions 1 and 2 hold. Then, for every and and for any , problem (24) has a unique solution , that moreover satisfies
[TABLE]
where the constant depends only on , the stability constants and the continuity constants . More precisely, with reference to the form (36a,b) of problem (24), the following estimates hold (with all dependences on in the continuity constants made explicit):
[TABLE]
where the non-dimensional constants are defined by , , and and the constant is given by 2Q:=\kappa\alpha^{-1}\big{(}\hskip 1.00006ptq_{d}r_{a}+\sqrt{4\alpha(\kappa\delta)^{-1}+q^{2}_{c}r^{2}_{a}}\hskip 1.00006pt\big{)}.
Remark 3.6**.**
Consider finite-dimensional subspaces and (e.g. from finite element discretizations), whose dimension depends on a discretization parameter . If the conditions of Theorem 3.5 hold uniformly with (i.e. for all discretization levels), the discrete systems arising from (24) are well-posed. Moreover, if the approximability property holds (i.e. elements of can be approximated arbitrarily closely by elements of for some small enough ), then the sequence of solutions of the discrete systems converges (as ) to the solution of (24) in the given norm [7].
3.2. Supplementary assumption for identification feasibility
Assumptions 1 and 2 ensure the well-posedness of the stationarity problem (24). Additional requirements are however needed to ensure that the data also provides useful information towards the original elastic imaging inverse problem. To see this, consider the case where , for which the experimental data is just sufficient to satisfy Assumption 1. In that case, the stationarity problem (24) reads
[TABLE]
having written equation (24b) separately for and . Equation (c) determines solely from the measurements (so depends neither on nor on the assumed elastic properties ), while equations (a), (b) show that do not depend on the measurements. The case is therefore a limiting situation where the measurement carries no information on . We therefore introduce the following additional assumption on the measurement configuration:
Assumption 3**.**
* is a proper subspace of , i.e. has a nontrivial orthogonal complement in .*
4. Stationarity solution asymptotics
To gain insight into the effect of the adjustable weight parameter on the minimization problem (9), we now seek the limiting forms of the solution of the stationarity problem (24) in the limiting situations and . Consequences of the results of this section, in particular regarding sign properties of the Hessian of the MECE reduced functional, are discussed in Sections 5.4 and 5.5.
We start by recasting problem (24) using the splitting (30) and operator notation, to obtain
[TABLE]
The above block form (36a,b) of the stationarity problem (24) is such that its first row is equation (24a) whereas the remaining two rows are equation (24b) with and , in that order. The operators are the , , and restrictions of , respectively (these restrictions being defined by in terms of the extension operators , and the orthogonal projectors , ). The zero blocks in (36b) account for being the null space of the operator .
4.1. Small- expansion
We begin by deriving the leading expansion of the stationarity solution about , assuming a priori the expansion to have the form
[TABLE]
(i.e. to follow the format ), inserting the ansatz (37) into problem (36a), (36b) and writing the resulting equations. The equations are readily obtained as:
[TABLE]
Proposition 1**.**
The stationarity solution admits the small- expansion , in the sense of the norm. The components of solve the well-posed problems (38a) and (38b). Moreover, we have in the “usual” case where .
Proof 4.1**.**
Define the truncation error . On setting in (36a) and using equations (38a)-(38c), the governing system for is
[TABLE]
and therefore has the form (36a,b) with and . Theorem 3.5 hence applies to problem (39), yielding the bounds
[TABLE]
Consequently there exists and a constant such that we have for all . Finally, if , the second equation of (38a) implies . The proof is complete.
4.2. Large- expansion
We now seek an expansion of about , assumed to have the form . Using this ansatz into the block form (36a,b) of problem (24), the arising leading-order equation is
[TABLE]
The chosen ansatz requires that this equation be well-posed. Assumption 1 on , which postulates only the coercivity of , is insufficient in this respect and must be replaced by a stronger requirement:
Assumption 4**.**
* is coercive on (with ): there exists such that for all .*
This also makes the splitting (30) unsuitable for studying the large- limiting case. We instead split according to
[TABLE]
which implies . The operator form (36a) of the coupled problem now uses the definitions
[TABLE]
based on the splitting (41), instead of (36a). Its solution satisfies (as shown in Sec. 7.3) the estimates
[TABLE]
with and . Problem (36a), (42) therefore has a unique solution that depends continuously on the data (as already known from Theorem 3.5). Moreover, and importantly, all continuity constants in estimates (43) are bounded in the limit . Choosing , we obtain the existence of a constant , independent on , such that for all .
We now proceed by assuming the expansion of the stationarity solution about to have, in terms of the new splitting (41), the form
[TABLE]
Inserting this ansatz into (42) results in equations. The equation is
[TABLE]
it is satisfied by since the definition of in (24) and decomposition (41) imply that . Then, the equations yield the two uncoupled well-posed systems
[TABLE]
Proposition 2**.**
The stationarity solution admits the large- expansion , in the sense of the norm, with the components of solving the well-posed problems (46a,b).
Proof 4.2**.**
We examine the truncation error , defined as for Prop. 1. The governing system for still has the format (39), its right-hand side being now given, from using (45) and (46a,b), by
[TABLE]
The problem (36a), (47) for has the form (36a), (42), with and . Its solution therefore satisfies estimates (43), which for this particular right-hand side give
[TABLE]
Consequently, there exists such that for all .
5. Derivatives of the reduced MECE functional
In this section, we derive general formulas for the gradient and Hessian of \tilde{\Lambda}{}_{\kappa}(\text{\boldmath\mathcal{C}}) at any \text{\boldmath\mathcal{C}}\in\mathcal{Q}. Then, taking advantage of the results of Sec. 4, we show that the Hessian of \tilde{\Lambda}{}_{\kappa}(\text{\boldmath\mathcal{C}}) is asymptotically positive in the large- case but sign-indefinite in the small- case connected to least-squares mininimization.
5.1. First-order derivative of
Recall, from remark (d) of Section 2.5, that the reduced objective is continuously differentiable with respect to . The first-order derivative is a priori given by
[TABLE]
where denotes a total derivative w.r.t. and the prime symbol is used as a shorthand notation for a derivative w.r.t. in a given direction \hat{\text{\boldmath\mathcal{C}}} (e.g. \tilde{\Lambda}{}_{\kappa}^{\prime}(\text{\boldmath\mathcal{C}})=\tilde{\Lambda}{}_{\kappa}^{\prime}(\text{\boldmath\mathcal{C}})[\hat{\text{\boldmath\mathcal{C}}}]). The last two equalities follow from definition (21) of implying verification of the stationarity equations (11a-c), and in particular of the balance constraint, for any . An explicit expression of \tilde{\Lambda}{}_{\kappa}^{\prime}(\text{\boldmath\mathcal{C}}) is then obtained as
[TABLE]
In particular, any \text{\boldmath\mathcal{C}}^{\star} solving the reduced minimization problem (21) must satisfy the first-order optimality condition
[TABLE]
5.2. First-order derivative of the stationarity solution
The second-order derivative of will involve (from applying the total derivative operator to (50)) the first-order derivative of the stationarity solution (see remark (d) of Section 2.5). Upon differentiating w.r.t. the stationarity problem (24), the stationarity solution derivative solves the variational problem
[TABLE]
which is well posed since its governing operator is identical to that of the stationarity problem (24).
5.3. Second-order derivative of
For convenience, we focus in the sequel on the quadratic form defined by \hat{\text{\boldmath\mathcal{C}}}\mapsto\tilde{\Lambda}{}_{\kappa}^{\prime\prime}(\text{\boldmath\mathcal{C}})[\hat{\text{\boldmath\mathcal{C}}},\hat{\text{\boldmath\mathcal{C}}}], denoted \tilde{\Lambda}{}_{\kappa}^{\prime\prime}(\text{\boldmath\mathcal{C}}) for short, the corresponding bilinear mapping being then given by the usual polarization identity 4\tilde{\Lambda}{}_{\kappa}^{\prime\prime}(\text{\boldmath\mathcal{C}})[\hat{\text{\boldmath\mathcal{C}}}_{1},\hat{\text{\boldmath\mathcal{C}}}_{2}]=\tilde{\Lambda}{}_{\kappa}^{\prime\prime}(\text{\boldmath\mathcal{C}})[\hat{\text{\boldmath\mathcal{C}}}_{1}+\hat{\text{\boldmath\mathcal{C}}}_{2},\hat{\text{\boldmath\mathcal{C}}}_{1}+\hat{\text{\boldmath\mathcal{C}}}_{2}]-\tilde{\Lambda}{}_{\kappa}^{\prime\prime}(\text{\boldmath\mathcal{C}})[\hat{\text{\boldmath\mathcal{C}}}_{1}-\hat{\text{\boldmath\mathcal{C}}}_{2},\hat{\text{\boldmath\mathcal{C}}}_{1}-\hat{\text{\boldmath\mathcal{C}}}_{2}]. The second-order derivative \tilde{\Lambda}{}_{\kappa}^{\prime\prime}(\text{\boldmath\mathcal{C}}) can be derived by differentiating (50) (e.g. [11, Thm. 7.8-2]) and rearranging terms, to obtain
[TABLE]
This expression of \tilde{\Lambda}{}_{\kappa}^{\prime\prime}(\text{\boldmath\mathcal{C}}) can be recast in a convenient alternative form as follows. Setting in (51a) and in (51b), we find the identity
[TABLE]
so that (52) takes the symmetric form
[TABLE]
Moreover, \tilde{\Lambda}{}_{\kappa}^{\prime\prime}(\text{\boldmath\mathcal{C}}) can alternatively be given a sign-revealing form. To this aim, invoking again the splitting (41), the derivative problem (51) can be written, in operator form, as
[TABLE]
where e.g. is the operator associated with \mathcal{A}(\cdot,\cdot,\hat{\text{\boldmath\mathcal{C}}}). Using this in (54) produces the following result (whose proof is given in Sec. 7.4), which gives \tilde{\Lambda}{}_{\kappa}^{\prime\prime}(\text{\boldmath\mathcal{C}}) as an algebraic sum of positive quadratic forms:
Proposition 3**.**
Let solve the stationarity problem (36a), (42) and solve the derivative problem (55). The second-order derivative of the reduced MECE functional is given by
[TABLE]
with the (symmetric, positive, coercive) operator defined by in terms of the operators appearing in problem (55).
5.4. Reduced MECE functional: large- limiting case
Regarding the leading-order term of the stationarity solution expansion (44), we observe that equation (46a) for coincides with the stationarity problem for the minimization of the pure ECE functional (7) with the kinematic measurement enforced strictly, the latter being achieved by the solution of the remaining equation (45). Moreover, using expansion (44) in \tilde{\mathcal{E}}(\text{\boldmath\mathcal{C}}),\tilde{\mathcal{D}}(\text{\boldmath\mathcal{C}}) defined by (21), we find
[TABLE]
Consistently with the fact that the data is enforced exactly in the limit, we therefore observe that the value of the reduced objective \tilde{\Lambda}{}_{\kappa}(\text{\boldmath\mathcal{C}}) is for large dominated by its ECE component.
Moreover, exploiting the large- expansion of the stationarity solution in the reduced objective \tilde{\Lambda}{}_{\kappa}^{\prime\prime}(\text{\boldmath\mathcal{C}}) as given by Proposition 3 for situations where elastic moduli are kept fixed outside the measurement region, \tilde{\Lambda}{}_{\kappa}^{\prime\prime}(\text{\boldmath\mathcal{C}}) is found to be asymptotically convex in the large- limit:
Theorem 5.1**.**
Assume that \text{supp}(\hat{\text{\boldmath\mathcal{C}}})\subset\Omega^{\text{\scriptsize m}} (i.e. that the perturbation \hat{\text{\boldmath\mathcal{C}}} vanishes outside the measurement region), so that . Then, the second-order derivative \tilde{\Lambda}{}_{\kappa}^{\prime\prime}(\text{\boldmath\mathcal{C}}) admits the large- expansion
[TABLE]
with its leading term \tilde{\Lambda}{}_{\kappa}^{\prime\prime}{}^{(0)}(\text{\boldmath\mathcal{C}}) given by
[TABLE]
In the above expression, the operator is defined by (so , i.e. is a projector, and ), while is the (leading) zeroth-order term of the large- expansion of , see Prop. 2. The above leading-order coefficient \tilde{\Lambda}{}_{\kappa}^{\prime\prime}{}^{(0)}(\text{\boldmath\mathcal{C}}) is therefore positive. Moreover, the subsequent coefficient \tilde{\Lambda}{}_{\kappa}^{\prime\prime}{}^{(1)}(\text{\boldmath\mathcal{C}}) is sign-indefinite.
Proof 5.2**.**
See Section 7.5.
5.5. Reduced MECE functional: small- limiting case
We restrict this discussion to the “usual” case where , for which we have (see Prop. 1). Using this and expansion (37) of the stationarity solution in (21) and (50), the reduced MECE functional \tilde{\Lambda}{}_{\kappa}(\text{\boldmath\mathcal{C}}) and its derivative \tilde{\Lambda}{}_{\kappa}^{\prime}(\text{\boldmath\mathcal{C}}) have the expansions
[TABLE]
Consistently with the constitutive equation is enforced exactly in the limit, we see that the value (60a) of the reduced objective \tilde{\Lambda}{}_{\kappa}(\text{\boldmath\mathcal{C}}) is for small dominated by its data-misfit component .
Moreover, equations (38a,c) give and . If in addition (i.e. boundary conditions are well-posed and is not an eigenvalue for problems (27) and (28a)), we have that (i) , (ii) is the forward solution, (iii) is the adjoint solution for the objective function \tilde{\mathcal{D}}(\text{\boldmath\mathcal{C}}), (iv) the leading contribution to \kappa^{-1}\tilde{\Lambda}{}_{\kappa}(\text{\boldmath\mathcal{C}}) as is the reduced quadratic misfit functional \tilde{\mathcal{D}}(\text{\boldmath\mathcal{C}}):=\mathcal{D}(\bm{u}^{(0)}-\bm{u}^{\text{\scriptsize m}},\bm{u}^{(0)}-\bm{u}^{\text{\scriptsize m}}) commonly used for solving PDE-constrained inverse problems, and (v) the leading term in equation (60b) coincides with the known expression for \tilde{\mathcal{D}}^{\prime}(\text{\boldmath\mathcal{C}}). Consequently, the MECE-based inversion in reduced form becomes the minimization of the non-regularized least-squares misfit in the limit . Similar remarks apply when , the (now nontrivial) field being such that the leading-order field minimizes for given .
Then, the stationarity solution expansion (37) yields expansions and for the solution derivatives (since , which implies ). Consequently, (54) provides
[TABLE]
whose leading term is a priori sign-indefinite (since it can be recast as an algebraic sum of positive quadratic expressions), by contrast with the corresponding result of Theorem 5.1 for the large- case.
6. Numerical results
In this section, we present numerical studies that support our theoretical findings. We first show (Sec. 6.1) that the inf-sup constant remains strictly positive over a wide range of frequencies and displays convergent behavior upon mesh discretization. We then show (Sec. 6.2) that finite element discretizations are convergent as long as the two key necessary conditions are met. The next example (Sec. 6.3) illustrates how the reduced objective becomes convex as increases. Finally, we show in Sec. 6.4 a parameter reconstruction example using interior data, which demonstrates the capability of the method to accommodate underspecified boundary conditions.
6.1. Stability of coupled system: 1D example
In this example, we study the behavior of the inf-sup constant defined in (29) for the operator corresponding to longitudinal vibrations at frequency of a one-dimensional bar fixed at one end, and whose length, mass density, and Young’s modulus are taken as one. The boundary condition at the other end is unspecified. The relevant spaces are \mathcal{W}:=\big{\{}\hskip 1.00006ptw\in H^{1}(\Omega),\;w(0)=w(1)=0\hskip 1.00006pt\big{\}} and \mathcal{U}:=\big{\{}\hskip 1.00006ptu\in H^{1}(\Omega),\;u(0)=0\hskip 1.00006pt\big{\}}, and we have . The bar is discretized using linear finite elements.
The inf-sup constant was computed for frequencies in the Hz range, in increments of Hz, the finite element mesh was refined until there was negligible change in over the latter frequency range. The numerical evaluation of uses a discretized version of (65). The requisite approximation of the Riesz representative of is obtained by setting and , where is a matrix of finite element shape functions for a given discretization with characteristic mesh size . We then take to be the positive-definite matrix such as \bm{d}^{T}\mathbf{S}_{h}\bm{d}:=\big{(}\hskip 1.00006pt\widehat{\bm{u}}_{h},\widehat{\bm{u}}_{h}\hskip 1.00006pt\big{)}_{\omega} and the matrix associated with the bilinear form after discretization with finite elements. Then, the nodal values associated with can be computed as . The inf-sup constant is obtained from
[TABLE]
(the first inequality following from the fact that ), i.e. by finding the smallest eigenvalue of the generalized eigenvalue problem . If that eigenvalue is positive, so is .
Figure 1(a) shows that indeed remains positive for all frequencies investigated in this example. Thus, if the measured data satisfies Assumption 1, the coupled system corresponding to this 1D bar is well-posed in the considered frequency range, in spite of the boundary conditions being underspecified, as expected per Theorem 3.5. Figure 1(b) then shows how changes with element size for a frequency of Hz, and clearly demonstrates the expected convergence of as the mesh is refined.
6.2. Well-posedness and convergence of coupled system
In this example, we demonstrate the implications of Assumption 1 (i.e. ), which is required for problem (24) to be well-posed, by showing its incidence on the convergence behavior of finite element approximations. We consider again a 1D bar, this time with a Dirichlet condition at both ends, so that we now have \mathcal{W}=\mathcal{U}=\big{\{}\hskip 1.00006ptu\in H^{1}(\Omega),\;u(0)=u(1)=0\hskip 1.00006pt\big{\}}. We use an assumed solution and , which corresponds to an excitation in problem (24). The observation operator (as introduced in (36b)) is of the form , i.e. involves pointwise measurements at locations to be specified (pointwise values of elements of being well-defined in this 1D setting).
We study in this example whether finite element discretizations of problem (24) converge and if so at what rate, given that the exact assumed solution is in . We define (piecewise-linear, conforming) finite element approximations and , with being an interpolant of , and consider two versions and of , such that the measurement locations are taken as for while uses randomly generated measurement abscissae. Consequently, we have for but for . The case thus makes non-trivial, in conflict with Assumption 1, whereas if is used. We thus expect the coupled system (24) to be ill-posed in the former case, but well-posed in the latter case.
Figure 2 shows the relative error between the manufactured solution of the coupled system (24) and its finite element approximation , defined as
[TABLE]
as a function of the element size and for either measurement set. The expected convergence of to is indeed observed, with the expected rate, if . By contrast, using leads to rapid divergence of as the mesh is refined.
6.3. Convexity of the reduced MECE functional
We now show an example that demonstrates the convexification of the reduced objective (21) as increases, predicted by part 2 of Theorem 5.1. We consider again a 1D bar problem, this time of the form
[TABLE]
with the inhomogeneous Young modulus taken as and the excitation frequency set to Hz. The bar is discretized using linear finite elements. Displacements are assumed to be measured at all nodes. The reduced objective was computed for and several values of , by solving the coupled system (24) for each combination and then computing (21).
Figures 3(a)-3(d) illustrate how the reduced objective changes with , and in particular show clearly that this function is progressively smoother as increases and becomes convex for large (Figure 3(d)). Furthermore, for very small values of (Figure 3(a)) the objective resembles that of a least squares functional, as predicted by the asymptotic analysis of Section 5.5.
These results have strong practical implications. The least-squares objective function (small- limiting case) has many local minima, making the inversion results strongly dependent on the initial guess. On the other hand, the MECE objective for intermediate and large is much smoother and convex, which translates into robustness with respect to the initial guess.
It is important to mention that the value of used for a given problem has to be set according to the level of noise in the data. Hence, although our analysis indicates that choosing as large as possible is beneficial for convexity, doing so without regard to noise level would likely produce poor reconstructions. This results from the fact that, as increases, becomes closer to the measured data and may over-fit the noise. See [27] for some discussion on how to choose according to the Morozov discrepancy principle and a heuristic approach termed error balance.
6.4. 2D identification example
In this last example, we demonstrate that we can successfully reconstruct material properties with MECE when boundary conditions are completely unknown but full field measurements are available in part of the domain. We consider a square elastic domain with an unknown circular inclusion, under 2D plane-strain conditions; see Figure 4(a). The synthetic experiment consists in loading the body with a uniform time-harmonic pressure of kPa applied on the top side at a frequency of Hz, the bottom side being kept fixed, and with the bulk and shear moduli set to (background) and (inclusion). The mass density is uniform, with . Both components of the displacement are assumed to be measured over a dense grid of points in the subdomain (delineated with a dashed line in Figure 4(a)). Corresponding synthetic measurements were generated with a fine finite element mesh and interpolated onto a coarser, regular, reconstruction mesh made of square 4-noded elements. Each element of that mesh supports two material unknowns . The boundary conditions on all four sides of are taken as unknown.
The reconstruction results shown in Figure 4(b) demonstrate that material properties can be imaged accurately even without the knowledge of boundary conditions; in particular, the recovered values of and are close on average to their target values. Interestingly, the reconstructed inclusion displays clear and sharp edges despite the fact that no additional regularization is used for the fields .
7. Proofs
7.1. Proof of Lemma 3.3
Let solve the variational problem \big{(}\hskip 1.00006pt\widetilde{\bm{u}}{},\widehat{\bm{u}}\hskip 1.00006pt\big{)}_{\omega}=\mathcal{B}(\widetilde{\bm{u}}{},\bm{w}) for all (i.e. is the -dependent Riesz representant of the linear functional ). We then have
[TABLE]
Let and denote the countable sets of eigenfunctions and eigenvalues associated with problem (27), i.e. such that each verifies for all (we assume for definiteness the normalization \big{(}\hskip 1.00006pt\rho\bm{\psi}_{n},\bm{\psi}_{n}\hskip 1.00006pt\big{)}=1 and the ordering ); then is a Hilbert basis of . Let , so that \mathcal{Z}=\text{span}\big{(}\hskip 1.00006pt\bm{\psi}_{n,n\in I}\hskip 1.00006pt\big{)}, noting that is finite-dimensional. With this convention, , i.e. , if is not an eigenvalue for problem (27). Expanding the fields and on the Hilbert basis (noting for the latter that ) as
[TABLE]
the weak problem linking to then implies (using as test functions)
[TABLE]
We therefore obtain
[TABLE]
To establish the existence of an inf-sup constant such that (29) holds, we separately examine two cases: (i) and (ii) (recall that ).
Case (i): .
This case corresponds to , and we have . Let be the orthogonal projection on , so that
[TABLE]
Using (66), we have
[TABLE]
Moreover, since . Therefore
[TABLE]
Since does not depend on , the inf-sup condition (29) holds true with the above value of .
Case (ii): .
In this case, , so that the infimum in (29) is taken over . If , then and we again have ; consequently, the argument of case (i) still applies, equations (67) and (68) remain valid (with infimums taken over all integers), and the inf-sup condition (29) again holds with as given by (68).
On the other hand, a distinct approach is needed for the case because no longer holds for any . Instead, we will show below that
[TABLE]
implying
[TABLE]
The lemma results from (70), since that value of does not depend on , provided (69) holds.
Proof of (69).
We first note that : requires on , whereas implies and on , and the unique continuation principle implies that only can fulfill all requirements.
The proof then proceeds by contradiction. Assume that (69) is false. In that case, we have
[TABLE]
Choose a sequence such that , , and for each choose such that . being linear, may be assumed for each without detriment. Then, being a projection, : each belongs to the unit ball of . As , (i) is compact, so the sequence contains a convergent subsequence (still denoted ), whose limit is denoted , and (ii) is closed, so . On the other hand, we have
[TABLE]
implying that as . Since and the Dirichlet trace on is continuous, ; moreover we have that (as the limit of a sequence of elements of unit norm), and hence . Summarizing, we simultaneously have and , which leads to a contradiction because . Concluding, (69) is true.
7.2. Proof of Theorem 3.5
The proof methods follows that of [7, Thm. 4.3.1]. We first observe that setting and in (24) and adding the resulting equalities yields \mathcal{A}(\bm{w},\bm{w})+\kappa\mathcal{D}(\bm{u},\bm{u})=\kappa\big{\langle}\hskip 1.00006pt\bm{d},\bm{u}\hskip 1.00006pt\big{\rangle}_{\mathcal{U}^{\prime},\mathcal{U}}+\big{\langle}\hskip 1.00006pt\bm{f},\bm{w}\hskip 1.00006pt\big{\rangle}_{\mathcal{W}^{\prime},\mathcal{W}} which, by virtue of the assumed coercivity of on , implies the inequality
[TABLE]
Then, equation (24b) with gives
[TABLE]
which implies the inequality \delta\|\bm{u}_{0}\|^{2}\leq\|\bm{d}_{0}\|\big{(}\hskip 1.00006pt\|\bm{u}_{0}\|+d\,\|\bm{u}_{1}\|\hskip 1.00006pt\big{)}, where the left-hand side results from being coercive on by Assumption 1, and the right-hand side from assumed continuity of and . This results in
[TABLE]
Finally, using equation (24a) in operator form (i.e. , since ), we have
[TABLE]
which, using the inf-sup condition (29) (which implies that for any ), yields the inequality
[TABLE]
We now exploit inequalities (71), (72) and (74), considering separately each of the three possible cases where only one of is nonzero. Considering first the case , inequalities (71), (72) and (74) readily provide the estimates
[TABLE]
with as defined in the statement of the proposition.
Consider next the case . Inequalities (71), (72) and (74) then become
[TABLE]
(with as defined in the statement of the proposition), from which we obtain the estimates
[TABLE]
Finally, for the case , inequalities (71), (72) and (74) provide
[TABLE]
Concatenating the above three bounds gives the inequality
[TABLE]
Being of the form with , it holds for all with the positive root of , i.e.:
[TABLE]
The remaining sought estimates are then
[TABLE]
The stationarity problem (24) being linear, the estimates for for general right-hand sides and follow from the triangle inequality.
Finally, since , we have , implying , and . The obtained estimates of , and therefore collectively imply that there exists a constant such that for any . Well-posedness of problem (24) follows by lemma 3.2 (with ).
7.3. Proof of estimates (43)
Let solve the problem
[TABLE]
which is well-posed ( being coercive on by Assumption 4); moreover, obeys the estimate . Introducing the new unknown , the coupled problem (24) becomes
[TABLE]
with (since ). Setting and in the above system and adding the resulting equalities yields \mathcal{A}(\bm{w},\bm{w})+\kappa\mathcal{D}(\bm{y},\bm{y})=\big{\langle}\hskip 1.00006pt\bm{h},\bm{w}\hskip 1.00006pt\big{\rangle}_{\mathcal{W}^{\prime},\mathcal{W}}, which (by coercivity of on and applying the triangle inequality to the right-hand side) implies the inequality
[TABLE]
Then, equation (82b) with gives (in operator form) , and hence, using (83), implies
[TABLE]
Finally, using equation (82a) in operator form (i.e. ), we have
[TABLE]
which, using the inf-sup condition (29) (which provides for any since ), yields the inequality
[TABLE]
Estimates (43) finally stem from recalling that .
7.4. Proof of Proposition 3
First, recasting (54) using operator notation, we have
[TABLE]
We then observe that solving the derivative problem (55) gives
[TABLE]
with the operator as given in the proposition statement by and having set , , . Using (88) and noting that , we obtain the identities
[TABLE]
with the symmetric, positive operator given by . Upon substitution into (87), the above formulas provide
[TABLE]
Since (88) implies , the above expression of yields the claimed formula.
7.5. Proof of Theorem 5.1
To begin, the expression (88) of gives
[TABLE]
with the operator defined by . It is easy to see that verifies (i.e. is a projection) and .
For the case where , we have , and therefore . The expression (93) of \tilde{\Lambda}{}_{\kappa}^{\prime\prime}(\text{\boldmath\mathcal{C}}) then becomes (with the above definition of )
[TABLE]
Next, applying the Sherman-Morrison-Woodbury formula to gives
[TABLE]
wherein , , . Using these identities and performing straightforward algebra, we find
[TABLE]
with the (projection) operator as defined in the theorem statement. We also note that , so that
[TABLE]
We now substitute the above expansion into (95) and set (noting that ) and , to obtain
[TABLE]
with
[TABLE]
Concluding, the above value of is that claimed in the theorem statement, and is clearly positive, whereas is obtained as an algebraic sum of positive quadratic expressions (so is a priori sign-indefinite). The proof of Theorem 5.1 is complete.
8. Conclusions
In this work we studied some of the most salient mathematical properties of the Modified Error in Constitutive Equations (MECE) approach for inverse problems in the context of frequency-domain elastodynamics. In particular, we proved (under conditions on the available interior data) well-posedness of the coupled system that arises as part of the first order optimality conditions. The coupled problem remains well posed even when boundary conditions are partially or completely unknown and at resonant frequencies. The latter findings have strong practical implications in inverse problems where interior data is abundant and boundary conditions are difficult to ascertain. We have exploited this benefit in our recent work in elastography [16].
We also showed that the reduced MECE functional becomes convex in the limit where the weight given to the data misfit component goes to infinity. Moreover, convexification of the reduced objective occurs continuously as the parameter increases, as demonstrated in the numerical examples. This characteristic of the MECE functional also has strong practical implications as solutions to the inverse problem are less sensitive to initial guesses. This fact has also been exploited in our work on elasticity and viscoelasticity imaging [12, 16]. Future work includes exploring the possibility of devising a unified MECE formulation that can be applied to a wide range of physics such as nonlinear elasticity, electromagnetism, plasticity, fluid dynamics, etc.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Allix, O., Feissel, P., Nguyen, H. M. Identification strategy in the presence of corrupted measurements. Eng. Comp. , 22 , 487–504 (2005).
- 2[2] Ang, D. D., Ikehata, M., Trong, D. D., Yamamoto, M. Unique continuation for a stationary isotropic Lamé system with variable coefficients. Commun. Part. Diff. Eq. , 23 , 599–617 (1998).
- 3[3] Babuška, Ivo . The finite element method with Lagrangian multipliers. Numerische Mathematik , 20 , 179–192 (1973).
- 4[4] Banerjee, B., Walsh, T. F., Aquino, W., Bonnet, M. Large scale parameter estimation problems in frequency-domain elastodynamics using an error in constitutive equation functional. Comp. Meth. Appl. Mech. Eng. , 253 , 60–72 (2013).
- 5[5] Barthe, D., Deraemaeker, A., Ladevèze, P., Le Loch, S. Validation and updating of industrial models based on the constitutive relation error. AIAA Journal , 42 , 1427–1434 (2004).
- 6[6] Ben Azzouna, M., Feissel, P., Villon, P. Robust identification of elastic properties using the Modified Constitutive Relation Error. Comp. Meth. Appl. Mech. Eng. , 295 , 196–218 (2015).
- 7[7] Boffi, D., Brezzi, F., Fortin, M. Mixed finite element methods and applications . Springer-Verlag (2013).
- 8[8] Bonnet, M., Aquino, W. Three-dimensional transient elastodynamic inversion using an error in constitutive relation functional. Inverse Probl. , 31 , 035010 (2015).
