Galerkin approximation of linear problems in Banach and Hilbert spaces

Wolfgang Arendt; Isabelle Chalendar (LAMA); Robert Eymard (LAMA)

arXiv:1908.03326·math.NA·July 13, 2020

Galerkin approximation of linear problems in Banach and Hilbert spaces

Wolfgang Arendt, Isabelle Chalendar (LAMA), Robert Eymard (LAMA)

PDF

TL;DR

This paper analyzes the convergence of Galerkin methods for linear problems in Banach and Hilbert spaces, establishing necessary and sufficient conditions and characterizing forms that guarantee convergence.

Contribution

It provides a comprehensive characterization of Galerkin approximation convergence and identifies the forms that ensure universal convergence in Hilbert spaces.

Findings

01

Necessary and sufficient condition for Galerkin convergence

02

Characterization of forms with universal Galerkin property

03

Optimal a priori estimates for coercive forms

Abstract

In this paper we study the conforming Galerkin approximation of the problem: find u $\in$ U such that a(u, v) = <L, v> for all v $\in$ V, where U and V are Hilbert or Banach spaces, a is a continuous bilinear or sesquilinear form and L $\in$ V' a given data. The approximate solution is sought in a finite dimensional subspace of U, and test functions are taken in a finite dimensional subspace of V. We provide a necessary and sufficient condition on the form a for convergence of the Galerkin approximation, which is also equivalent to convergence of the Galerkin approximation for the adjoint problem. We also characterize the fact that U has a finite dimensional Schauder decomposition in terms of properties related to the Galerkin approximation. In the case of Hilbert spaces, we prove that the only bilinear or sesquilinear forms for which any Galerkin approximation converges (this property…

Equations382

Find u \in U such that a (u, v) = ⟨ L, v ⟩, for all v \in V,

Find u \in U such that a (u, v) = ⟨ L, v ⟩, for all v \in V,

Find u_{n} \in U_{n} such that a (u_{n}, χ) = ⟨ L, χ ⟩, for all χ \in V_{n} .

Find u_{n} \in U_{n} such that a (u_{n}, χ) = ⟨ L, χ ⟩, for all χ \in V_{n} .

dist (v, V_{n}) \to 0 \mbox a s n \to \infty

dist (v, V_{n}) \to 0 \mbox a s n \to \infty

∣ a (u, v) ∣ \leq M ∥ u ∥_{U} ∥ v ∥_{V} \mbox f or a l l u \in U, v \in V

∣ a (u, v) ∣ \leq M ∥ u ∥_{U} ∥ v ∥_{V} \mbox f or a l l u \in U, v \in V

0 \neq = dim U_{n} = dim V_{n} \mbox f or a l l n \in N^{*} .

0 \neq = dim U_{n} = dim V_{n} \mbox f or a l l n \in N^{*} .

find u \in U such that a (u, v) = ⟨ L, v ⟩, for all v \in V .

find u \in U such that a (u, v) = ⟨ L, v ⟩, for all v \in V .

find u_{n} \in U_{n} such that a (u_{n}, χ) = ⟨ L, χ ⟩, for all χ \in V_{n} .

find u_{n} \in U_{n} such that a (u_{n}, χ) = ⟨ L, χ ⟩, for all χ \in V_{n} .

\hbox{for all }u\in{\mathcal{U}}_{n},\big{(}a(u,\chi)=0\mbox{ for all }\chi\in{\mathcal{V}}_{n}\big{)}\Rightarrow u=0,

\hbox{for all }u\in{\mathcal{U}}_{n},\big{(}a(u,\chi)=0\mbox{ for all }\chi\in{\mathcal{V}}_{n}\big{)}\Rightarrow u=0,

⟨ A u, v ⟩ = a (u, v) (u \in U, v \in V) .

⟨ A u, v ⟩ = a (u, v) (u \in U, v \in V) .

∥ A u ∥_{V^{'}} \geq β ∥ u ∥_{U} \mbox f or a l l u \in U .

∥ A u ∥_{V^{'}} \geq β ∥ u ∥_{U} \mbox f or a l l u \in U .

∥ v ∥_{V} \leq 1 sup ∣ a (u, v) ∣ \geq β ∥ u ∥_{U} \mbox f or a l l u \in U .

∥ v ∥_{V} \leq 1 sup ∣ a (u, v) ∣ \geq β ∥ u ∥_{U} \mbox f or a l l u \in U .

\mbox{ for all }v\in{\mathcal{V}},\big{(}a(u,v)=0\mbox{ for all }u\in{\mathcal{U}}\big{)}\Rightarrow v=0.

\mbox{ for all }v\in{\mathcal{V}},\big{(}a(u,v)=0\mbox{ for all }u\in{\mathcal{U}}\big{)}\Rightarrow v=0.

(B N B) \exists β > 0; \forall n \in N^{*}, \forall u \in U_{n}, v \in V_{n}, ∥ v ∥_{V} = 1 sup ∣ a (u, v) ∣ \geq β ∥ u ∥_{U} .

(B N B) \exists β > 0; \forall n \in N^{*}, \forall u \in U_{n}, v \in V_{n}, ∥ v ∥_{V} = 1 sup ∣ a (u, v) ∣ \geq β ∥ u ∥_{U} .

\exists β > 0; \forall n \in N^{*}, u \in U_{n}, ∥ u ∣_{V} = 1 in f v \in V_{n}, ∥ v ∥_{V} = 1 sup ∣ a (u, v) ∣ \geq β .

\exists β > 0; \forall n \in N^{*}, u \in U_{n}, ∥ u ∣_{V} = 1 in f v \in V_{n}, ∥ v ∥_{V} = 1 sup ∣ a (u, v) ∣ \geq β .

∥ u - u_{n} ∥_{U} \leq γ dist (u, U_{n}),

∥ u - u_{n} ∥_{U} \leq γ dist (u, U_{n}),

a^{*} (v, u) = \overline{a (u, v)} (u \in U, v \in V) .

a^{*} (v, u) = \overline{a (u, v)} (u \in U, v \in V) .

(B N B^{*}) \exists β^{*} > 0; \forall n \in N^{*}, u \in U_{n}, ∥ u ∥_{U} = 1 sup ∣ a^{*} (u, v) ∣ \geq β^{*} ∥ v ∥_{V} (v \in V_{n}) .

(B N B^{*}) \exists β^{*} > 0; \forall n \in N^{*}, u \in U_{n}, ∥ u ∥_{U} = 1 sup ∣ a^{*} (u, v) ∣ \geq β^{*} ∥ v ∥_{V} (v \in V_{n}) .

v \in V_{n}, ∥ v ∥_{V} = 1 sup ∣ a (u, v) ∣ \geq β ∥ u ∥_{U} (u \in U_{n}) .

v \in V_{n}, ∥ v ∥_{V} = 1 sup ∣ a (u, v) ∣ \geq β ∥ u ∥_{U} (u \in U_{n}) .

γ = 1 + \frac{M}{β} .

γ = 1 + \frac{M}{β} .

∥ u_{n} ∥_{U} \leq \frac{1}{β} v \in V_{n}, ∥ v ∥_{V} \leq 1 sup ∣ ⟨ L, v ⟩ ∣ \leq \frac{1}{β} ∥ L ∥_{V^{'}} .

∥ u_{n} ∥_{U} \leq \frac{1}{β} v \in V_{n}, ∥ v ∥_{V} \leq 1 sup ∣ ⟨ L, v ⟩ ∣ \leq \frac{1}{β} ∥ L ∥_{V^{'}} .

a (u, v) = k \to \infty lim a (u_{n_{k}}, v_{k}) = k \to \infty lim ⟨ L, v_{k} ⟩ = ⟨ L, v ⟩ .

a (u, v) = k \to \infty lim a (u_{n_{k}}, v_{k}) = k \to \infty lim ⟨ L, v_{k} ⟩ = ⟨ L, v ⟩ .

a (u, χ) = ⟨ L, χ ⟩ = a (u_{n}, χ) \mbox f or a l l χ \in V_{n} .

a (u, χ) = ⟨ L, χ ⟩ = a (u_{n}, χ) \mbox f or a l l χ \in V_{n} .

∥ u - u_{n} ∥_{U}

∥ u - u_{n} ∥_{U}

a (Q_{n} w, χ) = a (w, χ) (χ \in V_{n}) .

a (Q_{n} w, χ) = a (w, χ) (χ \in V_{n}) .

β ∥ Q_{n} w ∥_{U}

β ∥ Q_{n} w ∥_{U}

u - u_{n} = (Id - Q_{n}) u = (Id - Q_{n}) (u - χ) .

u - u_{n} = (Id - Q_{n}) u = (Id - Q_{n}) (u - χ) .

∥ u - u_{n} ∥_{U} \leq ∥ Id - Q_{n} ∥∥ u - χ ∥_{U} = ∥ Q_{n} ∥∥ u - χ ∥_{U} \leq \frac{M}{β} ∥ u - χ ∥_{U} .

∥ u - u_{n} ∥_{U} \leq ∥ Id - Q_{n} ∥∥ u - χ ∥_{U} = ∥ Q_{n} ∥∥ u - χ ∥_{U} \leq \frac{M}{β} ∥ u - χ ∥_{U} .

∥ u - u_{n} ∥_{U} \leq \frac{M}{β} dist (u, U_{n}) .

∥ u - u_{n} ∥_{U} \leq \frac{M}{β} dist (u, U_{n}) .

n \in N^{*} sup ∥ u_{n} ∥_{U} < \infty

n \in N^{*} sup ∥ u_{n} ∥_{U} < \infty

∥ v ∥_{V_{n}} := u \in U_{n}, ∥ u ∥_{U} = 1 sup ∣ a (u, v) ∣

∥ v ∥_{V_{n}} := u \in U_{n}, ∥ u ∥_{U} = 1 sup ∣ a (u, v) ∣

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

\definecolor

labelkeyrgb0.6,0,1

\definecolorvioletrgb0.580,0.,0.827

Galerkin approximation of linear problems

in Banach and Hilbert spaces

W. Arendt

Wolfgang Arendt, Institute of Applied Analysis, University of Ulm. Helmholtzstr. 18, D-89069 Ulm (Germany)

[email protected]

,

I. Chalendar

Isabelle Chalendar, Université Paris-Est, LAMA, (UMR 8050), UPEM, UPEC, CNRS, F-77454, Marne-la-Vallée (France)

[email protected]

and

R. Eymard

Robert Eymard, Université Paris-Est, LAMA, (UMR 8050), UPEM, UPEC, CNRS, F-77454, Marne-la-Vallée (France)

[email protected]

Abstract.

In this paper we study the conforming Galerkin approximation of the problem: find $u\in{\mathcal{U}}$ such that $a(u,v)=\langle L,v\rangle$ for all $v\in{\mathcal{V}}$ , where ${\mathcal{U}}$ and ${\mathcal{V}}$ are Hilbert or Banach spaces, $a$ is a continuous bilinear or sesquilinear form and $L\in{\mathcal{V}}^{\prime}$ a given data. The approximate solution is sought in a finite dimensional subspace of ${\mathcal{U}}$ , and test functions are taken in a finite dimensional subspace of ${\mathcal{V}}$ . We provide a necessary and sufficient condition on the form $a$ for convergence of the Galerkin approximation, which is also equivalent to convergence of the Galerkin approximation for the adjoint problem. We also characterize the fact that ${\mathcal{U}}$ has a finite dimensional Schauder decomposition in terms of properties related to the Galerkin approximation. In the case of Hilbert spaces, we prove that the only bilinear or sesquilinear forms for which any Galerkin approximation converges (this property is called the universal Galerkin property) are the essentially coercive forms. In this case, a generalization of the Aubin-Nitsche Theorem leads to optimal a priori estimates in terms of regularity properties of the right-hand side $L$ , as shown by several applications. Finally, a section entitled ”Supplement” provides some consequences of our results for the approximation of saddle point problems.

Key words and phrases:

Galerkin approximation, sesquilinear coercive forms, approximation properties in Banach spaces, essential coercivity, universal Galerkin convergence

2010 Mathematics Subject Classification:

65N30,47A07,47A52,46B20

1. Introduction

Due to its practical importance, the approximation of elliptic problems in Banach or Hilbert spaces has been the object of numerous works. In Hilbert spaces, a crucial result is the simultaneous use of the Lax-Milgram theorem and of Céa’s Lemma to conclude the convergence of conforming Galerkin methods in the case that the elliptic problem is resulting from a coercive bilinear or sesquilinear form.

But the coercivity property is lost in many practical situations: for example, consider the Laplace operator perturbed by a convection term or a reaction term (see the example in Section 7.2), and the approximation of non-coercive forms must be studied as well. For particular bilinear or sesquilinear forms, the Fredholm alternative provides an existence result in the case where the problem is well-posed in the Hadamard sense. Such results have been extended by Banach, Nečas, Babuška and Brezzi in the case of bilinear forms on Banach spaces. The conforming approximation of such problems enters into the framework of the so-called Petrov–Galerkin methods, for which sufficient conditions for the convergence are classical (see for example the references [2, 8, 12, 31] which also include the case of non-conforming approximations).

Nevertheless, these sufficient conditions do not guarantee that for a given problem, there exists a converging Galerkin approximation. Moreover, they do not answer the following question, which is important in practice: under which conditions does the Galerkin approximation exist and converge to the solution of the continuous problem for any sufficiently fine approximation (for example, letting the degree of an approximating polynomial or the number of modes in a Fourier approximation be high enough, or letting the size of the mesh for a finite element method be small enough, and, in the case of Hilbert spaces, using the Galerkin method and not the Petrov–Galerkin method)?

The aim of this paper is precisely to address such questions for not necessarily coercive bilinear or sesquilinear forms defined on some Banach or Hilbert spaces (we treat the real and complex cases simultaneously). We shall restrict this study to conforming approximations, in the sense that the approximation will be sought in subspaces of the underlying space, using the continuous bilinear or sesquilinear form.

In the first part we consider the Banach space framework. Given a continuous bilinear form $a:{\mathcal{U}}\times{\mathcal{V}}\to{\mathbb{R}}$ where ${\mathcal{U}}$ and ${\mathcal{V}}$ are reflexive, separable Banach spaces, one is interested in the existence and the convergence of the Galerkin approximation to $u$ , where $u$ is the solution of the following problem:

[TABLE]

where $L\in{\mathcal{V}}^{\prime}$ is given (the existence and uniqueness of $u$ are obtained under the Banach-Nečas-Babuška conditions, see for example [12, Theorem 2.6]). For approximating sequences $({\mathcal{U}}_{n})_{n\in{\mathbb{N}}^{*}}$ , $({\mathcal{V}}_{n})_{n\in{\mathbb{N}}^{*}}$ (see Section 2 for the definition), the Galerkin approximation of (1.1) is given by the sequence $(u_{n})_{n\in{\mathbb{N}}^{*}}$ such that, for any $n\in{\mathbb{N}}^{*}$ , $u_{n}$ is the solution of the following finite dimensional linear problem:

[TABLE]

It is known that, if $\dim{\mathcal{U}}_{n}=\dim{\mathcal{V}}_{n}$ , the uniform Banach-Nečas-Babuška condition (BNB) given in Section 2 is sufficient for these existence and convergence properties (see for example [12, Theorem 2.24]). We show here that this condition is also necessary and, surprisingly, that the convergence of the Galerkin approximation of (1.1) is equivalent to that of the Galerkin approximation of the dual problem.

These two results seem to be new and are presented in Section 2.

In Section 3, we ask the following: given a form $a$ such that (1.1) is well-posed, do there always exist approximating sequences in ${\mathcal{U}}$ and ${\mathcal{V}}$ such that the Galerkin approximation converges? Surprisingly, the answer is negative (even though the spaces ${\mathcal{U}}$ and ${\mathcal{V}}$ are supposed to be reflexive and separable). In fact, such approximating sequences exist if and only if the Banach space ${\mathcal{U}}$ has a finite dimensional Schauder decomposition, a property which is strictly more general than having a Schauder basis.

In the remainder of the paper, merely Hilbert spaces are considered and moreover we assume that ${\mathcal{U}}={\mathcal{V}}$ and ${\mathcal{U}}_{n}={\mathcal{V}}_{n}$ for all $n\in{\mathbb{N}}^{*}$ . Given is a continuous bilinear form $a:{\mathcal{V}}\times{\mathcal{V}}\to{\mathbb{R}}$ , where ${\mathcal{V}}$ is a separable Hilbert space. Assuming that (1.1) is well-posed, we show that the convergence of the Galerkin approximation for all approximating sequences in ${\mathcal{V}}$ (which we call here the universal Galerkin property) is equivalent to $a$ being essentially coercive, which means that a compact perturbation of $a$ is coercive. This notion of essential coercivity can also be characterized by a certain weak-strong inverse continuity of $a$ , which, in fact, we take as definition of essential coercivity (Definition 4.2).

We then derive improved a priori error estimates by generalizing the Aubin–Nitsche argument to non-symmetric forms and also allowing the given right hand side $L$ of (1.1) to belong to arbitrary interpolation spaces in between ${\mathcal{V}}$ and ${\mathcal{V}}^{\prime}$ . These generalizations are applied to two cases: the approximation of selfadjoint positive operators with compact resolvent (in this case, it is seen that our a priori error estimate is optimal, with the fastest speed of convergence for $L$ in ${\mathcal{V}}$ , the slowest for $L\in{\mathcal{V}}^{\prime}$ ) and the finite element approximation of a non-selfadjoint elliptic differential operator, including convection and reaction terms which is indeed essentially coercive.

We finally give some further historical remarks in Section 8, where we consider saddle point problems. As a consequence of our results, we show that Brezzi’s conditions, implying the convergence of mixed approximations (which are the Galerkin ones in the case of saddle point problems), are also necessary for this convergence.

To avoid any ambiguity, in the sequel, we let ${\mathbb{N}}=\{0,1,2,\cdots\}$ and ${\mathbb{N}}^{*}={\mathbb{N}}\setminus\{0\}$ .

The paper is organized as follows:

1 Introduction
2 Petrov–Galerkin approximation
3 Existence of a converging Galerkin approximation
4 Essentially coercive forms
5 Characterization of the universal Galerkin property
6 The Aubin-Nitsche trick revisited
7 Applications
7.1 Selfadjoint positive operators with compact resolvent
7.2 Finite elements for the Poisson problem
8 Supplement: saddle point problems

2. Petrov–Galerkin approximation

In this section we give a characterization of the convergence of Petrov–Galerkin methods, that, for short, we call Galerkin convergence. A basic definition is the following.

Definition 2.1 (Approximating sequences of Banach spaces).

Let ${\mathcal{V}}$ be a separable Banach space. An approximating sequence of ${\mathcal{V}}$ is a sequence $({\mathcal{V}}_{n})_{n\in{\mathbb{N}}^{*}}$ of finite dimensional subspaces of ${\mathcal{V}}$ such that

[TABLE]

for all $v\in{\mathcal{V}}$ , where $\mathop{\rm dist}\nolimits(u,{\mathcal{V}}_{n}):=\inf\{\|u-\chi\|:\chi\in{\mathcal{V}}_{n}\}$ .

Now let ${\mathcal{U}}$ and ${\mathcal{V}}$ be two separable, reflexive Banach spaces over ${\mathbb{K}}={\mathbb{R}}$ or ${\mathbb{C}}$ and $a:{\mathcal{U}}\times{\mathcal{V}}\to{\mathbb{K}}$ be a continuous sesquilinear form such that

[TABLE]

where $M>0$ is a constant. We assume that ${\mathcal{U}}$ and ${\mathcal{V}}$ are infinite dimensional and that $({\mathcal{U}}_{n})_{n\in{\mathbb{N}}^{*}}$ and $({\mathcal{V}}_{n})_{n\in{\mathbb{N}}^{*}}$ are approximating sequences of ${\mathcal{U}}$ and ${\mathcal{V}}$ respectively. We also assume throughout that

[TABLE]

Given $L\in{\mathcal{V}}^{\prime}$ we search a solution $u$ of the problem:

[TABLE]

Moreover we want to approximate such a solution by $u_{n}$ , the solution of the problem:

[TABLE]

Note that, given $n\in{\mathbb{N}}^{*}$ , there exists a unique $u_{n}\in{\mathcal{U}}_{n}$ satisfying (2.2) if and only if

[TABLE]

since, by assumption, ${\mathcal{U}}_{n}$ and ${\mathcal{V}}_{n}$ have the same finite dimension.

Let us briefly recall the origin of the Banach-Nečas-Babuška conditions for the well-posedness of (2.1) as stated for example in [12, 31, 2] (equivalent conditions are proposed in [8] in the case of Hilbert spaces). Let us consider the associated operator ${\mathcal{A}}:{\mathcal{U}}\to{\mathcal{V}}^{\prime}$ defined by

[TABLE]

Then ${\mathcal{A}}$ is linear, bounded with $\|{\mathcal{A}}\|\leq M$ . By the Inverse Mapping Theorem, ${\mathcal{A}}$ has closed range and is injective if and only if there exists $\beta>0$ such that

[TABLE]

By the definition of the norm of ${\mathcal{V}}^{\prime}$ , this can be reformulated by

[TABLE]

Recall that ${\mathcal{A}}$ is invertible if and only if ${\mathcal{A}}$ is injective and has a closed and dense range. By the Hahn-Banach theorem, ${\mathcal{A}}$ has dense range if and only if no non-zero continuous functional on ${\mathcal{V}}^{\prime}$ annihilates the range of ${\mathcal{A}}$ . By reflexivity, this is equivalent to the following uniqueness property:

[TABLE]

Thus (2.1) is well-posed (i.e. for all $L\in{\mathcal{V}}^{\prime}$ there exists a unique $u\in{\mathcal{U}}$ satisfying (2.1)) if and only if (2.5) and (2.6) are satisfied. In fact, Hadamard’s definition of well-posedness also requires continuity of the inverse operator, which here automatically follows from bijectivity by the Inverse Mapping Theorem.

In order to obtain a result of convergence of the approximate solutions we consider the following uniform Banach-Nečas-Babuška condition (called Ladyzenskaia-Babuška-Brezzi condition in the framework of the mixed formulations, i.e. approximation of saddle point problems, see also Section 8), which is the estimate (2.5) for $a_{|{\mathcal{U}}_{n}\times{\mathcal{V}}_{n}}$ uniformly in $n\in{\mathbb{N}}^{*}$ , namely

[TABLE]

Remark 2.2.

Condition (BNB) is also called the inf–sup condition since by the Hahn-Banach Theorem it can be reformulated as

[TABLE]

*More precisely, this is the uniform or *discrete BNB-condition which is used for approximation whereas (2.5) is the continuous BNB-condition which expresses well-posedness of the problem and can also be expressed by an inf-sup-condition (see for example [15, Lemma 6.95 and Lemma 6.110]). The use of (LBB) relates this inequality to the work of Ladyzhenkaya [18] who, after a previous contribution due to Babuska [3], used it to prove well-posedness. Brezzi [4] introduced the analogue of the uniform BNB-condition for the treatment of saddle point problems (see Section 8 for more details).

Usually, in the numerical analysis community, one uses the name “inf-sup” condition (or LBB condition) only in the context of saddle point problems (see condition (8.1). $(iii)$ in Section 8). We keep the name “(BNB) condition”, following the monograph [12].

We recall that (BNB) implies that the approximate solutions converge to the solution if the problem is well-posed (see for example [12, 31, 2]). Here we will show that (BNB) is actually equivalent to Galerkin-convergence, and surprisingly also to Galerkin-convergence for the dual problem.

Definition 2.3 (Convergence of Galerkin approximation).

We say that the Galerkin-approximation converges if (2.1) as well as (2.2) are well-posed for all $n\in{\mathbb{N}}^{*}$ and $L\in{\mathcal{V}}^{\prime}$ and if, in addition, there exists a constant $\gamma>0$ independent of $n$ and $L$ such that,

[TABLE]

where $u$ is the solution of (2.1) and $u_{n}$ the solution of (2.2) for $n\in{\mathbb{N}}^{*}$ and $L\in{\mathcal{V}}^{\prime}$ . In particular, $\lim_{n\to\infty}u_{n}=u$ in ${\mathcal{U}}$ .

We may also consider the dual problem of (2.1) where $a$ is replaced by the adjoint form $a^{*}:{\mathcal{V}}\times{\mathcal{U}}\to{\mathbb{K}}$ given by

[TABLE]

If in Definition 2.3 the form $a$ is replaced by $a^{*}$ , then we say that the dual Galerkin approximation converges. Similarly we note the following dual uniform Banach-Nečas-Babuška condition

[TABLE]

Then the following theorem holds.

Theorem 2.4.

The following assertions are equivalent:

(i)

the Galerkin approximation converges; 2. (ii)

$(BNB)$ * holds;* 3. (iii)

$(BNB^{*})$ * holds;* 4. (iv)

the dual Galerkin approximation converges.

It is surprising that $(BNB)$ and $(BNB^{*})$ are equivalent even though the corresponding condition (2.5) is obviously not equivalent to its dual form. In fact, it can well happen that ${\mathcal{A}}$ is injective and has closed range (so that there exists $\beta>0$ satisfying (2.5)) but the range of ${\mathcal{A}}$ is a proper subspace of ${\mathcal{V}}^{\prime}$ so that there exists $v\in{\mathcal{V}}$ such that $v\neq 0$ and $a(u,v)=0$ for all $u\in{\mathcal{U}}$ ; in particular the dual form of (2.5) does not hold for any $\beta^{*}>0$ .

We will give the proof of Theorem 2.4 in several steps which give partly even stronger results. At first we show that $(ii)$ implies $(i)$ , where $\gamma$ can even be expressed in terms of $\beta$ and $M$ . Although the proof of this result is classical (see for example [31, 12]), we provide it for the convenience of the reader, but also to establish the well-posedness of (2.1) which we did not assume. This will be important for the proof of Theorem 2.4 and for the main result in Section 5.

Proposition 2.5.

Let $\beta>0$ . Assume that for all $n\in{\mathbb{N}}^{*}$ ,

[TABLE]

Then the Galerkin-approximation converges and (2.7) holds with

[TABLE]

Proof.

Let $L\in{\mathcal{V}}^{\prime}$ . Note that $(\ref{eq:bnb})$ implies (2.3). Thus, for each $n\in{\mathbb{N}}^{*}$ there exists a unique solution $u_{n}$ of (2.2). By $(\ref{eq:bnb})$ ,

[TABLE]

Since ${\mathcal{U}}$ is reflexive, we find $u\in{\mathcal{U}}$ such that a subsequence of $(u_{n})_{n}$ , say, $(u_{n_{k}})_{k}$ , converges weakly to $u$ . Let $v\in{\mathcal{V}}$ . By assumption we find $v_{k}\in{\mathcal{V}}_{n_{k}}$ such that $\lim_{k\to\infty}\|v-v_{n_{k}}\|_{\mathcal{V}}=0$ . It follows that

[TABLE]

Thus we find a solution $u$ of (2.1). But so far we do not know its uniqueness. This will be a consequence of $(\ref{eq:3.7})$ which we prove now. Indeed, observe that

[TABLE]

It follows that $a(u,\chi)=a(u_{n},\chi)\mbox{ for all }\chi\in{\mathcal{V}}_{n}$ (Galerkin orthogonality). Using this, for all $w\in{\mathcal{U}}_{n}$ ,

[TABLE]

Taking the infimum over all $w\in{\mathcal{U}}_{n}$ we obtain (2.7). In particular $\lim_{n\to\infty}\|u-u_{n}\|_{\mathcal{U}}=0$ which shows uniqueness. ∎

The following result is due to Xu and Zikatanov [31, Theorem 2] (see also [2, Satz 9.41]). We nevertheless provide its proof for the sake of completeness.

Proposition 2.6.

Assume that ${\mathcal{U}}$ is a Hilbert space and that $\beta>0$ is such that (2.8) holds. Then the Galerkin-approximation converges and (2.7) holds with $\gamma=\frac{M}{\beta}$ .

Proof.

Note that (2.8) implies (2.3). Consequently for each $w\in{\mathcal{U}}$ there exists a unique $Q_{n}w\in{\mathcal{U}}_{n}$ such that

[TABLE]

Then $Q_{n}$ is a projection from ${\mathcal{U}}$ onto ${\mathcal{U}}_{n}$ , which is calle the Ritz projection. Moreover,

[TABLE]

Thus $\|Q_{n}\|\leq\frac{M}{\beta}$ .

Since $U_{n}\neq 0$ and $U\neq 0$ , one has $Q_{n}\neq 0,\mathop{\rm Id}\nolimits$ . It follows from a result due to Kato [17, Lemma 4] that $\|Q_{n}\|=\|{\rm Id}-Q_{n}\|$ .

Now let $L\in{\mathcal{V}}^{\prime}$ and $u$ the solution of (2.3), $u_{n}$ the solution of (2.2). Then for any $\chi\in{\mathcal{U}}_{n}$ ,

[TABLE]

Hence

[TABLE]

This implies that

[TABLE]

∎

Remark 2.7.

Also in certain Banach spaces an improvement of the constant $1+\frac{M}{\beta}$ is possible, see Stern [29].

Next we show that even a weaker assumption than the convergence of the Galerkin-approximation implies $(BNB^{*})$ .

Proposition 2.8.

Assume (2.3) for all $n\in{\mathbb{N}}^{*}$ and that

[TABLE]

whenever $L\in{\mathcal{V}}^{\prime}$ and $u_{n}$ is the solution of (2.2). Then $(BNB^{*})$ holds.

Proof.

Since the spaces ${\mathcal{V}}_{n}$ and ${\mathcal{U}}_{n}$ have the same finite dimension, our assumption (2.3) implies also dual uniqueness, i.e. $a(\chi,v)=0$ for all $\chi\in{\mathcal{U}}_{n}$ implies $v=0$ whenever $v\in{\mathcal{V}}_{n}$ , and this for all $n\in{\mathbb{N}}^{*}$ . Thus

[TABLE]

defines a norm on ${\mathcal{V}}_{n}$ . Moreover,

[TABLE]

We show that the set

[TABLE]

is bounded. For that purpose, let $L\in{\mathcal{V}}^{\prime}$ . By assumption there exist $c>0$ and $u_{n}\in{\mathcal{U}}_{n}$ such that

[TABLE]

and $\|u_{n}\|_{\mathcal{U}}\leq c$ for all $n\in{\mathbb{N}}^{*}$ . Now, for $\frac{v}{\|v\|_{{\mathcal{V}}_{n}}}\in{\mathcal{B}}$ ,

[TABLE]

This shows that $\mathcal{B}$ is weakly bounded and thus, owing to the Banach–Steinhaus theorem, norm-bounded. Therefore there exists $\beta^{*}>0$ such that $\|v\|_{\mathcal{V}}\leq\frac{1}{\beta^{*}}\|v\|_{{\mathcal{V}}_{n}}$ , i.e.

[TABLE]

This is $(BNB^{*})$ . ∎

Proof of Theorem 2.4.

$(ii)\Rightarrow(i)$ and $(iii)\Rightarrow(iv)$ via Proposition 2.5, whereas $(i)\Rightarrow(iii)$ and $(iv)\Rightarrow(ii)$ follows from Proposition 2.8.

∎

Remark: The hypothesis on ${\mathcal{U}}$ and ${\mathcal{V}}$ to be reflexive is not needed in Proposition 2.5.

Finally we mention that the best lower bounds $\beta$ for $(BNB)$ and $\beta^{*}$ for $(BNB^{*})$ are the same if ${\mathcal{U}}$ and ${\mathcal{V}}$ are Hilbert spaces.

Proposition 2.9.

Assuming that ${\mathcal{U}}$ and ${\mathcal{V}}$ are Hilbert spaces, let $\beta>0$ . Then the two conditions $(\ref{eq:11})$ and $(\ref{eq:12})$ are equivalent:

[TABLE]

Proof.

Let $n\in{\mathbb{N}}^{*}$ and $A_{n}:{\mathcal{U}}_{n}\to{\mathcal{V}}_{n}$ be given by

[TABLE]

Then

[TABLE]

where $A_{n}^{*}$ is the adjoint of $A$ . Moreover, since $A_{n}$ is invertible,

[TABLE]

for all $u\in{\mathcal{U}}_{n}$ if and only if $\|A_{n}^{-1}\|\leq\frac{1}{\beta}$ . Since $(A_{n}^{*})^{-1}=(A_{n}^{-1})^{*}$ , it follows that $\|(A_{n}^{*})^{-1}\|=\|(A_{n}^{-1})^{*}\|=\|A_{n}^{-1}\|\leq\frac{1}{\beta}$ and hence

[TABLE]

∎

W. V. Petryshyn, namely in Theorem 2 and 3 of [22], considers approximation of an operator equation by finite dimensional problems and characterizes strong convergence. However, besides in very special situations, it sems not possible to deduce from this convergence of a Galerkin approximation, formulated in terms of sesquilinear forms. Further results for operator equations and their approximation can be found in the monograph [25, p. 26 ff].

3.

Existence of a converging Galerkin approximation

In this section, we again let ${\mathcal{U}}$ and ${\mathcal{V}}$ be separable reflexive real Banach spaces and let $a:{\mathcal{U}}\times{\mathcal{V}}\to{\mathbb{R}}$ be a continuous sesquilinear form such that the problem (2.1) is well-posed; i.e. for all $L\in{\mathcal{V}}^{\prime}$ there exists a unique $u\in{\mathcal{U}}$ satisfying (2.1). Since ${\mathcal{U}}$ and ${\mathcal{V}}$ are separable, there always exist approximating sequences $({\mathcal{U}}_{n})_{n\in{\mathbb{N}}^{*}}$ of ${\mathcal{U}}$ and $({\mathcal{V}}_{n})_{n\in{\mathbb{N}}^{*}}$ of ${\mathcal{V}}$ . Our question is whether there is a choice of these sequences which is adapted to the problem (2.1); i.e. such that the associated Galerkin approximation converges. We will show that the answer is related to the approximation property. In fact, different versions of this property play a role; we recall them in the next definition.

Definition 3.1 (Approximation property and Schauder decomposition).

Let ${\mathcal{X}}$ be a separable Banach space.

a)

The space ${\mathcal{X}}$ has the approximation property (AP) if, for every compact subset $K$ of ${\mathcal{X}}$ and every $\varepsilon>0$ , there exists a finite rank operator $R\in{\mathcal{L}}({\mathcal{X}})$ such that

[TABLE]

b)

The space ${\mathcal{X}}$ has the bounded approximation property (BAP) if there exists a sequence $(P_{n})_{n\in{\mathbb{N}}^{*}}$ of finite rank operators in ${\mathcal{X}}$ such that

[TABLE]

c)

*The space ${\mathcal{X}}$ has the bounded projection approximation property (BPAP) if each $P_{n}$ in b) can be chosen as a projection (i.e. such that $P_{n}^{2}=P_{n}$ ). *

d)

The space ${\mathcal{X}}$ possesses a finite dimensional decomposition if one finds $(P_{n})_{n\in{\mathbb{N}}^{*}}$ as in c) with the additional property

[TABLE]

e)

The space ${\mathcal{X}}$ has a Schauder basis if d) holds with

[TABLE]

It is known that (BAP) is equivalent to (AP) if ${\mathcal{X}}$ is reflexive. The first counterexample of a Banach space without (AP) has been given by Enflo [11]. He constructed a space which is even separable and reflexive.

Obviously the properties a)–e) have decreasing generality. It was Read [26] who showed that (BAP) does not imply (BPAP), even if reflexive and separable spaces are considered. Szarek [30] constructed a reflexive, separable Banach space having a finite dimensional Schauder decompositon but not a Schauder basis. Finally, it seems to be unknown whether (BPAP) implies the existence of a finite dimensional Schauder decomposition (see [24, Sec. 5.7.4.6] and [7, Problem 6.2]). However, if ${\mathcal{X}}$ is reflexive and separable, then these two properties are equivalent by [7, Theorem 6.4 (3)]).

Concerning the notion of finite dimensional Schauder decomposition, there is an equivalent formulation, namely the existence of finite dimensional subspaces ${\mathcal{X}}_{n}$ of ${\mathcal{X}}$ such that for each $x\in{\mathcal{X}}$ there exist unique $x_{n}\in{\mathcal{X}}_{n}$ such that $x=\sum_{n\in{\mathbb{N}}^{*}}x_{n}$ This explains the name. We refer to [20, Chapter I] , [7] for more information and to [24, Sec. 5.7.4] for the history of the approximation property. In the following theorem, by the hypothesis of well-posedness, the two Banach spaces ${\mathcal{U}}$ and ${\mathcal{V}}$ are isomorphic. For this reason they have the same Banach space properties.

Theorem 3.2.

Let ${\mathcal{U}}$ and ${\mathcal{V}}$ be separable reflexive Banach spaces and let $a:{\mathcal{U}}\times{\mathcal{V}}\to{\mathbb{K}}$ be a continuous sesquilinear form such that (2.1) is well-posed. Then the following assertions are equivalent.

(i)

There exist approximating sequences $({\mathcal{U}}_{n})_{n\in{\mathbb{N}}^{*}}$ of ${\mathcal{U}}$ and $({\mathcal{V}}_{n})_{n\in{\mathbb{N}}^{*}}$ of ${\mathcal{V}}$ such that the associated Galerkin approximation converges.

(ii)

The space ${\mathcal{U}}$ has the (BPAP).

(iii)

The space ${\mathcal{U}}$ has a finite dimensional Schauder decomposition.

Here convergence of the associated Galerkin approximation is understood in the sense of Definition 2.3.

Proof of Theorem 3.2.

$(i)\Rightarrow(ii)$ Let $u\in{\mathcal{V}}$ . Then $\langle L,v\rangle:=a(u,v)$ defines an element $L\in{\mathcal{V}}^{\prime}$ . By Definition 2.3, for each $n\in{\mathbb{N}}^{*}$ , there exists a unique $P_{n}u\in{\mathcal{V}}_{n}$ such that

[TABLE]

Moreover, $\|P_{n}u-u\|\leq\gamma\mathop{\rm dist}\nolimits({\mathcal{U}}_{n},u)$ for all $n\in{\mathbb{N}}^{*}$ and some $\gamma>0$ . In particular, $\lim_{n\to\infty}P_{n}u=u$ . It follows from the definition that $P_{n}^{2}=P_{n}$ . Since $P_{n}{\mathcal{U}}\subset{\mathcal{U}}_{n}$ , each $P_{n}$ has finite rank. We have shown that the space ${\mathcal{U}}$ has the (BPAP).

$(ii)\Rightarrow(iii)$ See [7, Theorem 6.4 (3)].

$(iii)\Rightarrow(i)$ Let ${\mathcal{A}}:{\mathcal{U}}\to{\mathcal{V}}^{\prime}$ be the operator defined by $\langle{\mathcal{A}}u,v\rangle=a(u,v)$ . Then ${\mathcal{A}}$ is invertible. By hypothesis there exist finite rank projections $(P_{n})_{n\in{\mathbb{N}}^{*}}$ such that $\lim_{n\to\infty}P_{n}u=u$ for all $u\in{\mathcal{U}}$ . Let $L\in{\mathcal{V}}^{\prime}$ , $u:={\mathcal{A}}^{-1}L$ be the solution of (2.1). Then

[TABLE]

We show that $u_{n}$ is obtained as a Galerkin approximation. In fact, fix $n\in{\mathbb{N}}^{*}$ . There exist $b_{1},\cdots,b_{m}\in{\mathcal{U}}$ , $\varphi_{1},\cdots,\varphi_{m}\in{\mathcal{U}}^{\prime}$ such that $\langle\varphi_{i},b_{j}\rangle=\delta_{i,j}$ and

[TABLE]

for all $x\in{\mathcal{U}}$ . Since ${\mathcal{V}}$ is reflexive there exist $v_{k}\in{\mathcal{V}}$ such that

[TABLE]

for all $g\in{\mathcal{V}}^{\prime}$ and $k=1,\cdots,m$ . Define ${\mathcal{V}}_{n}=\mathop{\rm Span}\nolimits\{v_{1},\cdots,v_{m}\}$ and ${\mathcal{U}}_{n}=\mathop{\rm Span}\nolimits\{b_{1},\cdots,b_{m}\}$ . Now consider the given $L\in{\mathcal{V}}^{\prime}$ . Let $w=\sum_{k=1}^{m}\lambda_{k}b_{k}\in{\mathcal{U}}_{n}$ . Then

[TABLE]

if and only if

[TABLE]

By (3.4),

[TABLE]

Therefore $w=\sum_{k=1}^{m}\langle L,v_{k}\rangle b_{k}$ is the unique solution of (3.5). Again, by (3.4),

[TABLE]

and it follows from (3.2) that $\lim_{n\to\infty}u_{n}=u$ . This also implies that $\mathop{\rm dist}\nolimits({\mathcal{U}}_{n},u)\to 0$ as $n\to\infty$ . Thus the sequence $({\mathcal{U}}_{n})_{n\in{\mathbb{N}}^{*}}$ is approximating.

It remains to show that the sequence $({\mathcal{V}}_{n})_{n\in{\mathbb{N}}^{*}}$ is approximating in ${\mathcal{V}}$ . For this we need the the additional property (3.1). Consider the adjoint $P_{n}^{\prime}\in{\mathcal{L}}({\mathcal{U}}^{\prime})$ of $P_{n}$ . Then $P_{n}^{\prime}\varphi$ weakly converges to $\varphi$ as $n\to\infty$ for all $\varphi\in{\mathcal{U}}^{\prime}$ . Thus

[TABLE]

is weakly dense in ${\mathcal{U}}^{\prime}$ . But, because of (3.1), ${\mathcal{W}}$ is a subspace of ${\mathcal{U}}^{\prime}$ . Thus, by Mazur’s Theorem, ${\mathcal{W}}$ is dense in ${\mathcal{U}}^{\prime}$ . If $\psi\in{\mathcal{W}}$ , then there exist $m\in{\mathbb{N}}^{*},\varphi\in{\mathcal{U}}^{\prime}$ such that $\psi=P_{m}^{\prime}\varphi$ . Thus

[TABLE]

for all $n\in{\mathbb{N}}^{*}$ by (3.1), and then $\lim_{n\to\infty}P_{n}^{\prime}\psi=\psi$ for all $\psi\in{\mathcal{W}}$ . Since $\sup_{n\in{\mathbb{N}}^{*}}\|P_{n}^{\prime}\|<\infty$ , it follows that $\lim_{n\to\infty}P_{n}^{\prime}\varphi=\varphi$ for all $\varphi\in{\mathcal{U}}^{\prime}$ . This implies that the sequence $(P_{n}^{\prime}{\mathcal{U}}^{\prime})_{n\in{\mathbb{N}}^{*}}$ is approximating in ${\mathcal{U}}^{\prime}$ . It follows from (3.4) that ${\mathcal{V}}_{n}\supset({\mathcal{A}}^{-1})^{\prime}P_{n}^{\prime}{\mathcal{U}}^{\prime}$ . In fact, fix $n$ and consider $P_{n}$ as in (3.3). Then (3.4) says that $v_{k}=({\mathcal{A}}^{-1})^{\prime}\varphi_{k}$ . Since $(P_{n}^{\prime}{\mathcal{U}}^{\prime})_{n\in{\mathbb{N}}^{*}}$ is an approximating sequence in ${\mathcal{U}}^{\prime}$ and $({\mathcal{A}}^{-1})^{\prime}$ is an isomorphism from ${\mathcal{U}}^{\prime}$ to ${\mathcal{V}}$ , it follows that $({\mathcal{V}}_{n})_{n\in{\mathbb{N}}^{*}}$ is an approximating sequence in ${\mathcal{V}}$ . ∎

4. Essentially coercive forms

Let ${\mathcal{V}}$ be a separable Hilbert space over ${\mathbb{K}}={\mathbb{C}}$ or ${\mathbb{R}}$ and $a:{\mathcal{V}}\times{\mathcal{V}}\to{\mathbb{K}}$ be a sesquilinear form satisfying

[TABLE]

for some $M>0$ . Then we may associate with $a$ the operator ${\mathcal{A}}\in{\mathcal{L}}({\mathcal{V}},{\mathcal{V}}^{\prime})$ defined by

[TABLE]

If $a$ is coercive, i.e. if

[TABLE]

for some $\alpha>0$ , then ${\mathcal{A}}$ is invertible. This consequence is the well-known Lax-Milgram lemma.

Remark 4.1.

The notion of coercivity is not uniform in the literature. Ours is the natural hypothesis of the Lax-Milgram Lemma and is conform with the Wikipedia entry ”Babuska-Lax-MilgramTheorem”. In non-linear analysis there is a wide agreement on this notion: In the real case, a possibly non-linear operator ${\mathcal{A}}\in{\mathcal{L}}({\mathcal{V}},{\mathcal{V}}^{\prime})$ is called coercive if there exists a function $\eta:{\mathbb{R}}\to{\mathbb{R}}$ such that $\eta(t)\to\infty$ as $t\to\infty$ and $\langle{\mathcal{A}}v,v\rangle\geq\eta(\|u\|_{\mathcal{V}})\|v\|_{\mathcal{V}}$ for all $v\in{\mathcal{V}}$ . If ${\mathcal{A}}$ is linear this is equivalent to the existence of $\alpha>0$ such that

[TABLE]

i.e. our condition without the absolute value. This is a ”forcing condition” which justifies the name coercive. Other authors prefer the word ${\mathcal{V}}-$ ellipticity, see e.g. [15], [21]. We use elliptic for shifted coercivity in [1], see also the remark at the end of this section.

Our aim is to find weaker assumptions than coercivity which help to decide whether the operator ${\mathcal{A}}$ is invertible.

Note that $a$ is coercive if and only if

[TABLE]

We weaken this property in the following way.

Definition 4.2 (Essential coercivity).

The continuous sesquilinear form $a$ (or the operator ${\mathcal{A}}$ ) is called essentially coercive if for each sequence $(u_{n})_{n\in{\mathbb{N}}^{*}}$ in ${\mathcal{V}}$ weakly converging to [math] and such that $\lim_{n\to\infty}a(u_{n},u_{n})=0$ , one has $\lim_{n\to\infty}\|u_{n}\|_{{\mathcal{V}}}=0$ .

The following is a characterization of this new property.

Theorem 4.3.

The following assertions are equivalent:

(i)

the form $a$ is essentially coercive; 2. (ii)

there exist an orthogonal projection $P\in{\mathcal{L}}({\mathcal{V}})$ of finite rank and $\alpha>0$ such that

[TABLE] 3. (iii)

there exist a Hilbert space ${\mathcal{H}}$ , a compact operator $J:{\mathcal{V}}\to{\mathcal{H}}$ and $\alpha>0$ such that

[TABLE] 4. (iv)

there exist a compact operator ${\mathcal{K}}\in{\mathcal{L}}({\mathcal{V}},{\mathcal{V}}^{\prime})$ and $\alpha>0$ such that

[TABLE]

Proof.

$(i)\Rightarrow(ii)$ : Let $(e_{n})_{n\in{\mathbb{N}}^{*}}$ be an orthonormal basis of ${\mathcal{V}}$ and consider the orthogonal projections $P_{n}$ given by

[TABLE]

Assume that (ii) is false for every $P_{n}$ . Then there exists a sequence $(u_{n})_{n\in{\mathbb{N}}^{*}}\subset{\mathcal{V}}$ such that $\|u_{n}\|_{\mathcal{V}}=1$ and

[TABLE]

Note that, since $\mathop{\rm Id}\nolimits-P_{n}$ is a self-adjoint operator,

[TABLE]

with $\lim_{n\to\infty}\|(\mathop{\rm Id}\nolimits-P_{n})v\|_{\mathcal{V}}=0$ for all $v\in{\mathcal{V}}$ . This implies that $(\mathop{\rm Id}\nolimits-P_{n})u_{n}$ converges weakly to [math]. Since $\lim_{n\to\infty}\|P_{n}u_{n}\|_{\mathcal{V}}=0$ , it follows that $u_{n}$ converges weakly to [math]. Moreover $\lim_{n\to\infty}|a(u_{n},u_{n})|\leq\lim_{n\to\infty}\frac{1}{n}=0$ . Therefore $a$ is not essentially coercive.

$(ii)\Rightarrow(iii)$ : Choose ${\mathcal{H}}={\mathcal{V}}$ and $J=P$ .

$(iii)\Rightarrow(iv)$ : There exists a unique operator $J^{*}:{\mathcal{H}}\to{\mathcal{V}}^{\prime}$ such that

[TABLE]

for all $v\in{\mathcal{V}}$ . Choose ${\mathcal{K}}=J^{*}J$ .

$(iv)\Rightarrow(i)$ : Let $(u_{n})_{n\in{\mathbb{N}}^{*}}\subset{\mathcal{V}}$ that tends weakly to [math] and such that $a(u_{n},u_{n})=\langle{\mathcal{A}}u_{n},u_{n}\rangle$ tends to [math] as $n\to\infty$ . Since ${\mathcal{K}}$ is compact, $\|{\mathcal{K}}u_{n}\|_{\mathcal{V}}\to 0$ as $n\to\infty$ . Hence $|\langle{\mathcal{K}}u_{n},u_{n}\rangle_{\mathcal{V}}|\to 0$ as $n\to\infty$ . By assumption there exists $\beta>0$ such that

[TABLE]

It follows that $\|u_{n}\|_{\mathcal{V}}\to 0$ as $n\to\infty$ . ∎

Next we want to justify the notion ”essentially coercive”. We recall that by the Toeplitz–Hausdorff theorem [14], the numerical range of $a$ ,

[TABLE]

is a convex set. Hence also $\overline{W(a)}$ is convex. For $\alpha>0$ ,

[TABLE]

if and only if

[TABLE]

where $D_{\alpha}=(-\alpha,\alpha)$ in the real case and $D_{\alpha}=\{w\in{\mathbb{C}}:|w|<\alpha\}$ if ${\mathbb{K}}={\mathbb{C}}$ . This observation leads to the following more precise description of coercivity.

Lemma 4.4.

The form $a$ is coercive if and only if there exist $\alpha>0$ and $\lambda\in{\mathbb{K}}$ with $|\lambda|=1$ such that

[TABLE]

Proof.

We give the proof for ${\mathbb{K}}={\mathbb{C}}$ . Assume that $a$ is coercive. There exists a maximal $\alpha>0$ such that $\overline{W(a)}\cap D_{a}=\emptyset$ . Then there exists $z_{0}\in\overline{W(a)}$ of modulus $\alpha$ ; i.e. $z_{0}=e^{i\theta}\alpha$ for some $\theta\in{\mathbb{R}}$ . The set $C:=e^{-i\theta}\overline{W(a)}$ is convex and closed. Moreover $\alpha\in C$ and $D_{\alpha}\cap C=\emptyset$ . This implies that $\mathop{Re}(z)\geq\alpha$ for all $z\in C$ . Indeed, let $z\in C$ such that $\mathop{Re}(z)<\alpha$ . Then the segment $[\alpha,z]$ has a non-empty intersection with $D_{\alpha}$ . Since $C$ is convex it follows that $z\not\in C$ .

Conversely, clearly, if there exists $\alpha>0$ such that $\mathop{Re}(\lambda z)\geq\alpha$ for all $z\in W(a)$ , then $a$ is coercive. ∎

Theorem 4.5.

Let ${\mathcal{A}}\in{\mathcal{L}}({\mathcal{V}},{\mathcal{V}}^{\prime})$ . The following assertions are equivalent:

(i)

the operator ${\mathcal{A}}$ is essentially coercive; 2. (ii)

there exists a finite rank operator ${\mathcal{K}}:{\mathcal{V}}\to{\mathcal{V}}^{\prime}$ such that ${\mathcal{A}}+{\mathcal{K}}$ is coercive; 3. (iii)

there exists a compact operator ${\mathcal{K}}:{\mathcal{V}}\to{\mathcal{V}}^{\prime}$ such that ${\mathcal{A}}+{\mathcal{K}}$ is coercive.

Proof.

$(i)\Rightarrow(ii)$ : Choose the orthogonal finite rank projection $P$ on ${\mathcal{V}}$ and $\alpha>0$ as in Theorem 4.3 (ii). Let ${\mathcal{V}}_{1}=\ker V$ and ${\mathcal{V}}_{2}=\mathop{range}P$ . Then $\dim{\mathcal{V}}_{2}<\infty$ and $|a(u,u)|\geq\alpha\|u\|^{2}_{\mathcal{V}}$ for all $u\in{\mathcal{V}}_{1}$ . Let $j:{\mathcal{V}}\to{\mathcal{V}}^{\prime}$ be the Riesz isomorphism given by

[TABLE]

Let $A=j^{-1}\circ{\mathcal{A}}\in{\mathcal{L}}({\mathcal{V}})$ . Then $a(u,v)=\langle Au,v\rangle_{{\mathcal{V}}}$ for all $u,v\in{\mathcal{V}}$ . Moreover $A$ has a matrix decomposition

[TABLE]

according to the decomposition ${\mathcal{V}}={\mathcal{V}}_{1}\oplus{\mathcal{V}}_{2}$ of ${\mathcal{V}}$ . Since $P$ is orthogonal, $A_{11}$ is coercive. Thus, by Lemma 4.4, there exists $z_{0}\in{\mathbb{C}}$ such that $|z_{0}|=1$ and

[TABLE]

for all $u\in{\mathcal{V}}_{1}$ . Since $\dim{\mathcal{V}}_{2}<\infty$ , there exists a finite rank operator $K_{1}\in{\mathcal{L}}({\mathcal{V}})$ such that

[TABLE]

Choose a further finite rank perturbation $K_{2}$ such that

[TABLE]

Since $P$ is orthogonal, for $Q=\mathop{\rm Id}\nolimits-P$ , we get

[TABLE]

Hence

[TABLE]

Now let ${\mathcal{K}}=j\circ(K_{1}+K_{2})$ . Then ${\mathcal{A}}+{\mathcal{K}}$ is coercive.

$(ii)\Rightarrow(iii)$ is obvious.

$(iii)\Rightarrow(i)$ : Condition $(iii)$ implies clearly Condition $(iv)$ of Theorem 4.3; thus the claim $(i)$ follows from that theorem. ∎

Corollary 4.6.

Let $a$ be a continuous essentially coercive sesquilinear form. The following assertions are equivalent:

(i)

for all $L\in{\mathcal{V}}^{\prime}$ there exists a unique $u\in{\mathcal{V}}$ such that

[TABLE] 2. (ii)

$a(u,v)=0$ * for all $v\in{\mathcal{V}}$ implies that $u=0$ (uniqueness);* 3. (iii)

for all $L\in{\mathcal{V}}^{\prime}$ there exists $u\in{\mathcal{V}}$ such that $a(u,v)=\langle L,v\rangle$ for all $v\in{\mathcal{V}}$ (existence).

Proof.

The assertion (i) means that ${\mathcal{A}}$ is invertible, the assertion (ii) means that ${\mathcal{A}}$ is injective and the assertion (iii) means that ${\mathcal{A}}$ is surjective. By Theorem 4.5, there exists a compact operator ${\mathcal{K}}\in{\mathcal{L}}({\mathcal{V}},{\mathcal{V}}^{\prime})$ such that ${\mathcal{A}}+{\mathcal{K}}=:{\mathcal{B}}$ is invertible.

$(ii)\Rightarrow(i)$ : Assume that ${\mathcal{A}}$ is injective. Write

[TABLE]

Then also $(\mathop{\rm Id}\nolimits-{\mathcal{B}}^{-1}{\mathcal{K}})$ is injective. Since ${\mathcal{B}}^{-1}{\mathcal{K}}$ is compact, it follows from the classical Fredholm alternative that $(\mathop{\rm Id}\nolimits-{\mathcal{B}}^{-1}{\mathcal{K}})$ is invertible. Consequently also ${\mathcal{A}}$ is invertible.

$(iii)\Rightarrow(i)$ : If ${\mathcal{A}}$ is surjective, write ${\mathcal{A}}=(\mathop{\rm Id}\nolimits-{\mathcal{K}}{\mathcal{B}}^{-1}){\mathcal{B}}$ to conclude that $(\mathop{\rm Id}\nolimits-{\mathcal{K}}{\mathcal{B}}^{-1})$ is surjective. Again we deduce that $(\mathop{\rm Id}\nolimits-{\mathcal{K}}{\mathcal{B}}^{-1})$ is invertible and so is ${\mathcal{A}}$ . ∎

Remark 4.7.

In the previous corollary we deduced from Theorem 4.5 the Fredholm alternative. This conclusion is well-known, if a compact perturbation is given, see for example [32, Theorem 22.D], or [15, Lemma 6.108]. Our point is that a priori it is not at all clear that the topological condition defining essential coercivity implies that the form is a compact perturbation of a coercive form. This is what Theorem 4.5 shows. Note that, in [23, p229], our notion of essential coercivity is attributed, under the name “condition (S)”, to Felix Browder [6] if we identify the operator with a form.

Moreover, we deduce from Theorem 4.5 the following properties of essential coercivity.

Corollary 4.8.

(a)

The set of all essentially coercive operators on ${\mathcal{V}}$ is open in ${\mathcal{L}}({\mathcal{V}},{\mathcal{V}}^{\prime})$ . 2. (b)

If ${\mathcal{A}}\in{\mathcal{L}}({\mathcal{V}},{\mathcal{V}}^{\prime})$ is essentially coercive and ${\mathcal{K}}\in{\mathcal{L}}({\mathcal{V}},{\mathcal{V}}^{\prime})$ is compact, then ${\mathcal{A}}+{\mathcal{K}}$ is essentially coercive. 3. (c)

If ${\mathcal{A}}\in{\mathcal{L}}({\mathcal{V}},{\mathcal{V}}^{\prime})$ is essentially coercive, then ${\mathcal{A}}$ is a Fredholm operator of index [math].

The following example shows that the invertibility of ${\mathcal{A}}$ does not imply the essential coercivity of $a$ .

Example 4.9.

Let ${\mathcal{V}}=\ell^{2}({\mathbb{N}}^{*})$ , ${\mathbb{K}}={\mathbb{R}}$ and

[TABLE]

Let $j$ be the Riesz isomorphism introduced in the proof of Theorem 4.5. Then $A:=j^{-1}\circ{\mathcal{A}}$ is a diagonal operator with merely $1$ and $-1$ in the diagonal. Thus $A$ and obviously ${\mathcal{A}}$ are clearly invertible. Let $f_{n}=(0,\cdots,1,1,0,\cdots)$ where the $1$ is a coordinate for $k=2n$ and $k=2n+1$ . Then $\|f_{n}\|=\sqrt{2}$ and $(f_{n})_{n}$ tends weakly to [math] as $n\to\infty$ . Moreover $a(f_{n},f_{n})=0$ for all $n$ , which shows that $a$ is not essentially coercive.

Remark 4.10.

Let ${\mathbb{K}}={\mathbb{C}}$ . In [1] a continuous sesquilinear form $a$ is called compactly elliptic if there exists a compact operator $J:{\mathcal{V}}\to{\mathcal{H}}$ , where ${\mathcal{H}}$ is some Hilbert space and there exists $\alpha>0$ such that

[TABLE]

In view of Theorem 4.3, each compactly elliptic form is essentially coercive. In fact the following holds: the form $a$ is essentially coercive if and only if there exists $\lambda\in{\mathbb{C}}\setminus\{0\}$ such that $\lambda a$ is compactly elliptic.

Proof.

If $\lambda a$ is compactly elliptic, then $\lambda a$ is essentially coercive and hence also $a$ is essentially coercive. Conversely, let $a$ be essentially coercive. By Theorem 4.5, there exists a compact operator ${\mathcal{K}}:{\mathcal{V}}\to{\mathcal{V}}^{\prime}$ such that the form $b$ defined by

[TABLE]

is coercive. By Lemma 4.4 there exist $\lambda\in{\mathbb{C}}$ of modulus one and $\alpha>0$ such that $\mathop{Re}(\lambda b(u,u))\geq\alpha\|u\|^{2}_{{\mathcal{V}}}$ for all $u\in{\mathcal{V}}$ . Now let $j:{\mathcal{V}}\to{\mathcal{V}}^{\prime}$ be the Riesz isomorphism. Then $J:=j^{-1}\circ{\mathcal{K}}:{\mathcal{V}}\to{\mathcal{V}}$ is compact. Choosing ${\mathcal{H}}={\mathcal{V}}$ we see that $\lambda b$ is compactly elliptic. It follows from [1, Proposition 4.4 (b)] that $\lambda a$ is compactly elliptic. ∎

5. Characterization of the universal Galerkin property

In this section we want to characterize those forms on a Hilbert space for which every Galerkin approximation converges, whatever be the choice of the approximating sequence.

Let ${\mathcal{V}}$ be a separable, infinite dimensional separable Hilbert space over ${\mathbb{K}}={\mathbb{R}}$ or ${\mathbb{C}}$ , and let $a:{\mathcal{V}}\times{\mathcal{V}}\to{\mathbb{K}}$ be a continuous sesquilinear form. Given $L\in{\mathcal{V}}^{\prime}$ we again consider solutions of the problem:

[TABLE]

We say that the form $a$ satisfies uniqueness if for $u\in{\mathcal{V}}$ ,

[TABLE]

We say that (5.1) is well-posed if for all $L\in{\mathcal{V}}^{\prime}$ there exists a unique solution $u\in{\mathcal{V}}$ .

Definition 5.1 (Universal Galerkin property).

The sesquilinear and continuous form $a$ has the universal Galerkin property if (5.1) is well-posed and the following holds. Let $({\mathcal{V}}_{n})_{n\in{\mathbb{N}}^{*}}$ be an arbitrary approximating sequence of ${\mathcal{V}}$ . Then there exist $n_{0}\in{\mathbb{N}}^{*}$ and $\gamma>0$ such that for each $L\in{\mathcal{V}}^{\prime}$ and each $n\geq n_{0}$ , there exists a unique $u_{n}\in{\mathcal{V}}_{n}$ solving

[TABLE]

and

[TABLE]

where $u$ is the solution of (5.1).

As recalled in the introduction and in the preceding section, the Lax-Milgram Theorem and Céa’s Lemma imply the universal Galerkin property if $a$ is coercive. We now show that the weaker notion of essential coercivity also provides a sufficient condition for ensuring the universal Galerkin property, and moreover that it is necessary.

Theorem 5.2.

The following assertions are equivalent.

(i)

The form $a$ is essentially coercive and satisfies uniqueness. 2. (ii)

The form $a$ has the universal Galerkin property.

Proof.

$(i)\Rightarrow(ii)$ : let $({\mathcal{V}}_{n})_{n\in{\mathbb{N}}^{*}}$ be an approximating sequence in ${\mathcal{V}}$ . By Theorem 2.4 it suffices to show that there exist $\beta>0$ and $n_{0}\in{\mathbb{N}}^{*}$ such that

[TABLE]

Assume that (5.2) is false. We then find a subsequence $(n_{k})_{k\in{\mathbb{N}}^{*}}$ and $u_{n_{k}}\in{\mathcal{V}}_{n_{k}}$ such that $\|u_{n_{k}}\|_{\mathcal{V}}=1$ and

[TABLE]

We may assume that $(u_{n_{k}})_{k}$ converges weakly to $u$ taking a further subsequence otherwise. Let $v\in{\mathcal{V}}$ . Then there exist $v_{k}\in{\mathcal{V}}_{n_{k}}$ such that $\lim_{k\to\infty}\|v-v_{k}\|_{{\mathcal{V}}}=0$ . Thus

[TABLE]

It follows from the uniqueness assumption that $u=0$ . Thus $(u_{n_{k}})_{k}$ converges weakly to [math], $\lim_{k\to\infty}a(u_{n_{k}},u_{n_{k}})=0$ , but $\|u_{n_{k}}\|_{\mathcal{V}}=1$ for all $k$ . Therefore the form $a$ is not essentially coercive.

$(ii)\Rightarrow(i)$ : the uniqueness condition is part of $(ii)$ . It remains to show that $a$ is essentially coercive. Let $(e_{n})_{n\in{\mathbb{N}}^{*}}$ be an orthonormal basis of ${\mathcal{V}}$ and ${\mathcal{V}}_{n}:=\mathop{\rm Span}\nolimits\{e_{1},\cdots,e_{n}\}$ . By our assumption, there exist $2\leq n_{0}\in{\mathbb{N}}^{*}$ and for all $n\geq n_{0}$ an operator $Q_{n}:{\mathcal{V}}\to{\mathcal{V}}_{n}$ such that

[TABLE]

Denote by $P_{n}:{\mathcal{V}}\to{\mathcal{V}}_{n}$ the orthogonal projection. Define the operator

[TABLE]

by

[TABLE]

Now assume that $a$ is not essentially coercive. Then it follows from Theorem 4.3 that for all $n\geq n_{0}$ we find $u_{n}\in{\mathcal{V}}$ such that $\|u_{n}\|_{\mathcal{V}}=1$ and

[TABLE]

In particular $\|P_{n}u_{n}\|_{\mathcal{V}}<\frac{1}{(n+2)^{2}}$ . This implies that $u_{n}\not\in{{\mathcal{V}}_{n}}$ . Let $\tilde{{\mathcal{V}}}_{n}=\mathop{\rm Span}\nolimits\{{\mathcal{V}}_{n}\cup\{u_{n}\}\}$ . Then $({\mathcal{V}}_{n})_{n\geq n_{0}}$ and $(\tilde{{\mathcal{V}}}_{n})_{n\geq n_{0}}$ are both approximating sequences. Let $n\geq n_{0}$ and let $v\in\tilde{{\mathcal{V}}}_{n}$ be arbitrary with unit norm. There exist a unique $w_{1}\in{\mathcal{V}}_{n}$ and $\lambda\in{\mathbb{K}}$ such that

[TABLE]

where $w:=w_{1}+\lambda P_{n}u_{n}\in{\mathcal{V}}_{n}$ . Thus

[TABLE]

Consequently $\|w\|^{2}_{\mathcal{V}}\leq 1$ and, since $\|P_{n}u_{n}\|_{{\mathcal{V}}}<\frac{1}{2}$ , it follows that

[TABLE]

which implies that $|\lambda|^{2}\leq 4$ , i.e. $|\lambda|\leq 2$ .

Observe that the definition of $Q_{n}$ implies that $a(u_{n}-Q_{n}u_{n},w)=0$ . Hence

[TABLE]

Consequently

[TABLE]

Thus $(BNB)$ is violated for the approximating sequence $(\tilde{{\mathcal{V}}}_{n})_{n\geq n_{0}}$ . But then $(ii)$ does not hold by Theorem 2.4, which shows that the assumption that $a$ is not essentially coercive is false.

∎

It is obvious that a form $a$ is essentially coercive if and only if its adjoint $a^{*}$ is essentially coercive. However, a surprising consequence of Theorem 5.2 is that, for an essentially coercive form, uniqueness for the form and uniqueness for its adjoint are equivalent, as the following corollary shows.

Corollary 5.3.

Let ${\mathcal{V}}$ be a separable Hilbert space on ${\mathbb{K}}$ and $a:{\mathcal{V}}\times{\mathcal{V}}\to{\mathbb{K}}$ be a continuous essentially coercive form. The following assertions are equivalent:

(i)

for all $u\in{\mathcal{V}}$ , $a(u,v)=0$ for all $v\in{\mathcal{V}}$ implies $u=0$ ; 2. (ii)

for all $v\in{\mathcal{V}}$ , $a(u,v)=0$ for all $u\in{\mathcal{V}}$ implies $v=0$ ; 3. (iii)

for all $L\in{\mathcal{V}}^{\prime}$ there exists $u$ in ${\mathcal{V}}$ such that $a(u,v)=\langle L,v\rangle$ , for all $v\in{\mathcal{V}}$ ; 4. (iv)

for all $L\in{\mathcal{V}}^{\prime}$ there exists $v$ in ${\mathcal{V}}$ such that $a(u,v)=\overline{\langle L,u\rangle}$ , for all $u\in{\mathcal{V}}$ .

Proof.

$(i)\Longleftrightarrow(ii)$ : this follows from Theorem 5.2 and Theorem 2.4. The other equivalences follow from Corollary 4.6. ∎

6. The Aubin-Nitsche trick revisited

In this section we want to prove that on suitable Hilbert spaces containing the space ${\mathcal{V}}$ continuously the approximation speed in the Galerkin approximation can be improved. We refer also to [28] for related, but different results in this direction.

Let ${\mathcal{V}}$ be a separable Hilbert space over ${\mathbb{K}}={\mathbb{R}}$ or ${\mathbb{C}}$ , and $a:{\mathcal{V}}\times{\mathcal{V}}\to{\mathbb{K}}$ a sesquilinear form satisfying

[TABLE]

Let $({\mathcal{V}}_{n})_{n\in{\mathbb{N}}^{*}}$ be an approximating sequence of ${\mathcal{V}}$ . We assume that (BNB) holds; i.e. there exists $\beta>0$ such that

[TABLE]

Given $L\in{\mathcal{V}}^{\prime}$ and $n\in{\mathbb{N}}^{*}$ , let $u_{n}\in{\mathcal{V}}_{n}$ be the solution of

[TABLE]

and $u\in{\mathcal{V}}$ the solution of

[TABLE]

Note that, by subtracting (6.3) and (6.2), we obtain the following Galerkin orthogonality:

[TABLE]

We know from Proposition 2.5 and Proposition 2.6 that

[TABLE]

for all $n\in{\mathbb{N}}^{*}$ . We want to improve this estimate if the given data $L\in{\mathcal{V}}^{\prime}$ is in a suitable subspace of ${\mathcal{V}}^{\prime}$ .

Let ${\mathcal{X}}\hookrightarrow{\mathcal{V}}^{\prime}$ ; i.e. ${\mathcal{X}}$ is a Banach space such that ${\mathcal{X}}\subset{\mathcal{V}}^{\prime}$ and

[TABLE]

for all $f\in{\mathcal{X}}$ and some $c_{\mathcal{X}}>0$ . We define for $n\in{\mathbb{N}}^{*}$

[TABLE]

where the distance is taken in ${\mathcal{V}}$ . Thus

[TABLE]

where ${\mathcal{A}}:{\mathcal{V}}\to{\mathcal{V}}^{\prime}$ is the isomorphism given by

[TABLE]

Thus, if $u$ is the solution of (6.3) and $u_{n}$ the approximate solution of (6.2), then, if $L\in{\mathcal{X}}$ , we have the estimate

[TABLE]

which has the advantage of being uniform for $L$ in the unit ball of ${\mathcal{X}}$ .

Remark 6.1.

Let $Q_{n}:{\mathcal{X}}\to{\mathcal{V}},L\mapsto u_{n}$ be the solution operator for (6.2). Then (6.8) says that

[TABLE]

We can characterize when $\gamma_{n}({\mathcal{X}})\to 0$ as $n\to\infty$ .

Proposition 6.2.

One has

[TABLE]

Proof.

Denote by $P_{n}:{\mathcal{V}}\to{\mathcal{V}}_{n}$ the orthogonal projection onto ${\mathcal{V}}_{n}$ . Then

[TABLE]

where $j:{\mathcal{X}}\to{\mathcal{V}}^{\prime}$ is the canonical injection. If $j$ is compact, then $K:={{\mathcal{A}}}^{-1}\circ j(B_{\mathcal{X}})$ , where $B_{\mathcal{X}}$ is the unit ball of ${\mathcal{X}}$ , is relatively compact in ${\mathcal{V}}$ . Now, $P_{n}$ converges strongly to the identity of ${\mathcal{V}}$ . Since $\|P_{n}\|\leq 1$ , this convergence is uniform on compact subsets of ${\mathcal{X}}$ . This shows that $\gamma_{n}({\mathcal{X}})\to 0$ as $n\to\infty$ .

Conversely, if $\gamma_{n}({\mathcal{X}})\to 0$ , then ${{\mathcal{A}}}^{-1}\circ j$ is compact as limit of finite rank operators. Then also $j$ is compact. ∎

Similarly, we define

[TABLE]

where ${\mathcal{A}}^{*}:{\mathcal{V}}\to{\mathcal{V}}^{*}$ is given by

[TABLE]

As before we have $\gamma_{n}^{*}({\mathcal{X}})$ defined as $\gamma_{n}({\mathcal{X}})$ but with $a$ replaced by the adjoint form $a^{*}$ of $a$ . Thus we have for all $w\in{{\mathcal{A}}}^{*-1}{\mathcal{X}}$ ,

[TABLE]

Now we apply the Aubin–Nitsche trick in the following proof. In contrast to the literature [12] we allow non-selfadjoint forms and also let $L\in{\mathcal{X}}$ where ${\mathcal{X}}\hookrightarrow{\mathcal{V}}^{\prime}$ is arbitrary. However, as usual, we fix a Hilbert space ${\mathcal{H}}$ such that ${\mathcal{V}}\hookrightarrow{\mathcal{H}}$ with dense range. Thus we have the Gelfand triple

[TABLE]

Now we let ${\mathcal{X}}\hookrightarrow{\mathcal{V}}^{\prime}$ be another Banach space in which we choose the given data $L$ , whereas our error estimate is done with respect to the norm of ${\mathcal{H}}$ .

Theorem 6.3.

Let $L\in{\mathcal{X}}$ and let $u$ be the solution of (6.3), $u_{n}$ the solution of (6.2). Then

[TABLE]

for all $n\in{\mathbb{N}}^{*}$ .

Proof.

Let $n\in{\mathbb{N}}^{*}$ . Then, on the footsteps of Aubin–Nitsche, we consider the solution $w\in{\mathcal{V}}$ of

[TABLE]

Then, by (6.11), for any $\chi\in{\mathcal{V}}_{n}$ ,

[TABLE]

where in the last identity we used the Galerkin orthogonality (6.4).

Since $\chi\in{\mathcal{V}}_{n}$ is arbitrary, this implies that

[TABLE]

Now we use (6.8) and (6.9) to deduce

[TABLE]

Consequently, we obtain

[TABLE]

∎

7. Applications

7.1. Selfadjoint positive operators with compact resolvent

As an illustration, we apply Theorem 6.3 to selfadjoint positive operators with compact resolvent. Let ${\mathcal{V}},{\mathcal{H}}$ be infinite dimensional, separable Hilbert spaces over ${\mathbb{K}}={\mathbb{R}}$ or ${\mathbb{C}}$ such that ${\mathcal{V}}$ is compactly injected in ${\mathcal{H}}$ and dense in ${\mathcal{H}}$ . Thus we have the Gelfand triple

[TABLE]

Let $a:{\mathcal{V}}\times{\mathcal{V}}\to{\mathbb{K}}$ be continuous, symmetric and coercive. Then the operator ${\mathcal{A}}:{\mathcal{V}}\to{\mathcal{V}}^{\prime}$ given by

[TABLE]

is invertible. Moreover, there exist an orthonormal basis $(e_{n})_{n\geq 0}$ of ${\mathcal{H}}$ and $\lambda_{n}\in{\mathbb{R}}$ such that

[TABLE]

and

[TABLE]

(see e.g. [2, Satz 4.49]) and

[TABLE]

Passing to an equivalent scalar product we may and will assume that

[TABLE]

Thus $|a(u,v)|\leq\|u\|_{{\mathcal{V}}}\|v\|_{{\mathcal{V}}}$ and $\sup_{\|v\|_{\mathcal{V}}=1}|a(u,v)|=\|u\|_{\mathcal{V}}$ ; i.e. we have $M=\beta=1$ in the above estimates.

Consider ${\mathcal{V}}_{n}=\mathop{\rm Span}\nolimits\{e_{0},\cdots,e_{n-1}\},n=1,2,\cdots$ . Then $({\mathcal{V}}_{n})_{n\in{\mathbb{N}}^{*}}$ is an approximating sequence of ${\mathcal{V}}$ . We define for $s\in[-1,1]$

[TABLE]

which is a Hilbert space for the norm

[TABLE]

Then it is easy to see that ${\mathcal{V}}_{-1}={\mathcal{V}}^{\prime}$ , ${\mathcal{V}}_{0}={\mathcal{H}}$ , ${\mathcal{V}}_{1}={\mathcal{V}}$ with identity of the norms. Morever, for $s\in(0,1)$ ,

[TABLE]

(the complex interpolation space) and for $s\in(-1,0)$ ,

[TABLE]

Lemma 7.1.

One has for $s\in[-1,1]$ ,

[TABLE]

In particular,

[TABLE]

Proof.

Let $\widehat{e_{n}}=\frac{1}{\sqrt{\lambda_{n}}}e_{n}$ . Then $(\widehat{e_{n}})_{n\geq 0}$ is an orthonormal basis of ${\mathcal{V}}$ . For $u\in{\mathcal{V}}$ ,

[TABLE]

Thus

[TABLE]

defines the orthogonal projection of ${\mathcal{V}}$ onto ${\mathcal{V}}_{n}$ . Moreover, in ${\mathcal{V}}$ one has

[TABLE]

Let $f\in{\mathcal{V}}_{s}$ , $u={{\mathcal{A}}}^{-1}f$ . Then

[TABLE]

Thus

[TABLE]

since $(\lambda_{k})_{k\geq 0}$ is increasing.

Taking $f=e_{n}$ , one sees that $\gamma_{n}({\mathcal{V}}_{s})^{2}\geq\frac{\lambda_{n}^{-1}}{\lambda_{n}^{s}}=\lambda_{n}^{-s-1}$ . ∎

Now let $f\in{\mathcal{V}}_{s}$ , where $-1\leq s\leq 1$ and let $u={\mathcal{A}}^{-1}f$ . Let $u_{n}\in{\mathcal{V}}$ such that

[TABLE]

i.e. $u_{n}$ is the approximate solution. Then by Theorem 6.3

[TABLE]

Thus we obtain the following error estimate

[TABLE]

Remark 7.2.

In this special case one can compute the error directly. In fact $u=\sum_{k=0}^{\infty}\frac{1}{\lambda_{k}}\langle f,e_{k}\rangle_{{\mathcal{H}}}e_{k}$ and $u_{n}=\sum_{k=0}^{n-1}\langle f,e_{k}\rangle_{{\mathcal{H}}}e_{k}$ . Thus

[TABLE]

which is exactly the estimate (7.1). This means that Theorem 6.3 gives the best possible estimate of the error.

Let us provide an example of application of (7.1). Let ${\mathbb{K}}={\mathbb{C}}$ , ${\mathcal{H}}=L^{2}(0,2\pi)$ with norm $\|u\|_{{\mathcal{H}}}^{2}=\frac{1}{2\pi}\int_{0}^{2\pi}|u(t)|^{2}dt$ . Let ${\mathcal{V}}=\{u\in H^{1}(0,2\pi):u(0)=u(2\pi)\}$ with norm

[TABLE]

Then the injection ${\mathcal{V}}\hookrightarrow{\mathcal{H}}$ is compact. Let $a:{\mathcal{V}}\times{\mathcal{V}}\to{\mathbb{C}}$ be given by

[TABLE]

Let $f\in L^{2}(0,2\pi)$ . Then there exists a unique $u\in H^{2}(0,2\pi)$ such that

[TABLE]

In fact, $u$ is the unique element of ${\mathcal{V}}$ such that $a(u,v)=\langle f,v\rangle$ for all $v\in{\mathcal{V}}$ .

For $u\in{\mathcal{H}}$ , let $\widehat{u}(k)=\frac{1}{2\pi}\int_{0}^{2\pi}u(t)e^{-ikt}dt$ be the $k$ -th Fourier coefficient. Then

[TABLE]

Let $e_{k}(t)=e^{ikt},t\in(0,2\pi)$ . Then $(e_{k})_{k\in{\mathbb{Z}}}$ is an orthonormal basis of ${\mathcal{H}}$ and $\widehat{u}(k)=\langle u,e_{k}\rangle_{{\mathcal{H}}}$ . Let ${\mathcal{V}}_{n}=\mathop{\rm Span}\nolimits\{e_{k}:|k|<n\}$ and let $u_{n}$ be the approximate solution i.e.

[TABLE]

Then our estimate shows that

[TABLE]

Let $0<s\leq 1$ and ${\mathcal{V}}_{s}:=\{u\in L^{2}(0,2\pi):\sum_{k\in{\mathbb{Z}}}(1+k^{2})^{s}|\widehat{u}(k)|^{2}<\infty\}$ . If $f\in{\mathcal{V}}_{s}$ , then by (7.1),

[TABLE]

7.2. Finite elements for the Poisson problem

In this section we want to apply our results to show the convergence of a numerical approximation via triangularization for the solution of a Poisson problem where coercivity is violated but essential coercivity holds. For simplicity we choose ${\mathbb{K}}={\mathbb{R}}$ throughout this section. Let $\Omega\subset{\mathbb{R}}^{d}$ be an open, bounded, convex set and let $a_{ij}:\Omega\to{\mathbb{R}}$ ( $1\leq i,j\leq d$ ) be Lipschitz continuous functions such that

[TABLE]

for all $x\in\Omega$ , where $\alpha>0$ . Moreover, let $b_{j},c_{j}\in W^{1,\infty}(\Omega)$ for $j=1,\cdots,d$ and $b_{0}\in L^{\infty}(\Omega)$ . We consider the operator $A$ given by

[TABLE]

Note that $A:H^{2}(\Omega)\to L^{2}(\Omega)$ is linear and continuous.

Our aim is to study the Poisson equation

[TABLE]

where $f\in L^{2}(\Omega)$ is given and a solution $u\in H_{0}^{1}(\Omega)\cap H^{2}(\Omega)$ is to be determined and calculated by approximation. We will impose the uniqueness condition

[TABLE]

We use the continuous, coercive form

[TABLE]

given by

[TABLE]

and also the perturbed form $a$ given by

[TABLE]

Note that the adjoint form $a^{*}$ defined by $a^{*}(u,v)=a(v,u)$ has the same form as $a$ . This is the reason why we also consider the coefficients $c_{j}$ .

Then the following well posedness result holds.

Theorem 7.3.

i)

The form $a$ is essentially coercive. 2. ii)

Assume (7.4). Then for each $f\in L^{2}(\Omega)$ there exists a unique solution $u\in H^{1}_{0}(\Omega)\cap H^{2}(\Omega)$ of (7.3).

Proof.

a) We first show $H^{2}$ -regularity. Let $u\in H^{1}_{0}(\Omega)$ , $f\in L^{2}(\Omega)$ such that $a(u,v)=\int_{\Omega}fv$ for all $v\in H^{1}_{0}(\Omega)$ . Then $u\in H^{2}(\Omega)$ and $Au=f$ . In fact, let

[TABLE]

Then $g\in L^{2}(\Omega)$ and $a_{0}(u,v)=\int_{\Omega}gv$ for all $v\in H^{1}_{0}(\Omega)$ . Now it follows from the classical $H^{2}$ -result of Kadlec [16] (see [13, Theorem 3.2.1.2]) that $u\in H^{2}(\Omega)$ . It clearly follows that $Au=f$ .

b) We show that $a$ is essentially coercive. Let $u_{n}\rightharpoonup 0$ as $n\to\infty$ in $H_{0}^{1}(\Omega)$ and $a(u_{n},u_{n})\to 0$ as $n\to\infty$ . Then $D_{j}u_{n}\rightharpoonup 0$ as $n\to\infty$ in $L^{2}(\Omega)$ . Since the embedding of $H_{0}^{1}(\Omega)$ in $L^{2}(\Omega)$ is compact, it follows that $u_{n}\to 0$ in $L^{2}(\Omega)$ . Consequently

[TABLE]

Thus also $a_{0}(u_{n},u_{n})\to 0$ as $n\to\infty$ . Since $a_{0}$ is coercive this implies $\|u_{n}\|_{H^{1}}\to 0$ as $n\to\infty$ .

c) The form $a$ satisfies uniqueness. In fact, let $u\in H_{0}^{1}(\Omega)$ such that $a(u,v)=0$ for all $v\in H_{0}^{1}(\Omega)$ . Then $u\in H^{2}(\Omega)$ by part a) of the proof. Hence $u=0$ by our assumption (7.4).

d) Let $f\in L^{2}(\Omega)$ . It follows from Corollary 4.6 that there exists a unique $u\in H_{0}^{1}(\Omega)$ such that $a(u,v)=\langle f,v\rangle_{L^{2}}$ for all $v\in H_{0}^{1}(\Omega)$ . Now a) implies that $u\in H^{2}(\Omega)$ and $Au=f$ .

∎

Concerning the uniqueness property, we make the following remark.

Remark 7.4 (Eigenvalues and uniqueness).

Replace the operator $A$ by $A_{\lambda}:=A-\lambda\mathop{\rm Id}\nolimits$ (i.e. $b_{0}$ by $b_{0}-\lambda$ ) where $\lambda\in{\mathbb{R}}$ . Then there exists a finite or countable infinite set such that

[TABLE]

*where $1<N\leq\infty$ and $\lambda_{n}\in{\mathbb{R}}$ , $\lim_{n\to\infty}\lambda_{n}=\infty$ if $N=\infty$ .

If $b_{1}=\cdots=b_{d}=c_{1}=\cdots=c_{d}=0$ and $b_{0}\geq 0$ , then $\lambda_{n}>0$ for all $n\in{\mathbb{N}}^{*}$ and then we are in the coercive case. But in general there will be also negative eigenvalues. The uniqueness condition (7.4) for $A$ is equivalent to saying that $\lambda_{n}\neq 0$ for all $n\in{\mathbb{N}}^{*}$ .*

Our final aim is to show that the finite element method yields an approximation of the solution of (7.3).

For that purpose we assume that $d=2$ and that $\Omega$ is a convex polygon. Let $\{\tau_{h}\}_{h>0}$ be a quasi-uniform admissible triangularization of $\Omega$ (see [2, Definition 9.26]). In particular each $\tau_{h}$ consists of finitely many triangles covering $\Omega$ of outer radius $r_{T}\leq h$ .

For $h>0$ , we consider the corresponding finite element space $V_{h}$ (see [2, Equation (9.35)]). Thus $V_{h}$ consists of those continuous functions on $\overline{\Omega}$ which vanish at $\partial\Omega$ and are affine on each triangle $T\in\tau_{h}$ .

The following fundamental estimates are classical (see e.g. [2, Korollar 9.28]) .

Proposition 7.5.

There exists a constant $c>0$ such that for all $h\in(0,1)$ and for each $v\in H^{2}(\Omega)$ ,

[TABLE]

where $|v|^{2}_{H^{2}(\Omega)}:=\int_{\Omega}(|D_{1}^{2}v|^{2}+2|D_{1}D_{2}v|^{2}+|D_{2}v|^{2})$ .

Note that Proposition 7.5 shows how we can approximate functions in $H^{2}(\Omega)$ by finite elements and so far there is no relation with the solutions of the Poisson equation.

We assume the uniqueness condition (7.4). Then by Theorem 5.2, since the form $a$ is essentially coercive, there exists $h_{0}\in(0,1]$ such that for $0<h\leq h_{0}$ and $u\in V_{h}$

[TABLE]

Let $f\in L^{2}(\Omega)$ . Since $V_{h}$ is finite dimensional, it follows from (7.6) that for all $0<h\leq h_{0}$ , there exists a unique $u_{h}\in V_{h}$ such that

[TABLE]

The finite elements $(u_{h})_{0<h\leq h_{0}}$ are the approximation of the solution of (7.3) we are interested in. They converge in $H^{1}(\Omega)$ with convergence order $1$ and in $L^{2}(\Omega)$ with convergence order $2$ . More precisely, the following is our main theorem of this section.

Theorem 7.6.

Let $f\in L^{2}(\Omega)$ and consider the approximate solutions $u_{h}$ , $0<h\leq h_{0}$ . Then there exist $0<h_{1}\leq h_{0}$ and constants $c_{1},c_{2}$ independent of $f$ such that

[TABLE]

and

[TABLE]

where $u$ is the solution of (7.3).

Proof.

Applying the closed graph theorem in the situation of Theorem 7.3, we find a constant $c_{3}>0$ such that

[TABLE]

whenever $f\in L^{2}(\Omega)$ and $u$ solves (7.3).

By Theorem 5.2, there exist $\gamma>0$ , $0<h_{1}\leq h_{0}$ , both independent of $f$ , such that

[TABLE]

for all $0<h\leq h_{1}$ . Thus (7.5) implies that for $0<h\leq h_{1}$ ,

[TABLE]

Now (7.8) follows from (7.10).

Next we establish the $L^{2}$ -estimate (7.9). For that we compute using (7.5),

[TABLE]

Since $|w|_{H^{2}(\Omega)}\leq\|w\|_{H^{2}(\Omega)}$ , it follows from (7.10) that $\gamma_{h}({\mathcal{H}})\leq cc_{3}h$ for all $h>0$ .

The same estimate is true for $\gamma_{h}^{*}({\mathcal{H}})$ . Now assume that (7.9) is false. Then there exists a sequence $h_{n}\downarrow 0$ as $n\to\infty$ such that (7.9) does not hold for all $h=h_{n}$ and any constant $c_{2}$ . This contradicts Theorem 6.3. ∎

Remark 7.7.

There are other methods to approximate the solution of a non-coercive advection-diffusion equation as (7.3). In fact, Le Bris, Legoll and Madiot [19] use the Banach-Nečas-Babuska lemma (instead of essential coercivity as we do) and a special measure to construct an approximation.

The advantage is that no initial mesh $h_{1}$ has to be considered; on the other hand there seems to be no such precise error estimate as our quadratic convergence obtained in Theorem 7.6 even though numerical examples are given in [19].

Still, another approach (based on Fredholm perturbation) is presented by Christensen [9], which also involves the Babuska inf-sup condition.

Finally, let us mention the works by Droniou, Gallouët and Herbin [10], based on finite volume methods, which also present the advantage to provide an approximate solution for this problem on any admissible mesh.

One of the first results on the Galerkin method in a special non-coercive case are due to Schatz [27] and Schatz–Wang [28].

8. Supplement: saddle point problems

Brezzi’s contribution [4] is a version of (BNB) which implies the convergence of the Galerkin approximation in the case of saddle point problems. Let us consider the case where ${\mathcal{W}}$ and ${\mathcal{Y}}$ are real Hilbert spaces and $\widehat{a}~{}:~{}{\mathcal{W}}\times{\mathcal{W}}\to{\mathbb{R}}$ and $\widehat{b}~{}:~{}{\mathcal{W}}\times{\mathcal{Y}}\to{\mathbb{R}}$ are continuous bilinear forms in the sense that there exists $M>0$ with

[TABLE]

and

[TABLE]

Then, given $(f,g)\in{\mathcal{W}}^{\prime}\times{\mathcal{Y}}^{\prime}$ , the continuous saddle point problem consists in finding $(w,p)\in{\mathcal{W}}\times{\mathcal{Y}}$ such that

[TABLE]

Example 8.1.

An important example is the Stokes problem (motivating some investigation by Ladyžhenskaya [18]), with ${\mathcal{W}}=(H^{1}_{0}(\Omega))^{d}$ , where $d$ is the space dimension, ${\mathcal{Y}}=L^{2}_{0}(\Omega)$ (the space of $L^{2}$ -functions with null average),

[TABLE]

and

[TABLE]

The approximation of the saddle point problem is then generally done by a mixed method [4, 5], letting, for $n=1,2,\ldots$ , $({\mathcal{W}})_{n\in{\mathbb{N}}^{\star}}$ and $({\mathcal{Y}})_{n\in{\mathbb{N}}^{\star}}$ be approximating sequences in the spaces ${\mathcal{W}}$ and ${\mathcal{Y}}$ , respectively, in the sense of Definition 2.1, and looking for $(w_{n},p_{n})\in{\mathcal{W}}_{n}\times{\mathcal{Y}}_{n}$ such that

[TABLE]

We call this the approximate saddle point problem.

The following result shows that conditions (8.1) (which are Brezzi’s conditions [4, Hypotheses H1 and H2]) are sufficient for the convergence of the solutions of the approximate saddle point problems. This is proved by Brezzi [4, Theorem 2.1], where a solution is assumed to exist. However, similar to the proof of our Proposition 2.5, one can show that Brezzi’s conditions imply existence and uniqueness of the continuous saddle point problem. Indeed, following the proof of (8.2) given in the proof of [4, Theorem 2.1], letting $w=0$ and $p=0$ , we get a bound on the approximate solution, and a solution of the continuous problem can be obtained by passing to the limit of a weakly converging subsequence. Uniqueness follows from the estimate (8.2) proved by Brezzi. For $n=1,2,\ldots$ , define

[TABLE]

and assume that ${\mathcal{W}}_{0,n}^{*}:={\mathcal{W}}_{0,n}\setminus\{0\}\neq\emptyset$ and ${\mathcal{Y}}_{n}^{*}:={\mathcal{Y}}_{n}\setminus\{0\}\neq\emptyset$ for all $n\in{\mathbb{N}}^{*}$ .

Theorem 8.2 (Brezzi).

Assume that there exists $\beta>0$ such that

[TABLE]

Then, given $(f,g)\in{\mathcal{W}}^{\prime}\times{\mathcal{Y}}^{\prime}$ , there exists a unique solution $(w,p)$ of the continuous saddle point problem and for each $n\in{\mathbb{N}}^{*}$ a unique solution $(w_{n},p_{n})$ of the approximate saddle point problem. Moreover,

[TABLE]

where the constant $c$ depends only on $\beta$ and $M$ .

The saddle point problem can be cast in our framework by letting ${\mathcal{V}}={\mathcal{U}}={\mathcal{W}}\times{\mathcal{Y}}$ , $u=(w,p)$ , $v=(z,q)$ and

[TABLE]

Given $(f,g)\in{\mathcal{V}}^{\prime}={\mathcal{W}}^{\prime}\times{\mathcal{Y}}^{\prime}$ , define $L\in{\mathcal{V}}^{\prime}$ by

[TABLE]

Then $u=(w,p)$ is a solution of the continuous saddle point problem if and only if (1.1) is satisfied. Moreover, letting ${\mathcal{V}}_{n}={\mathcal{U}}_{n}={\mathcal{W}}_{n}\times{\mathcal{Y}}_{n}$ , a vector $u_{n}=(w_{n},p_{n})\in{\mathcal{V}}_{n}$ satisfies (1.2) if and only if $(w_{n},p_{n})$ is a solution of the approximate saddle point problem. Thus our Theorem 2.4 shows that the convergence property expressed in Brezzi’s Theorem is equivalent to (BNB) for the form $a$ and the approximating sequence $({\mathcal{V}}_{n})$ . We can use this to show the following converse result of Brezzi’s Theorem.

Theorem 8.3.

Assume that, given $(f,g)\in{\mathcal{W}}^{\prime}\times{\mathcal{Y}}^{\prime}$ , for each $n\in{\mathbb{N}}^{*}$ , there is a unique solution $(w_{n},p_{n})$ of the discrete saddle point problem and that $\sup_{n\in{\mathbb{N}}^{*}}(\|w_{n}\|_{W}+\|p_{n}\|_{Y})<\infty.$ Then Brezzi’s conditions (8.1) hold.

Proof.

We know from Theorem 2.4 and Proposition 2.5 that (BNB) is satisfied for some $\beta>0$ . We endow the space ${\mathcal{V}}$ with the norm $\|u\|_{{\mathcal{V}}}=\big{(}\|w\|_{{\mathcal{W}}}^{2}+\|p\|_{{\mathcal{Y}}}^{2}\big{)}^{1/2}$ for $u=(w,p)$ (it is then a Hilbert space as well). Let $n\in{\mathbb{N}}^{\star}$ be given. We then have,

[TABLE]

Let us first choose, for any $p\in{\mathcal{Y}}_{n}^{\star}$ , $u=(0,p)$ , which means that $w=0\in{\mathcal{W}}_{n}$ . Let $(z,q)\in{\mathcal{W}}_{n}\times{\mathcal{Y}}_{n}\setminus\{(0,0)\}$ attaining the supremum value in (8.3). We then have, from the definition of $a$ in this framework of a saddle point problem,

[TABLE]

which implies that $z\neq 0$ and

[TABLE]

This proves (8.1). $(iii)$ , and thus that the operator $\widehat{\mathcal{B}}_{n}~{}:~{}{\mathcal{W}}_{n}\to{\mathcal{Y}}_{n}$ , defined for all $z\in{\mathcal{W}}_{n}$ by

[TABLE]

is bijective from ${\mathcal{W}}_{0,n}^{\perp}$ to ${\mathcal{Y}}_{n}$ .

Let $w\in{\mathcal{W}}_{0,n}^{\star}$ and let $p\in{\mathcal{Y}}_{n}$ be defined by

[TABLE]

Choose an element $(z,q)\in{\mathcal{W}}_{n}\times{\mathcal{Y}}_{n}\setminus\{(0,0)\}$ attaining the supremum value in (8.3) for this choice of $u=(w,p)$ . We then write $z=z_{0}+z_{1}$ , with $z_{0}\in{\mathcal{W}}_{0,n}$ and $z_{1}\in{\mathcal{W}}_{0,n}^{\perp}$ , which can be written as $z_{1}=\widehat{\mathcal{B}}_{n}^{(-1)}q_{1}$ for some $q_{1}\in{\mathcal{Y}}_{n}$ . We have

[TABLE]

Moreover, $\widehat{b}(z_{0},p)=\widehat{b}(w,q)=0$ since $z_{0}\in{\mathcal{W}}_{0,n}$ and $w\in{\mathcal{W}}_{0,n}$ , and

[TABLE]

by definition of $p$ and of $z_{1}$ . Hence

[TABLE]

This implies that $z_{0}\neq 0$ , and therefore $z_{0}\in{\mathcal{W}}_{0,n}^{\star}$ is such that

[TABLE]

where we take into account that $\|z\|_{{\mathcal{W}}}\geq\|z_{0}\|_{{\mathcal{W}}}$ by Pythagore’s theorem. This concludes the proof of (8.1). $(i)$ .

The equivalence between $(BNB)$ and $(BNB^{\star})$ allows to obtain the proof of (8.1). $(ii)$ (with the same $\beta$ , see Proposition 2.9), following the same path.

∎

In conclusion, Brezzi’s conditions (8.1) are equivalent to the well posedness of the continuous saddle point problem together with the convergence of the approximate solutions to the solution, and they are also equivalent to (BNB) for the form $a$ and the approximating sequence $({\mathcal{V}}_{n})$ of ${\mathcal{V}}$ .

Note that [5, Chapter II, Remark 2.11] provides a comment on the fact that (8.1). $(iii)$ is a necessary condition.

Acknowledgments: We are most grateful to Gilles Lancien about a discussion on the approximation property and pointing out the survey article of Casazza [7] to us. We also thank the anonymous referee for useful and inspiring comments. This research is partly supported by the Bézout Labex, funded by ANR, reference ANR-10-LABX-58.

Bibliography32

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] W. Arendt, A. F. M. ter Elst, J. B. Kennedy, and M. Sauter. The Dirichlet-to-Neumann operator via hidden compactness. J. Funct. Anal. , 266(3):1757–1786, 2014.
2[2] W. Arendt and K. Urban. Partielle Differenzialgleichungen. Eine Einführung in analytische und numerische Methoden. Berlin: Springer Spektrum, 2nd edition edition, 2018.
3[3] I. Babuška. Error-bounds for finite element method. Numer. Math. , 16:322–333, 1970/71.
4[4] F. Brezzi. On the existence, uniqueness and approximation of saddle-point problems arising from Lagrangian multipliers. Rev. Française Automat. Informat. Recherche Opérationnelle Sér. Rouge , 8(R-2):129–151, 1974.
5[5] F. Brezzi and M. Fortin. Mixed and hybrid finite element methods , volume 15 of Springer Series in Computational Mathematics . Springer-Verlag, New York, 1991.
6[6] F. E. Browder. Nonlinear operators and nonlinear equations of evolution in Banach spaces. In Nonlinear functional analysis (Proc. Sympos. Pure Math., Vol. XVIII, Part 2, Chicago, Ill., 1968) , pages 1–308, 1976.
7[7] P. G. Casazza. Chapter 7 - approximation properties. In W. Johnson and J. Lindenstrauss, editors, Handbook of the Geometry of Banach Spaces , volume 1 of Handbook of the Geometry of Banach Spaces , pages 271 – 316. Elsevier Science B.V., 2001.
8[8] L. Chesnel and P. jun. Ciarlet. T 𝑇 T -coercivity and continuous Galerkin methods: application to transmission problems with sign changing coefficients. Numer. Math. , 124(1):1–29, 2013.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Galerkin approximation of linear problems

Abstract.

Key words and phrases:

2010 Mathematics Subject Classification:

1. Introduction

Contents

2. Petrov–Galerkin approximation

Definition 2.1** (Approximating sequences of Banach spaces).**

Remark 2.2**.**

Definition 2.3** (Convergence of Galerkin approximation).**

Theorem 2.4**.**

Proposition 2.5**.**

Proof.

Proposition 2.6**.**

Proof.

Remark 2.7**.**

Proposition 2.8**.**

Proof.

Proof of Theorem 2.4.

Proposition 2.9**.**

Proof.

3.

Definition 3.1** (Approximation property and Schauder decomposition).**

Theorem 3.2**.**

Proof of Theorem 3.2.

4. Essentially coercive forms

Remark 4.1**.**

Definition 4.2** (Essential coercivity).**

Theorem 4.3**.**

Proof.

Lemma 4.4**.**

Proof.

Theorem 4.5**.**

Proof.

Corollary 4.6**.**

Proof.

Remark 4.7**.**

Corollary 4.8**.**

Example 4.9**.**

Remark 4.10**.**

Proof.

5. Characterization of the universal Galerkin property

Definition 5.1** (Universal Galerkin property).**

Theorem 5.2**.**

Proof.

Corollary 5.3**.**

Proof.

6. The Aubin-Nitsche trick revisited

Remark 6.1**.**

Proposition 6.2**.**

Proof.

Theorem 6.3**.**

Proof.

7. Applications

7.1. Selfadjoint positive operators with compact resolvent

Lemma 7.1**.**

Proof.

Remark 7.2**.**

7.2. Finite elements for the Poisson problem

Theorem 7.3**.**

Proof.

Remark 7.4** (Eigenvalues and uniqueness).**

Proposition 7.5**.**

Theorem 7.6**.**

Proof.

Remark 7.7**.**

8. Supplement: saddle point problems

Example 8.1**.**

Theorem 8.2** (Brezzi).**

Theorem 8.3**.**

Proof.

Definition 2.1 (Approximating sequences of Banach spaces).

Remark 2.2.

Definition 2.3 (Convergence of Galerkin approximation).

Theorem 2.4.

Proposition 2.5.

Proposition 2.6.

Remark 2.7.

Proposition 2.8.

Proposition 2.9.

Definition 3.1 (Approximation property and Schauder decomposition).

Theorem 3.2.

Remark 4.1.

Definition 4.2 (Essential coercivity).

Theorem 4.3.

Lemma 4.4.

Theorem 4.5.

Corollary 4.6.

Remark 4.7.

Corollary 4.8.

Example 4.9.

Remark 4.10.

Definition 5.1 (Universal Galerkin property).

Theorem 5.2.

Corollary 5.3.

Remark 6.1.

Proposition 6.2.

Theorem 6.3.

Lemma 7.1.

Remark 7.2.

Theorem 7.3.

Remark 7.4 (Eigenvalues and uniqueness).

Proposition 7.5.

Theorem 7.6.

Remark 7.7.

Example 8.1.

Theorem 8.2 (Brezzi).

Theorem 8.3.