On the Convergence of Adaptive Iterative Linearized Galerkin Methods

Pascal Heid; Thomas P. Wihler

arXiv:1905.06682·math.NA·May 17, 2019

On the Convergence of Adaptive Iterative Linearized Galerkin Methods

Pascal Heid, Thomas P. Wihler

PDF

TL;DR

This paper develops a unified convergence theory for adaptive iterative linearized Galerkin methods, encompassing various iterative schemes like Zarantonello, Kačanov, and Newton, applied to nonlinear equations in Hilbert spaces.

Contribution

It introduces an abstract convergence framework for adaptive ILG schemes that unifies multiple iterative methods within a general discretization context.

Findings

01

Convergence of the unified ILG approach is established theoretically.

02

The framework is validated through numerical experiments on nonlinear conservation laws.

03

Comparison of different linearization schemes demonstrates the effectiveness of the unified method.

Abstract

A wide variety of different (fixed-point) iterative methods for the solution of nonlinear equations exists. In this work we will revisit a unified iteration scheme in Hilbert spaces from our previous work that covers some prominent procedures (including the Zarantonello, Ka\v{c}anov and Newton iteration methods). In combination with appropriate discretization methods so-called (adaptive) iterative linearized Galerkin (ILG) schemes are obtained. The main purpose of this paper is the derivation of an abstract convergence theory for the unified ILG approach (based on general adaptive Galerkin discretization methods) proposed in our previous work. The theoretical results will be tested and compared for the aforementioned three iterative linearization schemes in the context of adaptive finite element discretizations of strongly monotone stationary conservation laws.

Figures8

Click any figure to enlarge with its caption.

Equations227

u \in X : F (u) = 0 in X^{⋆},

u \in X : F (u) = 0 in X^{⋆},

u \in X : ⟨ F (u), v ⟩_{X^{⋆} \times X} = 0 for all v \in X,

u \in X : ⟨ F (u), v ⟩_{X^{⋆} \times X} = 0 for all v \in X,

⟨ F (u) - F (v), w ⟩_{X^{⋆} \times X} \leq L_{F} ∥ u - v ∥_{X} ∥ w ∥_{X},

⟨ F (u) - F (v), w ⟩_{X^{⋆} \times X} \leq L_{F} ∥ u - v ∥_{X} ∥ w ∥_{X},

ν ∥ u - v ∥_{X}^{2} \leq ⟨ F (u) - F (v), u - v ⟩_{X^{⋆} \times X},

ν ∥ u - v ∥_{X}^{2} \leq ⟨ F (u) - F (v), u - v ⟩_{X^{⋆} \times X},

u = u - A [v]^{- 1} F (u) .

u = u - A [v]^{- 1} F (u) .

u^{n + 1} = u^{n} - A [u^{n}]^{- 1} F (u^{n}), n \geq 0.

u^{n + 1} = u^{n} - A [u^{n}]^{- 1} F (u^{n}), n \geq 0.

u^{n + 1} \in X : A [u^{n}] u^{n + 1} = A [u^{n}] u^{n} - F (u^{n}), n \geq 0.

u^{n + 1} \in X : A [u^{n}] u^{n + 1} = A [u^{n}] u^{n} - F (u^{n}), n \geq 0.

f : X \to X^{⋆}, f (u) := A [u] u - F (u),

f : X \to X^{⋆}, f (u) := A [u] u - F (u),

A [u^{n}] u^{n + 1} = f (u^{n}), n \geq 0.

A [u^{n}] u^{n + 1} = f (u^{n}), n \geq 0.

a (u; v, w) := ⟨ A [u] v, w ⟩_{X^{⋆} \times X}, v, w \in X .

a (u; v, w) := ⟨ A [u] v, w ⟩_{X^{⋆} \times X}, v, w \in X .

a (u^{n}; u^{n + 1}, w) = ⟨ f (u^{n}), w ⟩_{X^{⋆} \times X} \forall w \in X .

a (u^{n}; u^{n + 1}, w) = ⟨ f (u^{n}), w ⟩_{X^{⋆} \times X} \forall w \in X .

a (u; v, v) \geq α ∥ v ∥_{X}^{2} \forall v \in X,

a (u; v, v) \geq α ∥ v ∥_{X}^{2} \forall v \in X,

a (u; v, w) \leq β ∥ v ∥_{X} ∥ w ∥_{X} \forall v, w \in X,

a (u; v, w) \leq β ∥ v ∥_{X} ∥ w ∥_{X} \forall v, w \in X,

(u^{n + 1}, \cdot)_{X} = (u^{n}, \cdot)_{X} - δ ⟨ F (u^{n}), \cdot ⟩_{X^{⋆} \times X}, n \geq 0,

(u^{n + 1}, \cdot)_{X} = (u^{n}, \cdot)_{X} - δ ⟨ F (u^{n}), \cdot ⟩_{X^{⋆} \times X}, n \geq 0,

A [u^{n}] u^{n + 1} = g, n \geq 0,

A [u^{n}] u^{n + 1} = g, n \geq 0,

F^{'} (u^{n}) u^{n + 1} = F^{'} (u^{n}) u^{n} - δ (u^{n}) F (u^{n}), n \geq 0,

F^{'} (u^{n}) u^{n + 1} = F^{'} (u^{n}) u^{n} - δ (u^{n}) F (u^{n}), n \geq 0,

u_{N}^{⋆} \in X_{N} : ⟨ F (u_{N}^{⋆}), v ⟩_{X^{⋆} \times X} = 0 \forall v \in X_{N} .

u_{N}^{⋆} \in X_{N} : ⟨ F (u_{N}^{⋆}), v ⟩_{X^{⋆} \times X} = 0 \forall v \in X_{N} .

u_{N}^{n + 1} \in X_{N} : a (u_{N}^{n}; u_{N}^{n + 1}, v) = ⟨ f (u_{N}^{n}), v ⟩_{X^{⋆} \times X} \forall v \in X_{N},

u_{N}^{n + 1} \in X_{N} : a (u_{N}^{n}; u_{N}^{n + 1}, v) = ⟨ f (u_{N}^{n}), v ⟩_{X^{⋆} \times X} \forall v \in X_{N},

∥ u^{⋆} - u^{n} ∥_{X} \leq C_{\ref e q : d i scr e t eer r or es t ima t e} u^{n} - u^{n - 1}_{X}, with C_{\ref e q : d i scr e t eer r or es t ima t e} := 1 + \nicefrac β ν,

∥ u^{⋆} - u^{n} ∥_{X} \leq C_{\ref e q : d i scr e t eer r or es t ima t e} u^{n} - u^{n - 1}_{X}, with C_{\ref e q : d i scr e t eer r or es t ima t e} := 1 + \nicefrac β ν,

ν u^{⋆} - u^{n - 1}_{X}^{2}

ν u^{⋆} - u^{n - 1}_{X}^{2}

ν u^{⋆} - u^{n - 1}_{X}^{2} \leq a (u^{n - 1}; u^{n - 1} - u^{n}, u^{n - 1} - u^{⋆}) \leq β u^{n} - u^{n - 1}_{X} u^{n - 1} - u^{⋆}_{X},

ν u^{⋆} - u^{n - 1}_{X}^{2} \leq a (u^{n - 1}; u^{n - 1} - u^{n}, u^{n - 1} - u^{⋆}) \leq β u^{n} - u^{n - 1}_{X} u^{n - 1} - u^{⋆}_{X},

u^{⋆} - u^{n - 1}_{X} \leq β ν^{- 1} u^{n} - u^{n - 1}_{X} .

u^{⋆} - u^{n - 1}_{X} \leq β ν^{- 1} u^{n} - u^{n - 1}_{X} .

∥ u^{⋆} - u^{n} ∥_{X} \leq u^{n} - u^{n - 1}_{X} + u^{⋆} - u^{n - 1}_{X} \leq C_{\ref e q : d i scr e t eer r or es t ima t e} u^{n} - u^{n - 1}_{X},

∥ u^{⋆} - u^{n} ∥_{X} \leq u^{n} - u^{n - 1}_{X} + u^{⋆} - u^{n - 1}_{X} \leq C_{\ref e q : d i scr e t eer r or es t ima t e} u^{n} - u^{n - 1}_{X},

\frac{ν}{2} ∥ u^{⋆} - u ∥_{X}^{2} \leq H (u) - H (u^{⋆}) \leq \frac{L _{F}}{2} ∥ u^{⋆} - u ∥_{X}^{2} \forall u \in X .

\frac{ν}{2} ∥ u^{⋆} - u ∥_{X}^{2} \leq H (u) - H (u^{⋆}) \leq \frac{L _{F}}{2} ∥ u^{⋆} - u ∥_{X}^{2} \forall u \in X .

φ^{'} (t) = ⟨ H^{'} (u^{⋆} + t (u - u^{⋆})), u - u^{⋆} ⟩_{X^{⋆} \times X} = ⟨ F (u^{⋆} + t (u - u^{⋆})), u - u^{⋆} ⟩_{X^{⋆} \times X} .

φ^{'} (t) = ⟨ H^{'} (u^{⋆} + t (u - u^{⋆})), u - u^{⋆} ⟩_{X^{⋆} \times X} = ⟨ F (u^{⋆} + t (u - u^{⋆})), u - u^{⋆} ⟩_{X^{⋆} \times X} .

H (u) - H (u^{⋆})

H (u) - H (u^{⋆})

= \int_{0}^{1} ⟨ F (u^{⋆} + t (u - u^{⋆})) - F (u^{⋆}), u - u^{⋆} ⟩_{X^{⋆} \times X} d t .

H (u) - H (u^{⋆})

H (u) - H (u^{⋆})

H (u) - H (u^{⋆}) \geq \frac{ν}{2} ∥ u^{⋆} - u ∥_{X}^{2} .

H (u) - H (u^{⋆}) \geq \frac{ν}{2} ∥ u^{⋆} - u ∥_{X}^{2} .

H (u) - H (u^{⋆}) \leq \frac{L _{F}}{2} ∥ u^{⋆} - u ∥_{X}^{2} .

H (u) - H (u^{⋆}) \leq \frac{L _{F}}{2} ∥ u^{⋆} - u ∥_{X}^{2} .

H (u^{n - 1}) - H (u^{n}) \geq C_{H} u^{n} - u^{n - 1}_{X}^{2} \forall n \geq 1,

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

On the Convergence of

Adaptive Iterative Linearized Galerkin Methods

Pascal Heid and Thomas P. Wihler

[email protected] and [email protected]

Mathematics Institute, University of Bern, Sidlerstrasse 5, CH-3012 Bern, Switzerland

Abstract.

A wide variety of different (fixed-point) iterative methods for the solution of nonlinear equations exists. In this work we will revisit a unified iteration scheme in Hilbert spaces from our previous work [12] that covers some prominent procedures (including the Zarantonello, Kačanov and Newton iteration methods). In combination with appropriate discretization methods so-called (adaptive) iterative linearized Galerkin (ILG) schemes are obtained. The main purpose of this paper is the derivation of an abstract convergence theory for the unified ILG approach (based on general adaptive Galerkin discretization methods) proposed in [12]. The theoretical results will be tested and compared for the aforementioned three iterative linearization schemes in the context of adaptive finite element discretizations of strongly monotone stationary conservation laws.

Key words and phrases:

Numerical solution methods for quasilinear elliptic PDE, monotone problems, fixed point iterations, linearization schemes, Kačanov method, Newton method, Galerkin discretizations, adaptive mesh refinement, convergence of adaptive finite element methods

2010 Mathematics Subject Classification:

35J62, 47J25, 47H05, 47H10, 49M15, 65J15, 65N12, 65N30, 65N50

The authors acknowledge the financial support of the Swiss National Science Foundation (SNF), Grant No. 200021 182524

1. Introduction

In this paper we analyze the convergence of adaptive iterative linearized Galerkin (ILG) methods for nonlinear problems with strongly monotone operators. To set the stage, we consider a real Hilbert space $X$ with inner product $(\cdot,\cdot)_{X}$ and induced norm denoted by $\|\cdot\|_{X}$ . Then, given a nonlinear operator $\mathsf{F}:\,X\to X^{\star}$ , we focus on the equation

[TABLE]

where $X^{\star}$ denotes the dual space of $X$ . In weak form, this problem reads

[TABLE]

with $\left<\cdot,\cdot\right>_{X^{\star}\times X}$ signifying the duality pairing in $X^{\star}\times X$ . For the purpose of this work, we suppose that $\mathsf{F}$ satisfies the following conditions:

(F1)

The operator $\mathsf{F}$ is Lipschitz continuous, i.e. it exists a constant $L_{\mathsf{F}}>0$ such that

[TABLE]

for all $u,v,w\in X$ . 2. (F2)

The operator $\mathsf{F}$ is strongly monotone, i.e. there is a constant $\nu>0$ such that

[TABLE]

for all $u,v\in X$ .

Given the properties (F1) and (F2), the main theorem of strongly monotone operators states that (1) has a unique solution $u^{\star}\in X$ ; see, e.g., [16, §3.3] or [18, Theorem 25.B].

Iterative linearization

The existence of a solution to the nonlinear equation (1) can be established in a constructive way. This can be accomplished, for instance, by transforming (1) into an appropriate fixed-point form, which, in turn, induces a potentially convergent fixed-point iteration scheme. To this end, following our approach in [12], for some given $v\in X$ , we consider a linear and invertible preconditioning operator $\mathsf{A}[v]:\,X\to X^{\star}$ . Then, applying $\mathsf{A}[v]^{-1}$ to (1) leads to the fixed-point equation

[TABLE]

For any suitable initial guess $u^{0}\in X$ , the above identity motivates the iteration scheme

[TABLE]

Equivalently, we have

[TABLE]

For given $u^{n}\in X$ , we emphasize that the above problem of solving for $u^{n+1}$ is linear; consequently, we call (3) an iterative linearization scheme for (1). Letting

[TABLE]

we may write

[TABLE]

In order to discuss the weak form of (5), for a prescribed $u\in X$ , we introduce the bilinear form

[TABLE]

Then, based on $u^{n}\in X$ , the solution $u^{n+1}\in X$ of (5) can be obtained from the weak formulation

[TABLE]

Throughout this paper, for any $u\in X$ , we assume that the bilinear form $a(u;\cdot,\cdot)$ is uniformly coercive and bounded. The latter two assumptions refer to the fact that there are two constants $\alpha,\beta>0$ independent of $u\in X$ , such that

[TABLE]

and

[TABLE]

respectively. In particular, owing to the Lax-Milgram Theorem, these properties imply the well-posedness of the solution $u^{n+1}\in X$ of the linear equation (7), for any given $u^{n}\in X$ .

Let us briefly review some prominent procedures that can be cast into the framework of the linearized fixed-point iteration (7): For instance, we point to the Zarantonello iteration given by

[TABLE]

with $\delta>0$ being a sufficiently small parameter; cf. Zarantonello’s original report [17], or the monographs [16, §3.3] and [18, §25.4]. A further example is the Kačanov scheme which reads

[TABLE]

in the special case that $g=\mathsf{A}[u]u-\mathsf{F}(u)$ is independent of $u$ . Finally, we mention the (damped) Newton method which is defined by

[TABLE]

for a damping parameter $\delta(u^{n})>0$ . Here $\mathsf{F}^{\prime}$ signifies the Gâteaux derivative of $\mathsf{F}$ (provided that it exists). For any of the above three iterative procedures, we emphasize that convergence to the unique solution of (1) can be guaranteed under suitable conditions; see our previous work [12] for details.

The ILG approach

Consider a finite dimensional subspace $X_{N}\subset X$ . Then, the Galerkin approximation of (2) in $X_{N}$ reads as follows:

[TABLE]

We note that (13) has a unique solution $u_{N}^{\star}\in X_{N}$ since the restriction $\mathsf{F}|_{X_{N}}$ still satisfies the conditions (F1) and (F2) above. The iterative linearized Galerkin (ILG) approach is based on discretizing the iteration scheme (7). Specifically, a Galerkin approximation ${u}_{N}^{n+1}\in X_{N}$ of $u_{N}^{\star}$ , based on a prescribed initial guess ${u}_{N}^{0}\in X_{N}$ , is obtained by solving iteratively the linear discrete problem

[TABLE]

for $n\geq 0$ . For the resulting sequence $\{u_{N}^{n}\}_{n\geq 0}\subset X_{N}$ of discrete solutions it is possible, based on elliptic reconstruction techniques (cf., e.g., [13, 14]), to obtain general (abstract) a posteriori estimates for the difference to the exact solution, $u^{\star}\in X$ , of (1), i.e. for $\|u^{\star}-u_{N}^{n+1}\|_{X}$ , $n\geq 0$ , see [12, §3]. Based on such a posteriori error estimators, an adaptive ILG algorithm that exploits an efficient interplay of the iterative linearization scheme (14) and automatic Galerkin space enrichments was proposed in [12, §4]; see also [6]. We refer to some related works in the context of (inexact) Newton schemes [1, 2, 9, 8], or of the Kačanov iteration [4, 11].

Goal of this paper

The convergence of an adaptive Kačanov algorithm, which is based on a finite element discretization, for the numerical solution of quasi-linear elliptic partial differential equations has been studied in [11]. Furthermore, more recently, the authors of [10] have proposed and analyzed an adaptive algorithm for the numerical solution of (1) within the specific context of a finite element discretization of the Zarantonello iteration (10). The latter paper includes an analysis of the convergence rate which is related to the work [5] on optimal convergence for adaptive finite element methods within a more general abstract framework. The purpose of the current paper is to generalize the adaptive ILG algorithm from [10] to the framework of the unified iterative linearization scheme (5); furthermore, arbitrary (conforming) Galerkin discretizations will be considered. In order to provide a convergence analysis for the ILG scheme (14) within this general abstract setting, we will follow along the lines of [10], however, we emphasize that some significant modifications in the analysis are required. Indeed, whilst the theory in [10] relies on a contraction argument for the Zarantonello iteration, this favourable property is not available for the general iterative linearization scheme (5). To address this difficulty, we derive a contraction-like property instead. This observation will then suffice to establish the convergence of the adaptive ILG scheme, and to (uniformly) bound the number of linearization steps on each (fixed) Galerkin space similar to [10]; we note that the latter property constitutes a crucial ingredient with regards to the (linear) computational complexity of adaptive iterative linearized finite element schemes.

Outline

Section 2 contains a convergence analysis of the unified iteration scheme (5). On that account we will encounter a contraction-like property, which is key for the subsequent analysis of the convergence rate of the adaptive ILG algorithm in Section 3. Here, in addition, a (uniform) bound of the iterative linearization steps on each discrete space will be shown. In Section 4, we will test our ILG algorithm in the context of finite element discretizations of stationary conservation laws. Finally, we add a few concluding remarks in Section 5.

2. Iterative linearization

In this first section we will address the convergence of the linearized iteration (5). We begin with the following a posteriori error estimate.

Lemma 2.1.

Consider the sequence $\{u^{n}\}_{n\geq 0}\subset X$ generated by the iteration (5). If $\mathsf{F}$ satisfies (F1)–(F2), and $a(u;\cdot,\cdot)$ , for $u\in X$ , fulfils (8)–(9), then it holds the bound

[TABLE]

for any $n\geq 1$ .

Proof.

By invoking (F2), and since $u^{\star}$ is the (unique) solution of (1), for $n\geq 1$ , we find that

[TABLE]

Employing (4), (7), and (9), we further get

[TABLE]

and thus

[TABLE]

By the triangle inequality, this leads to

[TABLE]

which completes the proof. ∎

Remark 2.2.

We note that the above result equally holds if (2) and (5) are restricted to any closed subspace of $X$ .

2.1. Potentials

In addition to (F1) and (F2), let us make a further assumption on the (nonlinear) operator $\mathsf{F}$ from (1), which will play an essential role in the ensuing analysis.

(F3)

The operator $\mathsf{F}$ possesses a potential, i.e. it exists a Gâteaux differentiable functional $\mathsf{H}:X\to\mathbb{R}$ such that $\mathsf{H}^{\prime}=\mathsf{F}$ .

We notice the following relation between the norm $\|\cdot\|_{X}$ and the potential $\mathsf{H}$ .

Lemma 2.3.

Suppose that the operator $\mathsf{F}$ satisfies (F1)–(F3), and denote by $u^{\star}\in X$ the unique solution of (1). Then, we have the estimate

[TABLE]

In particular, $\mathsf{H}$ takes its minimum at $u^{\star}$ .

Proof.

For fixed $u\in X$ , define the real-valued function $\varphi(t):=\mathsf{H}(u^{\star}+t(u-u^{\star}))$ , for $t\in[0,1]$ . Taking the derivative leads to

[TABLE]

By invoking the fundamental theorem of calculus and implementing (1), this yields

[TABLE]

Applying the assumptions (F1) and (F2) we can bound the integrand from above and below, respectively. Indeed, the strong monotonicity (F2) implies that

[TABLE]

and therefore,

[TABLE]

Likewise, by invoking (F1) instead of (F2), we find that

[TABLE]

Combining the above bounds leads to (16). ∎

2.2. Contractivity

In order to state and prove the main result of this section, see Theorem 2.7 below, we impose a monotonicity condition on the sequence generated by the iterative linearization scheme (5).

(F4)

There is a constant $C_{\mathsf{H}}>0$ such that the sequence defined by (5) fulfils the bound

[TABLE]

where $\mathsf{H}$ is the potential of $\mathsf{F}$ introduced in (F3).

Proposition 2.4.

Suppose that (F1)–(F4) are satisfied. Furthermore, let the bilinear form $a(\cdot;\cdot,\cdot)$ from (7) be coercive and bounded, cf. (8) and (9), respectively. Then, for any $k\geq 1$ , the sequence $\{u^{n}\}_{n\geq 0}$ from (5) satisfies the estimate

[TABLE]

with

[TABLE]

Moreover, the contraction-like property

[TABLE]

holds true for any $n\geq 2$ .

Before turning to the proof of the above proposition, we establish an auxiliary result.

Lemma 2.5.

Consider a sequence $\{a_{j}\}_{j=1}^{\infty}\subset[0,\infty)$ which satisfies the estimate

[TABLE]

for some constant $c>0$ . Then, it holds the bound $a_{j}\leq c^{-1}(1+c)^{2-j}a_{1}$ , for any $j\geq 2$ .

Proof.

Let us define the sequence $b_{k}:=\sum_{j=k}^{\infty}a_{j}$ , $k\geq 1$ . Using (21), we note that

[TABLE]

for all $k\geq 1$ . By induction, this implies that $b_{2}\geq(c+1)^{k-2}b_{k}$ for any $k\geq 2$ . Therefore, we infer that

[TABLE]

Rearranging terms completes the proof. ∎

Proof of Proposition 2.4.

Let $n>k\geq 1$ be arbitrary. Then, we note the telescope sum

[TABLE]

Thus, by virtue of (17), we infer that

[TABLE]

We aim to bound the left-hand side. To this end, we employ Lemma 2.3, which implies that

[TABLE]

This, together with Lemma 2.1, leads to

[TABLE]

Combining (22) and (23) yields

[TABLE]

Letting $n\to\infty$ , we obtain (18). Moreover, upon setting $c:=C_{\ref{eq:sumbound}}^{-1}$ and $a_{j}:=\|u^{j}-u^{j-1}\|^{2}_{X}$ , $j\geq 1$ , the bound (18) takes the form (21). Hence, applying Lemma 2.5, we deduce (20). ∎

From (20) we immediately obtain the following result.

Corollary 2.6.

Under the assumptions of Proposition 2.4, it follows that $\left\|u^{n}-u^{n-1}\right\|_{X}$ is a null sequence as $n\to\infty$ .

2.3. Convergence

We are now ready to state and prove the main result of this section.

Theorem 2.7.

Suppose that (F1)–(F4) as well as (8) and (9) hold true. Then, the sequence $\{u^{n}\}_{n\geq 0}$ obtained from the iterative linearization procedure (5) converges to the unique solution $u^{\star}\in X$ of (1).

Proof.

Combining Lemma 2.1 and Corollary 2.6 directly implies the convergence of the linearized iteration scheme (5). ∎

Remark 2.8.

In the proof of Theorem 2.7 the application of Lemma 2.1 can be replaced by using [12, Proposition 2.1] instead. We note that the latter result does not require property (F1) to hold. Indeed, assume that (F2), (8) and (9) are satisfied, and that $u\mapsto a(u;u,\cdot)$ and $u\mapsto f(u)$ are continuous mappings from $X$ into its dual space $X^{\star}$ with respect to the weak topology on $X^{\star}$ . Then, if the sequence $\{u^{n}\}_{n\geq 0}$ defined by (5) satisfies $\|u^{n}-u^{n-1}\|_{X}\to 0$ as $n\to\infty$ , it converges to the unique solution $u^{\star}\in X$ of (1).

2.4. Some remarks on condition (F4)

Suppose that the assumptions (F1)–(F3) are satisfied, and consider the sequence $\{u^{n}\}_{n\geq 0}$ generated by the iteration (5). Analogously as in the proof of Lemma 2.3, for fixed $n\geq 1$ , we define the real-valued function $\varphi(t):=\mathsf{H}(u^{n-1}+t(u^{n}-u^{n-1}))$ , for $t\in[0,1]$ . Then, it holds the identity

[TABLE]

Using (4) and (7), we note that

[TABLE]

Hence,

[TABLE]

Consequently, if the bilinear form $a(u;\cdot,\cdot)$ , for any given $u\in X$ , is uniformly coercive with constant $\alpha>\nicefrac{{L_{\mathsf{F}}}}{{2}}$ , cf. (8), where $L_{\mathsf{F}}$ refers to the Lipschitz constant occurring in (F1), then we obtain that

[TABLE]

i.e. (17) is satisfied with $C_{\mathsf{H}}=\alpha-\nicefrac{{L_{\mathsf{F}}}}{{2}}>0$ .

Proposition 2.9.

If $\mathsf{F}$ satisfies (F1)–(F3), and the bilinear form $a(\cdot;\cdot,\cdot)$ from the unified iteration scheme (5) is coercive with coercivity constant $\alpha>\nicefrac{{L_{\mathsf{F}}}}{{2}}$ , cf. (8), then (F4) holds true.

Remark 2.10.

For the Zarantonello iteration scheme (10) we note that $a(u;v,w)=\delta^{-1}(v,w)_{X}$ , for $u,v,w\in X$ , in (6). Then, we have that

[TABLE]

which, upon using Proposition 2.9, shows that (F4) is satisfied for any $\delta\in(0,\nicefrac{{2}}{{L_{\mathsf{F}}}})$ . Under suitable assumptions, a similar observation can be made for the Newton method (12) provided that the damping parameter $\delta(u^{n})$ is chosen sufficiently small; cf. [12, Theorem 2.6].

Remark 2.11.

The above Proposition 2.9 delivers a sufficient condition for (F4). We note, however, that it is not necessary. In particular, if the coercivity constant $\alpha$ in (8) is much smaller than the Lipschitz constant $L_{\mathsf{F}}$ from (F1), then the bound on $\alpha$ in Proposition 2.9 is violated. Nonetheless, in that case, we can still satisfy (17) by imposing alternative assumptions; cf., e.g., (K2) in [12].

3. Adaptive ILG Discretizations

In this section, following the recent approach [10], we will present an adaptive ILG algorithm that exploits an interplay of the unified iterative linearization procedure (5) and abstract adaptive Galerkin discretizations thereof, cf. (14). Moreover, we will establish the (linear) convergence of the resulting sequence of approximations to the unique solution of (1), and comment on the uniform boundedness of the iterative linearization steps on each discrete space. We proceed along the ideas of [10, §4 and §5], and generalize those results to the abstract framework considered in the current paper. Throughout this section, we will assume that any iterative linearization is of the form (7), with (8) and (9) being satisfied.

3.1. Abstract error estimators

We generalize the assumptions on the finite element refinement indicator from [10, §4]. Let us consider a sequence of hierarchical finite dimensional Galerkin subspaces $\{X_{N}\}_{N\geq 0}\subset X$ , i.e.

[TABLE]

Suppose that, for any $N\geq 0$ , there is a computable error estimator

[TABLE]

which satisfies the following two properties:

(A1)

For all $u,v\in X_{N}$ it holds that

[TABLE] 2. (A2)

The error of the discrete solution $u_{N}^{\star}\in X_{N}$ from (13) is controlled by the a posteriori error bound

[TABLE]

where $u^{\star}\in X$ is the exact solution of (1).

Here, $C_{\ref{eq:a1}},C_{\ref{eq:a2}}\geq 1$ are two constants.

The following result shows that the two estimators for ${u}_{N}^{n}$ and $u_{N}^{\star}$ are equivalent once the linearization error is small enough.

Lemma 3.1.

Suppose that $\mathsf{F}$ satisfies (F1)–(F2), and that the a posteriori estimator fulfils (A1). Furthermore, for some $n\geq 1$ , assume that

[TABLE]

with a constant $\lambda\in(0,C_{\ref{eq:clambda}}^{-1})$ , where

[TABLE]

Then, we have that

[TABLE]

Moreover, the two error estimators $\eta_{N}({u}_{N}^{n})$ and $\eta_{N}(u_{N}^{\star})$ are equivalent in the sense that

[TABLE]

Proof.

Owing to Lemma 2.1 and Remark 2.2, and due to (28), it holds that

[TABLE]

Invoking the Lipschitz continuity (A1), we obtain

[TABLE]

Since $\lambda<C_{\ref{eq:clambda}}^{-1}$ we have that $\lambda C_{\ref{eq:discreteerrorestimate}}C_{\ref{eq:a1}}=\lambda C_{\ref{eq:clambda}}<1$ . Hence, manipulating the above inequality yields

[TABLE]

Combining the two inequalities (32) and (33) gives the bound (30). Moreover, applying (A1), as before, and using (32), we infer that

[TABLE]

Similarly, employing (33), it follows that

[TABLE]

This completes the argument. ∎

3.2. Adaptive ILG algorithm

We focus on the adaptive algorithm from [10], which was studied in the context of finite element discretizations of the Zarantonello iteration (10). It is closely related to the general adaptive ILG scheme in [12]. The key idea is the same in both algorithms: On a given Galerkin space, we iterate the linearization scheme (14) as long as the linearization error dominates. Once the ratio of the linearization error and the a posteriori error bound is sufficiently small, we enrich the Galerkin space in a suitable way.

Remark 3.2.

We emphasize that we do not know (a priori) if the while loop of Algorithm 1 always terminates after finitely many steps. Moreover, it may happen that $\eta_{N}(u_{N})=0$ , for some $N\geq 0$ , i.e. the algorithm terminates. Let us provide two comments on this issue:

(a)

Suppose that there is an enrichment $X_{N}$ of $X_{0}$ generated by the above Algorithm 1 such that $\Xi^{n}_{N}>\lambda\Upsilon^{n}_{N}$ for all $n\geq 0$ ; in this situation, the while loop will never end. Given the assumptions of Proposition 2.4, it follows from Corollary 2.6 that $\Xi^{n}_{N}\to 0$ as $n\to\infty$ . In addition, by virtue of Theorem 2.7 (applied to the discrete setting (13) and (14)), we have that ${u}_{N}^{n}\to u_{N}^{\star}$ as $n\to\infty$ . Then, invoking the reliability (A2) and the continuity (A1), we conclude that

[TABLE]

It follows that $u^{\star}=u_{N}^{\star}$ , and therefore ${u}_{N}^{n}\to u^{\star}$ as $n\to\infty$ . In particular, Algorithm 1 will generate an approximate solution which, for sufficiently large $n$ , is arbitrarily close to the exact solution of (1). 2. (b)

If, for some $n,N\in\mathbb{N}$ , the while loop terminates, then we have the bound $\Xi^{n}_{N}\leq\lambda\Upsilon^{n}_{N}$ . Thus, in the special situation where $\Upsilon^{n}_{N}=\eta_{N}(u_{N}^{n})=0$ , we directly obtain that $\Xi^{n}_{N}=0$ . Then, employing Lemma 2.1 and Remark 2.2, we find that $\left\|u_{N}^{\star}-{u}_{N}^{n}\right\|_{X}\leq C_{\ref{eq:discreteerrorestimate}}\|{u}_{N}^{n}-{u}_{N}^{n-1}\|_{X}=0$ , i.e. $u_{N}^{\star}={u}_{N}^{n}$ . Consequently, recalling (A2), we deduce that

[TABLE]

We obtain that ${u}_{N}^{n}=u^{\star}$ , i.e. the exact solution of (1) is found.

3.3. Perturbed contractivity

We will now turn to the proof of the convergence of Algorithm 1. More precisely, we will show that the sequence $u_{N}$ generated by the above ILG procedure converges, under certain assumptions, to the exact solution $u^{\star}$ of (1). In view of Remark 3.2 we may assume that the while loop always terminates after finitely many steps with $\eta_{N}(u_{N})>0$ for all $N\geq 0$ .

We begin with the following result, which corresponds to [10, Proposition 4.10]. Since we consider general Galerkin discretizations, an additional assumption, cf. the perturbed contraction (34) below, is imposed.

Proposition 3.3.

Let (F1)–(F2) and (A1) be satisfied, and $\lambda\in(0,C_{\ref{eq:clambda}}^{-1})$ be given. Moreover, for each $N\geq 0$ , assume that the while loop of Algorithm 1 terminates after finitely many steps, thereby yielding an output $u_{N}\in X_{N}$ , with $\eta_{N}(u_{N})>0$ . Furthermore, suppose that there are constants $0<q_{\ref{eq:contractionperb}}<1$ and $C_{\ref{eq:contractionperb}}>0$ such that it holds the perturbed contraction bound

[TABLE]

where $u_{N}^{\star}\in X_{N}$ is the unique solution of (13). Then, we have that $\eta_{N}(u_{N})\to 0$ as $N\to\infty$ .

Proof.

Set $X_{\infty}:=\overline{\bigcup_{N\geq 0}X_{N}}$ , and denote by $u_{\infty}^{\star}\in X_{\infty}$ the solution of the weak formulation

[TABLE]

For any $N\geq 0$ , Galerkin orthogonality reads $\left<\mathsf{F}(u_{\infty}^{\star})-\mathsf{F}(u_{N}^{\star}),v\right>_{X^{\star}\times X}=0$ for all $v\in X_{N}$ . Thus, by the Lipschitz continuity (F1) and strong monotonicity (F2) we find that

[TABLE]

for any $v\in X_{N}$ . This results in the Céa type estimate

[TABLE]

Recalling the nestedness (24) of the Galerkin spaces, and exploiting the definition of $X_{\infty}$ , the above bound (35) directly implies that $u_{N}^{\star}\to u_{\infty}^{\star}$ for $N\to\infty$ . Consequently, we deduce that $\left\|u_{N+1}^{\star}-u_{N}^{\star}\right\|_{X}\to 0$ for $N\to\infty$ . Hence, by (34), the estimator $\eta_{N}(u_{N}^{\star})^{2}$ , $N\geq 0$ , is contractive up to a non-negative perturbation which tends to 0. This implies that $\eta_{N}(u_{N}^{\star})\to 0$ as $N\to\infty$ , see, e.g., [3, Lemma 2.3]. Since $u_{N}=u_{N}^{n}$ satisfies $\left\|{u}_{N}^{n}-{u}_{N}^{n-1}\right\|_{X}\leq\lambda\eta_{N}({u}_{N}^{n})$ by construction of Algorithm 1, Lemma 3.1 yields the equivalence of $\eta_{N}(u_{N}^{\star})$ and $\eta_{N}(u_{N})$ . Hence, we conclude that $\eta_{N}(u_{N})\to 0$ as $N\to\infty$ . ∎

Remark 3.4.

We emphasize that the perturbed contraction property (34) is satisfied, for instance, in the special case of the finite element method; see [10] for details.

Combining the above Proposition 3.3 and Lemma 3.1 leads to the following result.

Corollary 3.5.

Given the same assumptions as in Proposition 3.3 and, additionally, (A2), then $u_{N}\to u^{\star}$ for $N\to\infty$ , where the sequence $\{u_{N}\}_{N\geq 0}$ is generated by the Algorithm 1, and $u^{\star}$ is the unique solution of (1).

Proof.

Let $u_{N}={u}_{N}^{n}\in X_{N}$ , $n\geq 1$ , be the output of Algorithm 1 based on the Galerkin space $X_{N}$ . Then, by virtue of (30), and due to (A2) and (31), we have

[TABLE]

with

[TABLE]

Applying Proposition 3.3 completes the proof. ∎

3.4. Linear convergence

In this section we show the linear convergence of the output sequence $\{u_{N}\}_{N\geq 0}$ generated by Algorithm 1. Our analysis follows closely the work [10, Theorem 5.3]. Again, we formulate and prove the result within a more general setting, and, for this purpose, under the additional assumption (34) as before.

Letting

[TABLE]

with $\nu>0$ from (F2), we introduce the quantity

[TABLE]

where $u^{\star}\in X$ and $u_{N}^{\star}\in X_{N}$ are the (unique) solution of (1) and its Galerkin approximation from (13), respectively. By virtue of Lemma 2.3, provided that (F1)–(F3) hold, we observe that

[TABLE]

for any $N\geq 0$ .

Theorem 3.6.

Let $\mathsf{F}$ satisfy (F1)–(F3), and assume (A1)–(A2). Furthermore, suppose that there are constants $0<q_{\ref{eq:contractionperb}}<1$ and $C_{\ref{eq:contractionperb}}>0$ such that (34) holds true. Then, upon setting

[TABLE]

with $\gamma$ from (38), and with $\lambda\in(0,C_{\ref{eq:clambda}}^{-1})$ , the following contraction property holds: If the while loop of Algorithm 1 terminates after finitely many steps with $\eta_{N}(u_{N})>0$ , for all $N\geq 0$ , then we have the (linear) contraction property

[TABLE]

Moreover, there exists a constant $C_{\ref{eq:linearconvergence}}>0$ such that

[TABLE]

i.e. the error estimators decay at a linear rate.

Proof.

Given any integers $N\geq K\geq 0$ , and corresponding Galerkin subspaces $X_{K}\subseteq X_{N}\subset X$ . Then, using Lemma 2.3, with $X$ being replaced by $X_{N}$ , we have that

[TABLE]

Furthermore, recalling (39), and using the perturbed contraction assumption (34), we obtain that

[TABLE]

Invoking (43), and applying the definition of $\gamma$ from (38), we arrive at

[TABLE]

Here, owing to the reliability assumption (A2), and upon implementing (16), we note that

[TABLE]

Thus, it follows that

[TABLE]

Noticing that $1-2\gamma(q_{\ref{eq:qDelta}}-q_{\ref{eq:contractionperb}})L_{\mathsf{F}}^{-1}C_{\ref{eq:a2}}^{-2}=q_{\ref{eq:qDelta}}$ , yields (41). Hence, by induction, it holds that $\Delta_{N+K}\leq q_{\ref{eq:qDelta}}^{K}\Delta_{N}$ . From this inequality, together with (31) and the fact that $\mathsf{H}(u_{N+K}^{\star})-\mathsf{H}(u^{\star})\geq 0$ , cf. (16), we conclude that

[TABLE]

Employing again (16) and making use of the reliability condition (A2), this leads to

[TABLE]

Therefore, once again applying (31), we deduce (42). ∎

3.5. Uniform bound for the number of linearization steps

Given that the while loop of Algorithm 1 terminates after finitely many steps for all $N\geq 0$ , we will show that the number of iterative linearization steps (14) on each Galerkin space $X_{N}$ , which will be denoted by $\#\mathrm{It}(N)$ , can be (uniformly) bounded. As before we assume that $\eta_{N}(u_{N})>0$ for all $N\geq 0$ . The following result is in particular [10, Proposition 4.6], however, we will adapt the proof to the effect that it applies for the contraction-like property (20) (instead of the contraction of the Zarantonello iteration exploited in [10]).

Proposition 3.7.

Suppose (F1)–(F4) and (A1)–(A2). Let $\lambda\in(0,C_{\ref{eq:clambda}}^{-1})$ be the adaptivity parameter from Algorithm 1. Suppose that the while loop of Algorithm 1 terminates after finitely many steps with $\eta_{N}(u_{N})>0$ for all $N\geq 0$ . Then, the number of iterative linearization steps $\#\mathrm{It}(N)$ on $X_{N}$ satisfies the estimate

[TABLE]

for all $N\geq 1$ , where

[TABLE]

Proof.

We split the proof into two parts.

Part 1: Let

[TABLE]

and choose $n\in\mathbb{N}$ , $n\geq 2$ , minimal such that

[TABLE]

We denote by $k:=\#\mathrm{It}(N)$ the number of linearization steps on $X_{N}$ , and aim to show that $k\leq n$ . To this end, we assume by contradiction that $k>n>1$ . By definition of Algorithm 1, $k\geq 1$ is the minimal number such that $\left\|{u}_{N}^{k}-{u}_{N}^{k-1}\right\|_{X}\leq\lambda\eta_{N}({u}_{N}^{k})$ . Invoking (20), we find that

[TABLE]

Let us bound $\left\|{u}_{N}^{1}-{u}_{N}^{0}\right\|_{X}^{2}$ . From (F4) and (16) we obtain

[TABLE]

Hence, we infer the bound

[TABLE]

Since $u_{N}^{0}=u_{N-1}$ , we apply (36) to deduce that

[TABLE]

Therefore, we arrive at

[TABLE]

Invoking the stability (A1), we find that

[TABLE]

Restricting (20) to $X_{N}$ and shifting indices, yields that

[TABLE]

Combining the previous inequalities leads to

[TABLE]

Inserting this inequality into (45) yields

[TABLE]

A straightforward manipulation leads to

[TABLE]

which contradicts the minimality of $k$ , wherefore it must hold that $k\leq n$ .

Part 2: Set

[TABLE]

We show that

[TABLE]

where $\lceil M\rceil$ denotes the smallest integer larger than or equal to $M$ . We verify that

[TABLE]

This proves the lower bound in (46). In a similar way, the upper bound is obtained. Indeed, multiplying (47) by $\lambda$ , we have that

[TABLE]

Thence, a simple manipulation leads to

[TABLE]

which immediately implies the claim. In summary, using Part 1 of the proof, we conclude that $\#\mathrm{It}(N)=k\leq\lceil M\rceil$ . ∎

Remark 3.8.

We note that Proposition 3.7 does not assert a uniform bound for the number of linearization steps since the right-hand side of (44) depends on $N$ . For many sensible error estimators, however, it holds the contraction property

[TABLE]

for $N$ large enough, with a contraction constant $0<q_{\ref{eq:qest}}<1$ . Furthermore, we may assume a reliability and efficiency estimate:

[TABLE]

Then, combining (48), (49), and (31), we obtain $\eta_{N}(u_{N})\simeq\eta_{N-1}(u_{N-1})$ , and consequently, a uniform bound on the number of linearization steps on each Galerkin subspace $X_{N}$ is guaranteed.

Remark 3.9.

As was mentioned earlier, this result is one of the key parts in the computational complexity analysis of the ILG Algorithm 1. Indeed, following along the lines of [10, §6] and replacing [10, Proposition 4.6] by the above Proposition 3.7, the almost optimal computational work of Algorithm 1, under suitable assumptions, can be established in the context of finite element method discretizations.

4. Numerical experiments

In this section we test our ILG Algorithm 1 with two numerical experiments in the context of finite element discretizations of stationary conservation laws.

4.1. Model problem

On an open, bounded and polygonal domain $\Omega\subset\mathbb{R}^{2}$ , with Lipschitz boundary $\Gamma=\partial\Omega$ , let us consider the second-order elliptic partial differential equation

[TABLE]

Here, we choose $X:=H^{1}_{0}(\Omega)$ to be the standard Sobolev space of $H^{1}$ -functions on $\Omega$ with zero trace along $\Gamma$ ; the inner product and norm on $X$ are defined, respectively, by $(u,v)_{X}:=(\nabla u,\nabla v)_{L^{2}(\Omega)}$ and $\left\|u\right\|_{X}:=\|\nabla u\|_{L^{2}(\Omega)}$ , for $u,v\in X$ . We suppose that $g\in X^{\star}=H^{-1}(\Omega)$ in (50) is given, and the diffusion parameter $\mu\in C^{1}([0,\infty))$ fulfils the monotonicity property

[TABLE]

with constants $M_{\mu}\geq m_{\mu}>0$ . Under this condition the nonlinear operator $\mathsf{F}:\,H^{1}_{0}(\Omega)\to H^{-1}(\Omega)$ from (50) can be shown to satisfy (F1) and (F2), with $\nu=m_{\mu}$ and $L_{\mathsf{F}}=3M_{\mu}$ ; see [18, Proposition 25.26]. Moreover, $\mathsf{F}$ has a potential $\mathsf{H}:X\to\mathbb{R}$ given by

[TABLE]

where $\psi(s):=\nicefrac{{1}}{{2}}\int_{0}^{s}\mu(t)\,\mathsf{d}t$ , $s\geq 0$ . The weak form of the boundary value problem (50) in $X$ reads:

[TABLE]

In [12, Section 5.1] the convergence of the Zarantonello, Kačanov, and Newton iteration for the nonlinear boundary value problem (50) was examined.

4.2. Discretization and refinement indicator

For the sake of discretizing (52), and thereby of obtaining an ILG formulation for (50), we will use a conforming finite element framework. We consider a sequence of hierarchical, regular and shape-regular meshes $\{\mathcal{T}_{N}\}_{N\geq 1}$ that partition the domain $\Omega$ into open and disjoint triangles $T\in\mathcal{T}_{N}$ such that $\overline{\Omega}=\bigcup_{T\in\mathcal{T}_{N}}T$ . Moreover, we consider the finite element space

[TABLE]

where we signify by $\mathcal{P}_{1}(T)$ the space of all affine functions on $T\in\mathcal{T}_{N}$ . The mesh refinements in Algorithm 1 are obtained by means of the newest vertex bisection and the Dörfler marking strategy, see [15] and [7], respectively.

For an edge $e\subset\partial T^{+}\cap\partial T^{-}$ , which is the intersection of (the closures of) two neighbouring elements $T^{\pm}\in\mathcal{T}_{N}$ , we signify by $\left\llbracket\bm{v}\right\rrbracket|_{e}=\bm{v}^{+}|_{e}\cdot\bm{n}_{T^{+}}+\bm{v}^{-}|_{e}\cdot\bm{n}_{T^{-}}$ the jump of a (vector-valued) function $\bm{v}$ along $e$ , where $\bm{v}^{\pm}|_{e}$ denote the traces of the function $\bm{v}$ on the edge $e$ taken from the interior of $T^{\pm}$ , respectively, and $\bm{n}_{T^{\pm}}$ are the unit outward normal vectors on $\partial T^{\pm}$ , respectively. For $u\in X_{N}$ we define the local refinement indicator, for each $T\in\mathcal{T}_{N}$ , and the global error indicator, respectively, by

[TABLE]

This error estimator satisfies the assumptions (A1)–(A2) for the problem under consideration; we refer to [10, §8.3] for details.

4.3. Experiments

We revisit two experiments from [12], whereby we test the (modified) adaptive ILG Algorithm 1. We consider the L-shaped domain $\Omega=(-1,1)^{2}\setminus([0,1]\times[-1,0])$ , and start the computations with an initial mesh consisting of 192 uniform triangles. The procedure is run until the number of elements exceeds $10^{6}$ . Moreover, we will always choose the initial guess $u_{0}^{0}\equiv 0$ .

4.3.1. Smooth solution

We consider the nonlinear diffusion coefficient $\mu(t)=(t+1)^{-1}+\nicefrac{{1}}{{2}}$ , for $t\geq 0$ , and select $g$ in (50) such that the analytical solution of (52) is given by the smooth function $u^{\star}(x,y)=\sin(\pi x)\sin(\pi y)$ . It is straightforward to verify that $\mu$ fulfils the bounds (51) so that the assumptions (F1)–(F3) are satisfied. In addition, the convergence of the Zarantonello (10), Kačanov (11), and Newton (12) iterative linearization procedures are guaranteed (with the parameter $\delta=0.85$ and $\delta=1$ in case of the Zarantonello and Newton method, respectively); see [12]. A priori, for the Newton method, we remark that choosing the damping parameter $\delta=1$ (potentially resulting in quadratic convergence of the iterative linearization close to the solution) might lead to a divergent iteration for the given boundary value problem; for this reason, a prediction and correction strategy, which guarantees convergence (and which does not cause any correction of the damping parameter in the current experiments), is presented in [12, Remark 2.8].

In Figure 1 we plot the error estimators (solid lines) and true errors (dashed lines) of our three linearization schemes against the number of elements $|\mathcal{T}_{N}|$ in the triangulation. In addition, the dashed line without any markers is the graph of the function $|\mathcal{T}_{N}|^{-\nicefrac{{1}}{{2}}}$ . We observe the optimal convergence rate $\mathcal{O}(|\mathcal{T}_{N}|^{-\nicefrac{{1}}{{2}}})$ for both (almost) uniform and adaptive mesh refinements corresponding to the parameters $\theta_{*}=0$ and $\theta_{*}=0.5$ , respectively, in the Dörfler marking strategy, see [7, §4.2, Eq. (M∗)].

In Figure 2 we can observe the uniformly bounded number of linearization steps on a given mesh for any of the three considered fixed-point methods, and for both choices of the adaptivity parameter $\lambda=0.1$ and $\lambda=0.001$ in Algorithm 1. For the value $\lambda=0.1$ , the number of linearization steps on a given mesh does not differ essentially between the three considered iteration schemes. However, there is a remarkable difference for the choice $\lambda=0.001$ . The Newton iteration clearly outperforms the other two fixed-point methods, which is not surprising because of the local quadratic convergence regime. In addition, we can also observe that the Kačanov method is superior to the Zarantonello iteration in view of the number of linearization steps.

4.3.2. Nonsmooth solution

In our second experiment, we consider the nonlinear diffusion parameter $\mu(t)=1+\mathrm{e}^{-t}$ , for $t\geq 0$ . Again, it is easily seen that $\mu$ satisfies (51). Moreover, it can be shown that the three linearization schemes under consideration will converge for appropriate choices of the parameter $\delta$ , see [12]. We choose $g$ in (50) such that the analytical solution is given by

[TABLE]

where $r$ and $\varphi$ are polar coordinates. This is the prototype singularity for (linear) second-order elliptic problems with homogeneous Dirichlet boundary conditions in the L-shaped domain; in particular, we note that the gradient of $u^{\star}$ is unbounded at the origin. We let $\delta=0.5$ for the Zarantonello iteration, and use the damping parameter $\delta=1$ for the Newton method as in the experiment before.

For the choice $\theta_{*}=0.5$ in Dörfler’s marking procedure we retain the (almost) optimal convergence rate for both the error and the estimator. Due to the singularity, however, the convergence rate is reduced when the mesh is (almost) uniformly refined (i.e. corresponding to the value $\theta_{*}=0$ ). For the number of linearization steps on each Galerkin space, we can make the same observations as for the smooth case from before.

5. Conclusions

We have established a contraction-like property of the unified iteration scheme (5), which is key for the convergence analysis of the adaptive ILG Algorithm 1. In particular, we were able to generalize some of the results from [10] including the linear convergence of the general ILG procedure and the (uniform) boundedness of the number of linearization steps on each Galerkin space. We underline that the latter property constitutes an important stepping stone for the analysis of optimal computational complexity, cf. [10, §6 and §7].

Bibliography18

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] M. Amrein and T. P. Wihler, An adaptive Newton-method based on a dynamical systems approach , Commun. Nonlinear Sci. Numer. Simul. 19 (2014), no. 9, 2958–2973.
2[2] by same author, Fully adaptive Newton-Galerkin methods for semilinear elliptic partial differential equations , SIAM J. Sci. Comput. 37 (2015), no. 4, A 1637–A 1657.
3[3] M. Aurada, S. Ferraz-Leite, and D. Praetorius, Estimator reduction and convergence of adaptive BEM , Appl. Numer. Math. 62 (2012), no. 6, 787–801.
4[4] C. Bernardi, J. Dakroub, G. Mansour, and T. Sayah, A posteriori analysis of iterative algorithms for a nonlinear problem , J. Sci. Comput. 65 (2015), no. 2, 672–697.
5[5] C. Carstensen, M. Feischl, M. Page, and D. Praetorius, Axioms of adaptivity , Comput. Math. Appl. 67 (2014), no. 6, 1195–1253.
6[6] S. Congreve and T. P. Wihler, Iterative Galerkin discretizations for strongly monotone problems , Journal of Computational and Applied Mathematics 311 (2017), 457–472.
7[7] W. Dörfler, A convergent adaptive algorithm for Poisson’s equation , SINUM 33 (1996), 1106–1124.
8[8] L. El Alaoui, A. Ern, and M. Vohralík, Guaranteed and robust a posteriori error estimates and balancing discretization and linearization errors for monotone nonlinear problems , Comput. Methods Appl. Mech. Engrg. 200 (2011), no. 37-40, 2782–2795.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

On the Convergence of

Abstract.

Key words and phrases:

2010 Mathematics Subject Classification:

1. Introduction

Iterative linearization

The ILG approach

Goal of this paper

Outline

2. Iterative linearization

Lemma 2.1**.**

Proof.

Remark 2.2**.**

2.1. Potentials

Lemma 2.3**.**

Proof.

2.2. Contractivity

Proposition 2.4**.**

Lemma 2.5**.**

Proof.

Proof of Proposition 2.4.

Corollary 2.6**.**

2.3. Convergence

Theorem 2.7**.**

Proof.

Remark 2.8**.**

2.4. Some remarks on condition (F4)

Proposition 2.9**.**

Remark 2.10**.**

Remark 2.11**.**

3. Adaptive ILG Discretizations

3.1. Abstract error estimators

Lemma 3.1**.**

Proof.

3.2. Adaptive ILG algorithm

Remark 3.2**.**

3.3. Perturbed contractivity

Proposition 3.3**.**

Proof.

Remark 3.4**.**

Corollary 3.5**.**

Proof.

3.4. Linear convergence

Theorem 3.6**.**

Proof.

3.5. Uniform bound for the number of linearization steps

Proposition 3.7**.**

Proof.

Remark 3.8**.**

Remark 3.9**.**

4. Numerical experiments

4.1. Model problem

4.2. Discretization and refinement indicator

4.3. Experiments

4.3.1. Smooth solution

4.3.2. Nonsmooth solution

5. Conclusions

Lemma 2.1.

Remark 2.2.

Lemma 2.3.

Proposition 2.4.

Lemma 2.5.

Corollary 2.6.

Theorem 2.7.

Remark 2.8.

Proposition 2.9.

Remark 2.10.

Remark 2.11.

Lemma 3.1.

Remark 3.2.

Proposition 3.3.

Remark 3.4.

Corollary 3.5.

Theorem 3.6.

Proposition 3.7.

Remark 3.8.

Remark 3.9.