The Prager-Synge theorem in reconstruction based a posteriori error   estimation

Fleurianne Bertrand; Daniele Boffi

arXiv:1907.00440·math.NA·July 2, 2019·75 Years of Mathematics of Computation

The Prager-Synge theorem in reconstruction based a posteriori error estimation

Fleurianne Bertrand, Daniele Boffi

PDF

TL;DR

This paper reviews the Prager-Synge hypercircle method and its influence on a posteriori error estimation, focusing on the Braess-Schöberl estimator for the Poisson problem, and demonstrates convergence and optimality of adaptive finite element schemes.

Contribution

It provides a comprehensive review of the Prager-Synge theorem's application to a posteriori error estimation and proves convergence and optimality of related adaptive algorithms.

Findings

01

The Braess-Schöberl estimator effectively estimates errors in Poisson problems.

02

Adaptive finite element schemes based on these estimators converge.

03

The algorithms achieve optimal error reduction.

Abstract

In this paper we review the hypercircle method of Prager and Synge. This theory inspired several studies and induced an active research in the area of a posteriori error analysis. In particular, we review the Braess--Sch\"oberl error estimator in the context of the Poisson problem. We discuss adaptive finite element schemes based on two variants of the estimator and we prove the convergence and optimality of the resulting algorithms.

Equations208

div C ε (u) = f,

div C ε (u) = f,

(\nabla u, \nabla v) = (f, v) \forall v \in H_{0}^{1} (Ω) .

(\nabla u, \nabla v) = (f, v) \forall v \in H_{0}^{1} (Ω) .

{(σ, τ) + (div τ, u) = 0 (div σ, v) = - (f, v) \forall τ \in H (div; Ω) \forall v \in L^{2} (Ω) .

{(σ, τ) + (div τ, u) = 0 (div σ, v) = - (f, v) \forall τ \in H (div; Ω) \forall v \in L^{2} (Ω) .

{(σ, τ) + (div τ, u) = ⟨ τ \cdot n, g_{D} ⟩ ∣_{Γ_{D}} (div σ, v) = - (f, v) \forall τ \in H_{Γ_{D}} (div; Ω) \forall v \in L^{2} (Ω),

{(σ, τ) + (div τ, u) = ⟨ τ \cdot n, g_{D} ⟩ ∣_{Γ_{D}} (div σ, v) = - (f, v) \forall τ \in H_{Γ_{D}} (div; Ω) \forall v \in L^{2} (Ω),

⟨ τ \cdot n, g_{D} ⟩ ∣_{Γ_{D}} = \int_{Γ_{D}} g_{D} τ \cdot n d s .

⟨ τ \cdot n, g_{D} ⟩ ∣_{Γ_{D}} = \int_{Γ_{D}} g_{D} τ \cdot n d s .

(\nabla u, \nabla v) = (f, v) - ⟨ g_{N}, v ⟩ ∣_{Γ_{N}} \forall v \in u \in H_{Γ_{D}}^{1} (Ω),

(\nabla u, \nabla v) = (f, v) - ⟨ g_{N}, v ⟩ ∣_{Γ_{N}} \forall v \in u \in H_{Γ_{D}}^{1} (Ω),

(σ, σ^{*}) = - (div σ^{*}, u) = (f, u) = - (div σ, u) = (σ, σ)

(σ, σ^{*}) = - (div σ^{*}, u) = (f, u) = - (div σ, u) = (σ, σ)

(σ, σ - σ^{*}) = 0.

(σ, σ - σ^{*}) = 0.

σ^{''} = \nabla v .

σ^{''} = \nabla v .

(σ, σ^{''}) = (σ, \nabla v) = - (div σ, v) = (f, v)

(σ, σ^{''}) = (σ, \nabla v) = - (div σ, v) = (f, v)

(σ^{*}, σ^{''}) = (σ^{*}, \nabla v) = - (div σ^{*}, v) = (f, v),

(σ^{*}, σ^{''}) = (σ^{*}, \nabla v) = - (div σ^{*}, v) = (f, v),

(σ - σ^{*}, σ^{''}) = 0

(σ - σ^{*}, σ^{''}) = 0

(σ - σ^{''}, σ - σ^{*}) = 0,

(σ - σ^{''}, σ - σ^{*}) = 0,

∥ σ^{''} ∥ \leq ∥ σ ∥ \leq ∥ σ^{*} ∥,

∥ σ^{''} ∥ \leq ∥ σ ∥ \leq ∥ σ^{*} ∥,

∥\nabla u - \nabla v ∥^{2} + ∥\nabla u - σ^{*} ∥^{2} = ∥\nabla v - σ^{*} ∥^{2}

∥\nabla u - \nabla v ∥^{2} + ∥\nabla u - σ^{*} ∥^{2} = ∥\nabla v - σ^{*} ∥^{2}

(\nabla u_{h}, \nabla v_{h}) = (f, v_{h}) \forall v \in V_{h} .

(\nabla u_{h}, \nabla v_{h}) = (f, v_{h}) \forall v \in V_{h} .

∣\nabla u - \nabla u_{h} ∣ \leq ∥ q - \nabla u_{h} ∥,

∣\nabla u - \nabla u_{h} ∣ \leq ∥ q - \nabla u_{h} ∥,

div q^{Δ} = - Π^{k} f - div \nabla u_{h}

div q^{Δ} = - Π^{k} f - div \nabla u_{h}

[[q^{Δ} \cdot n]]_{E} = - [[\nabla u_{h} \cdot n]]_{E} \forall E \in E_{I},

R T^{Δ} (T) = {q \in R T^{k} (T) for all T \in T},

R T^{Δ} (T) = {q \in R T^{k} (T) for all T \in T},

R T^{k} (T) = {p \in P^{k + 1} (T) : p (x) = \hat{p} (x) + x \tilde{p}, \hat{p} \in (P^{k} (T))^{d}, \tilde{p} \in P^{k} (T)}

R T^{k} (T) = {p \in P^{k + 1} (T) : p (x) = \hat{p} (x) + x \tilde{p}, \hat{p} \in (P^{k} (T))^{d}, \tilde{p} \in P^{k} (T)}

ω_{ν} := ⋃ {T \in T : ν \mbox i s a v er t e x o f T} .

ω_{ν} := ⋃ {T \in T : ν \mbox i s a v er t e x o f T} .

1 \equiv ν \in V \sum ϕ_{ν} \mbox o n Ω.

1 \equiv ν \in V \sum ϕ_{ν} \mbox o n Ω.

q^{Δ} = ν \in V \sum ϕ_{ν} q^{Δ} = ν \in V \sum q_{ν}^{Δ},

q^{Δ} = ν \in V \sum ϕ_{ν} q^{Δ} = ν \in V \sum q_{ν}^{Δ},

⎩ ⎨ ⎧ div q_{ν}^{Δ} = - ((f + Δ u_{h}), ϕ_{ν})_{T} [[q_{ν}^{Δ} \cdot n]] = - ([[\nabla u_{h} \cdot n]], ϕ_{ν})_{E} q_{ν}^{Δ} \cdot n = 0 in each T \in ω_{ν} on each interior edge E of ω_{ν} on \partial ω_{ν} .

⎩ ⎨ ⎧ div q_{ν}^{Δ} = - ((f + Δ u_{h}), ϕ_{ν})_{T} [[q_{ν}^{Δ} \cdot n]] = - ([[\nabla u_{h} \cdot n]], ϕ_{ν})_{E} q_{ν}^{Δ} \cdot n = 0 in each T \in ω_{ν} on each interior edge E of ω_{ν} on \partial ω_{ν} .

η_{T}^{Δ} (u_{h}) = ∥ q^{Δ} (u_{h}) ∥_{0, T} η^{Δ} (u_{h}, T) = (T \in T \sum (η_{T}^{Δ} (u_{h}))^{2})^{1/2},

η_{T}^{Δ} (u_{h}) = ∥ q^{Δ} (u_{h}) ∥_{0, T} η^{Δ} (u_{h}, T) = (T \in T \sum (η_{T}^{Δ} (u_{h}))^{2})^{1/2},

η_{ν}^{\hexstar \hexagon} (u_{h}) = ∥ q_{ν}^{Δ} (u_{h}) ∥_{0, ω_{ν}} η^{\hexstar \hexagon} (u_{h}, T) = (T \in T \sum ν \in V_{T} \sum (η_{ν}^{\hexstar \hexagon} (u_{h}))^{2})^{1/2} .

η_{ν}^{\hexstar \hexagon} (u_{h}) = ∥ q_{ν}^{Δ} (u_{h}) ∥_{0, ω_{ν}} η^{\hexstar \hexagon} (u_{h}, T) = (T \in T \sum ν \in V_{T} \sum (η_{ν}^{\hexstar \hexagon} (u_{h}))^{2})^{1/2} .

∥∣ u - u_{ℓ} ∥ ∣^{2} \leq η^{2} (u_{l}, T) + osc_{T}^{2} (f)

∥∣ u - u_{ℓ} ∥ ∣^{2} \leq η^{2} (u_{l}, T) + osc_{T}^{2} (f)

η^{2} (u_{ℓ}, T) \leq ∥∣ u - u_{ℓ} ∥ ∣^{2} + osc_{T}^{2} (f)

η^{2} (u_{ℓ}, T) \leq ∥∣ u - u_{ℓ} ∥ ∣^{2} + osc_{T}^{2} (f)

R_{T} (v) = (f + Δ v) ∣_{T}

R_{T} (v) = (f + Δ v) ∣_{T}

J_{E} (v) = ([[\nabla v]] \cdot n) ∣_{E},

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

The Prager–Synge theorem in reconstruction based a posteriori error estimation

Fleurianne Bertrand

Institut für Mathematik, Humboldt Universität zu Berlin, Unter den Linden 6, 10099 Berlin, Germany

[email protected]

and

Daniele Boffi

Dipartimento di Matematica “F. Casorati”, University of Pavia, Italy and Department of Mathematics and System Analysis, Aalto University, Finland

[email protected]

(Date: June 2019)

Abstract.

In this paper we review the hypercircle method of Prager and Synge. This theory inspired several studies and induced an active research in the area of a posteriori error analysis. In particular, we review the Braess–Schöberl error estimator in the context of the Poisson problem. We discuss adaptive finite element schemes based on two variants of the estimator and we prove the convergence and optimality of the resulting algorithms.

2010 Mathematics Subject Classification:

65N30, 65N50

The first author gratefully acknowledges support by the German Research Foundation (DFG) in the Priority Programme SPP 1748 Reliable simulation techniques in solid mechanics under grant number BE6511/1-1.

The second author is member of the INdAM Research group GNCS and his research is partially supported by IMATI/CNR and by PRIN/MIUR

1. Introduction

In this paper we review the hypercircle method introduced by Prager and Synge [PS47] and some of its consequences for the a posteriori analysis of partial differential equations. We believe that it is useful to discuss a paper that has been the object of several studies and has induced an active research in the area of a posteriori analysis of partial differential equations. On the one hand, it turns out that the hypercircle method is well appreciated by people working in the field, but less known by applied mathematicians with a less deep knowledge of a posteriori error analysis. On the other hand, we think that it is useful to discuss the consequences of the hypercircle method for a posteriori error analysis after some years of active research in the field, which has led in particular to a nowadays mature study of adaptive finite element schemes. The hypercircle method provides a natural way to get guaranteed upper bounds for the error associated to Galerkin approximations; the corresponding lower bounds are more difficult to obtain and have been widely investigate in the literature.

It is now interesting to address the question whether an error estimator based on the hypercircle technique provides an optimally convergent method when combined with an adaptive strategy. This topic is less studied (see [KS11, CN12]) and we shall see that the answer to this question is not immediate.

The hypercircle method, originally developed for elasticity problems, can be used for several examples of PDEs. Starting from the pioneer work of Ladevèze and Leguillon [LL83], the Prager–Synge idea has led to several applications to the finite element approximation of elliptic problems [AO93, DM99, RSS04, RSS07, BS08b, BPS09b, Bra09, Ver09, Voh10, Voh11, CZ12b, Kim12, CM13] and of problems in elasticity [Bra13, Zha06, BMS10]. Other examples of applications include discontinuous Galerkin approximation of elliptic problems [BFH14] or for convection-diffusion problems [ESV10]; finite element approximation of convection-diffusion and reaction-diffusion problems has been studied in [CFPV09, DEV13]. The Stokes problem and two phase fluid-flow have been considered in [HSV12, DPVY15]. An intense activity is related to multiscale and mortar elements [PVWW13, TW13] as well as to porous media and porous elasticity [MN17, RDPE*+*17, VY18]. Obstacle and contact problems have been studied in [BHS08, WW10, HW12]. Further examples of applications include Maxwell’s equations [CNT17], $hp$ finite elements [DEV16], and eigenvalue problems [CDM*+*17, LO13, BBS19]. An interesting unified approach is provided in [EV15a] where the $p$ -robustness of the error estimator is considered.

The hypercircle technique leads naturally to two methods: the so called gradient reconstruction (related to the construction of the function $\nabla v$ of Figure 1) and the equilibrated flux approach (related to the construction of the function $\boldsymbol{\sigma}^{*}$ of Figure 1).

We develop our study starting from the case of the Laplace operator and we shall focus on the equilibrated flux approach. More precisely, we are going to discuss what is generally known as Braess–Schöberl error estimator [BS08a]. For this estimator an a posteriori error analysis is well known which has been shown to be robust in the degree of the used polynomial [BS08a, BPS09a]. We refer the interested reader in particular to the nice unified framework presented in [EV15b] for more details on these results and for a complete survey of the use of equilibrated flux recovery in various applications.

The convergence analysis of the adaptive finite element method driven by non-residual error estimators has been performed in [KS11] and [CN12]. Both references start from the remark that it is not possible to expect in general a contraction property of the error and the estimator between two consecutive refinement levels. Since [KS11] is based on an assumption on the oscillations that might not be satisfied in our case (i.e., the oscillations are dominated by the estimator), in this paper we adopt the abstract setting of [CN12]. The Braess–Schöberl estimator is considered in [CN12, Section 3.5] where it is claimed that, up to oscillations, it is equivalent to the standard residual error estimator. We shall see that this property is not so immediate and that the consequence analysis has to be performed with particular care. In our paper we consider two variants of the Braess–Schöberl estimator: the first one is the most standard and it is based on single elements (we denote it by $\eta^{\Delta}$ ); the second one is more elaborate and is based on patches of elements (denoted by $\eta^{\mathrlap{{\small\hexstar}}{\hexagon}{}}$ ). The estimator $\eta^{\Delta}$ has been introduced in [BS08a], while $\eta^{\mathrlap{{\small\hexstar}}}{\hexagon}{}$ has been considered in [BPS09a]. We are going to show that actually $\eta^{\mathrlap{{\small\hexstar}}{\hexagon}{}}$ is equivalent, up to oscillations, to a residual estimator arranged on patches of elements (see Section 4). We could not prove an analogous result for the estimator $\eta^{\Delta}$ , which we analyze directly in Section 6. In both cases we have to pay attention to the appropriate definition and to the analysis of the oscillation terms. Oscillations are defined on patches of elements and the theory of [CN12] is modified accordingly. In turn, we present a clean theory where the convergence and the optimality of the adaptive schemes based on $\eta^{\Delta}$ and on $\eta^{\mathrlap{{\small\hexstar}}{\hexagon}{}}$ is rigorously proved.

The structure of the paper is the following: in Section 2 we recall the main results of the Prager–Synge hypercircle theory [PS47], in Section 3 we review the equilibrated flux reconstruction by Braess and Schöberl [BS08a, BPS09a], in Section 4 we show the equivalence of the estimator $\eta^{\mathrlap{{\small\hexstar}}{\hexagon}{}}$ with the standard residual one. We are then ready to recall in Section 5 the main ingredients of the theory of [CN12] and to apply it to the adaptive finite element method based on $\eta^{\mathrlap{{\small\hexstar}}{\hexagon}{}}$ . Finally, Section 6 shows how to apply directly the theory of [CN12] to the estimator $\eta^{\Delta}$ without bounding it in terms of the standard residual estimator.

2. The Prager–Synge theory and its application to error estimates

We start this section by reviewing the main aspects of the hypercircle theory introduced by Prager and Synge in [PS47]. The theory was developed for the mixed elasticity equation: the problem under consideration was to seek $\mathfrak{u}\in{\bf H}^{1}_{\Gamma_{D}}({\Omega})$ with

[TABLE]

where $C$ is the linear relationship between the stress and the strain.

In this paper we deal with the Poisson problem where a simplified version of the Prager–Synge theory can be applied.

Given a polytopal domain $\Omega$ in $\mathbb{R}^{d}$ and $f\in L^{2}(\Omega)$ , our problem is to find $u\in H^{1}_{0}(\Omega)$ such that

[TABLE]

In this context, it is convenient to describe the Prager–Synge theory with the help of the mixed Laplacian equations. More precisely, let us consider the following problem: given $f\in L^{2}(\Omega)$ , find $u\in L^{2}(\Omega)$ and $\boldsymbol{\sigma}\in H({\operatorname{div}};\Omega)$ such that

[TABLE]

Problem (2.3) corresponds to the case of homogeneous boundary conditions on $u$ . Clearly, more general boundary conditions can be considered. For the sake of completeness, we write down explicitly the general formulation associated to mixed boundary conditions $u=g_{D}$ on $\Gamma_{D}$ and $\boldsymbol{\sigma}\cdot{\mathbf{n}}=g_{N}$ on $\Gamma_{N}$ , where $\partial\Omega$ is split in a Dirichlet part $\Gamma_{D}$ and in a Neumann part $\Gamma_{N}$ . Let $H_{\Gamma_{D}}({\operatorname{div}};\Omega)$ and $H_{\Gamma_{D},g}({\operatorname{div}};\Omega)$ denote the subspaces of vectorfields in $H({\operatorname{div}};\Omega)$ with normal component vanishing or equal to $g_{N}$ , respectively, on $\Gamma_{N}$ . Then the problem is: find $u\in L^{2}(\Omega)$ and $\boldsymbol{\sigma}\in H_{\Gamma_{D},g}({\operatorname{div}};\Omega)$ such that

[TABLE]

where the brackets in the first equation represent the duality pairing between $H^{1/2}(\Gamma_{D})$ and $H^{-1/2}(\Gamma_{D})$ which, in the case of smooth functions, can be interpreted as

[TABLE]

In this more general setting the analogue of (2.2) reads: find $u\in H^{1}_{\Gamma_{D},g}(\Omega)$ such that

[TABLE]

where $u\in H^{1}_{\Gamma_{D}}(\Omega)$ and $u\in H^{1}_{\Gamma_{D},g}(\Omega)$ denote the subspace of $H^{1}(\Omega)$ with boundary conditions on $\Gamma_{D}$ vanishing or equal to $g_{D}$ , respectively.

All the following theory could be stated in this general setting, but for the sake of readability we present it in the case when $\Gamma_{N}=\emptyset$ (so that $\Gamma_{D}=\partial\Omega)$ and $g_{D}=0$ .

The equilibrium condition. Let $\boldsymbol{\sigma}^{*}$ be any function in $H({\operatorname{div}};\Omega)$ satisfying the equilibrium equation ${\operatorname{div}}\boldsymbol{\sigma}^{*}=-f$ ; then it is easily seen that

[TABLE]

from which the following orthogonality is obtained

[TABLE]

Equation (2.4) says that $\boldsymbol{\sigma}$ lies on a hypersphere having $\boldsymbol{\sigma}^{*}$ for diameter. The center of the sphere is denoted by $K$ in Figure 1.

Gradients of $\mathbf{H^{1}_{0}(\Omega)}$ . Let now $\boldsymbol{\sigma}^{\prime\prime}$ be the gradient of any function $v$ in $H^{1}_{0}(\Omega)$

[TABLE]

It follows that

[TABLE]

and that

[TABLE]

which imply

[TABLE]

The orthogonality stated in (2.5) can be expressed by saying that $\boldsymbol{\sigma}$ and $\boldsymbol{\sigma}^{*}$ lie on the same hyperplane orthogonal to $\boldsymbol{\sigma}^{\prime\prime}$ .

Putting together the orthogonalities of Equations (2.4) and (2.5) leads to the conclusion that $\boldsymbol{\sigma}$ and $\boldsymbol{\sigma}^{*}$ lie on the hypercircle $\Gamma$ given by the intersection of the hypersphere defined by (2.4) and the hyperplane given by (2.5). Moreover, let $\widehat{\boldsymbol{\sigma}^{\prime\prime}}$ be the foot of $\boldsymbol{\sigma}^{\prime\prime}$ on the hyperplane; since $\boldsymbol{\sigma}-\boldsymbol{\sigma}^{*}$ is orthogonal to $\boldsymbol{\sigma}$ and $\widehat{\boldsymbol{\sigma}^{\prime\prime}}$ is the orthogonal projection of $\boldsymbol{\sigma}$ onto $\boldsymbol{\sigma}^{\prime\prime}$ , we have the following orthogonality

[TABLE]

which implies that the segment connecting $\boldsymbol{\sigma}^{*}$ to $\widehat{\boldsymbol{\sigma}^{\prime\prime}}$ is a diameter of the hypercircle $\Gamma$ . The center of this hypercircle is denoted by $C$ in Figure 1.

The conclusion of this construction, summarized in Figure 1, is an energy bound with constant one which we state in the following theorem.

Theorem 2.1.

Let $\boldsymbol{\sigma}$ be the second component of the solution to (2.3); let $\boldsymbol{\sigma}^{*}$ be any function in $H({\operatorname{div}};\Omega)$ which satisfies the equilibrium condition $\boldsymbol{\sigma}^{*}=-f$ in $\Omega$ and let $\boldsymbol{\sigma}^{\prime\prime}$ be the gradient of any function in $H^{1}_{0}(\Omega)$ . Then

[TABLE]

where $\widehat{\boldsymbol{\sigma}}^{\prime\prime}$ is the multiple of $\boldsymbol{\sigma}^{\prime\prime}$ lying in the hyperplane orthogonal to $\boldsymbol{\sigma}^{\prime\prime}$ and containing $\boldsymbol{\sigma}$ (see Figure 1).

We now state another important consequence of the previous geometrical construction which applies to problem (2.2) and which is usually referred to as Prager–Synge theorem.

Theorem 2.2.

Let $u$ be the solution of problem (2.2). Then it holds

[TABLE]

for all $v\in H^{1}_{0}(\Omega)$ and all $\boldsymbol{\sigma}^{*}\in H({\operatorname{div}};\Omega)$ satisfying the equilibrium condition ${\operatorname{div}}\boldsymbol{\sigma}^{*}=-f$ .

Proof.

From the orthogonalities defining the hypersphere and the hyperplane $(\boldsymbol{\sigma},\boldsymbol{\sigma}-\boldsymbol{\sigma}^{*})=(\boldsymbol{\sigma}-\boldsymbol{\sigma}^{*},\boldsymbol{\sigma}^{\prime\prime})=0$ if follows immediately $(\boldsymbol{\sigma}-\boldsymbol{\sigma}^{\prime\prime},\boldsymbol{\sigma}-\boldsymbol{\sigma}^{*})=0$ which gives the results with the identifications $\nabla u=\boldsymbol{\sigma}$ and $\nabla v=\boldsymbol{\sigma}^{\prime\prime}$ . ∎

The Prager-Synge theorem has been used in order to obtain error estimates in various contexts, starting from [LL83]. We describe the application of Theorem 2.2 in the case of the conforming finite element approximation of problem (2.2). Let $V_{h}$ be a finite dimensional subspace of $H^{1}_{0}(\Omega)$ and consider the discrete problem: find $u_{h}\in V_{h}$ such that

[TABLE]

We are going to consider a standard conforming $V_{h}$ , so that $u_{h}\in\mathcal{P}^{k}(\mathcal{T})$ , the space of continuous piecewise polynomials of degree less than or equal to $k$ .

A direct application of Theorem 2.2 with $v=u_{h}$ and $\boldsymbol{\sigma}^{*}=\mathbf{q}$ gives

[TABLE]

where $\mathbf{q}$ is any function in $H({\operatorname{div}};\Omega)$ with ${\operatorname{div}}\mathbf{q}=-f$ in $\Omega$ . It turns out that the right hand side in (2.8) is a reliable error estimator with constant one. Clearly, this fundamental idea leads to a viable approach only if it is possible to construct $\mathbf{q}$ in a practical way. This is what is generally called equilibrated flux reconstruction.

*Remark 2.3**.*

In the case when $f$ is piecewise polynomial, a possible (not practical) definition of $\mathbf{q}$ could be obtained by solving an approximation of the mixed problem (2.3), so that $\mathbf{q}$ is a discretization of $\boldsymbol{\sigma}$ . If $f$ is a generic function, a standard oscillation term will show up. A smart modification of this intuition is behind the Braess-Schöberl construction presented later in this paper.

Ainsworth and Oden in [AO00, Chap. 6.4] show that $\mathbf{q}$ can be efficiently constructed by solving local problems. Let $\mathcal{E}_{I}$ be the set of the interior edges of a shape-regular triangulation $\mathcal{T}$ . We will also denote by $\mathcal{E}_{B}$ the set of the boundary edges. In the case when problem (2.7) is solved with polynomials of degree $k$ , the reconstruction proposed in [AO00] seeks $\mathbf{q}^{\Delta}=\mathbf{q}-\nabla u_{h}$ such that

[TABLE]

where $\Pi^{k}f$ denotes the $L^{2}$ projection of $f$ onto polynomials of degree $k$ .

3. The Braess–Schöberl construction

In [BS08a] Braess and Schöberl show how to realize the above conditions (2.9a) and (2.9b) by exploiting some basic properties of the Raviart–Thomas finite element spaces. The resulting estimator is commonly called the Braess–Schöberl error estimator.

The local problems can be solved on patches around vertices of the mesh. The construction has been extended to different problems and geometrical configurations, thus allowing for a very powerful and general equilibration procedure. The reconstruction aims at defining $\mathbf{q}^{\Delta}$ in the broken Raviart–Thomas space of order $k$ , that is

[TABLE]

where the Raviart–Thomas element is given by

[TABLE]

and $\mathbb{P}^{k}(K)$ denotes the space of polynomials of degree at most $k$ on the domain $K$ . Clearly, since $u_{h}\in\mathcal{P}^{k}(\mathcal{T})$ , we will have that $\mathbf{q}=\mathbf{q}^{\Delta}-\nabla u_{h}$ belongs to $RT^{k}(\mathcal{T}):=RT^{\Delta}(\mathcal{T})\cap H({\operatorname{div}},\Omega)$ by virtue of the jump conditions (2.9b).

The Braess-Schöberl reconstruction is performed as follows. Let $\mathcal{V}$ denote the set of vertices of the triangulation, $\nu\in\mathcal{V}$ a vertex, and $\omega_{\nu}$ the patch of elements sharing the vertex $\nu$

[TABLE]

Let $\phi_{\nu}$ be the continuous piecewise linear Lagrange function with $\phi_{\nu}(\nu)=1$ and whose support is $\omega_{\nu}$ , (that is, the hat function equal to one at the node $\nu$ ), so that the following partition of unity property holds

[TABLE]

Hence $\mathbf{q}^{\Delta}$ can be decomposed into functions living on vertex patches, i.e.

[TABLE]

where $\text{supp}(\mathbf{q}^{\Delta}_{\nu})=\omega_{\nu}$ and $\mathbf{q}^{\Delta}_{\nu}\cdot{\mathbf{n}}=0\text{ on }\partial\omega_{\nu}$ .

Since each facet belongs to two elements the conditions (2.9a) and (2.9b) mean that the function $\mathbf{q}^{\Delta}_{\nu}$ has to fulfill

[TABLE]

It is common to use a notation where the dependence on the discrete solution $u_{h}$ is made explicit, so that in general we are going to denote the reconstruction by $\mathbf{q}^{\Delta}(u_{h})$ or its contribution coming from a patch $\mathbf{q}^{\Delta}_{\nu}(u_{h})$ .

Two options are now given for the design of an error indicator based on the above reconstruction. The first one, introduced in [BS08a], considers directly the quantity $\mathbf{q}^{\Delta}(u_{h})$ on each single element

[TABLE]

while the second on, presented in [BPS09a], is based on patches of elements

[TABLE]

The estimators $\eta^{\Delta}_{T}(u_{h})$ and $\eta^{\mathrlap{{\small\hexstar}}{\hexagon}}_{\nu}(u_{h})$ are clearly not equivalent. People usually tend to consider $\eta^{\Delta}$ as the standard Breass–Schöberl estimator, but it is clear that for the analysis sometimes $\eta^{\mathrlap{{\small\hexstar}}}{\hexagon}{}$ may be more convenient.

An a posteriori analysis for both estimators is available in the sense that both satisfy a global reliability

[TABLE]

and a global efficiency

[TABLE]

up to oscillations (see, in particular, [Bra13, Theorems 9.4 and 9.5], and [BS08a, BPS09a]). The definition of the oscillation terms need particular attention. We shall comment on that in the next sections.

Explicit formulas in the case $d=2$ for the computation of $\mathbf{q}^{\Delta}_{\nu}$ are given in [BKMSa]. The direct construction is extended to $d=3$ in [CZ12a].

4. Equivalence with the residual error estimator

In this section we are going to show that, up to an oscillation term, the estimator $\eta^{\mathrlap{{\small\hexstar}}{\hexagon}{}}$ is equivalent to an estimator based on the standard residual error estimator.

A crucial step for the analysis of the convergence of the adaptive scheme based on the Braess–Schöberl error estimator is its local equivalence with a standard residual error estimator. This fact has been observed (without rigorous proof) in [CN12, Section 3.5] and it has been used (without oscillations) in [KS11, Equation 2.17]. The interested reader is referred to [CFPP14, Section 8] for a more elaborate discussion about the equivalence between residual and non-residual error estimators.

The standard residual estimator for Laplace equation is based on two contributions: the element and jump residuals

[TABLE]

where $T$ is an element of the triangulation $\mathcal{T}$ and $E$ is a facet in the set of facets $\mathcal{E}$ . The residual estimator for $T\in\mathcal{T}$ then reads

[TABLE]

where $J_{\partial T}(u_{h})$ is viewed as a piecewise function over $\partial T$ and where as usual $h_{T}$ denotes the diameter of the element $T$ .

It is well known that the error estimator defines a functional $R(u_{h})\in(H_{0}^{1}(\Omega))^{\prime}$ as follows

[TABLE]

The global residual error estimator on a triangulation $\mathcal{T}$ is usually defined by adding up the local contributions

[TABLE]

Unfortunately, no equivalence holds in general between $\eta^{\mathrlap{{\small\hexstar}}}{\hexagon}{}(u_{h},T)$ and $\eta_{res}(u_{h},T)$ ; a crucial difference between the two estimators is that if an element $T$ belonging to the patch $\omega_{\nu}$ is refined and the discrete solution $u_{h}$ doesn’t change, then the error is not reduced, but the estimator $\eta_{res}(u_{h},T)$ decreases because of the reduction of the mesh-size; on the other hand, $\eta^{\mathrlap{{\small\hexstar}}}{\hexagon}{}(u_{h},\omega_{\nu})$ may not decrease since it is based on the equilibration procedure that might generate a reconstruction that is not different from the one computed on the coarser mesh.

An interesting alternative, described in [BPS09a] for piecewise constant $f$ , consists in building a residual error estimator which is based on element patches, so that the comparison with $\eta^{\mathrlap{{\small\hexstar}}}{\hexagon}{}$ is more natural. This leads, for every node $\nu$ with corresponding Lagrangian function $\phi_{\nu}$ , to the following definition

[TABLE]

We denote the corresponding global estimator by

[TABLE]

with

[TABLE]

The next lemma states the local equivalence between $\eta^{\mathrlap{{\small\hexstar}}}{\hexagon}{}$ and the patchwise residual estimator $\tilde{\eta}^{res}$ .

Lemma 4.1.

Let $u_{h}$ be the solution of the variational formulation (2.7) and consider a node $\nu$ of the triangulation $\mathcal{T}$ . Then, it holds

[TABLE]

up to the oscillation term $\sum\limits_{T\in\omega_{\nu}}\|h_{T}(id-\Pi^{k-1}_{T})(f)\|_{T}$ , that is

[TABLE]

Proof.

Let us start with the upper bound (4.9a). When $f$ is piecewise polynomial of degree ${k-1}$ , from [BPS09a, Theorem 7] we have

[TABLE]

and thus, using standard scaling arguments,

[TABLE]

If now $f$ is a generic function in $L^{2}(\Omega)$ , then the first term in (4.10) transforms into

[TABLE]

so that it remains to show that

[TABLE]

Indeed

[TABLE]

and

[TABLE]

since $\max\limits_{\mathbf{x}\in T}(\phi_{\nu}(\mathbf{x}))=1$ . This implies

[TABLE]

Let us now show how to prove the lower bound (4.9b). Recall that (3.5) implies

[TABLE]

for any $v\in H^{1}(\omega_{\nu})$ satisfying either zero boundary conditions or $(v,1)_{\omega_{\nu}}=0$ in the case when $\nu$ is an internal node. Now, take

[TABLE]

with $\tilde{v}$ is defined as follows

[TABLE]

where $\phi^{3}_{T}$ denotes the cubic Lagrange bubble function corresponding to the barycenter of $T$ and $\phi^{k+2}_{E}$ one of the Lagrange functions of degree $k+2$ associated to the edge $E$ . Since the norm of $v$ is bounded, we have

[TABLE]

Moreover, we have that

[TABLE]

By inserting the expression for $\tilde{v}$ and by evaluating the different terms separately we finally obtain

[TABLE]

∎

5. Optimal convergence rate for $\eta^{\mathrlap{{\small\hexstar}}}{\hexagon}{}$

In this section we recall the abstract theory developed in [CN12] for the analysis of AFEM formulations where nonresidual estimators are used and we show how to use it for the analysis of the AFEM based on the Braess–Schöberl error estimator. The interested reader is referred to [CN12, Sections 4–6] for all details of the theory. The main results, stated in Theorems 5.1 and 6.6, are the contraction property for the total error (which guarantees the convergence of the AFEM procedure) and the quasioptimality of the rate of convergence in terms of number of degrees of freedom.

As usual when dealing with adaptive schemes, we use a notation that takes into account the levels of refinement instead of the mesh size. We denote by $\mathcal{T}_{0}$ the initial triangulation of ${\Omega}$ and by $u_{\ell}$ the discretization of $u$ on the triangulation $\mathcal{T}_{\ell}$ obtained from $\mathcal{T}_{0}$ after $\ell$ refinements. For some of the remaining notation we will adopt the one from [CN12].

Contraction property. If $u$ is the solution of problem (2.2) and $u_{j}$ is the solution of the corresponding discrete problem after $j$ refinements, the contraction property states the existence of constants $\gamma>0$ , $0<\alpha<1$ , and $\mathcal{J}\in\mathbb{N}$ such that

[TABLE]

where the norm ${\left|\kern-1.07639pt\left|\kern-1.07639pt\left|\cdot\right|\kern-1.07639pt\right|\kern-1.07639pt\right|}$ denotes the $H^{1}$ -seminorm (equivalent to the norm in $H^{1}_{0}(\Omega)$ ). The main difference with respect to the standard contraction property commonly used in this context is that in general there might not be a contraction between two consecutive refinement levels $j$ and $j+1$ , but contraction is guaranteed every $\mathcal{J}$ levels.

Quasioptimal decay rate. The quasioptimality in terms of degrees of freedom is described as usual in the framework of approximation classes. The triple $(u,f,\mathbf{D})$ , of the solution, the right hand side, and the other data of problem (2.2), is in the approximation class $\mathbb{A}_{s}$ if

[TABLE]

where the total error $\sigma(N;v,f,\mathbf{D})$ , in the set $\mathbb{T}_{N}$ of conforming triangulations generated from $\mathcal{T}_{0}$ with at most $N$ elements more than $\mathcal{T}_{0}$ , is defined as

[TABLE]

With this notation, the quasioptimal decay rate is expressed by the following formula

[TABLE]

where the constant $C$ is independent of $j$ . We refer the interested reader to [CN12] for more detail on the constant $C$ , especially for its dependence on $s$ . Clearly, $C$ will depend in particular on the initial triangulation $\mathcal{T}_{0}$ and on the integer $\mathcal{J}$ appearing in the above contraction property.

The assumptions needed in order to get (5.1) and (5.3) are divided into three main groups: assumptions related to the a posteriori error estimators, assumptions related to the oscillations, and assumptions related to the design of the adaptive finite element method. We are going to use the newest vertex bisection algorithm for the refinement of the mesh (see, for instance, [Ste08]). While assumptions on oscillations and on the design of AFEM do not change when residual or nonresidual a posteriori estimators are used, the main modification for the analysis of nonresidual estimators is given by the verification of the assumptions related to the a posteriori error estimators. For this reason, we focus in this section only on these assumptions (see [CN12, Assumption 4.1]), which are the main object of our analysis in the present paper. We will also make more precise the reduction assumption about the oscillations (see condition [H5] later on). We adopt the notation of the previous section and we state the assumptions for a generic error estimator $\eta(u_{\ell},\mathcal{T})$ . In [CN12] there are some typos ( $V$ instead of $U$ , for instance) that we have corrected here.

[CN12] considers a closed set called $K$ -element made of elements or sides and denoted by $\mathcal{K}_{\mathcal{T}}$ . We restrict to the case when $K$ is a triangle. The following definition of refined set of order $j$ is needed between to (not necessarily consecutive) meshes $\mathcal{T}_{\ell}$ and $\mathcal{T}_{m}$

[TABLE]

where the generation $g(T)$ of $T\in\mathcal{T}$ is the number of bisections needed to create $T$ from the initial triangulation $\mathcal{T}_{0}$ .

The four assumptions related to the a posteriori error estimator state the existence of four constants $C_{re}$ , $C_{ef}$ , $C_{dre}$ , and $C_{def}$ and of an index $j^{\star}$ such that the following four conditions are satisfied.

**[H1] Global upper bound (reliability): **

[TABLE]

**[H2] Global lower bound (efficiency): **

[TABLE]

**[H3] Localized upper bound (discrete reliability): **

[TABLE]

**[H4] Discrete local lower bound (discrete efficiency): **

[TABLE]

In particular, it is clear that conditions H1 and H2 are satisfied by the estimators we are considering (see (3.8) and (3.9))

*Remark 5.1**.*

Actually, in [CN12] the conditions H3 and H4 are stated with $\textit{osc}_{\mathcal{T}}(u_{l},\mathcal{R}^{1}_{\mathcal{T}_{\ell}\rightarrow\mathcal{T}_{m}})$ instead of $\textit{osc}_{\mathcal{T}}(u_{l},\mathcal{R}^{j^{\star}}_{\mathcal{T}_{\ell}\rightarrow\mathcal{T}_{m}})$ . In our case, for technical reasons that will be apparent soon, we have to use $j^{\star}$ levels of refinements for the oscillations as well. The proof presented in [CN12] carries over to this situation with the natural modifications.

In H1-H4, particular attention has to be paid to the oscillation terms. When we are using polynomials of degree $k$ for the solution of the discrete problem (2.7), we usually define the oscillation terms by introducing the projection $\Pi_{k-1}$ onto polynomials of degree $k-1$ . The standard oscillation term would then read

[TABLE]

On the other hand, we will consider the estimator $\eta^{\mathrlap{{\small\hexstar}}{\hexagon}{}}$ built on patches and for this reason it makes sense to introduce a corresponding definition of patch oscillations:

[TABLE]

where

[TABLE]

The critical assumption related to the oscillations (see [CN12, Assuption 4.2(a)]) is the following one.

**[H5] Oscillation reduction: **

there exists a constant $\lambda\in]0,1[$ such that

[TABLE]

*Remark 5.2**.*

We need to modify the original assumption of [CN12] by replacing $\textit{osc}_{\mathcal{T}_{l}}(f,\mathcal{R}^{{1}}_{\mathcal{T}_{\ell}\rightarrow\mathcal{T}_{m}})$ with $\textit{osc}_{\mathcal{T}_{l}}(f,\mathcal{R}^{{j^{\star}}}_{\mathcal{T}_{\ell}\rightarrow\mathcal{T}_{m}})$ . A simple example for the necessity of this modification is to consider a triangulation $\mathcal{T}_{\ell}$ and the triangulation $\mathcal{T}_{\ell+1}$ obtained with a minimal refinement, so that only two triangles belong to $\mathcal{R}^{1}_{\mathcal{T}_{\ell}\rightarrow\mathcal{T}_{\ell+1}}$ . This refinement is marked in red in Figure 3 and we can see that

[TABLE]

Repeating this argument, we have also for some $k>1$ that

[TABLE]

This is illustrated in Figure 3 with the green refinement leading to the set $\mathcal{R}^{2}_{\mathcal{T}_{\ell}\rightarrow\mathcal{T}_{\ell+1}}$ . The same holds for $\mathcal{R}^{3}_{\mathcal{T}_{\ell}\rightarrow\mathcal{T}_{\ell+1}}$ (yellow refinement). We see in Figure 4, with the notation $\mathcal{S}^{k}:=supp(\eta^{\mathrlap{{\small\hexstar}}}{\hexagon}{}(u_{h},\mathcal{R}^{k}_{\mathcal{T}_{\ell}\rightarrow\mathcal{T}_{\ell+1}}))$ , that only the fourth refinement leads to a reduction of the support of the estimator.

In the rest of this section we are going to show that hypotheses [H1-H4] hold true for $\eta^{\mathrlap{{\small\hexstar}}{\hexagon}}$ and $\textit{osc}^{\mathrlap{{\small\hexstar}}}{\hexagon}$ , in the case when $d=2$ , with $j^{\star}$ defined in the following lemma.

Lemma 5.3.

Assume that the triangulation $\mathcal{T}_{\ell}$ is shape-regular. Let $\mathcal{T}_{m}$ be a triangulation obtained from $\mathcal{T}_{\ell}$ after $m-\ell$ refinements with the newest vertex bisection strategy (see, for instance, [Ste08]). Then there exists $j^{\star}$ such that $\mathcal{R}^{j^{\star}}_{\mathcal{T}_{\ell}\rightarrow\mathcal{T}_{m}}$ satisfies the following property: all triangles in $\omega_{\nu}$ , for all $\nu\in\mathcal{V}_{T}$ , and all their edges have an interior node that is a vertex of a triangle of $\mathcal{T}_{m}$ .

Proof.

Let $n^{\star}$ denote the maximum number of triangles in a patch in the triangulation $\mathcal{T}_{\ell}$ . The shape-regularity of $\mathcal{T}_{\ell}$ implies that $n^{\star}$ is bounded.

We observe that if $T\in\mathcal{R}^{2}_{\mathcal{T}_{\ell}\rightarrow\mathcal{T}_{m}}$ then the adjacent triangles of $T$ belong at least to $\mathcal{R}^{1}_{\mathcal{T}_{\ell}\rightarrow\mathcal{T}_{m}}$ ; this is illustrated in Figure 7. If moreover $T\in\mathcal{R}^{3}_{\mathcal{T}_{\ell}\rightarrow\mathcal{T}_{m}}$ then $T$ has the interior node property, but two of the adjacent triangles could still belong only to $\mathcal{R}^{1}_{\mathcal{T}_{\ell}\rightarrow\mathcal{T}_{m}}$ as is it shown in Figure 7.

However, $T\in\mathcal{R}^{4}_{\mathcal{T}_{\ell}\rightarrow\mathcal{T}_{m}}$ implies that the two adjacent triangles belong at least to $\mathcal{R}^{2}_{\mathcal{T}_{\ell}\rightarrow\mathcal{T}_{m}}$ (see Figure 7). Similarly, $T\in\mathcal{R}^{6}_{\mathcal{T}_{\ell}\rightarrow\mathcal{T}_{m}}$ implies that the two adjacent triangles belong to $\mathcal{R}^{3}_{\mathcal{T}_{\ell}\rightarrow\mathcal{T}_{m}}$ . Repeating this argument shows that for $j^{\star}=3n^{\star}/4$ all triangles in the patch have the interior node property, and all facets have an interior node. ∎

We are now showing that conditions [H1-H4] hold true for the residual error estimator defined on patches $\tilde{\eta}^{res}$ ; thanks to the equivalence proved in Section 4 the same conditions will hold for the Braess–Schöberl estimator $\eta^{\mathrlap{{\small\hexstar}}{\hexagon}{}}$ as well.

Lemma 5.4 (H3 — discrete reliability for $\tilde{\eta}^{res}$ ).

Let $\mathcal{T}_{m}$ be a refinement of $\mathcal{T}_{\ell}$ . Then

[TABLE]

Proof.

It is well-known (see [Ste07, Theorem 4.1]) that the discrete reliability properties holds true for the standard residual error estimator, that is

[TABLE]

Clearly, the extension to $\tilde{\eta}^{res}$ is straightforward. ∎

Lemma 5.5 (H4 — discrete efficiency for $\tilde{\eta}^{res}$ ).

Let $\mathcal{T}_{m}$ be a refinement of $\mathcal{T}_{\ell}$ and let $j^{\star}$ be the index introduced in Lemma 5.3. Then it holds

[TABLE]

Proof.

From the definition of $j^{\star}$ we have that if $T$ belongs to $\mathcal{R}^{j^{\star}}_{\mathcal{T}_{\ell}\rightarrow\mathcal{T}_{m}}$ then $T$ and its edges have the interior node property. Moreover, $\tilde{\omega}_{T}:=\{T^{\prime}\ :\ T^{\prime}\cap\omega_{T}\neq 0\}$ is contained in $\mathcal{R}^{1}_{\mathcal{T}_{\ell}\rightarrow\mathcal{T}_{m}}$ .

Therefore, we can use the fact that the standard residual estimator $\eta^{res}$ is discretely efficient, that is,

[TABLE]

It follows

[TABLE]

∎

The next lemma is related to the oscillation reduction stated in condition [H5]. For completeness, we show the condition met both by the standard oscillation term and by the patchwise oscillation; in our analysis we are going to use the latter one.

Lemma 5.6 (H5 — Oscillation reduction).

Let $\mathcal{T}_{m}$ be a refinement of $\mathcal{T}_{\ell}$ and let $j^{\star}$ be the index introduced in Lemma 5.3. Then it holds

[TABLE]

and

[TABLE]

Proof.

The first statement is equivalent to

[TABLE]

where $\mathcal{T}_{m}^{\star}=\mathcal{T}_{m}\backslash\mathcal{T}_{l}$ .

Consider a triangle $T_{m}$ in $\mathcal{T}_{m}^{\star}$ which originates from the triangle $\mathfrak{T}_{\ell}(T_{m})$ in $\mathcal{T}_{\ell}$ . Our refinement strategy guarantees that the mesh size is reduced, so that $h_{T_{m}}\leq\gamma h_{\mathfrak{T}_{\ell}(T_{m})}$ for a positive $\gamma<1$ . Then, it holds

[TABLE]

So we have (5.7) with $\lambda=1-\gamma^{2}$ .

The second statement is equivalent to

[TABLE]

Recall that the definition of $j^{\star}$ implies that for any $T$ in $\mathcal{R}^{j^{\star}}_{\mathcal{T}_{\ell}\rightarrow\mathcal{T}_{m}}$ all the triangles in $\omega_{T}$ belong to $\mathcal{R}^{1}_{\mathcal{T}_{\ell}\rightarrow\mathcal{T}_{m}}$ . Therefore,

[TABLE]

∎

We are now in the position of stating our main result concerning the convergence of AFEM based on the Braess–Schöberl error estimator.

Theorem 5.7.

Let $u$ be the solution of Problem (2.2) and consider a SOLVE–ESTIMATE–MARK–REFINE strategy satisfying the following properties.

(1)

In the solve module the solution is computed exactly. 2. (2)

The estimate module makes use of the Braess–Schöberl error estimator $\eta^{\mathrlap{{\small\hexstar}}{\hexagon}{}}$ defined on patches and takes into account the total error (5.2) with the patchwise oscillation term $\textit{osc}^{\mathrlap{{\small\hexstar}}}{\hexagon}{}$ . 3. (3)

The mark module is the usual Dörfler marking strategy. 4. (4)

The refine module is performed using the newest vertex bisection algorithm and it is slightly modified from the standard routines, as described in **[CN12]**, using the iteration counter $j^{\star}$ defined in Lemma 5.3, so that the interior node property is satisfied.

Then the sequence of discrete solutions $\{u_{\ell}\}$ converges to $u$ with the quasioptimal decay rate

[TABLE]

(see (5.3)).

Proof.

As explained above, we need to show that the five conditions H1-H5 are satisfied. We have already observed that H1 (global reliability) and H2 (global efficiency) are proved in [Bra13, Theorems 9.4 and 9.5] (see (3.8) and (3.9)).

The equivalence between the estimator $\eta^{\mathrlap{{\small\hexstar}}{\hexagon}{}}$ and the patchwise residual estimator $\tilde{\eta}^{res}$ (see Section 4), together with Lemmas 5.4 and 5.5, leads directly to the localized bounds H3 (discrete reliability) and H4 (discrete efficiency) for the estimator $\eta^{\mathrlap{{\small\hexstar}}{\hexagon}{}}$ . Finally, [H5] (oscillation reduction) has been proved in Lemma 5.6 (see Equation (5.8)). ∎

*Remark 5.8**.*

Another way to prove a result analogue to the one presented in Theorem 5.7 would be to use the theory developed in [KS11]. In such theory, the refine module is not modified from the standard routines, while the mark module acts on patches instead of on single elements. Unfortunately, the theory of [KS11] assumes that the oscillations are dominated by the error estimator, which might not be true in our case; a possible fix would be the use of a separate marking strategy as in [CR17].

6. Optimal convergence rate for $\eta^{\Delta}$

In this section we see how the results of the previous section can be extended to $\eta^{\Delta}$ , which is the error estimator usually referred to as Braess–Schöberl estimator.

Even if the estimator is constructed element by element, we keep using the oscillation term $\textit{osc}^{\mathrlap{{\small\hexstar}}}{\hexagon}{}(u_{\ell},\mathcal{T}_{\ell})$ defined on patches of elements. This is needed, in particular, for the proof of the discrete efficiency (see Lemma 6.2).

It is clear from the above discussion that, in order to apply the theory of [CN12], the two crucial properties are H3 (discrete reliability) and H4 (discrete efficiency). We are not going to use the equivalence with any residual-type error estimator, but we are showing these properties directly in the next two lemmas.

Lemma 6.1 (Discrete Reliability).

Let $\mathcal{T}_{m}$ be a refinement of $\mathcal{T}_{\ell}$ , then

[TABLE]

Proof.

Since $u_{m}$ is the solution of (2.7) on $\mathcal{T}_{m}$ and $u_{m}-u_{l}$ is piecewise polynomial of degree $k$ on $\mathcal{T}_{m}$ as well, we have

[TABLE]

Since $u_{\ell}$ is the solution of (2.7) on $\mathcal{T}_{\ell}$ , we have

[TABLE]

where $\mathcal{I}_{\mathcal{T}_{\ell}}$ is the Lagrange interpolation operator with respect to the triangulation $\mathcal{T}_{\ell}$ . From the results of [BPS09a] we obtain

[TABLE]

Outside the refined set $\mathcal{R}^{1}_{\mathcal{T}_{\ell}\rightarrow\mathcal{T}_{m}}$ we have $u_{m}=\mathcal{I}_{\mathcal{T}}u_{m}$ . This include the boundary $\partial\mathcal{R}^{1}_{\mathcal{T}_{\ell}\rightarrow\mathcal{T}_{m}}$ , so that $\nabla u_{m}=\nabla\mathcal{I}_{\mathcal{T}_{\ell}}u_{m}$ on $\Omega\backslash\mathcal{R}^{1}_{\mathcal{T}_{\ell}\rightarrow\mathcal{T}_{m}}$ . Therefore,

[TABLE]

From the identity

[TABLE]

we obtain

[TABLE]

Dividing by $\|\nabla(u_{m}-u_{l})\|_{\Omega}$ finishes the proof.

∎

Lemma 6.2 (Discrete Efficiency).

Let $\mathcal{T}_{m}$ be a refinement of $\mathcal{T}_{\ell}$ , then

[TABLE]

where $j^{\star}$ is defined in Lemma 5.3.

Proof.

This result is a consequence of the following inequality

[TABLE]

and of the analogous result for the patchwise estimator $\eta^{\mathrlap{{\small\hexstar}}}{\hexagon}{}$ . ∎

We have then proved all the conditions that allow us to state a theorem analogue to 5.7 in the case of the standard estimator $\eta^{\Delta}$ .

Theorem 6.3.

Let $u$ be the solution of (2.2) and consider the adaptive strategy as in the Theorem 5.7 with the standard Braess–Schöberl error estimator $\eta^{\Delta}$ and the oscillation term $\textit{osc}^{\mathrlap{{\small\hexstar}}}{\hexagon}{}$ . Then the sequence of discrete solutions $\{u_{\ell}\}$ converges with the quasioptimal decay rate

[TABLE]

(see (5.3)).

Before concluding this section, we would like to briefly comment on the elasticity problem (2.1) for which the Prager–Synge theory has been developed. In that case the symmetric gradients of the constitutive equation give an additional term in the integration by parts needed for the Prager–Synge Theorem 2.2. The anti-symmetric part of the equilibrated stress has therefore to be controlled. Clearly, symmetric $H({\operatorname{div}})$ -conforming stress spaces such as the Arnold–Winther elements (see [AW02]) can be used as in [NWW08] or [AR10]. Another possibility is to impose the symmetric condition in a weak form [BKMSb]. For non-conforming elements the reconstruction procedure simplifies to an element-based reconstruction as shown in [BMS18].

7. Conclusion

In this paper we discussed the equilibrated flux reconstruction by Braess and Schöberl [BS08a, BPS09a], stemming from the classical Prager–Synge hypercircle theory [PS47]. We recalled the a posteriori error analysis for both an elementwise estimator $\eta^{\Delta}$ and a patchwise estimator $\eta^{\mathrlap{{\small\hexstar}}}{\hexagon}{}$ , and we showed how to adapt the abstract theory of [CN12] in order to prove the optimal convergence of the adaptive scheme based on those estimators.

Bibliography55

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[AO 93] M. Ainsworth and J. T. Oden, A unified approach to a posteriori error estimation using element residual methods , Numer. Math. 65 (1993), 23–50.
2[AO 00] Mark Ainsworth and J. Tinsley Oden, A posteriori error estimation in finite element analysis , Wiley, New York, 2000.
3[AR 10] M. Ainsworth and R. Rankin, Guaranteed computable error bounds for conforming and nonconforming finite element analyses in planar elasticity , Int. J. Numer. Meth. Engng. 82 (2010), 1114–1157.
4[AW 02] D. N. Arnold and R. Winther, Mixed finite elements for elasticity , Numer. Math. 92 (2002), 401–419.
5[BBS 19] F. Bertrand, D. Boffi, and R. Stenberg, Asymptotically exact a posteriori error analysis for the mixed Laplace eigenvalue problem , Comput. Methods Appl. Math. (2019), to appear.
6[BFH 14] D. Braess, T. Fraunholz, and R. H. W. Hoppe, An equilibrated a posteriori error estimator for the interior penalty discontinuous Galerkin method , SIAM J. Numer. Anal. 52 (2014), no. 4, 2121–2136. MR 3249368
7[BHS 08] Dietrich Braess, Ronald H. W. Hoppe, and Joachim Schöberl, A posteriori estimators for obstacle problems by the hypercircle method , Comput. Vis. Sci. 11 (2008), no. 4-6, 351–362. MR 2425501
8[BKM Sa] Fleurianne Bertrand, Bernhard Kober, Marcel Moldenhauer, and Gerhard Starke, Equilibrated stress reconstruction and a posteriori error estimation for linear elasticity .

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

The Prager–Synge theorem in reconstruction based a posteriori error estimation

Abstract.

2010 Mathematics Subject Classification:

1. Introduction

2. The Prager–Synge theory and its application to error estimates

Theorem 2.1**.**

Theorem 2.2**.**

Proof.

Remark 2.3*.*

3. The Braess–Schöberl construction

4. Equivalence with the residual error estimator

Lemma 4.1**.**

Proof.

5. Optimal convergence rate for η\hexstar\hexagon\eta^{\mathrlap{{\small\hexstar}}}{\hexagon}{}η\hexstar\hexagon

Remark 5.1*.*

Remark 5.2*.*

Lemma 5.3**.**

Proof.

Lemma 5.4** (H3 — discrete reliability for η~res\tilde{\eta}^{res}η~​res).**

Proof.

Lemma 5.5** (H4 — discrete efficiency for η~res\tilde{\eta}^{res}η~​res).**

Proof.

Lemma 5.6** (H5 — Oscillation reduction).**

Proof.

Theorem 5.7**.**

Proof.

Remark 5.8*.*

6. Optimal convergence rate for ηΔ\eta^{\Delta}ηΔ

Lemma 6.1** (Discrete Reliability).**

Proof.

Lemma 6.2** (Discrete Efficiency).**

Proof.

Theorem 6.3**.**

7. Conclusion

Theorem 2.1.

Theorem 2.2.

*Remark 2.3**.*

Lemma 4.1.

5. Optimal convergence rate for $\eta^{\mathrlap{{\small\hexstar}}}{\hexagon}{}$

*Remark 5.1**.*

*Remark 5.2**.*

Lemma 5.3.

Lemma 5.4 (H3 — discrete reliability for $\tilde{\eta}^{res}$ ).

Lemma 5.5 (H4 — discrete efficiency for $\tilde{\eta}^{res}$ ).

Lemma 5.6 (H5 — Oscillation reduction).

Theorem 5.7.

*Remark 5.8**.*

6. Optimal convergence rate for $\eta^{\Delta}$

Lemma 6.1 (Discrete Reliability).

Lemma 6.2 (Discrete Efficiency).

Theorem 6.3.