Lower a posteriori error estimates on anisotropic meshes

Natalia Kopteva

arXiv:1906.05703·math.NA·March 3, 2020

Lower a posteriori error estimates on anisotropic meshes

Natalia Kopteva

PDF

TL;DR

This paper reviews existing lower a posteriori error bounds on anisotropic meshes, demonstrates their limitations, and introduces a new approach that provides sharper bounds for finite element approximations of the Laplace equation.

Contribution

It proposes a novel method to obtain sharper lower a posteriori error bounds on anisotropic meshes, improving the efficiency of error estimation in finite element analysis.

Findings

01

Standard bounds are not sharp on anisotropic meshes

02

Numerical example confirms limitations of existing bounds

03

New approach yields sharper lower error bounds

Abstract

Lower a posteriori error bounds obtained using the standard bubble function approach are reviewed in the context of anisotropic meshes. A numerical example is given that clearly demonstrates that the short-edge jump residual terms in such bounds are not sharp. Hence, for linear finite element approximations of the Laplace equation in polygonal domains, a new approach is employed to obtain essentially sharper lower a posteriori error bounds and thus to show that the upper error estimator in the recent paper [N. Kopteva, Numer. Math., 137 (2017), 607-642] is efficient on certain anisotropic meshes.

Tables3

Table 1. Table 1: Lower error estimators ( 2.1 ) for test problem with u = sin ⁡ ( π a x ) 𝑢 𝜋 𝑎 𝑥 u=\sin(\pi ax) in Ω = ( 0 , 1 ) 2 Ω superscript 0 1 2 \Omega=(0,1)^{2} .

	$a = 1$			$a = 3$
	$N = 20$	$N = 40$	$N = 80$	$N = 20$	$N = 40$	$N = 80$
	Errors ${‖ \nabla (u_{h} - u) ‖}_{Ω}$ (odd rows) & ${‖ h_{T} (f - f^{I}) ‖}_{Ω}$ (even rows)
$M = 2 N$	1.01e-1	5.04e-2	2.52e-2	9.00e-1	4.52e-1	2.27e-1
	3.51e-4	4.39e-5	5.49e-6	2.83e-2	3.55e-3	4.45e-4
$M = 8 N$	1.01e-1	5.04e-2	2.52e-2	9.00e-1	4.52e-1	2.27e-1
	9.74e-5	1.22e-5	1.52e-6	7.86e-3	9.86e-4	1.23e-4
$M = 32 N$	1.01e-1	5.04e-2	2.52e-2	9.00e-1	4.52e-1	2.27e-1
	2.45e-5	3.07e-6	3.84e-7	1.98e-3	2.48e-4	3.11e-5
$M = 128 N$	1.01e-1	5.04e-2	2.52e-2	9.00e-1	4.52e-1	2.27e-1
	6.14e-6	7.67e-7	9.59e-8	4.95e-4	6.21e-5	7.77e-6
	$ℰ$ using $ϱ_{S} = \frac{\| S \|}{diam (ω_{S})}$ (odd rows) & Effectivity Indices (even rows)
$M = 2 N$	2.80e-1	1.40e-1	7.02e-2	2.46e+0	1.25e+0	6.31e-1
	2.78	2.79	2.79	2.73	2.77	2.78
$M = 8 N$	1.30e-1	6.51e-2	3.26e-2	1.14e+0	5.82e-1	2.93e-1
	1.29	1.29	1.29	1.26	1.29	1.29
$M = 32 N$	6.24e-2	3.13e-2	1.57e-2	5.46e-1	2.80e-1	1.41e-1
	0.62	0.62	0.62	0.61	0.62	0.62
$M = 128 N$	3.09e-2	1.55e-2	7.74e-3	2.71e-1	1.38e-1	6.95e-2
	0.31	0.31	0.31	0.30	0.31	0.31
	$ℰ$ using $ϱ_{S} = 1$ (odd rows) & Effectivity Indices (even rows)
$M = 2 N$	3.81e-1	1.91e-1	9.55e-2	3.34e+0	1.71e+0	8.58e-1
	3.79	3.79	3.79	3.71	3.77	3.79
$M = 8 N$	3.51e-1	1.76e-1	8.79e-2	3.06e+0	1.57e+0	7.90e-1
	3.48	3.49	3.49	3.40	3.47	3.49
$M = 32 N$	3.48e-1	1.74e-1	8.73e-2	3.04e+0	1.56e+0	7.84e-1
	3.46	3.46	3.47	3.38	3.44	3.46
$M = 128 N$	3.48e-1	1.74e-1	8.72e-2	3.04e+0	1.56e+0	7.84e-1
	3.46	3.46	3.46	3.38	3.44	3.46

Table 2. Table 2: Lower error estimators ( 2.1 ) for test problem with u = sin ⁡ ( x / μ ) e − y / ε 𝑢 𝑥 𝜇 superscript 𝑒 𝑦 𝜀 u=\sin(x/\mu)e^{-y/\varepsilon} for μ = 2 ε 2 𝜇 2 superscript 𝜀 2 \mu=2\varepsilon^{2} in Ω = ( 0 , 1 ) × ( 0 , ε ) Ω 0 1 0 𝜀 \Omega=(0,1)\times(0,\varepsilon) using M = 2 N 𝑀 2 𝑁 M=2N .

	$ε = 2^{- 2}$	$ε = 2^{- 3}$	$ε = 2^{- 4}$	$ε = 2^{- 2}$	$ε = 2^{- 3}$	$ε = 2^{- 4}$
	Errors ${‖ \nabla (u_{h} - u) ‖}_{Ω}$			${‖ h_{T} (f - f^{I}) ‖}_{Ω}$
$N = 320$	1.66e-2	1.60e-1	1.74e+0	2.51e-7	2.79e-5	2.67e-3
$N = 640$	8.30e-3	8.01e-2	8.73e-1	3.13e-8	3.49e-6	3.34e-4
	$ℰ$ using $ϱ_{S} = \frac{\| S \|}{diam (ω_{S})}$			Effectivity Indices
$N = 320$	3.68e-2	2.30e-1	1.48e+0	2.22	1.44	0.85
$N = 640$	1.84e-2	1.15e-1	7.47e-1	2.22	1.44	0.86
	$ℰ$ using $ϱ_{S} = 1$			Effectivity Indices
$N = 320$	5.76e-2	5.55e-1	5.92e+0	3.47	3.46	3.40
$N = 640$	2.88e-2	2.78e-1	3.01e+0	3.47	3.47	3.45

Table 3. Table 3: Lower error estimators ( 2.1 ) for test problem with u = sin ⁡ ( ( 2 y − x ) / ε ) 𝑢 2 𝑦 𝑥 𝜀 u=\sin((2y-x)/\varepsilon) in Ω = ( 0 , 1 ) × ( 0 , ε ) Ω 0 1 0 𝜀 \Omega=(0,1)\times(0,\varepsilon) using N = M 𝑁 𝑀 N=M .

	$N = 160$	$N = 320$	$N = 640$	$N = 160$	$N = 320$	$N = 640$
	Errors ${‖ \nabla (u_{h} - u) ‖}_{Ω}$			${‖ h_{T} (f - f^{I}) ‖}_{Ω}$
$ε = 2^{- 4}$	2.29e-1	1.14e-1	5.72e-2	7.17e-5	8.97e-6	1.12e-6
$ε = 2^{- 5}$	6.67e-1	3.34e-1	1.67e-1	4.29e-4	5.36e-5	6.71e-6
$ε = 2^{- 6}$	1.90e+0	9.59e-1	4.80e-1	2.49e-3	3.12e-4	3.90e-5
	$ℰ$ using $ϱ_{S} = \frac{\| S \|}{diam (ω_{S})}$ (odd rows)			Corresponding $\overset{̊}{ℰ}$ (odd rows)
	Effectivity Indices (even rows)			$\overset{̊}{ℰ} / ℰ$ (even rows)
$ε = 2^{- 4}$	7.59e-1	3.80e-1	1.90e-1	6.16e-2	3.09e-2	1.54e-2
	3.32	3.32	3.32	0.08	0.08	0.08
$ε = 2^{- 5}$	2.19e+0	1.10e+0	5.50e-1	1.31e-1	6.61e-2	3.31e-2
	3.28	3.29	3.29	0.06	0.06	0.06
$ε = 2^{- 6}$	6.20e+0	3.13e+0	1.57e+0	2.67e-1	1.36e-1	6.82e-2
	3.26	3.27	3.27	0.04	0.04	0.04
	$ℰ$ using $ϱ_{S} = 1$ (odd rows)			Corresponding $\overset{̊}{ℰ}$ (odd rows)
	Effectivity Indices (even rows)			$\overset{̊}{ℰ} / ℰ$ (even rows)
$ε = 2^{- 4}$	7.96e-1	3.98e-1	1.99e-1	2.46e-1	1.24e-1	6.18e-2
	3.48	3.48	3.48	0.31	0.31	0.31
$ε = 2^{- 5}$	2.31e+0	1.16e+0	5.80e-1	7.43e-1	3.74e-1	1.87e-1
	3.46	3.47	3.47	0.32	0.32	0.32
$ε = 2^{- 6}$	6.55e+0	3.31e+0	1.66e+0	2.14e+0	1.09e+0	5.46e-1
	3.44	3.46	3.46	0.33	0.33	0.33

Equations102

- △ u = f (x, y) \mbox f or (x, y) \in Ω, u = 0 \mbox o n \partial Ω,

- △ u = f (x, y) \mbox f or (x, y) \in Ω, u = 0 \mbox o n \partial Ω,

⟨ \nabla u_{h}, \nabla v_{h} ⟩ = ⟨ f, v_{h} ⟩ \forall v_{h} \in S_{h},

⟨ \nabla u_{h}, \nabla v_{h} ⟩ = ⟨ f, v_{h} ⟩ \forall v_{h} \in S_{h},

\|\nabla(u_{h}-u)\|_{2\,;\Omega}\leq C\,\Bigl{\{}\sum_{S\in{\mathcal{S}}\backslash\partial\Omega}\!\!\!|\omega_{S}|J_{S}^{2}+\sum_{T\in{\mathcal{T}}}\bigl{\|}H_{T}f^{I}\bigr{\|}^{2}_{2\,;T}+\,\bigl{\|}f-f^{I}\bigr{\|}^{2}_{2\,;\Omega}\Bigr{\}}^{1/2}\!\!,

\|\nabla(u_{h}-u)\|_{2\,;\Omega}\leq C\,\Bigl{\{}\sum_{S\in{\mathcal{S}}\backslash\partial\Omega}\!\!\!|\omega_{S}|J_{S}^{2}+\sum_{T\in{\mathcal{T}}}\bigl{\|}H_{T}f^{I}\bigr{\|}^{2}_{2\,;T}+\,\bigl{\|}f-f^{I}\bigr{\|}^{2}_{2\,;\Omega}\Bigr{\}}^{1/2}\!\!,

\displaystyle\|\nabla(u_{h}-u)\|_{2\,;\Omega}\leq C\,\Bigl{\{}\sum_{S\in{\mathcal{S}}\backslash\partial\Omega}\!\!\!|\omega_{S}|J_{S}^{2}

\displaystyle\|\nabla(u_{h}-u)\|_{2\,;\Omega}\leq C\,\Bigl{\{}\sum_{S\in{\mathcal{S}}\backslash\partial\Omega}\!\!\!|\omega_{S}|J_{S}^{2}

\displaystyle{}+\sum_{T\in{\mathcal{T}}}\bigl{\|}H_{T}{\rm osc}(f^{I}\,;T)\bigr{\|}^{2}_{2\,;T}\,\Bigr{\}}^{1/2}\!\!.

{\mathcal{E}}:=\Bigl{\{}\sum_{S\in{\mathcal{S}}\backslash\partial\Omega}\!\!\!\varrho_{S}\,|\omega_{S}|J_{S}^{2}+\|h_{T}f^{I}\|_{\Omega}^{2}\Bigr{\}}^{1/2}\!\!\lesssim\|\nabla(u_{h}-u)\|_{\Omega}+\|h_{T}(f-f^{I})\|_{\Omega},

{\mathcal{E}}:=\Bigl{\{}\sum_{S\in{\mathcal{S}}\backslash\partial\Omega}\!\!\!\varrho_{S}\,|\omega_{S}|J_{S}^{2}+\|h_{T}f^{I}\|_{\Omega}^{2}\Bigr{\}}^{1/2}\!\!\lesssim\|\nabla(u_{h}-u)\|_{\Omega}+\|h_{T}(f-f^{I})\|_{\Omega},

\varrho_{S}=\left\{\begin{array}[]{cll}\frac{|S|}{{\rm diam}(\omega_{S})},&&\mbox{\cite[cite]{\@@bibref{Authors Phrase1YearPhrase2}{KunVer00}{\@@citephrase{(}}{\@@citephrase{)}}} using bubble functions (see also \S\ref{ssec_kun_})},\\[8.5359pt] 1,&&\mbox{see Theorem~{}\ref{theo_lower_struct} in \S\ref{sec_struct}}.\end{array}\right.

h_{T} ∥ f^{I} ∥_{T}

h_{T} ∥ f^{I} ∥_{T}

\frac{∣ S ∣}{diam ( ω _{S} )} ∣ ω_{S} ∣ J_{S}^{2}

\|f^{I}\|_{T}^{2}\lesssim\Bigl{(}h_{T}^{-1}\|\nabla(u_{h}-u)\|_{T}+\|f-f^{I}\|_{T}\Bigr{)}\,\|w\|_{T}\,.

\|f^{I}\|_{T}^{2}\lesssim\Bigl{(}h_{T}^{-1}\|\nabla(u_{h}-u)\|_{T}+\|f-f^{I}\|_{T}\Bigr{)}\,\|w\|_{T}\,.

∣ S ∣ J_{S}^{2} ≃ \int_{S} w [\partial_{ν} u_{h}]_{S} = ⟨ \nabla u_{h}, \nabla w ⟩ = ⟨ \nabla (u_{h} - u), \nabla w ⟩ + ⟨ f, w ⟩ .

∣ S ∣ J_{S}^{2} ≃ \int_{S} w [\partial_{ν} u_{h}]_{S} = ⟨ \nabla u_{h}, \nabla w ⟩ = ⟨ \nabla (u_{h} - u), \nabla w ⟩ + ⟨ f, w ⟩ .

|S|J_{S}^{2}\lesssim\sum_{T\in\omega_{S}}\underbrace{\Bigl{(}h_{T}^{-1}\|\nabla(u_{h}-u)\|_{T}+\|f\|_{T}\Bigr{)}}_{{}\lesssim h_{T}^{-1}\,{\mathcal{Y}}^{T}_{\tiny\mbox{(\ref{lower_f})}}}\underbrace{\|w\|_{T}}_{\simeq(h_{T}|S|)^{1/2}|J_{S}|},\hskip 28.45274pt

|S|J_{S}^{2}\lesssim\sum_{T\in\omega_{S}}\underbrace{\Bigl{(}h_{T}^{-1}\|\nabla(u_{h}-u)\|_{T}+\|f\|_{T}\Bigr{)}}_{{}\lesssim h_{T}^{-1}\,{\mathcal{Y}}^{T}_{\tiny\mbox{(\ref{lower_f})}}}\underbrace{\|w\|_{T}}_{\simeq(h_{T}|S|)^{1/2}|J_{S}|},\hskip 28.45274pt

H_{z} := diam (ω_{z}), h_{z} := H_{z}^{- 1} ∣ ω_{z} ∣, γ_{z} := S_{z} ∖ \partial Ω.

H_{z} := diam (ω_{z}), h_{z} := H_{z}^{- 1} ∣ ω_{z} ∣, γ_{z} := S_{z} ∖ \partial Ω.

h_{z} ≪ H_{z}, \mbox an d ∣ T ∣ ≃ ∣ ω_{z} ∣ \forall T \subset ω_{z} .

h_{z} ≪ H_{z}, \mbox an d ∣ T ∣ ≃ ∣ ω_{z} ∣ \forall T \subset ω_{z} .

\overset{˚}{S}

\overset{˚}{S}

Y_{D}

E_{D}

\overset{˚}{E}_{D}

\overset{˚}{E}_{Ω_{i}} ≲ Y_{Ω_{i}} \forall i = 1, \dots, n - 1.

\overset{˚}{E}_{Ω_{i}} ≲ Y_{Ω_{i}} \forall i = 1, \dots, n - 1.

E_{Ω_{i}} ≲ Y_{Ω_{i}} \forall i = 0, \dots, n, E_{Ω} ≲ Y_{Ω} .

E_{Ω_{i}} ≲ Y_{Ω_{i}} \forall i = 0, \dots, n, E_{Ω} ≲ Y_{Ω} .

\bigl{|}J_{{S}^{+}}-J_{{S}^{-}}\bigr{|}\lesssim h_{z}H_{z}^{-1}\!\!\!\sum_{S\in\gamma_{z}\backslash{\mathcal{P}}_{i}}\!\!\!|J_{S}|.

\bigl{|}J_{{S}^{+}}-J_{{S}^{-}}\bigr{|}\lesssim h_{z}H_{z}^{-1}\!\!\!\sum_{S\in\gamma_{z}\backslash{\mathcal{P}}_{i}}\!\!\!|J_{S}|.

|\omega_{z}|\,\bigl{|}{\textstyle\frac{H_{z}}{h_{z}}}(J_{S+}-J_{S^{-}})\bigr{|}^{2}\lesssim{\mathcal{Y}}_{\omega_{z}}^{2},

|\omega_{z}|\,\bigl{|}{\textstyle\frac{H_{z}}{h_{z}}}(J_{S+}-J_{S^{-}})\bigr{|}^{2}\lesssim{\mathcal{Y}}_{\omega_{z}}^{2},

\overset{˚}{E}_{i}^{2} = S \subset P \sum ∣ ω_{S} ∣ J_{S}^{2} = H \int_{P} J_{S}^{2}, Y_{i} = ∥\nabla (u_{h} - u) ∥_{2; Ω_{i}} + H ∥ osc (f; T) ∥_{2; Ω_{i}} .

\overset{˚}{E}_{i}^{2} = S \subset P \sum ∣ ω_{S} ∣ J_{S}^{2} = H \int_{P} J_{S}^{2}, Y_{i} = ∥\nabla (u_{h} - u) ∥_{2; Ω_{i}} + H ∥ osc (f; T) ∥_{2; Ω_{i}} .

=: ψ_{1} ⟨ \nabla (u_{h} - u), \nabla v ⟩

=: ψ_{1} ⟨ \nabla (u_{h} - u), \nabla v ⟩

= =: Ψ + \frac{1}{2} H^{- 1} \overset{˚}{E}_{i}^{2} ⟨ \nabla u_{h}, \nabla (v - \frac{1}{2} v_{h})⟩ - =: ψ_{2} ⟨ f, v - \frac{1}{2} v_{h} ⟩ .

H\bigl{(}|\psi_{1}|+|\psi_{2}|+|\Psi|\bigr{)}\lesssim{\mathcal{Y}}_{i}\,(\mathring{{\mathcal{E}}}_{i}+{\mathcal{Y}}_{i}\bigr{)}.

H\bigl{(}|\psi_{1}|+|\psi_{2}|+|\Psi|\bigr{)}\lesssim{\mathcal{Y}}_{i}\,(\mathring{{\mathcal{E}}}_{i}+{\mathcal{Y}}_{i}\bigr{)}.

v_{h}(z):={\textstyle\frac{1}{2}}\!\!\!\sum_{S\in\gamma_{z}\cap{\mathcal{P}}}\!\!\!\!\!J_{S}\;\;\forall\,z\in{\mathcal{P}}\backslash\partial\Omega,\quad\;\;\hat{v}_{h}(x,y):=v_{h}\bigl{(}x_{i}+{\color[rgb]{0,0,0}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,0}2}[x-x_{i}],y\bigr{)}.

v_{h}(z):={\textstyle\frac{1}{2}}\!\!\!\sum_{S\in\gamma_{z}\cap{\mathcal{P}}}\!\!\!\!\!J_{S}\;\;\forall\,z\in{\mathcal{P}}\backslash\partial\Omega,\quad\;\;\hat{v}_{h}(x,y):=v_{h}\bigl{(}x_{i}+{\color[rgb]{0,0,0}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,0}2}[x-x_{i}],y\bigr{)}.

\displaystyle\Bigl{|}\int_{S}\!(\hat{v}_{h}-{\textstyle\frac{1}{2}}v_{h})\Bigr{|}

\displaystyle\Bigl{|}\int_{S}\!(\hat{v}_{h}-{\textstyle\frac{1}{2}}v_{h})\Bigr{|}

∥ H \nabla v_{h} ∥_{2; Ω_{i}} + ∥ v_{h} ∥_{2; Ω_{i}}

\displaystyle\sum_{S\subset{\mathcal{P}}}|\omega_{S}|\Bigl{\{}{\textstyle\frac{H}{|S|}}{\rm osc}(v_{h}\,;S)\Bigr{\}}^{2}

∣ ψ_{2} ∣ = ∣ ⟨ f - \hat{f}, v ⟩ ∣

∣ ψ_{2} ∣ = ∣ ⟨ f - \hat{f}, v ⟩ ∣

⟨ \nabla u_{h}, \nabla (v - \frac{1}{2} v_{h})⟩ = \frac{1}{2} \int_{P} J_{S} v_{h} + S \subset Ω_{i} \ P \sum \int_{S} J_{S} (v - \frac{1}{2} v_{h}) .

⟨ \nabla u_{h}, \nabla (v - \frac{1}{2} v_{h})⟩ = \frac{1}{2} \int_{P} J_{S} v_{h} + S \subset Ω_{i} \ P \sum \int_{S} J_{S} (v - \frac{1}{2} v_{h}) .

Ψ = \frac{1}{2} \int_{P} J_{S} (v_{h} - J_{S}) + S \subset Ω_{i} \ P \sum J_{S} \int_{S} (v - \frac{1}{2} v_{h}) .

Ψ = \frac{1}{2} \int_{P} J_{S} (v_{h} - J_{S}) + S \subset Ω_{i} \ P \sum J_{S} \int_{S} (v - \frac{1}{2} v_{h}) .

|\Psi|\lesssim H^{-1/2}\mathring{{\mathcal{E}}}_{i}\,\,\|v_{h}-J_{S}\|_{2\,;{\mathcal{P}}}+\Bigl{\{}\sum_{S\subset\Omega_{i}\backslash{\mathcal{P}}}\!\!\!|\omega_{S}|J_{S}^{2}\Bigr{\}}^{1/2}\|\partial_{y}v_{h}\|_{2\,;\Omega_{i}}\,.

|\Psi|\lesssim H^{-1/2}\mathring{{\mathcal{E}}}_{i}\,\,\|v_{h}-J_{S}\|_{2\,;{\mathcal{P}}}+\Bigl{\{}\sum_{S\subset\Omega_{i}\backslash{\mathcal{P}}}\!\!\!|\omega_{S}|J_{S}^{2}\Bigr{\}}^{1/2}\|\partial_{y}v_{h}\|_{2\,;\Omega_{i}}\,.

S \subset P_{0} \sum ∣ ω_{S} ∣ J_{S}^{2} ≲ Y_{ω_{P}}^{2}, \mbox w h er e ω_{P} := \cup_{z \in P} ω_{z} .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

11institutetext: N. Kopteva 22institutetext: Department of Mathematics and Statistics, University of Limerick, Limerick, Ireland

22email: [email protected]

Lower a posteriori error estimates

on anisotropic meshes††thanks: The author was partially supported by Science Foundation Ireland grant SFI/12/IA/1683.

Natalia Kopteva

Abstract

Lower a posteriori error bounds obtained using the standard bubble function approach are reviewed in the context of anisotropic meshes. A numerical example is given that clearly demonstrates that the short-edge jump residual terms in such bounds are not sharp. Hence, for linear finite element approximations of the Laplace equation in polygonal domains, a new approach is employed to obtain essentially sharper lower a posteriori error bounds and thus to show that the upper error estimator in the recent paper Kopt_NM_17 is efficient on partially structured anisotropic meshes.

Keywords:

Anisotropic triangulation Lower a posteriori error estimate Estimator efficiency

MSC:

65N15 65N30

1 Introduction

The purpose of this paper is to address the efficiency of a posteriori error estimators on anisotropic meshes, which essentially reduces to obtaining sharp lower a posteriori error bounds. For shape-regular meshes such lower error bounds can be found in AinsOd_2000 ; Ver_book_13 . For anisotropic meshes, the situation is more delicate, as we shall now elaborate.

For unstructured anisotropic meshes, both upper and lower a posteriori error estimates were obtained in Kunert2000 ; Kun01 ; KunVer00 for the Laplace equation and for a singularly perturbed reaction-diffusion equation; see also (Ver_book_13, , §4.5). We also refer the reader to Mich_Perrotto , where the reliability and efficiency of a residual-type estimator from Picasso_2003 based on the Zienkiewicz-Zhu recovery procedure was established on anisotropic meshes under an $\eta$ % superconvergence type condition (explained, e.g., in (AinsOd_2000, , §4.8)). It should be noted that although the lower error bounds in Kunert2000 ; Kun01 ; KunVer00 involve the same estimators as the corresponding upper bounds, however the error constants in the upper bounds include the so-called matching functions. The latter depend on the unknown error and take moderate values only when the mesh is either isotropic, or, being anisotropic, is aligned correctly to the solution, while, in general, they may be as large as mesh aspect ratios.

The presence of such matching functions in the estimator is clearly undesirable. It is entirely avoided in the more recent papers Kopt15 ; Kopt_NM_17 ; Kopt17 , where upper a posteriori error estimates on anisotropic meshes were obtained for singularly perturbed semilinear reaction-diffusion equations in the energy norm and in the maximum norm.

Interestingly, the efficiency of the estimators in Kopt15 ; Kopt_NM_17 ; Kopt17 cannot be established using the standard bubble function approach, employed in Kunert2000 ; Kun01 ; KunVer00 . To be more precise, this approach (which will be reviewed in §2) leads to lower error bounds with significantly smaller weights at the short-edge jump residual terms than those in the upper bounds.

The main findings of the present paper are as follows.

•

Lower a posteriori error bounds obtained using the standard bubble function approach, such as in Kunert2000 ; Kun01 ; KunVer00 , will be reviewed in the context of anisotropic meshes. Numerical examples will be given in §2 that clearly demonstrate that the short-edge jump residual terms in such bounds are not sharp.

•

Hence, we shall present a new approach that yields essentially sharper lower a posteriori error bounds and thus shows that the upper error estimator in Kopt_NM_17 is efficient on partially structured anisotropic meshes.

Note that mild restrictions on the structure of the mesh are not uncommon in the literature when, for example, recovery type a posterior error estimators are considered xu_Zhang , and, as discussed in §5.1, such restrictions are not unreasonable when an anisotropic mesh is generated starting from a regular mesh.

Compared to Kopt15 ; Kopt_NM_17 ; Kopt17 , to simplify the presentation, we shall restrict the consideration to the simpler Laplace equation and consider the problem

[TABLE]

posed in a, possibly non-Lipschitz, polygonal domain $\Omega\subset\mathbb{R}^{2}$ . We also assume that $f\in L_{\infty}(\Omega)$ (for a less smooth $f$ , see Remarks 2.2 and 4.3).

Linear finite element approximations of (1.1) will be considered. Let $S_{h}\subset H_{0}^{1}(\Omega)\cap C(\bar{\Omega})$ be a piecewise-linear finite element space relative to a triangulation $\mathcal{T}$ , and let the computed solution $u_{h}\in S_{h}$ satisfy

[TABLE]

where $\langle\cdot,\cdot\rangle$ denotes the $L_{2}(\Omega)$ inner product.

To give an idea of the results in Kopt_NM_17 , under the assumptions on the mesh described in §3, one upper error estimate reduces to (Kopt_NM_17, , Theorems 6.1 and 7.4)

[TABLE]

where $C$ is independent of the diameters and the aspect ratios of elements in $\mathcal{T}$ . Here $\mathcal{S}$ is the set of edges in $\mathcal{T}$ , $J_{S}$ is the standard jump in the normal derivative of $u_{h}$ across any interior edge $S\in{\mathcal{S}}\backslash\partial\Omega$ , and $\omega_{S}$ is the patch of two elements sharing $S$ . We also use $H_{T}:={\rm diam}(T)$ , which may be significantly larger than $h_{T}:=2H_{T}^{-1}|T|$ , and the standard piecewise-linear Lagrange interpolant $f^{I}\in S_{h}$ of $f$ .

Furthermore, under some additional assumptions on the orientation of mesh elements surrounding sequences of anisotropic nodes connected by short edges, a sharper upper estimator was obtained in (Kopt_NM_17, , Theorem 6.2):

[TABLE]

To relate (1.3) and (1.4) to interpolation error bounds, as well as to possible adaptive-mesh construction strategies, note that $|J_{S}|$ may be interpreted as approximating the diameter of $\omega_{S}$ under the metric induced by the squared Hessian matrix of the exact solution (while $f^{I}$ approximates $\triangle u$ ).

Our task in this paper will be to establish the efficiency of the upper estimator in (1.4) up to data oscillation. As was already mentioned, the standard bubble function approach yields unsatisfactory lower bounds, with the weight $\frac{|S|}{{\rm diam}(\omega_{S})}|\omega_{S}|$ at $J_{S}^{2}$ (rather than a simpler and more natural $|\omega_{S}|$ in (1.4)). Remark 2.4 sheds some light on our approach to remedying this.

The paper is organized as follows. In §2, we review lower a posteriori error bounds obtained using the standard bubble function approach. In particular, numerical examples are given that demonstrate that the short-edge jump residual terms in such bounds are not sharp. The remainder of the paper is devoted to obtaining sharper lower error bounds In §3, we describe basic triangulation assumptions. Then in §4, we present a version of the analysis for partially structured meshes, while the case of more general anisotropic meshes is addressed in §5.

Notation. We write $a\simeq b$ when $a\lesssim b$ and $a\gtrsim b$ , and $a\lesssim b$ when $a\leq Cb$ with a generic constant $C$ depending on $\Omega$ and $f$ , but not on the diameters and the aspect ratios of elements in $\mathcal{T}$ . Also, for $\mathcal{D}\subset\bar{\Omega}$ and $1\leq p\leq\infty$ , let $\|\cdot\|_{p\,;\mathcal{D}}=\|\cdot\|_{L_{p}(\mathcal{D})}$ and $\|\cdot\|_{\mathcal{D}}=\|\cdot\|_{2\,;\mathcal{D}}$ , and also ${\rm osc}(v\,;\mathcal{D})=\sup_{\mathcal{D}}v-\inf_{\mathcal{D}}v$ for $v\in L_{\infty}(\mathcal{D})$ . Whenever quantities such as ${\rm osc}(\cdot\,;T)$ or $H_{T}$ appear in volume integrals or related norms, or $J_{S}$ appears in line integrals or related norms, they are understood as piecewise-constant functions.

2 Standard lower error bounds are not sharp on anisotropic meshes

This section is devoted to lower error bounds, such as in Kunert2000 ; Kun01 ; KunVer00 , obtained using the standard bubble function approach. Numerical examples will be given in §2.1 that clearly demonstrate that the short-edge jump residual terms in such bounds are not sharp. These examples also suggest that the jump residual terms in our upper estimators (1.3) and (1.4) have correct weights (the efficiency of the latter will be theoretically justified in §§4-5). Furthermore, in §2.2, we shall review the bubble function approach when applied to anisotropic meshes and discuss its deficiencies with a view of changing the paradigm for deriving upper bounds for jump residuals associated with short edges (in particular, see Remarks 2.3 and 2.4).

2.1 Numerical examples

Our first test problem is (1.1) with the exact solution $u=\sin(\pi ax)$ (for $a=1,3$ ) and the corresponding $f$ in $\Omega=(0,1)^{2}$ . We employ the triangulation obtained by drawing diagonals from the tensor product of the uniform grids $\{\frac{i}{N}\}_{i=0}^{N}$ and $\{\frac{j}{M}\}_{j=0}^{M}$ in the $x$ - and $y$ -directions respectively (with all diagonals having the same orientation). A standard quadrature with $f$ replaced in (1.2) by its Lagrange interpolant $f^{I}\in S_{h}$ will be used in numerical experiments.

For this problem, we compare two lower error estimates: obtained using the standard bubble function approach KunVer00 (see also Lemma 2.1 in §2.2) and the one obtained in §4 (see Theorem 4.1). They can be described by

[TABLE]

(To be more precise, when $\varrho_{S}=1$ is used, the term $\|h_{T}(f-f^{I})\|_{\Omega}$ in the right-hand side of (2.1a) should be replaced by a larger $\|H_{T}\,{\rm osc}(f\,;T)\|_{\Omega}$ ; see §4 for details.) Importantly, the choice $\varrho_{S}=1$ , which will be theoretically justified in §§4-5, is consistent with the jump residual terms in our upper error estimates (1.3) and (1.4).

To address whether the lower error estimator ${\mathcal{E}}$ in (2.1a) is sharp, the errors $\|\nabla(u_{h}-u)\|_{\Omega}$ (as well as $\|h_{T}(f-f^{I})\|_{\Omega}$ ) are compared with ${\mathcal{E}}$ in Table 1. (In these computations $\nabla u$ and $f$ are replaced, respectively, by their piecewise-linear and piecewise-quadratic interpolants.)

Clearly, the standard lower estimator with $\varrho_{S}=\frac{|S|}{{\rm diam}(\omega_{S})}$ is not sharp. Not only its effectivity indices strongly depend on the ratio $M/N$ , but, perhaps more alarmingly, ${\mathcal{E}}$ converges to zero as $M/N$ increases, i.e. when the mesh is anisotropically refined in the wrong direction (while the error remains almost independent of $M/N$ ). By contrast, the estimator of §4, with $\varrho_{S}=1$ , performs quite well, with the effectivity indices stabilizing.

When comparing the two estimators, note that their weights are similar when $|S|\simeq{\rm diam}\,\omega_{S}$ ; however, they become dramatically different when $|S|\ll{\rm diam}(\omega_{S})$ , i.e. for short edges. Hence, our numerical experiments clearly suggest that it is the short-edge jump residual terms in the standard lower error estimator that are not sharp.

Next, consider a two-scale exact solution $u=\sin(x/\mu)e^{-y/\varepsilon}$ with $\mu=2\varepsilon^{2}$ , which exhibits a boundary layer in variable $y$ and smaller-scale oscillations in variable $x$ . To simplify the setting, we consider a version of problem (1.1) with this exact solution only in the boundary-layer domain $\Omega=(0,1)\times(0,\varepsilon)$ , with the corresponding $f$ and Dirichlet boundary conditions. The two lower estimators from (2.1) are compared in Table 2 on the mesh constructed similarly to the first test problem, with $M=2N$ , only now the 1d grid in the $y$ -direction is $\{\varepsilon\frac{j}{M}\}_{j=0}^{M}$ . Thus, the mesh is correctly adapted in the $y$ -direction, but ignores the oscillations in the $x$ -direction, i.e. it is anisotropic, but incorrectly aligned. As $\varepsilon$ takes smaller values, the errors increase, which is not adequately detected by the estimator with $\varrho_{S}=\frac{|S|}{{\rm diam}(\omega_{S})}$ , the effectivity indices of which deteriorate (although more moderately than in Table 1; see Remark 2.1 for further discussion). The estimator with $\varrho_{S}=1$ again performs quite well, with all effectivity indices close to $3.45$ .

Finally, in Table 3, the two estimators are tested for a one-scale exact solution $u=\sin((2y-x)/\varepsilon)$ on the anisotropic mesh which is incorrectly aligned in the $x$ -direction only. To simplify the setting, we again consider a version of (1.1) in $\Omega=(0,1)\times(0,\varepsilon)$ and use $N=M$ . As discussed in Remark 2.1 below, both estimators exhibit stable effectivity indices. However, the bulk contribution of the short-edge residuals, computed as $\mathring{{\mathcal{E}}}:=\bigl{\{}\sum_{S\in\mathring{\mathcal{S}}}\varrho_{S}\,|\omega_{S}|J_{S}^{2}\bigr{\}}^{1/2}$ with $\mathring{S}:=\bigl{\{}|S|<\frac{1}{2}{\rm diam}(\omega_{S})\bigr{\}}$ , becomes negligible for $\varrho_{S}=\frac{|S|}{{\rm diam}(\omega_{S})}$ (unlike the case $\varrho_{S}=1$ ). This is undesirable, as may lead to the erroneous interpretation that the mesh is aligned correctly and possibly requires further refinement only in the $y$ -direction. (As here we compare $\mathring{{\mathcal{E}}}$ with the overall estimator ${\mathcal{E}}$ , it is important to note for the component $\|h_{T}f^{I}\|_{\Omega}$ of ${\mathcal{E}}$ that $\|h_{T}f^{I}\|_{\Omega}/\mathring{\mathcal{E}}$ was $\approx 1.43$ for the first estimator and $\varepsilon=2^{-4}$ , and did not exceed $1$ in all other computations for this problem.)

Remark 2.1

From the point of view of interpolation, if the anisotropic elements are aligned in the $x$ -direction, roughly speaking, one may expect that $|J_{S}|$ gives an approximation to $O(h_{y}|\partial_{y}^{2}u|+h_{x}|\partial^{2}_{xy}u|)$ for long edges and $O(h_{x}|\partial_{x}^{2}u|+h_{y}|\partial^{2}_{xy}u|)$ for short edges, where $h_{x}$ and $h_{y}$ are the mesh sizes respectively in the $x$ - and $y$ -directions. In all our computations $h_{y}\ll h_{x}$ . For the first test problem, $\partial^{2}_{xy}u=\partial_{y}^{2}u=0$ , which explains why the contributions of the short-edge residuals with correct weights are crucial for the overall efficiency of the estimator. In our second test, $h_{y}|\partial^{2}_{xy}u|$ is dominated by $h_{x}|\partial_{x}^{2}u|$ , but not as significantly, which is reflected in a more moderate deterioration of the estimator efficiency whenever $\varrho_{S}\ll 1$ for short edges. For the final test, $h_{x}|\partial^{2}_{xy}u|\simeq h_{x}|\partial_{x}^{2}u|$ , so $|J_{S}|$ takes similar in magnitude values for short and long edges; hence, even when the bulk contribution of short-edge residuals is almost nullified by $\varrho_{S}\ll 1$ , the overall estimator efficiency remains adequate.

2.2 Lower error bounds using the standard bubble approach

Here, for completeness, and with a view of motivating the new approach of §§4-5, we prove a version of the lower error bound from (KunVer00, , Theorem 5.1); see also (Ver_book_13, , Theorem 4.37). Similar bounds can also be found in (Kunert2000, , Theorem 2) for the 3d case, and in (Kun01, , Theorem 4.3) for a singularly perturbed equation; see also (Ver_book_13, , §4.5). Note also that Lemma 2.1 below gives a version of the lower error bounds from (Kun01, , Theorem 4.3), while in the earlier literature the weight $\varrho_{S}=\frac{|S|}{{\rm diam}(\omega_{S})}$ in the bounds of type (2.2b) was replaced by the smaller $\varrho_{S}^{2}$ .

Lemma 2.1

Let $\mathcal{T}$ satisfy the maximum angle condition, and let $|T|\simeq|\omega_{S}|$ $\forall\,T\subset\omega_{S}$ , $S\in{\mathcal{S}}\backslash\partial\Omega$ . Then for a solution $u$ of (1.1) and any $u_{h}\in S_{h}$ , one has

[TABLE]

Proof

(i) On any $T\in\mathcal{T}$ , consider $w:=f^{I}\,\phi_{1}\phi_{2}\phi_{3}$ , where $\{\phi_{i}\}_{i=1}^{3}$ are the standard hat functions associated with the three vertices of $T$ . Now, a standard calculation yields $\|f^{I}\|_{T}^{2}\simeq\langle f^{I},w\rangle$ . Note also that, in view of (1.1) and also $\triangle u_{h}=0$ on $T$ , one has $\langle f^{I},w\rangle=\langle\nabla(u-u_{h}),\nabla w\rangle-\langle f-f^{I},w\rangle$ . Next, invoking $\|\nabla w\|_{T}\lesssim h_{T}^{-1}\|w\|_{T}$ , one arrives at

[TABLE]

The first desired result (2.2a) follows in view of $\|w\|_{T}\lesssim\|f^{I}\|_{T}$ .

(ii) For each of the two triangles $T\subset\omega_{S}$ , introduce a triangle $\widetilde{T}\subseteq T$ with an edge $S$ such that $|\widetilde{T}|\simeq h_{T}|S|$ . Next, set $w:=J_{S}\,\color[rgb]{0,0,0}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,0}\widetilde{\phi}_{1}\widetilde{\phi}_{2}$ , where $\color[rgb]{0,0,0}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,0}\widetilde{\phi}_{1}$ and $\color[rgb]{0,0,0}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,0}\widetilde{\phi}_{2}$ are the hat functions associated with the end points of $S$ on the obtained triangulation $\{\widetilde{T}\}_{T\subset\omega_{S}}$ (with $w:=0$ on each $T\backslash\widetilde{T}$ for $T\subset\omega_{S}$ ). A standard calculation using $\triangle u_{h}=0$ in $T\subset\omega_{S}$ and (1.1), yields

[TABLE]

Next, invoking $\|\nabla w\|_{T}\lesssim h_{T}^{-1}\|w\|_{T}$ for any $T\subset\omega_{S}$ , we arrive at

[TABLE]

where ${\mathcal{Y}}^{T}_{\scriptsize\mbox{(\ref{lower_f})}}$ denotes the right-hand side of (2.2a), and the latter bound was also employed for the estimation of $\|f\|_{T}$ . The second desired bound (2.2b) follows in view of $h_{T}=|T|/H_{T}\simeq|\omega_{S}|/{\rm diam}(\omega_{S})$ . $\Box$

Remark 2.2

The piecewise-linear Lagrange interpolant $f^{I}$ of $f$ used in (2.2) may be replaced by any, possibly discontinuous, quasi-interpolant of $f$ (such as the piecewise-constant approximation of $f$ by its element average values).

Remark 2.3 (Deficiency of the bubble function approach)

An inspection of the above proof shows that it is sharp in the sense that it cannot be tweaked to remove the weight $\frac{|S|}{{\rm diam}(\omega_{S})}$ in (2.2b); see also Appendix A. More precisely, for such an improvement, one would need $h_{T}\simeq|\omega_{S}|/|S|$ in (2.3), which is not the case for short edges.

Remark 2.4 (Preview of the new approach)

The bubble function in the proof of (2.2b) may be viewed as a simplest local cut-off function. However, in the case of anisotropic mesh elements, its gradient is not consistent with the diameter of the local patch. To remedy this, when dealing with short edges in §§4-5 below, we shall switch to a cut-off function, the support of which comprises a larger local patch of anisotropic elements (rather than a two-triangle patch) and has an interior diameter $\simeq{\rm diam}(\omega_{S})$ . (Such local patches are highlighted in grey in Fig. 1 (left) and Fig. 2.) Unsurprisingly, this approach brings new challenges. For example, we have to deal with multiple edges inside this larger patch; in particular, we need to find a way to (almost) eliminate the jump residuals associated with the long edges. But this change of the paradigm will lead to essentially sharper lower error bounds of type (2.1a) with $\varrho_{S}=1$ .

3 Basic triangulation assumptions

In the remainder of the paper, we shall use $z=(x_{z},y_{z})$ , $S$ and $T$ to denote particular mesh nodes, edges and elements, respectively, while $\mathcal{N}$ , $\mathcal{S}$ and $\mathcal{T}$ will denote their respective sets. For each $z\in\mathcal{N}$ , let $\omega_{z}$ be the patch of elements surrounding $z$ , ${\mathcal{S}}_{z}$ the set of edges originating at $z$ , and

[TABLE]

Throughout the paper we make the following triangulation assumptions.

•

Maximum Angle condition. Let the maximum interior angle in any triangle $T\in\mathcal{T}$ be uniformly bounded by some positive $\alpha_{0}<\pi$ .

•

Local Element Orientation condition. For any $z\in\mathcal{N}$ , there is a rectangle $\omega^{*}_{z}\supset\omega_{z}$ such that $|\omega^{*}_{z}|\simeq|\omega_{z}|$ .

•

Also, let the number of triangles containing any node be uniformly bounded.

Note that the above conditions are automatically satisfied by shape-regular triangulations.

Additionally, we restrict our analysis to the following two node types defined using a fixed small constant $c_{0}$ (to distinguish between anisotropic and isotropic elements), with the notation $a\ll b$ for $a<c_{0}b$ .

(1) Anisotropic Nodes, the set of which is denoted by ${\mathcal{N}}_{\rm ani}$ , are such that

[TABLE]

Note that the above implies that ${\mathcal{S}}_{z}$ contains at most two edges of length ${}\lesssim h_{z}$ (see also Fig. 2).

(2) Regular Nodes, the set of which is denoted by ${\mathcal{N}}_{\rm reg}$ , are those surrounded by shape-regular mesh elements.

The above imposes a gradual transition between anisotropic and isotropic elements, i.e. the set ${\mathcal{N}}_{\rm ani}\cap{\mathcal{N}}_{\rm reg}$ is not necessarily empty. (To simplify the presentation, here we exclude more general node types, such as in Kopt15 ; Kopt_NM_17 ; Kopt17 , with both anisotropic and isotropic mesh elements allowed to appear within the same patch $\omega_{z}$ .)

Next, recall that $\omega_{S}$ is the patch of two elements sharing $S$ , and introduce the set of short edges

[TABLE]

Remark 3.1

By Lemma 2.1, one has $\bigl{\{}\sum_{S\subset{\mathcal{D}}\backslash\mathring{\mathcal{S}}}|\omega_{S}|J_{S}^{2}+\|h_{T}\,f^{I}\|_{\mathcal{D}}\bigr{\}}^{1/2}\lesssim{\mathcal{Y}}_{{\mathcal{D}}}$ . Indeed, this follows from (2.2) combined with ${\mathcal{Y}}^{T}_{\scriptsize\mbox{(\ref{lower_f})}}\leq{\mathcal{Y}}_{T}$ , where ${\mathcal{Y}}^{T}_{\scriptsize\mbox{(\ref{lower_f})}}$ denotes the right-hand side of (2.2a). Hence, for ${{\mathcal{E}}}_{\mathcal{D}}\lesssim{\mathcal{Y}}_{{\mathcal{D}}}$ , it suffices to prove that $\mathring{{\mathcal{E}}}_{\mathcal{D}}\lesssim{\mathcal{Y}}_{{\mathcal{D}}}$ .

4 Estimator efficiency on a partially structured anisotropic mesh

4.1 Lower error bound on a partially structured anisotropic mesh

To illustrate our approach in a simpler setting, we first present a version of the analysis for a simpler, partially structured, anisotropic mesh in a square domain $\Omega=(0,1)^{2}$ . So, throughout this section, we make the following triangulation assumptions.

A1.

Let $\{x_{i}\}_{i=0}^{n}$ be an arbitrary mesh on the interval $(0,1)$ in the $x$ direction. Then, let each $T\in\mathcal{T}$ , for some $i$ ,

(i) have the shortest edge on the line segment ${\mathcal{P}}_{i}:=\{x=x_{i},\,y\in[0,1]\}$ ;

(ii) have a vertex on ${\mathcal{P}}_{i+1}$ or ${\mathcal{P}}_{i-1}$ (see Fig. 1, left). 2. A2.

Let ${\mathcal{N}}={\mathcal{N}}_{\rm ani}$ , i.e. each mesh node $z$ satisfies (3.1). 3. A3.

Global Element Orientation condition. For any $z\in\mathcal{N}$ , there is a rectangle $\omega^{*}_{z}\supset\omega_{z}$ with sides parallel to the coordinate axes such that $|\omega^{*}_{z}|\simeq|\omega_{z}|$ .

These conditions essentially imply that all mesh elements are anisotropic and aligned in the $x$ -direction.

Theorem 4.1

Let $u$ and $u_{h}$ solve, respectively, (1.1) and (1.2) under conditions A1–A3. Then in $\Omega_{i}:=(x_{i-1},x_{i+1})\times(0,1)$ , using the notation (3.2), one has

[TABLE]

The remainder of this section will be devoted to the proof of this result.

Corollary 4.2

Under the conditions of Theorem 4.1, with $\Omega_{0}$ and $\Omega_{n}$ defined using $x_{-1}:=x_{0}$ and $x_{n+1}:=x_{n}$ , one has

[TABLE]

Proof

Combining (4.1) with $\mathring{{\mathcal{E}}}_{\Omega_{0}}=\mathring{{\mathcal{E}}}_{\Omega_{n}}=0$ (as there are no short edges in $\Omega_{0}\cup\Omega_{n}$ ) and Remark 3.1, we conclude that ${{\mathcal{E}}}_{\Omega_{i}}\lesssim{\mathcal{Y}}_{\Omega_{i}}$ $\forall\,i$ . The final bound ${{\mathcal{E}}}_{\Omega}\lesssim{\mathcal{Y}}_{\Omega}$ follows. $\Box$

Remark 4.1 (Estimator efficiency)

It follows from (Kopt_NM_17, , Theorems 5.1 and 7.4) that if $f(0,y)=f(1,y)$ , then $\|\nabla(u_{h}-u)\|_{\Omega}\lesssim{{\mathcal{E}}}_{\Omega}+\|H_{T}\,{\rm osc}(f\,;T)\|_{\Omega}+\|f-f^{I}\|_{\Omega}$ . Comparing this upper error bound with ${{\mathcal{E}}}_{\Omega}\lesssim{\mathcal{Y}}_{\Omega}$ from Corollary 4.2, we conclude that the error estimator ${{\mathcal{E}}}_{\Omega}$ is efficient up to data oscillation.

4.2 Preliminary results for partially structured meshes

The following result will be useful in the proof of Theorem 4.1.

Lemma 4.3

(i) If $z\in{\mathcal{P}}_{i}\backslash\partial\Omega$ for some $1\leq i\leq n-1$ , with $\gamma_{z}\cap{\mathcal{P}}_{i}$ formed by the two edges $S^{-}$ and $S^{+}$ , then

[TABLE]

(ii) If $z\in{\mathcal{P}}_{i}\cap\partial\Omega$ for some $1\leq i\leq n-1$ , with $\gamma_{z}\cap{\mathcal{P}}_{i}$ formed by a single edge $S^{+}\!$ , then (4.2) holds true with $J_{{S}^{-}}\!$ replaced by [math].

Proof

(i) As $z\not\in\partial\Omega$ , so $\sum_{S\in\gamma_{z}}\llbracket\nabla u_{h}\rrbracket_{S}=0$ , where $\llbracket\nabla u_{h}\rrbracket_{S}$ denotes the jump in $\nabla u_{h}$ across any edge $S$ in $\gamma_{z}$ evaluated in the anticlockwise direction about $z$ . Multiplying this relation by the unit vector ${\mathbf{i}}_{x}$ in the $x$ -direction, and noting that $\llbracket\nabla u_{h}\rrbracket_{{S}^{\pm}}\cdot{\mathbf{i}}_{x}=\pm J_{{S}^{\pm}}$ , one gets the desired assertion. Here we also use the observation that for $S\in\gamma_{z}\backslash{\mathcal{P}}_{i}$ , one has $|\llbracket\nabla u_{h}\rrbracket_{S}\cdot{\mathbf{i}}_{x}|\simeq|J_{S}\,{\boldsymbol{\nu}}_{S}\cdot{\mathbf{i}}_{x}|$ , where ${\boldsymbol{\nu}}_{S}$ is a unit normal vector to $S$ , for which A3 implies $|{\boldsymbol{\nu}}_{S}\cdot{\mathbf{i}}_{x}|\lesssim h_{z}H_{z}^{-1}$ .

(ii) Now $z\in\partial\Omega$ , so extend $u_{h}$ to $\mathbb{R}^{2}\backslash\Omega$ by [math] and imitate the above proof with the modification that now $\sum_{S\in{\mathcal{S}}_{z}}\llbracket\nabla u_{h}\rrbracket_{S}=0$ . When dealing with the two edges on $\partial\Omega$ , note that for $S\in{\mathcal{S}}_{z}\cap\partial\Omega$ , one gets ${\boldsymbol{\nu}}_{S}\cdot{\mathbf{i}}_{x}=0$ . $\Box$

Corollary 4.4

Under the conditions of Lemma 4.3, one has

[TABLE]

where ${\mathcal{Y}}_{\omega_{z}}$ is from (3.2a), and if $z\in{\mathcal{P}}_{i}\cap\partial\Omega$ , then $J_{{S}^{-}}\!$ in (4.3) is replaced by [math].

Proof

In view of (4.2), the left-hand side in (4.3) is $\lesssim\sum_{S\in\gamma_{z}\backslash{\mathcal{P}}_{i}}|\omega_{S}|J_{S}^{2}$ , where we also used $|\omega_{S}|\simeq|\omega_{z}|$ $\forall\,S\in\gamma_{z}$ . Next, note that the set of edges $\{S\in\gamma_{z}\backslash{\mathcal{P}}_{i}\}$ can be described as $\{S\subset\omega_{z}\backslash\mathring{\mathcal{S}}\}$ , so, by Remark 3.1, the desired assertion follows. $\Box$

Remark 4.2

The minimal rectangle $\omega_{z}^{*}$ from condition A3 is defined by $\omega_{z}^{*}=(x_{i-1},x_{i+1})\times(y_{z}^{-},y_{z}^{+})$ , where $(y_{z}^{-},y_{z}^{+})$ is the range of $y$ within $\omega_{z}$ . For this rectangle, the above conditions (in particular A3) imply that $y^{+}_{z}-y_{z}^{-}\simeq h_{z}$ . Furthermore, there is $k\lesssim 1$ such that $\omega_{z}^{*}\subset\omega_{z}^{(k)}$ $\forall\,z\in\mathcal{N}$ , where $\omega_{z}^{(0)}:=\omega_{z}$ , and $\omega_{z}^{(j+1)}$ denotes the patch of elements in/touching $\omega_{z}^{(j)}$ . This conclusion is illustrated on Fig. 1 (right). (Note that $k=1$ if our partially structured triangulation is non-obtuse.)

4.3 Proof of Theorem 4.1

Proof

Throughout the proof we shall use the somewhat simplified notation ${\mathcal{Y}}_{i}:={\mathcal{Y}}_{\Omega_{i}}$ and $\mathring{\mathcal{E}}_{i}:=\mathring{\mathcal{E}}_{\Omega_{i}}$ , and also will frequently drop the index $i$ and write ${\mathcal{P}}:={\mathcal{P}}_{i}=\{x=x_{i},\,y\in[0,1]\}$ , and $\color[rgb]{0,0,0}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,0}H:=H_{i}:=\frac{1}{2}(x_{i+1}-x_{i-1})$ . With this notation, $\mathring{\mathcal{S}}\cap\Omega_{i}={\mathcal{P}}$ , so, taking into consideration the structure of the mesh (see Fig. 1, left), (3.2c) and (3.2a) with ${\mathcal{D}}=\Omega_{i}$ can be rewritten as

[TABLE]

Next, note that for any $v\in H_{0}^{1}(\Omega)$ and $v_{h}\in S_{h}$ , a standard calculation using (1.1), (1.2) yields

[TABLE]

As this immediately implies $\mathring{{\mathcal{E}}}^{2}_{i}\lesssim H(|\psi_{1}|+|\psi_{2}|+|\Psi|)$ , it suffices to prove that

[TABLE]

The desired assertion (4.1) will indeed follow, in view of ${\mathcal{Y}}_{i}\,\mathring{{\mathcal{E}}}_{i}\leq\theta\mathring{{\mathcal{E}}}_{i}^{2}+\frac{1}{4}\theta^{-1}{\mathcal{Y}}_{i}^{2}$ with a sufficiently small positive constant $\theta$ .

The remainder of the proof is split into three parts. In part (i), we shall describe appropriate non-standard $v_{h}$ and $v$ , which will be crucial for (4.6) to hold true. Certain sufficient conditions for the latter will be established in part (ii), and then shown to be satisfied in part (iii).

(i) Crucially, in (4.5), we require that $v_{h}\in S_{h}$ and $v:=\hat{v}_{h}\not\in S_{h}$ both have support in $\Omega_{i}$ and satisfy

[TABLE]

Note that $\gamma_{z}\cap{\mathcal{P}}$ , which appears in the definition of nodal values of $v_{h}$ , includes exactly two short edges, while, to be more precise, $\hat{v}_{h}$ has support in ${\color[rgb]{0,0,0}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,0}\hat{\Omega}_{i}}:=(x_{i-1/2},x_{i+1/2})\times(0,1)\subset\Omega_{i}$ .

(ii) We claim that for (4.6), and hence for the desired assertion (4.1), it suffices to prove that the following conditions are satisfied:

[TABLE]

Indeed, for $\psi_{1}$ from (4.5), by (4.4), one immediately has $|\psi_{1}|\lesssim{\mathcal{Y}}_{i}\|\nabla v\|_{2\,;\Omega_{i}}$ . Here, by (4.7), $\|\nabla v\|_{2\,;\Omega_{i}}=\|\nabla\hat{v}_{h}\|_{2\,;\Omega_{i}}\simeq\|\nabla v_{h}\|_{2\,;\Omega_{i}}$ , for which we have (4.8b). Combining these observations, one gets the desired bound on $\psi_{1}$ in (4.6).

Next, for $\psi_{2}$ from (4.5), set $\hat{f}(x,y):=f(x_{i}+{\color[rgb]{0,0,0}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,0}2}[x-x_{i}],y)$ (similarly to $\hat{v}_{h}$ in (4.7)). Then $\frac{1}{2}\langle f,v_{h}\rangle=\langle\hat{f},\hat{v}_{h}\rangle=\langle\hat{f},v\rangle$ , so

[TABLE]

Here we also used $\|v\|_{2\,;\Omega_{i}}=\|\hat{v}_{h}\|_{2\,;\Omega_{i}}\simeq\|v_{h}\|_{2\,;\Omega_{i}}$ (in view of (4.7)), while the bound on $\|f-\hat{f}\|_{2\,;\color[rgb]{0,0,0}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,0}\hat{\Omega}_{i}}$ follows from Remark 4.2. Combining the above with $H\|{\rm osc}(f\,;T)\|_{2\,;\Omega_{i}}\lesssim{\mathcal{Y}}_{i}$ (in view of (4.4)) and the bound in (4.8b) on $\|v_{h}\|_{2\,;\Omega_{i}}$ yields the desired bound on $\psi_{2}$ in (4.6).

Finally, consider $\Psi$ , the most delicate term in (4.5). To check that the corresponding bound in (4.6) follows from (4.8), note that in each triangle $T\in{\mathcal{T}}\cap\Omega_{i}$ , one has $\triangle u_{h}=0$ , so $\int_{T}\nabla u_{h}\cdot\nabla(v-\frac{1}{2}v_{h})=\int_{\partial T}\nabla u_{h}\cdot{\boldsymbol{\nu}}(v-\frac{1}{2}v_{h})$ . Note also that $v=v_{h}=0$ on $\partial\Omega_{i}$ , so $\langle\nabla u_{h},\nabla(v-{\textstyle\frac{1}{2}}v_{h})\rangle=\sum_{S\subset\Omega_{i}}\int_{S}J_{S}(v-{\textstyle\frac{1}{2}}v_{h})$ . It also follows from (4.7) that $v-{\textstyle\frac{1}{2}}v_{h}={\textstyle\frac{1}{2}}v_{h}$ on ${\mathcal{P}}$ . Combining these observations, one gets

[TABLE]

Now, subtracting $\frac{1}{2}H^{-1}\mathring{{\mathcal{E}}}_{i}^{2}=\frac{1}{2}\int_{{\mathcal{P}}}J_{S}^{2}$ (in view of (4.4)) yields

[TABLE]

So, using (4.4) for the first term, and (4.8a) combined with Remark 4.2 for the second, one gets

[TABLE]

When dealing with the second term, we also used $|\omega_{z}^{*}|\simeq|\omega_{z}|\simeq|\omega_{S}|$ for any edge $S$ originating at $z\in{\mathcal{P}}$ . For the first term in (4.11), in view of (4.7), $\|v_{h}-J_{S}\|_{2\,;{\mathcal{P}}}\lesssim\|{\rm osc}(v_{h}\,;S)\|_{2\,;{\mathcal{P}}}\lesssim H^{-1/2}{\mathcal{Y}}_{i}$ , where the latter bound follows from (4.8c) combined with $\frac{H}{|S|}\gtrsim 1$ and $|\omega_{S}|\simeq H|S|$ $\forall\,S\subset{\mathcal{P}}$ . The second term in (4.11) is bounded by ${\mathcal{Y}}_{i}\cdot H^{-1}\!(\mathring{{\mathcal{E}}}_{i}+{\mathcal{Y}_{i}})$ , where we used Remark 3.1 and (4.8b). Combining these findings yields the desired bound on $\Psi$ in (4.6).

(iii) To complete the proof, it remains to establish the three bounds on $v_{h}$ in (4.8). To establish (4.8a), for any $S\subset\gamma_{z}\backslash{\mathcal{P}}$ starting at $z=(x_{i},y_{z})$ , let $S^{\prime}:={\rm proj}_{y=y_{z}}S$ , the projection of $S$ onto the line $y=y_{z}$ . Then, by (4.7), $\int_{S^{\prime}}\hat{v}_{h}=\frac{1}{2}\int_{S^{\prime}}v_{h}$ . On the other hand, by A3, one has $\Bigl{|}\int_{S}v_{h}-\frac{|S|}{|S^{\prime}|}\int_{S^{\prime}}v_{h}\Bigr{|}\lesssim\|\partial_{y}v_{h}\|_{1\,;\omega_{z}^{*}}$ and a similar bound on $\hat{v}_{h}$ (see, e.g., (Kopt_NM_17, , Lemma 7.1)). Combining these observations, and also noting that $\|\partial_{y}\hat{v}_{h}\|_{1\,;\omega_{z}^{*}}\simeq\|\partial_{y}v_{h}\|_{1\,;\omega_{z}^{*}}$ , yields (4.8a).

For (4.8b), first, note that $v_{h}\in S_{h}$ has support in $\Omega_{i}$ , so $\|v_{h}\|^{2}_{2\,;\Omega_{i}}\simeq H\|v_{h}\|^{2}_{2\,;{\mathcal{P}}}\lesssim H\|J_{S}\|^{2}_{2\,;{\mathcal{P}}}={\mathcal{E}}_{i}^{2}$ , where we used (4.7) and then (4.4). Furthermore, $\|\nabla v_{h}\|_{2\,;\Omega_{i}}\lesssim\|\partial_{y}v_{h}\|_{2\,;\Omega_{i}}+H^{-1}\|v_{h}\|_{2\,;\Omega_{i}}$ . So it remains to bound $\|\partial_{y}v_{h}\|^{2}_{2\,;\Omega_{i}}$ , for which we note that $|\partial_{y}v_{h}|=|S|^{-1}{\rm osc}(v_{h}\,;S)$ on any $T$ having an edge $S\subset{\mathcal{P}}$ (while otherwise $\partial_{y}v_{h}=0$ ). Assuming that (4.8c) is true, one then gets $\|H\,\partial_{y}v_{h}\|^{2}_{2\,;\Omega_{i}}\lesssim{\mathcal{Y}}_{i}^{2}$ . Combining our findings, we conclude that (4.8b) follows from (4.8c).

Finally, to establish (4.8c), recall (4.3) and combine it with the definition of $v_{h}$ in (4.7) and the observation that $\sum_{z\in{\mathcal{P}}_{i}}{\mathcal{Y}}_{\omega_{z}}^{2}\lesssim{\mathcal{Y}}_{i}^{2}$ . $\Box$

Remark 4.3 (Non-smooth $f$ )

An inspection of the above proof shows that (4.1) remains valid if in ${\mathcal{Y}}_{\Omega_{i}}$ , which appears in the right-hand side, the term $\|H_{T}\,{\rm osc}(f\,;T)\|_{\Omega_{i}}$ is replaced by $\|H_{T}(f-\bar{f}_{i})\|_{\Omega_{i}}+\|h_{T}(f-f^{I})\|_{\Omega_{i}}$ , where $\bar{f}_{i}=\bar{f}_{i}(y)$ is an arbitrary function of variable $y$ . To be more precise, the bound (4.9) for $\psi_{2}$ can be replaced by $|\psi_{2}|=|\langle f-\bar{f}_{i},v-\frac{1}{2}v_{h}\rangle|\lesssim\|f-\bar{f}_{i}\|_{2\,;\Omega_{i}}\,\|v_{h}\|_{2\,;\Omega_{i}}$ . Additionally, we use a sharper version of the bound (4.3), with $H_{T}\,{\rm osc}(f\,;T)$ in the right-hand side term ${\mathcal{Y}}_{\omega_{z}}$ replaced by $h_{T}(f-f^{I})$ . (This version of (4.3) holds true as a similar improvement applies to the bound of Remark 3.1.) Note that if $f\in H^{1}(\Omega)$ , then $\bar{f}_{i}(y)$ may be chosen equal to a 1d local average of $f$ , and if $f\in L_{2}(\Omega)$ , then $\bar{f}_{i}(y)$ may be piecewise-constant with local 2d average values, while $f^{I}$ may be a quasi-interpolant described in Remark 2.2.

5 Estimator efficiency on more general meshes

5.1 Main result

Under the triangulation assumptions of §3, each anisotropic node shares the local patch orientation with its anisotropic neighbours. So it is not unreasonable to expect that an anisotropic mesh may include small clusters of anisotropic elements sharing the same orientation. In fact, (in particular, if a locally anisotropic mesh was generated starting from some regular mesh) one may assume that the entire anisotropic part of the mesh can be split into sufficiently large, and possibly overlapping, clusters of anisotropic elements with interior cluster diameters $\simeq{\rm diam}(\omega_{z})$ .

Hence, in this section, lower error bounds will be given for small patches of elements surrounding what will be called a local anisotropic path (also see Fig. 2 (left)).

Definition. A Local Anisotropic Path ${\mathcal{P}}$ is a simple polygonal curve formed by a subset of short edges, together with their endpoints, that does not touch any corners of $\Omega$ , has 2 endpoints (the set of the latter is denoted $\partial{\mathcal{P}}$ ), and satisfies the following conditions:

•

Any node $z\in{\mathcal{P}}$ is anisotropic in the sense (3.1) and satisfies $H_{z}\simeq H_{\mathcal{P}}$ for some $H_{\mathcal{P}}$ associated with ${\mathcal{P}}$ , and also $|\gamma_{z}\cap{\mathcal{P}}|\simeq h_{z}$ (so $\gamma_{z}\cap{\mathcal{P}}$ is formed by at most two short edges).

•

Path Element Orientation condition. There exists a path-specific cartesian coordinate system $(\xi,\eta)=(\xi_{\mathcal{P}},\eta_{\mathcal{P}})$ such that for any node $z\in{\mathcal{P}}$ , there is a rectangle $\omega^{*}_{z}\supset\omega_{z}$ with sides parallel to the coordinate axes and $|\omega^{*}_{z}|\simeq|\omega_{z}|$ .

Theorem 5.1 (Short-edge jump residual terms)

Suppose that ${\mathcal{P}}_{0}\subset{\mathcal{P}}$ , where ${\mathcal{P}}_{0}$ and ${\mathcal{P}}$ are local anisotropic paths that share a coordinate system $(\xi,\eta)$ , and also $\partial{\mathcal{P}}\cap\partial\Omega\subset\partial{\mathcal{P}}_{0}$ and ${\rm dist}(\partial{\mathcal{P}}\backslash\partial\Omega,\,\partial{\mathcal{P}}_{0}\backslash\partial\Omega)\simeq H_{\mathcal{P}}$ . Then for $u$ and $u_{h}$ satisfying respectively (1.1) and (1.2), with the notation (3.2), one has

[TABLE]

The remainder of this section is devoted to the proof of this result.

Corollary 5.2

Under the conditions of Theorem 5.1, one has

[TABLE]

Proof

As (5.1) is equivalent to $\mathring{{\mathcal{E}}}_{\omega_{{\mathcal{P}}_{0}}}\lesssim{\mathcal{Y}}_{\omega_{{\mathcal{P}}}}$ , combining the latter with Remark 3.1 immediately yields the desired result. $\Box$

Remark 5.1 (Estimator efficiency)

It follows from (Kopt_NM_17, , §6.1 and Theorem 7.4) that under conditions on the mesh described in §3 and some additional assumptions on the orientation of anisotropic mesh elements, the error bound (1.4) holds true, i.e. $\|\nabla(u_{h}-u)\|_{\Omega}\lesssim{{\mathcal{E}}}_{\Omega}+\|H_{T}\,{\rm osc}(f\,;T)\|_{\Omega}+\|f-f^{I}\|_{\Omega}$ . Note that for any regular node $z\in{\mathcal{N}}_{\rm reg}$ , (2.2) yields a standard bound ${{\mathcal{E}}}_{\omega_{z}}\lesssim{\mathcal{Y}}_{\omega_{z}}$ . Now, suppose that all anisotropic nodes in ${\mathcal{N}}_{\rm ani}\backslash{\mathcal{N}}_{\rm reg}$ can be split into disjoint sets, each forming a local anisotropic path of type ${\mathcal{P}}_{0}$ in Theorem 5.1,

and any node in $\mathcal{N}$ belongs to at most a finite number of the respective paths of type ${\mathcal{P}}$ .

Then, in view of Corollary 5.2, one gets ${{\mathcal{E}}}_{\Omega}\lesssim{\mathcal{Y}}_{\Omega}$ , i.e. the error estimator ${{\mathcal{E}}}_{\Omega}$ is efficient up to data oscillation.

Remark 5.2 (Singular perturbation case)

Note that the upper a posteriori error bounds in Kopt_NM_17 were obtained for more general singularly perturbed semilinear reaction-diffusion equations, solutions of which typically exhibit sharp boundary and interior layers, so anisotropic meshes are frequently employed in their numerical solution. With regard to the lower error bounds for such equations, the standard bubble-function approach was employed in Kun01 , and, as was shown in §2, the resulting estimates are not sharp even in the regular regime. Sharper lower bounds of type (5.1) will be generalized to this case in a forthcoming paper.

5.2 Preliminary results for a local anisotropic path

To prove Theorem 5.1, we shall use a version of Lemma 4.3, in which we shall consider the normalized version of $J_{S}$ defined by

[TABLE]

Here ${\mathcal{P}}$ is a local anisotropic path associated with the coordinate system $(\xi,\eta)$ , ${\mathbf{i}}_{\xi}$ is the unit vector in the $\xi$ -direction, and ${\boldsymbol{\nu}}_{S}$ is a unit vector normal to $S$ , while $|{\boldsymbol{\nu}}_{S}\cdot{\mathbf{i}}_{\xi}|\simeq 1$ follows from $S$ being a short edge and the path element orientation condition. It may be helpful to note that $J_{S}^{\prime}$ equals a signed jump of $\partial_{\xi}u_{h}$ across $S$ .

Lemma 5.3

*Let ${\mathcal{P}}$ be a local anisotropic path associated with the coordinate system $(\xi,\eta)$ , and $J_{S}^{\prime}$ from (5.2).

(i) For any node $z\in{\mathcal{P}}\backslash\partial{\mathcal{P}}$ , with $\gamma_{z}\cap{\mathcal{P}}$ formed by two edges $S^{-}$ and $S^{+}$ ,*

[TABLE]

(ii) If $z\in\partial{\mathcal{P}}\cap\partial\Omega$ , with $\gamma_{z}\cap{\mathcal{P}}$ formed by a single edge $S^{+}\!$ , then (5.3) holds true with $J^{\prime}_{{S}^{-}}\!$ replaced by [math].

Proof

(i) As $z\in{\mathcal{N}}_{\rm ani}\backslash\partial\Omega$ , so $\sum_{S\in\gamma_{z}}\llbracket\nabla u_{h}\rrbracket_{S}=0$ , where $\llbracket\nabla u_{h}\rrbracket_{S}$ denotes the jump in $\nabla u_{h}$ across any edge $S$ in $\gamma_{z}$ evaluated in the anticlockwise direction about $z$ . Multiply this relation by the unit vector ${\mathbf{i}}_{\xi}$ in the $\xi$ -direction, and note that the quantities ${\boldsymbol{\nu}}_{S}\cdot{\mathbf{i}}_{\xi}$ for $S=S^{\pm}$ have opposite signs (in view of the path element orientation condition combined with the maximum angle condition), so $|(\llbracket\nabla u_{h}\rrbracket_{{S}^{-}}+\llbracket\nabla u_{h}\rrbracket_{{S}^{+}})\cdot{\mathbf{i}}_{\xi}|=|J^{\prime}_{{S}^{+}}-J^{\prime}_{{S}^{-}}|$ . Note also that for $S\in\gamma_{z}\backslash{\mathcal{P}}$ , one has $|S|\simeq H_{z}$ and $|{\boldsymbol{\nu}}_{S}\cdot{\mathbf{i}}_{\xi}|\lesssim h_{z}H_{z}^{-1}$ (again, in view of the path element orientation condition combined with the maximum angle condition), so $|\llbracket\nabla u_{h}\rrbracket_{S}\cdot{\mathbf{i}}_{\xi}|=|J_{S}\,{\boldsymbol{\nu}}_{S}\cdot{\mathbf{i}}_{\xi}|\lesssim h_{z}H_{z}^{-1}|J_{S}|$ . Combining theses observations yields the desired assertion (5.3).

(ii) Now $z\in{\mathcal{N}}_{\rm ani}\cap\partial\Omega$ , and $z$ is not a corner of $\partial\Omega$ . First, suppose that ${\mathcal{S}}_{z}\cap\partial\Omega$ is parallel to the $\xi$ -axis. Then extend $u_{h}$ to $\mathbb{R}^{2}\backslash\Omega$ by [math] and imitate the above proof with the modification that now $\sum_{S\in{\mathcal{S}}_{z}}\llbracket\nabla u_{h}\rrbracket_{S}=0$ . When dealing with the two edges on $\partial\Omega$ , note that for $S\in{\mathcal{S}}_{z}\cap\partial\Omega$ , one gets ${\boldsymbol{\nu}}_{S}\cdot{\mathbf{i}}_{\xi}=0$ .

Finally, suppose ${\mathcal{S}}_{z}\cap\partial\Omega$ is not parallel to the $\xi$ -axis; then introduce a $\widetilde{\xi}$ -axis parallel to ${\mathcal{S}}_{z}\cap\partial\Omega$ . Now the above argument yields a version of (5.3) with $J^{\prime}_{{S}^{+}}-J^{\prime}_{{S}^{-}}$ replaced by $\widetilde{J}^{\prime}_{S}:=J_{S}|{\boldsymbol{\nu}}_{S}\cdot{\mathbf{i}}_{\widetilde{\xi}}|$ . The desired result follows as $\widetilde{J}^{\prime}_{S}\simeq J_{S}\simeq J_{S}^{\prime}$ . The latter follows from $|{\boldsymbol{\nu}}_{S}\cdot{\mathbf{i}}_{\widetilde{\xi}}|\simeq 1$ , in view of the path element orientation condition combined with the maximum angle condition. $\Box$

Corollary 5.4

Under the conditions of Lemma 5.3, one has

[TABLE]

where ${\mathcal{Y}}_{\omega_{z}}$ is from (3.2a), and if $z\in{\mathcal{P}}\cap\partial\Omega$ , then $J_{{S}^{-}}\!$ in (5.4) is replaced by [math].

Proof

Imitate the proof of Corollary 4.4. $\Box$

Remark 5.3

Similarly to the case of a partially structured mesh (see Remark 4.2 and Fig. 1 (right)), there is $k\lesssim 1$ such that each rectangle $\omega_{z}^{*}$ from the above path element orientation condition satisfies $\omega_{z}^{*}\cap\omega_{\mathcal{P}}\subset\omega_{z}^{(k)}$ for all $z\in{\mathcal{P}}$ .

5.3 Proof of Theorem 5.1

We generalize the proof of Theorem 4.1.

Proof

Without loss of generality, let $\partial{\mathcal{P}}=\{z_{0},z_{1}\}$ such that $z_{0}\in\partial\Omega$ and $z_{1}\not\in\partial\Omega$ (see Fig. 2). Also, to simplify the presentation, let the $\xi$ -axis be parallel to $\partial\Omega$ at $z_{0}$ (otherwise, see Remark 5.4).

Set $H:=H_{\mathcal{P}}\simeq H_{{\mathcal{P}}_{0}}$ . A certain weight $\rho_{S}\in[0,1]$ will be associated with each $S\subset{\mathcal{P}}$ , and it will be imposed that $\rho_{S}=1$ $\forall\,S\subset{\mathcal{P}}_{0}$ . Hence, it suffices to prove that

[TABLE]

where $J_{S}^{\prime}$ is from (5.2). Then, indeed, in view of $H|S|\simeq|\omega_{S}|$ and $J_{S}^{\prime}\simeq J_{S}$ , (5.5) immediately implies the desired assertion (5.1).

Next, note that for any $v\in H_{0}^{1}(\Omega)$ and $v_{h}\in S_{h}$ , a standard calculation using (1.1), (1.2) yields

[TABLE]

As this immediately implies ${{\widetilde{\mathcal{E}}}}_{{\mathcal{P}}}^{2}\lesssim H(|\psi_{1}|+|\psi_{2}|+|\Psi|)$ , to get (5.5) (and hence the desired assertion (5.1)), it suffices to prove that

[TABLE]

The remainder of the proof is split into three parts. In part (i), we shall describe appropriate weights $\{\rho_{S}\}$ and non-standard functions $v_{h}$ and $v$ , which will be crucial for (5.7) to hold true. Certain sufficient conditions for the latter will be established in part (ii), and then shown to be satisfied in part (iii).

(i) We start by introducing a smooth monotone cut-off function $\rho$ of the arc-length parameter $l$ of ${\mathcal{P}}$ such that

[TABLE]

Here for the final relation, recall that ${\rm dist}(\partial{\mathcal{P}}\backslash\partial\Omega,\partial{\mathcal{P}}_{0}\backslash\partial\Omega)\simeq H$ and let $\rho$ be quadratic near its zeros.

Next, introduce

[TABLE]

where $J_{S}^{\prime}=J_{S}|{\boldsymbol{\nu}}_{S}\cdot{\mathbf{i}}_{\xi}|$ is from (5.2) (and also appears in (5.5)), and ${{\mathcal{P}}}_{S}$ denotes the patch of (at most three) edges in ${\mathcal{P}}$ touching $S$ (so $S\subset{{\mathcal{P}}}_{S}\subset{\mathcal{P}}$ ).

Finally, in (5.6), we let $v_{h}\in S_{h}$ , with support in $\omega_{{\mathcal{P}}}$ , and $v:=\hat{v}_{h}\in H^{1}_{0}(\Omega)$ satisfy

[TABLE]

Here the function $\bar{\xi}=\bar{\xi}_{{\mathcal{P}}}(\eta)\in C(\mathbb{R})$ describes the curve ${\mathcal{P}}$ for the range of $\eta$ in ${\mathcal{P}}$ , and is constant outside this range. Without loss of generality, $\omega_{z_{1}}^{*}\subset\Omega$ , so $\hat{v}_{h}$ has support in $\color[rgb]{0,0,0}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,0}\omega_{{\mathcal{P}}}$ . (Otherwise, in view of Remark 5.3, shorten ${\mathcal{P}}$ by $k$ short edges starting from $z_{1}$ , where $k\lesssim 1$ .)

For $\bar{\xi}_{{\mathcal{P}}}(\eta)$ in (5.10), note that $|\bar{\xi}^{\prime}_{{\mathcal{P}}}|\lesssim 1$ (in view of the path element orientation condition combined with the maximum angle condition). This observation implies that $\hat{v}_{h}$ is well-defined in $\Omega$ , and $\|\nabla\hat{v}_{h}\|_{2\,;{\Omega}}\simeq\|\nabla v_{h}\|_{2\,;{\omega_{\mathcal{P}}}}$ , as well as $\|\hat{v}_{h}\|_{2\,;{\Omega}}\simeq\|v_{h}\|_{2\,;{\omega_{\mathcal{P}}}}$ .

Note also a few useful properties, which follow from (5.9) and (5.10):

[TABLE]

To check (5.11a), note that $v_{h}$ is linear on $S\subset{\mathcal{P}}$ , so $|S|^{-1}\int_{S}v_{h}$ is between $\rho_{S}\min_{{\mathcal{P}}_{S}}\{J^{\prime}_{S}\}$ and $\rho_{S}\max_{{\mathcal{P}}_{S}}\{J^{\prime}_{S}\}$ , so this assertion follows. For (5.11b), we note that ${\textstyle\frac{1}{2}}\rho(z)\leq\rho_{S}$ for any $S\in\gamma_{z}\cap{\mathcal{P}}$ , so $|v_{h}(z)|\leq\sum_{S\in\gamma_{z}\cap{\mathcal{P}}}|\rho_{S}J^{\prime}_{S}|$ . Finally, $\forall\,z\in\partial S$ , where $S\subset{\mathcal{P}}$ , one has $|v_{h}(z)-\rho(z)J_{S}^{\prime}|\leq\rho(z)\delta_{S}\leq\delta_{S}$ , so ${\rm osc}(v_{h}\,;S)\leq{\rm osc}(\rho\,;S)|J^{\prime}_{S}|+2\delta_{S}$ . Here $|J_{S}^{\prime}|\leq|J_{S}|$ , while the final relationship in (5.8) yields ${\rm osc}(\rho\,;S)\lesssim\frac{|S|}{H}\sqrt{\rho_{S}}$ (where we also used $\sup_{S}{\rho}\leq 2\rho_{S}$ as $\rho(l)$ is monotone).

(ii) We claim that for (5.7), and hence for the desired assertion (5.1), it suffices to prove that the following conditions (which give a version of (4.8)) are satisfied:

[TABLE]

Note that $\psi_{1}$ and $\psi_{2}$ are shown to satisfy (5.7) using (5.12) in a very similar manner to the corresponding bounds in part (ii) of the proof of Theorem 4.1, only for $\psi_{2}$ we now employ $\hat{f}:=f\bigl{(}{\color[rgb]{0,0,0}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,0}\bar{\xi}_{{\mathcal{P}}}(\eta)+2[\xi-\bar{\xi}_{{\mathcal{P}}}(\eta)]},\eta\bigr{)}$ and then Remark 5.3.

To show that $\Psi$ also satisfies (5.7), first, we get a version of (4.10) with $\Omega_{i}$ replaced by $\omega_{{\mathcal{P}}}$ . Next, subtracting $\frac{1}{2}H^{-1}{{\widetilde{\mathcal{E}}}}_{{\mathcal{P}}}^{2}=\frac{1}{2}\sum_{S\subset{\mathcal{P}}}{\color[rgb]{0,0,0}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,0}|S|}\rho_{S}J_{S}J^{\prime}_{S}$ (in view of the definition of ${{\widetilde{\mathcal{E}}}}_{{\mathcal{P}}}$ in (5.5)) yields

[TABLE]

So, using the definition of ${{\widetilde{\mathcal{E}}}}_{{\mathcal{P}}}$ combined with $J_{S}\simeq J_{S}^{\prime}$ for the first term, and combined with Remark 5.3 for the second, one gets

[TABLE]

When dealing with the second term, we also used $|\omega_{z}^{*}|\simeq|\omega_{z}|\simeq|\omega_{S}|$ for any edge $S$ originating at $z\in{\mathcal{P}}$ . For the first term in (5.13), $\|\delta_{S}\|_{2\,;{\mathcal{P}}}\lesssim H^{-1/2}{\mathcal{Y}_{\omega_{\mathcal{P}}}}$ , which follows from (5.12c) combined with $\frac{H}{|S|}\gtrsim 1$ and $|\omega_{S}|\simeq H|S|$ $\forall\,S\subset{\mathcal{P}}$ . The second term in (5.13) is bounded by ${\mathcal{Y}_{\omega_{\mathcal{P}}}}\cdot H^{-1}\!({{\widetilde{\mathcal{E}}}}_{{\mathcal{P}}}+{\mathcal{Y}_{\omega_{\mathcal{P}}}})$ , where we used Remark 3.1 and (5.12b). Combining these findings yields the desired bound on $\Psi$ in (5.7).

(iii) To complete the proof, it remains to establish the three bounds on $v_{h}$ in (5.12). The first bound (5.12a) is obtained similarly to (4.8a). Only now for any $S\subset\gamma_{z}\backslash{\mathcal{P}}$ starting at $z=(\xi_{z},\eta_{z})$ , we use $S^{\prime}:={\rm proj}_{\eta=\eta_{z}}S$ , the projection of $S$ onto the line $\eta=\eta_{z}$ , and also $\|\partial_{\eta}\hat{v}_{h}\|_{1\,;\omega_{z}^{*}}\lesssim\|\nabla v_{h}\|_{1\,;\omega_{z}^{*}}=\|\nabla v_{h}\|_{1\,;\omega_{z}^{*}\cap\omega_{{\mathcal{P}}}}$ .

For (5.12b), first, note that $v_{h}\in S_{h}$ with support in $\omega_{{\mathcal{P}}}$ , so $\|v_{h}\|^{2}_{2\,;\omega_{{\mathcal{P}}}}\simeq H\|v_{h}\|^{2}_{2\,;{\mathcal{P}}}\lesssim H\|\rho_{S}J^{\prime}_{S}\|^{2}_{2\,;{\mathcal{P}}}\leq{{\widetilde{\mathcal{E}}}}_{{\mathcal{P}}}^{2}$ , where we used (5.11b) and also the definition of ${{\widetilde{\mathcal{E}}}}_{{\mathcal{P}}}$ in (5.5). Furthermore, on any $S\subset{\mathcal{P}}$ one has $|H\,\partial_{l}u_{h}|=\frac{H}{|S|}{\rm osc}(v_{h}\,;S)$ , so

[TABLE]

Here $H\|\sqrt{\rho_{S}}J_{S}\|^{2}_{2\,;{{\mathcal{P}}}}\lesssim{{\widetilde{\mathcal{E}}}}_{{\mathcal{P}}}$ , while $H\|\frac{H}{|S|}\delta_{S}\|^{2}_{2\,;{{\mathcal{P}}}}\lesssim{\mathcal{Y}}_{\omega_{\mathcal{P}}}^{2}$ assuming that (5.12c) is true. Combining our findings, we conclude that (5.12b) follows from (5.12c).

Finally, (5.12c) is obtained similarly to (4.8c). To be more precise we recall (5.4) and combine it with the definition of $\delta_{S}$ in (5.9) and the observation that $\sum_{T\subset\omega_{{\mathcal{P}}}}{\mathcal{Y}}_{T}^{2}\lesssim{\mathcal{Y}}_{\omega_{{\mathcal{P}}}}^{2}$ . $\Box$

Remark 5.4

If in the proof of Theorem 5.1 $z_{0}\in\partial\Omega\cap\partial{\mathcal{P}}$ is such that the $\xi$ -axis is not parallel to $\partial\Omega$ at $z_{0}$ , then one needs to tweak the definition of $v_{h}$ so that its support is in $\omega_{{\mathcal{P}}}\backslash\omega^{*}_{z_{0}}$ (rather than in $\omega_{{\mathcal{P}}}$ ). This modification is required to ensure that $\hat{v}_{h}$ has support in $\color[rgb]{0,0,0}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,0}\omega_{{\mathcal{P}}}$ . For this, $\rho$ remains unchanged (i.e. equal to $1$ ) on ${\mathcal{P}}$ near $\partial\Omega$ , while we now set $v_{h}(z):=0$ for any $z\in{\mathcal{P}}\cap\omega^{*}_{z_{0}}$ . Note that the evaluations will remain without major changes as ${\mathcal{P}}\cap\omega^{*}_{z_{0}}$ includes a finite number of edges (in view of Remark 5.3), so ${\rm osc}(v_{h}\,;S)$ for the edge $S\subset{\mathcal{P}}\backslash\omega^{*}_{z_{0}}$ closest to $\partial\Omega$ will involve ${\rm osc}(J_{S}^{\prime}\,;{\mathcal{P}}\cap\omega^{*}_{z_{0}})$ , the estimation of which will require a finite number of applications of (5.4).

6 Conclusion

We have reviewed lower a posteriori error bounds obtained using the standard bubble function approach in the context of anisotropic meshes. Numerical examples have been given in §2 that clearly demonstrate that the short-edge jump residual terms in such bounds are not sharp. Hence, in §§4–5, for linear finite element approximations of the Laplace equation in polygonal domains, a new approach has been presented that yields essentially sharper lower a posteriori error bounds and thus shows that the upper error estimator (1.4) from the recent paper Kopt_NM_17 is efficient on partially structured anisotropic meshes.

Appendix A Generalized proof of (2.2b) for the case $\frac{|S|}{{\rm diam}(S)}\ll 1$

The purpose of this section is to illustrate Remark 2.3 by giving a more general version of the proof of (2.2b) in Lemma 2.1, which shows that the latter proof cannot be tweaked to remove the weight $\frac{|S|}{{\rm diam}(\omega_{S})}$ in (2.2b).

Proof of (2.2b) for the case $\frac{|S|}{{\rm diam}(S)}\ll 1$ . As (2.2b) is obtained in part (ii) of the proof of Lemma 2.1, we generalize only this part. Also, we shall consider only the case $\frac{|S|}{{\rm diam}(S)}\ll 1$ . Hence, in view of the conditions of Lemma 2.1, one has $|S|\simeq h_{T}$ $\forall\,T\subset\omega_{S}$ .

(ii) For each of the two triangles $T\subset\omega_{S}$ , introduce a triangle $\widetilde{T}\subseteq T$ with an edge $S$ such that $|\widetilde{T}|\simeq\kappa h_{T}|S|$ . In the original proof, we used $\kappa=1$ , while now, to allow more flexibility, it is assumed that $0<\kappa h_{T}\lesssim{\rm diam}(S)$ .

Next, set $w:=J_{S}\,\widetilde{\phi}_{1}\widetilde{\phi}_{2}$ , where $\widetilde{\phi}_{2}$ and $\widetilde{\phi}_{2}$ are the hat functions associated with the end points of $S$ on the obtained triangulation $\{\widetilde{T}\}_{T\subset\omega_{S}}$ (with $w:=0$ on each $T\backslash\widetilde{T}$ for $T\subset\omega_{S}$ ). A standard calculation using $\triangle u_{h}=0$ in $T\subset\omega_{S}$ and (1.1), yields

[TABLE]

Next, invoking $\|\nabla w\|_{T}\lesssim\max\{1,\kappa^{-1}\}\,h_{T}^{-1}\,\|w\|_{T}$ for any $T\subset\omega_{S}$ , we arrive at

[TABLE]

Finally, a calculation using $h_{T}\simeq|S|$ yields

[TABLE]

To minimize the weight $\max\{\kappa^{1/2},\kappa^{-1/2}\}$ at $\|\nabla(u_{h}-u)\|_{\omega_{S}}$ in the right-hand side, one needs $\kappa=1$ , i.e. as in the original proof of (2.2b)! Hence, we get (2.2b) with the same, i.e. unimproved, weights. $\Box$

Bibliography11

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1(1) Ainsworth, M., Oden, J. T.: A posteriori error estimation in finite element analysis. Wiley-Interscience, New York (2000)
2(2) Kopteva, N.: Maximum-norm a posteriori error estimates for singularly perturbed reaction-diffusion problems on anisotropic meshes. SIAM J. Numer. Anal. 53 , 2519–2544 (2015)
3(3) Kopteva, N.: Energy-norm a posteriori error estimates for singularly perturbed reaction-diffusion problems on anisotropic meshes. Numer. Math. 137 , 607–642 (2017)
4(4) Kopteva, N.: Fully computable a posteriori error estimator using anisotropic flux equilibration on anisotropic meshes. ar Xiv:1704.04404 (2017)
5(5) Kunert, G.: An a posteriori residual error estimator for the finite element method on anisotropic tetrahedral meshes. Numer. Math. 86 , 471–490 (2000)
6(6) Kunert, G.: Robust a posteriori error estimation for a singularly perturbed reaction-diffusion equation on anisotropic tetrahedral meshes. Adv. Comput. Math. 15 , 237–259 (2001)
7(7) Kunert, G., Verfürth, R.: Edge residuals dominate a posteriori error estimates for linear finite element methods on anisotropic triangular and tetrahedral meshes. Numer. Math. 86 , 283–303 (2000)
8(8) Micheletti, S., Perotto, S.: Reliability and efficiency of an anisotropic Zienkiewicz-Zhu error estimator. Comput. Methods Appl. Mech. Engrg. 195 , 799–835 (2006)

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Lower a posteriori error estimates

Abstract

Keywords:

MSC:

1 Introduction

2 Standard lower error bounds are not sharp on anisotropic meshes

2.1 Numerical examples

Remark 2.1

2.2 Lower error bounds using the standard bubble approach

Lemma 2.1

Proof

Remark 2.2

Remark 2.3 (Deficiency of the bubble function approach)

Remark 2.4 (Preview of the new approach)

3 Basic triangulation assumptions

Remark 3.1

4 Estimator efficiency on a partially structured anisotropic mesh

4.1 Lower error bound on a partially structured anisotropic mesh

Theorem 4.1

Corollary 4.2

Proof

Remark 4.1 (Estimator efficiency)

4.2 Preliminary results for partially structured meshes

Lemma 4.3

Proof

Corollary 4.4

Proof

Remark 4.2

4.3 Proof of Theorem 4.1

Proof

Remark 4.3 (Non-smooth fff)

5 Estimator efficiency on more general meshes

5.1 Main result

Theorem 5.1** (Short-edge jump residual terms)**

Corollary 5.2

Proof

Remark 5.1 (Estimator efficiency)

Remark 5.2 (Singular perturbation case)

5.2 Preliminary results for a local anisotropic path

Lemma 5.3

Proof

Corollary 5.4

Proof

Remark 5.3

5.3 Proof of Theorem 5.1

Proof

Remark 5.4

6 Conclusion

Appendix A Generalized proof of (2.2b) for the case ∣S∣diam(S)≪1\frac{|S|}{{\rm diam}(S)}\ll 1diam(S)∣S∣​≪1

Remark 4.3 (Non-smooth $f$ )

Theorem 5.1 (Short-edge jump residual terms)

Appendix A Generalized proof of (2.2b) for the case $\frac{|S|}{{\rm diam}(S)}\ll 1$