Residual estimates for post-processors in elliptic problems
Andreas Dedner, Jan Giesselmann, Tristan Pryer, Jennifer K Ryan

TL;DR
This paper develops a unified framework for a posteriori error estimation in elliptic problems, enhancing the reliability and efficiency of various post-processing techniques including superconvergent methods.
Contribution
It introduces a general post-processing operator that improves error bounds for multiple existing reconstruction techniques in elliptic boundary value problems.
Findings
Optimal error control achieved for superconvergent and other reconstruction operators
The proposed framework applies to popular methods like SIAC filter and Superconvergent Patch Recovery
Numerical tests confirm the theoretical error estimates and improvements
Abstract
In this work we examine a posteriori error control for post-processed approximations to elliptic boundary value problems. We introduce a class of post-processing operator that `tweaks' a wide variety of existing post-processing techniques to enable efficient and reliable a posteriori bounds to be proven. This ultimately results in optimal error control for all manner of reconstruction operators, including those that superconverge. We showcase our results by applying them to two classes of very popular reconstruction operators, the Smoothness-Increasing Accuracy-Enhancing filter and Superconvergent Patch Recovery. Extensive numerical tests are conducted that confirm our analytic findings.
| -error | 2 | 2 | 4 | 3 | 4 | 5 | 4 | 6 | 6 |
|---|---|---|---|---|---|---|---|---|---|
| -error | 1 | 2 | 3 | 2 | 4 | 4 | 3 | 5 | 5 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
\newsiamremark
remarkRemark \newsiamremarkhypothesisHypothesis
\headers
Residual estimates for post-processors in elliptic problems
Andreas Dedner
Mathematics Institute, University of Warwick, Coventry CV4 7AL UK. (). [email protected]
Jan Giesselmann Department of Mathematics, TU Darmstadt, 64293 Darmstadt, Germany. (). [email protected]
Tristan Pryer
Mathematical Sciences, University of Bath, Bath BS2 7AY, UK. Corresponding author. (). [email protected]
Jennifer K Ryan
Applied Mathematics & Statistics, Colorado School of Mines, Golden, Colorado 80401, USA. (). [email protected]
Abstract
In this work we examine a posteriori error control for post-processed approximations to elliptic boundary value problems.
We introduce a class of post-processing operator that “tweaks” a wide variety of existing post-processing techniques to enable efficient and reliable a posteriori bounds to be proven. This ultimately results in optimal error control for all manner of reconstruction operators, including those that superconverge.
We showcase our results by applying them to two classes of very popular reconstruction operators, the Smoothness-Increasing Accuracy-Conserving filter and Superconvergent Patch Recovery. Extensive numerical tests are conducted that confirm our analytic findings.
1 Introduction
Post-Processing techniques are often used in numerical simulations for a variety of reasons from visualisation purposes [BMBS95] to designing superconvergent approximations [BS77] through to becoming fundamental building blocks in constructing numerical schemes [GP18, GZZ18, CGZZ17]. Another application of these operators is that they are a very useful component in the a posteriori analysis for approximations of partial differential equations (PDEs) [AO00, Ver96]. The goal of an a posteriori error bound is to computationally control the error committed in approximating the solution to a PDE. In order to illustrate the ideas, let denote the solution to some PDE and let denote a numerical approximation. Then, the simplest possible use of post-processing in a posteriori estimates is to compute some from and to use
[TABLE]
as an error estimator.
However, a key observation here (and in several more sophisticated approaches) is that must be at least as good of an approximation of the solution as is. In fact, in many cases, is actually expected to be a better approximation. This raises a natural question: If an adaptive algorithm computes (on any given mesh) not only but also and is a better approximation of than is, why is and not considered as the “primary” approximation of ? Indeed, the focus of this paper is to consider as the primary approximation of . We are therefore interested in control of the error and in adaptivity based on an a posteriori estimator for . Specifically, we aim to provide reliable and efficient error control for .
Note that our goal is not to try to construct “optimal” superconvergent post-processors. Rather we try to determine, from the a posteriori viewpoint, the accuracy of some given post-processed solution and to determine how this is useful for the construction of adaptive numerical schemes based on an error tolerance for .
There are several examples of superconvergent post-processors, includig SIAC and superconvergent patch recovery. More details on the history, properties and implementation of these methods will be provided in Sections 4.1 and 4.2. However, our a posteriori analysis, aims at being applicable for a wide variety of post-processors and, therefore, we avoid special assumptions that are only valid for specific post-processors. Indeed, our analysis makes only very mild assumptions on the post-processing operator. Specifically, we only require that:
The post-processed solution belongs to a finite dimensional space that contains piecewise polynomials, although it does not necessarily need to be piecewise polynomial itself. 2. 2.
The post-processed solution should be piecewise smooth over the same triangulation, or a sub-triangulation, of the finite element approximation.
Given a post-processor, that satisfies these rather mild assumptions, we perturb it slightly and call the result . This is to ensure an orthogonality condition holds which then allows us to show various desirable properties including:
The orthogonal post-processor provides a better approximation than the original post-processor, i.e. in the energy norm, see Lemma 3.5. 2. 2.
The orthogonal post-processor has an increased convergence order in the norm. Practically, this is not always the case for the original post-processor, see Lemma 3.8. 3. 3.
Efficient and reliable a posteriori bounds are available for the error committed by the orthogonal post-processor.
This, motivates us to consider (and not or ) as the primary approximation. Since the improved accuracy of , compared to , stems from superconvergence it is much more sensitive with respect to smoothness of the exact solution, i.e. in regions where the exact solution is we expect to be much more accurate than whereas in places where the exact solution is less regular, e.g. has kinks, and are expected to have similar accuracy. Therefore, meshes constructed based on error estimators for will usually not be optimal when used for computing in the sense that the ratio of degrees of freedom to error would be much better for other meshes, this is elaborated upon in Section 5.
We will demonstrate the good approximation properties of our modification strategy for post-processors and the benefits of basing mesh adaptation on an estimator for in a series of numerical experiments. In order to highlight the versatility of our approach, we conduct experiments based on two popular post-processing techniques: The Smoothness Increasing Accuracy Conserving (SIAC) filter and superconvergent patch recovery (SPR). Background on these methods is provided in Sections 4.1 and 4.2 respectively.
The rest of the paper is structured as follows: In §2 we introduce the model elliptic problem and its dG approximation. We also recall some standard results for this method. In §3, for a given reconstruction, we perturb it so it satisfies Galerkin orthogonality and show some a priori type results. We then study a posteriori results and give upper and lower bounds for a residual type estimator. In §4 we describe the two families of post-processor that we consider in this work. Finally, in §5 we perform extensive numerical tests on the SIAC and SPR post-processors to show the performance of the a posteriori bounds, the effect of smoothness of the solution on the post-processors and to study adaptive methods driven by these estimators.
2 Problem setup and notation
Let , be bounded with Lipschitz boundary . We denote by , , the standard Lebesgue spaces and , the Sobolev spaces of real-valued functions defined over . Further we denote the space of functions in with vanishing trace on .
For we consider the problem
[TABLE]
where is a uniformly positive definite diffusion tensor and . Weakly, the problem reads: find such that
[TABLE]
Let be a triangulation of into disjoint simplicial or box-type (quadrilateral/hexahedral) elements such that . Let be the set of edges which we split into the set of interior edges and the set of boundary edges .
We introduce the standard broken Sobolev spaces. For we define
[TABLE]
and we will use the notation
[TABLE]
as an elementwise norm for the broken space.
For we denote the set of all polynomials over of total degree at most by . For , we consider the finite element space
[TABLE]
Let be an arbitrary scalar function. For any interior edge there are two adjacent triangles and we can consider the traces of from respectively. We denote the outward normal of by and define average and jump operators for one by
[TABLE]
For boundary edges there is only one trace of and one outward pointing normal vector and we define
[TABLE]
For vector valued functions we define jumps and averages on interior edges by
[TABLE]
As before, for boundary edges, we define jumps and averages using traces from the interior only. Note that \left\llbracket\boldsymbol{v}\right\rrbracket,\mathrel{\ooalign{\cr\kern 1.0pt{{}}v\mathrel{\ooalign{}}\cr\kern-1.0pt}}\in\operatorname{L}^{2}(\mathscr{E}) and \left\llbracket v\right\rrbracket,\mathrel{\ooalign{\cr\kern 1.0pt{{}}\boldsymbol{v}\mathrel{\ooalign{}}\cr\kern-1.0pt}}\in[\operatorname{L}^{2}(\mathscr{E})]^{d}.
For any triangle we define and collect these values into an element-wise constant function with . We denote the radius of the largest ball inscribed in by . For every edge we denote by h_{e}=\mathrel{\ooalign{\cr\kern 1.0pt{{}}h\mathrel{\ooalign{}}\cr\kern-1.0pt}}, i.e., the mean of diameters of adjacent triangles. For our analysis we will assume that belongs to a family of triangulations which is quasi-uniform and shape-regular. Let us briefly recall the definitions of these two notions: The triangulation is called
- •
shape-regular if there exists so that
[TABLE]
- •
quasi-uniform if there exists so that
[TABLE]
Note that for shape-regular triangulations we have inverse and trace inequalities [DPE12, Lemmas 1.44, 1.46]. Note that the quasi-uniformity assumption is only required for the first part of our analysis, in §3.1 and can be relaxed in §3.2.
In this work we will consider a standard interior penalty method to approximate solutions of (2.2). We consider the Galerkin method to seek such that
[TABLE]
where is given by
[TABLE]
Note that the bilinear form (2.18) is stable provided is large enough, see [ABCM02].
Remark 2.1** (Continuous Galerkin methods).**
*Note that if we restrict test and trial functions to then all jumps on interior edges vanish and (2.17), (2.18) reduces to a (continuous) finite element method with weakly enforced boundary data. Our analysis is equally valid in this case. *
We introduce two dG norms
[TABLE]
which are equivalent provided is sufficiently large and conclude this section by stating a-priori estimates for the Galerkin method as is standard in the literature [ABCM02, KP03].
Theorem 2.2** (Error bounds for the dG approximation).**
Let for be the solution of (2.1) and be the unique solution to the problem (2.17). Then,
[TABLE]
Further, for , we have the a posteriori error bound
[TABLE]
where
[TABLE]
*and is a constant depending on the shape-regularity and quasi-uniformity constants of and depends only upon the shape-regularity. Here is a computable residual that we refer to during our numerical simulations. *
3 The orthogonal reconstruction, a priori and a posteriori error estimates
In this section, we derive robust and efficient error estimates. We make the assumption that we have access to a computable reconstruction, generated from our numerical solution , where is required to contain the original finite element space, that is . We are unable to provide reliable a posteriori error estimates for directly, but we can modify, and, as we shall demonstrate, improve any such reconstruction such that a robust and efficient error estimate can be obtained for the modified reconstruction.
We split this section into two parts, the first subsection contains the definition of the improved reconstruction and some of its properties. In particular, we study this from an a priori viewpoint, show that it satisfies Galerkin orthogonality as well as some desirable a priori bounds. Throughout this subsection we assume that and the underlying mesh is quasi-uniform. In the second part we derive reliable and efficient a posteriori estimates under the weaker assumption that and the mesh is shape regular.
3.1 Improved reconstruction
In the following assume that solves (2.2) and let be a reconstruction of the discrete solution , e.g. a SIAC reconstruction as described in Section 4.1 or obtained by some patch recovery operator as described in Section 4.2.
Definition 3.1**.**
Let denote the Ritz projection with respect to , i.e.,
[TABLE]
We define the improved reconstruction as
[TABLE]
Remark 3.2**.**
We make the following remarks:
The finite element approximation from (2.17) satisfies . 2. 2.
We work under the assumption that a post-processor is already being computed. To realise we are required to solve the original elliptic problem a second time with a different forcing term. This means the improved reconstruction is computable at a small additional cost to . Once has been computed, can be computed by solving a discrete elliptic problem over . A typical scenario is that the user already has a good scheme for computing , and that the cost of computing (after the post-processing to obtain ) is just that of solving the same system as that for with a different right hand side. This means the assembly and preconditioning, perhaps ILU or AMG, can be reused without change.
Estimating the cost of computing is more complicated and will depend on the method used and the implementation. While our implementation for solving and the correction are optimized (and implemented in C++) the computation of is a proof of concept implementation in Python and is therefore not competitive.
The improved reconstruction is computable at a small additional cost to . Once has been computed, can be computed by solving a discrete elliptic problem over . A typical scenario is that the user already has a good scheme for computing , and that the cost of computing (after the post-processing to obtain ) is just that of solving the same system as that for with a different righthand side, e.g if an decomposition of the system matrix was determined for computing this decomposition can be reused. 3. 3.
Note that
[TABLE]
where is the identity mapping, i.e., the error of is the Ritz-projection of the error of onto the orthogonal complement of . 4. 4.
Even if is continuous, this does not necessarily hold for as may contain discontinuous functions.
One of the key properties of the improved reconstruction is that it satisfies a Galerkin orthogonality result.
Lemma 3.3** (Galerkin orthogonality).**
The reconstruction from (3.2) satisfies Galerkin orthogonality, i.e.,
[TABLE]
Proof 3.4**.**
For any , we have using (3.3)
[TABLE]
*by definition of the Ritz projection, as required. *
Now, we show that with respect to the new reconstruction indeed improves upon :
Lemma 3.5** (Better approximation of the improved reconstruction).**
Let be defined by (3.2), then the following holds:
[TABLE]
*In (3.6) the inequality is an equality if and only if , i.e., if the original reconstruction itself satisfies Galerkin orthogonality. *
Proof 3.6**.**
Since the images of and are orthogonal with respect to , Pythagoras’ theorem implies
[TABLE]
*We have used (3.3) in the third step. Note that if is not Galerkin orthogonal then leading to a strict inequality in the first step. This completes the proof. *
Remark 3.7**.**
*One appealing feature of the new reconstruction that results from Galerkin orthogonality is that if the reconstruction has some superconvergence properties in the energy norm this is inherited by and also immediately implies an additional order of accuracy in . This results from an Aubin-Nitsche trick being available. *
Lemma 3.8** (Dual bounds).**
Let be a convex polygonal domain and let be defined by (3.2), then there exists a constant (only depending on the shape regularity of the mesh) such that
[TABLE]
Proof 3.9**.**
Let solve
[TABLE]
which implies
[TABLE]
Thus, by choosing in (3.9) we obtain
[TABLE]
for any where the last equality follows from Galerkin orthogonality (3.4). Thus, choosing as the best approximation of in the piecewise linear subspace of , we obtain
[TABLE]
*by elliptic regularity of the dual problem, concluding the proof. *
3.2 A posteriori error estimates
Now that we have shown some fundamental results on the improved reconstruction, we relax the regularity requirements on in this subsection allowing for weak solutions to (2.1), that is, . With that in mind we modify the definition of such that it is a suitable extension over to
[TABLE]
for and where is the lifting operator that we recall from [DPE12, Section 4.3.1]
[TABLE]
The lifting operators satisfy the stability estimate, [DPE12, Lemma 4.34],
[TABLE]
For test and trial functions in (which contains by assumption) the new definition of is equivalent to the one given in (2.18). Therefore for any function the Ritz projection given in Definition 3.1 remains the same still satisfying for all . But note that we no longer have and Galerkin orthogonality for no longer holds in general, it only holds for a conforming subspace of :
Lemma 3.10**.**
For and it holds that
[TABLE]
Proof 3.11**.**
By definition of we have that
[TABLE]
since . Now, notice that by definition
[TABLE]
*as is an element of and, hence, continuous, as required. *
Let a quantity of interest be given by the linear functional , the dual space of . Note that where the latter is the dual space of . We begin by deriving an error representation formula. Following [HSW05], we split into a continuous part and a discontinuous part so that
[TABLE]
Theorem 3.12** (Dual error representation).**
Let be the solution of (2.2) and let be given by (3.2), then
[TABLE]
where denotes the scalar product and is the solution of the dual problem
[TABLE]
*and is an arbitrary function in . *
Proof 3.13**.**
By definition of , we have, for any ,
[TABLE]
*where we made use of Galerkin orthogonality, Lemma 3.10, in the last step. *
Theorem 3.14** (Primal error estimate).**
There exists some constant depending on mesh geometry and polynomial degree such that
[TABLE]
where
[TABLE]
Proof 3.15**.**
Since
[TABLE]
where is some constant depending on only, it is sufficient to show that is bounded by the right hand side of (3.22).
In Theorem 3.12 we may choose
[TABLE]
Note that, by definition, so that, if contains discontinuous functions, . Nevertheless, satisfies the stability estimate
[TABLE]
Then, for any , Theorem 3.12 implies
[TABLE]
Integrating by parts in (3.27) and using (3.14) we obtain
[TABLE]
From [HPS04, Theorem 5.3] we obtain
[TABLE]
with a constant which is independent of but depends on the shape regularity of the mesh and the polynomial degree and we also note that
[TABLE]
We insert (3.29) and (3.30) into (3.28) and apply trace inequality and Cauchy-Schwarz inequality and obtain
[TABLE]
Now, we choose as the Clément interpolant of so that
[TABLE]
*and insert (3.32) into (3.31) to obtain the assertion of the theorem. *
The error estimator, derived in Theorem 3.14, is locally efficient in the following sense:
Theorem 3.16** (Local efficiency).**
Assume and are piecewise polynomial on . Then, there exists a constant independent of such that for any and any the following estimates hold:
[TABLE]
and
[TABLE]
*where denotes the union of cells sharing common edge . *
Proof 3.17**.**
*Both proofs are standard and follow [Ver96]. *
Remark 3.18** (Data oscillation).**
*In case or are not polynomial the right hand side of (3.33) contains additional data oscillation terms. *
4 Post-Processors
In order to show the versatility of our results, we consider two families of reconstruction operators. Namely, the Smoothness-Increasing Accuracy-Conserving (SIAC) post-processing [Tho77, BS77, Rya15] as well as patch reconstruction via the Zienkiewicz and Zhu [ZZ92, ZZ98] Superconvergent Patch Recovery (SPR) technique. Below we outline the procedure for performing these reconstructions as well as error estimates for the ideal case.
4.1 SIAC post-processors
One example of a superconvergent post-processor that we examine is the Smoothness Increasing Accuracy Conserving (SIAC) filter. The SIAC filter has its roots in an accuracy-enhancing post-processor developed by Bramble and Schatz [BS77]. The original analysis was done for finite element approximations for elliptic equations. This technique has desirable qualities including its locality, allowing for efficient parallel implementations, and its effectiveness in almost doubling the order of accuracy rather than increasing the order of accuracy by one or two orders. This post-processor was also explored from a Fourier perspective and for derivative filtering by Thomeé [Tho77] and Ryan and Cockburn [Rya09].
SIAC filters are an extension of the above ideas and have traditionally been used to reduce the error oscillations and recover smoothness in the solution and its derivatives for visualization purposes [MRK10, WRKH09, SCKR08] or to extract accuracy out of existing code [Rya05]. It has been extended to a variety of PDEs as well as meshes [JXR12]. A quasi-interpolant perspective on SIAC can be found in [MRK16]. The important property of these filters is that, in addition to increasing the smoothness, for smooth initial data and linear problems, the filtered solution is more accurate than the DG solution. To combat the high computational cost of the tensor-product nature of the multi-dimensional kernel, a line filter was introduced in [DSRMK17].
For ease of presentation the following discussion only details the design of the filter and presents a-priori error estimates for the case of a smooth solution. Although the discussion is limited to one-dimension, it can be extended to Cartesian meshes in more than one space dimension using a tensor product approach. More advanced applications of the multi-dimensional SIAC post-processor are the Hexagonal SIAC [MJRK17] or Line SIAC [DSRMK17].
The basic idea is that the reconstruction is done via convolution post-processing:
[TABLE]
where is the mesh size of the numerical scheme and is the scaling of the post-processor. The convolution kernel, is defined as
[TABLE]
This is a linear combination of shifted copies of some function, The function weights are real scalars, For the kernel, is chosen to satisfy consistency as well as moment requirements, i.e., polynomial reproduction conditions, which are necessary for preserving the accuracy of the Galerkin scheme and is chosen for smoothness requirements. We focus on kernels built from B-splines which are defined via the B-Spline recurrence relation:
[TABLE]
for .
Remark 4.1** (Kernel scaling.).**
*For cartesian grids, the kernel scaling is typically chosen to be the element size, . In adaptive meshes or structured triangular meshes and tetrahedral meshes, is typically chosen to be the length of the mesh pattern [MJRK11]. For Line SIAC, the kernel scaling is taken to be the element diagonal [DSRMK17], for unstructured meshes, the kernel scaling is taken to be largest element side [MKRK13, MRK14]. *
It can be shown that when the solution is sufficiently smooth the post-processed numerical solution is a superconvergent approximation.
In particular, if , then the Galerkin solution converges in Theorem 2.2 as
[TABLE]
If we choose and then
[TABLE]
see Theorem 1 in [Tho77, BS77], which for constitutes an improvement. It is possible to obtain the same estimates in by taking higher order B-Splines.
In this paper, in order to apply the post-processor globally, we mirror the underlying approximation as an odd function at the boundary as discussed in [BS77].
Remark 4.2** (Impact of in-cell regularity of ).**
If Theorem 3.14 does not hold. Still, as long as similar results can be obtained by slightly modifying the proof of Theorem 3.14. One interesting example is SIAC reconstruction with . In this case, for any the restriction contains several kinks, i.e. there are hypersurfaces (points for , lines for ) across which is continuous but not differentiable. However, for any there exists a triangulation of of , such that . If we follow the steps of the proof of Theorem 3.14 we realise that integration by parts can only be carried out on elements of and each term in the error bound needs to be replaced by
[TABLE]
*where denotes the set of interior edges of . Efficiency of this modified estimator can be shown along the same lines as in Theorem 3.16 but bubble functions with respect to the elements and edges in the sub-triangulation need to be used. *
4.2 Superconvergent Patch Recovery
The second post-processing operator we study is based on the superconvergent patch recovery (SPR) technique. This was originally studied numerically and showed a type of superconvergence for elliptic equations using finite element approximations [ZZ87]. The mathematical theory behind this recovery technique was addressed by Zhang and Zhu [ZZ95] for the two-point boundary value problems and for two-dimensional problems and extended to parabolic problems in [LW06, LP12]. The superconvergent patch recovery method works by recovering the derivative approximation values for one element from patches surrounding the nodes of that element using a least squares fitting of the superconvergent values at the nodes and edges. In typical derivative recovery, the derivative approximation is a continuous piecewise polynomial of some given degree. For overlapping patches, the recovered derivative is just an average of the approximations obtained on the surrounding patches. Unlike SIAC post-processing, this recovery technique does not rely on translation invariance for the high-order recovery. The superconvergent patch recovery technique has been shown to work well for elliptic equations that have a smooth solution, and for less smooth solutions with a suitably refined mesh.
The usual application of this technique is for gradient recovery. However, in this article we apply this technique to recover function values.
As mentioned, we suitably modify the algorithm to construct a function with . The construction of , given some finite element function , is carried out in two steps:
Construct a polynomial of order at each node of the mesh using a least squares fitting of function values of evaluated at suitable points in elements surrounding . 2. 2.
Given an element we use linear interpolation of the values of for the three nodes of to compute .
There are many approaches for constructing the polynomials in the first step at a given node with surrounding triangles . For our experiments, we use the following approach. For , we construct a quadratic polynomial by fitting the values of at the nodes of all . For a piecewise quadratic () we also use the midpoints of all edges of the . Finally for our tests with we evaluate at two points on each edge chosen symmetrically around the midpoint of the edge (we use the Lobatto points with local coordinates ) and also add the value of at the barycentre of the . This is depicted in Figure 1. To guarantee that we have enough function values to compute the least squares fits, we add a second layer of triangles around if necessary, e.g., at boundary nodes.
Note that this procedure is similar, although not the same, as the approach investigated in [ZN05]. Another related procedure was proposed in [Ova07].
5 Numerical Results
In this section we study the numerical behaviour of the error indicators proposed for the SIAC and SPR post-processing operators. We compare this behaviour with the true error on some typical model problems. The computational work was done in the DUNE package [BBD*+*08] based on the new Python frontend for the DUNE-FEM module [DKNO10, DN18].
5.1 Smoothness-Increasing Accuracy-Conserving post-processors
The implementation of the post-processor is done through simple matrix-vector multiplication and is discussed in [Mir12].
We first investigate the behaviour of the error and the residual estimator for the problem (2.1) in one space dimension with , i.e. the Laplace problem
[TABLE]
where the forcing function is chosen so that the exact solution is
[TABLE]
on the interval . We show both the and errors for the Galerkin approximation , the SIAC postprocessed approximation, and the orthogonal postprocessor, . We also show the two residual indicators (from Theorem 2.2) and from (Theorem 3.14). For the basis we consider the continuous Lagrange polynomials for and impose the boundary conditions weakly with a penalty parameter , where is the polynomial degree and is the grid spacing. Additional experiments were conducted using a discontinuous Galerkin approximation, but no significant differences in the outcome where found and therefore do not include the results. We solve the resulting linear system using an exact solver [Dav04] to avoid issues with stopping tolerances.
We will mainly focus on but also show results for and . The SIAC postprocessing is constructed using a continuous B-spline, as well as setting This leads to an inner stencil of elements. We also tested other choices of for but the above choice provided the best results and these are the results shown.
In Figure 2 we show the errors for for a series of grid refinement levels starting with intervals and doubling that number on each level. In Figure 2 we plot the corresponding Experimental Orders of Convergence (EOCs). As can clearly be seen, SIAC postprocessing () improves the convergence rate in from to and in from to . While in the Galerkin orthogonality trick only leads to a small improvement in the error, in we see an improvement of a full order leading to a convergence rate of . As expected from the theory the residual indicators follow the errors of and closely. The efficiency index is comparable between and .
For a better understanding of how the error is reduced by utilizing SIAC postprocessing and the Galerkin orthogonality treatment, we show the pointwise errors of the approximations in Figure 3. It is evident that the function values are much smoother when applying SIAC. The move from and does reintroduce small scale errors, but at a far lower level compared to the original approximation, . As expected from the errors, the differences in are less pronounced.
In Figures 4 we show errors and EOCs for . Due to the very low errors on the final grid the actual convergence rates for and is not clear. However, the improvement due to the Galerkin orthogonality trick, especially in is quite noticable as it reduced the error by two orders of magnitude.
We next show results for in Figure 5. Again, there is a clear reduction in the values of the errors from to to in together with an improvement in the convergence rate due to the SIAC postprocessing. This improvement in the convergence rate is about order. While the convergence rate going from to seems to only be half an order on the higher grid resolution, it is important to note that the error using is still significantly smaller than the error between the exact solution and by at least a factor of . Hence the results do not contradict the theory. In , SIAC leads to no improvement while the convergence rate of the error using is at least half an order higher. Overall the improvement in the convergence rate is not quite as good as for the higher polynomial degrees. The following tests summarized in Figure 6 show that the weak form of the boundary conditions is responsible for the reduced order improvement. The figure shows results using a hyperpenalty of the form . Applying this hyperpenalty term leads to improvements that are again more in line with our observations for higher order polynomials. We note that strong enforcement of the boundary conditions also lead to similar results.
We summarize our results for the smooth problem in Table 1. It can be clearly seen that the step from to , which requires solving one additional low order problem, is quite advantageous and increases the convergence rate in the norm by at least one. In the linear case this improves by two and by one in the norm. This makes it highly efficient in this case, at least when implementing the hyperpenalization or strong constraints to enforce the Dirichlet boundary conditions. The reason for this restriction will be investigated further in future work. For the actual EOCs of the postprocessed solutions are difficult to determine and therefore we provide approximate numbers. In this case, SIAC shows a higher order in compared to the norm. The Galerkin orthogonalty trick does not improve the rate further, but note that the overall error is still a factor of smaller. Additionally note that in the other cases where there is no improvement in the rate, the error is reduced by enforcing Galerkin orthogonality, e.g., in the norm with the error is still reduced by about a factor of two. In addition, the orthogonality of allows us to compute a reliable and efficient error estimator with a comparable efficiency index to the error estimator for using .
We conclude our investigations of the SIAC reconstruction and the residual estimates by studying problems with less smooth solutions. We change the forcing function so that the exact solution is of the form
[TABLE]
where
[TABLE]
is the smooth function from previous studies. We show results for polynomial degree . We again iplement a simple penalty term at the boundary. Note that the solution is in at and only in for . Overall the solution is an element of but not of , i.e., it is not smooth enough to achieve optimal convergence rates for when the mesh is not aligned. Even when the mesh is aligned, as in our experiments, we do not expect an increase of the convergence rate using the SIAC reconstruction as can be seen in Figure 7. The local loss of regularity at and is clearly visible for the pointwise errors of the two reconstructions as shown in Figure 8. Examining the errors in the original approximation, , the reduced smoothness is hardly visible. However, in both of the reconstructions a jump in the error is clearly visible. At , where the solution is still in the error in increases approximately by two orders while at it is close to four orders of magnitude larger since the solution is only at this point. The lack of smoothness is also identified by the residual indicator , the spatial distribution of which is shown in Figure 9 together with the distribution of . It is worthwhile to note that the region of the ’reduced smoothness’ is better isolated by than by . Hence it would be easier for an adaptive algorithm to separate these different smoothness regions which would lead to more optimal meshes. The picture clearly shows that does not ’see’ the kink so that an adaptive algorithm would either refine the whole non-constant region or nothing at all depending on the tolerance. In contrast, with (and the right algorithm) refinement could be isolated to the kinks.
5.2 Superconvergent Patch Recovery
In the following we solve
[TABLE]
in a two dimensional domain where the forcing function is chosen by prescribing an exact solution . This function is also used to prescribe Dirichlet boundary conditions on all of . In the first example we chose a smooth exact solution with a scalar diffusion coefficient , while for the second test we use a solution with a corner singularity and .
In the following we show results using a DG scheme on a triangular grid. The grid is refined by splitting each element into four elements. In the final examples with local adaptivity, this leads to a grid with hanging nodes. We also carried out experiments using a continuous ansatz space with very simular results.
Note that in all figures depicting errors and EOCs, the -axis shows the number of degrees of freedom for . While the other approximations have a larger number of degrees of freedom, the global problem that has to be solved, i.e. solving the linear system for and for , scales with the number of degrees of freedom for and thus this seems a reasonable indication of the computational complexity.
For our first test we choose , and . We start with an initial grid which is slightly irregular as shown in Figure 10. This is to avoid any superconvergence effects due to a structured layout of the triangles.
Figure 11 shows and errors and EOCs for the three approximations with polynomial degrees . It can be seen that, in general, the postprocessor improves the EOC by an order of in the norm and that the EOC of the improved postprocessor is at least as good. While the actual error of can be larger on coarser grids than the error computed with , the error using is significantly better in all cases. Focusing now on the norm, we see that when computing the error using , the EOC is one order better then the convergence rate in the norm, as expected. For , this is also true when using , while for the EOC is only in this case, and an increase to is only achieved with the improved postprocessor . The same observation can be made when using the SIAC postprocessor in the previous section.
Using the same problem setting, we investigate the performance of an adaptive algorithm in Figure 12. We use a modified equal distribution strategy where elements are marked for refinement when the local indicator exceeds . We compute the local indicator on either or on the improved reconstruction . The advantage of basing the marking strategy on is clearly demonstrated. While marking with respect to and then using the postprocessor only on the final solution (filled upward triangles) leads to a significant reduction of the final error, the difference in the convergence rate between and results in a finer grid than necessary for a given tolerance. A reduction in the number of degrees of freedom by a factor of to can be easily achieved by using .
For our final test we study a reentrant corner type problem, i.e., using a regular triangulation. First we choose the well known exact solution leading to . Since the solution is not even we can not expect the postprocessed solution to have an increased convergence rate. This is confirmed by our numerical tests summarized in Figure 13. Due to the reduced smoothness and the simplicity of the solution away from the corner, the postprocessing does not only not improve the EOC but can even lead to a slight increase in the overall error clearly noticeable in the error for the case. This is even more obvious when the postprocessor, , is used directly. Alternatively, going from to leads to an approximation which is very close to the original in all cases. Although the results for the globally refined grid are not that promising, the postprocessing nevertheless has considerable benefits when adapting the grid using the residual indicator based on . Indeed Figure 14 shows that, for a given number of dofs, mesh adaptation based on produces an approximation which has a much smaller error than (on a mesh constructed using ).
For a more challenging test, especially for , we construct the forcing function so that the exact solution is , where is the solution to the above corner problem and . The function still has the same corner singularity but is also smooth. However, the challenging nature of this solution is that it has large gradients towards the outer boundaries. Results for are summarized in Figures 15 and 17. The final grids for are shown in Figure 16 using and to mark cells for refinement. In both cases steps were needed and the resulting grids have and cells ( and degrees of freedom), respectively. While the corner is highly refined in both cases, the regions that are smooth but strongly varying in their solution are far less refined when using . When using the final errors are and while adaptivity based on results in errors of the size and . Because of the corner singularity, using the postprocessor after finishing the refinement (based on ) does not lead to a significant improvement while basing the adaptive process on leads to an almost identical error while requiring only of the cells.
Figure 18 shows the efficiency index for all three test cases on globally refined grids. The results seem to indicate that there is only a slight increase in the efficiency index compared to .
6 Summary Discussion
In this article, we provide a strategy for improving existing post-processing strategies for numerical solutions of a model elliptic problem. The main idea is to modify the post-processed solution so that it satisfies Galerkin orthogonality. We prove various a priori type results showing desirable convergence properties of the orthogonal post-processor including an increased order of accuracy in the norm. We supported the analysis with numerical examples using two types of post-processors – that of SIAC and SPR – approximating smooth and non-smooth solutions.
In addition to the a priori results, we provide a reliable and efficient a posteriori error estimator for the orthogonal post-processed solution, it should be noted tht no such estimator is available for . We demonstrate in several examples that much more efficient meshes are obtained when adaptation is based on than when refinement is based on and the post-processor is only applied to the numerical solution on the final mesh.
Acknowledgement
This work was initiated during the authors’ stay in Edinburgh with an ICMS “research-in-group” grant. J.G. thanks the German Research Foundation (DFG) for support of the project via DFG grant GI1131/1-1. Work performed while the fourth author was visiting Heinrich Heine University, Düsseldorf, Germany and supported by a DAAD fellowship as well as the U.S. Air Force Office of Scientific Research (AFOSR), Computational Mathematics Program, under grant number FA9550-18-1-0486.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[ABCM 02] Douglas N. Arnold, Franco Brezzi, Bernardo Cockburn, and L. Donatella Marini. Unified analysis of discontinuous Galerkin methods for elliptic problems. SIAM J. Numer. Anal. , 39(5):1749–1779, 2001/02.
- 2[AO 00] Mark Ainsworth and J. Tinsley Oden. A posteriori error estimation in finite element analysis . Pure and Applied Mathematics (New York). Wiley-Interscience [John Wiley & Sons], New York, 2000.
- 3[BBD + 08] P. Bastian, M. Blatt, A. Dedner, C. Engwer, R. Klöfkorn, R. Kornhuber, M. Ohlberger, and O. Sander. A generic grid interface for parallel and adaptive scientific computing. II. Implementation and tests in DUNE. Computing , 82(2-3):121–138, 2008.
- 4[BMBS 95] Steven E Benzley, Karl Merkley, Ted D Blacker, and Larry Schoof. Pre-and post-processing for the finite element method. Finite elements in analysis and design , 19(4):243–260, 1995.
- 5[BS 77] J. H. Bramble and A. H. Schatz. Higher order local accuracy by averaging in the finite element method. Math. Comp. , 31(137):94–111, 1977.
- 6[CGZZ 17] Hongtao Chen, Hailong Guo, Zhimin Zhang, and Qingsong Zou. A C 0 superscript 𝐶 0 C^{0} linear finite element method for two fourth-order eigenvalue problems. IMA J. Numer. Anal. , 37(4):2120–2138, 2017.
- 7[Dav 04] Timothy A Davis. Algorithm 832: Umfpack v 4. 3—an unsymmetric-pattern multifrontal method. ACM Transactions on Mathematical Software (TOMS) , 30(2):196–199, 2004.
- 8[DKNO 10] A. Dedner, R. Klöfkorn, M. Nolte, and M. Ohlberger. A generic interface for parallel and adaptive discretization schemes: abstraction principles and the DUNE-FEM module. Computing , 90:165–196, 2010.
