A multi-level ADMM algorithm for elliptic PDE-constrained optimization problems
Xiaotong Chen, Xiaoliang Song, Zixuan Chen, Bo Yu

TL;DR
This paper introduces a multi-level ADMM algorithm for efficiently solving elliptic PDE-constrained optimization problems with box constraints, combining mesh refinement, inexact subproblem solutions, and convergence guarantees.
Contribution
It proposes a novel multi-level ADMM method with mesh refinement and inexact subproblem solving, along with convergence analysis and complexity results.
Findings
The mADMM algorithm converges with a rate of O(1/k).
Numerical experiments demonstrate high efficiency of the proposed method.
The multi-level strategy improves computational performance over fixed mesh approaches.
Abstract
In this paper, the elliptic PDE-constrained optimization problem with box constraints on the control is studied. To numerically solve the problem, we apply the 'optimize-discretize-optimize' strategy. Specifically, the alternating direction method of multipliers (ADMM) algorithm is applied in function space first, then the standard piecewise linear finite element approach is employed to discretize the subproblems in each iteration. Finally, some efficient numerical methods are applied to solve the discretized subproblems based on their structures. Motivated by the idea of the multi-level strategy, instead of fixing the mesh size before the computation process, we propose the strategy of gradually refining the grid. Moreover, the subproblems in each iteration are solved inexactly. Based on the strategies above, an efficient convergent multi-level ADMM (mADMM) algorithm is proposed. We…
| h | EOC | Index | mADMM | ihADMM | classical ADMM | ||
|---|---|---|---|---|---|---|---|
| 635 | 0.00168 | - | residual | 8.46e-07 | 9.29e-07 | 8.32e-07 | |
| CPU times/s | 0.21 | 0.61 | 0.95 | ||||
| iter | 14 | 27 | 63 | ||||
| 2629 | 5.57e-04 | 1.5927 | residual | 6.90e-07 | 8.78e-07 | 2.47e-07 | |
| CPU times/s | 0.43 | 1.73 | 2.13 | ||||
| iter | 15 | 26 | 30 | ||||
| 10697 | 2.05e-04 | 1.4420 | residual | 7.14e-07 | 8.83e-07 | 9.03e-07 | |
| CPU times/s | 1.23 | 7.40 | 30.46 | ||||
| iter | 13 | 25 | 54 | ||||
| 43153 | 1.13e-04 | 0.8593 | residual | 4.29e-07 | 7.16e-07 | 6.93e-07 | |
| CPU times/s | 4.45 | 43.13 | 755.84 | ||||
| iter | 13 | 25 | 119 | ||||
| 173345 | 6.68e-05 | 0.7584 | residual | 1.22e-07 | 8.69e-07 | 2.68e-07 | |
| CPU times/s | 39.15 | 384.39 | 39646.84 | ||||
| iter | 14 | 24 | 380 | ||||
| 694849 | - | - | residual | 8.7e-08 | 9.29e-07 | 3.58e-05 | |
| CPU times/s | 553.37 | 6451.98 | 289753.72 | ||||
| iter | 15 | 23 | 500 |
| h | EOC | Index | mADMM | ihADMM | classical ADMM | ||
|---|---|---|---|---|---|---|---|
| 225 | 0.0172 | - | residual | 9.26e-07 | 9.88e-07 | 9.98e-07 | |
| CPU times/s | 0.31 | 0.28 | 0.43 | ||||
| iter | 22 | 25 | 120 | ||||
| 961 | 6.71e-03 | 1.3580 | residual | 7.83e-07 | 9.19e-07 | 7.25e-07 | |
| CPU times/s | 0.85 | 0.69 | 0.87 | ||||
| iter | 23 | 26 | 32 | ||||
| 3969 | 2.11e-03 | 1.6691 | residual | 8.62e-07 | 7.10e-07 | 3.34e-07 | |
| CPU times/s | 2.97 | 3.85 | 4.45 | ||||
| iter | 24 | 30 | 32 | ||||
| 16129 | 8.02e-04 | 1.3956 | residual | 9.80e-08 | 7.80e-08 | 8.86e-08 | |
| CPU times/s | 14.92 | 37.39 | 91.15 | ||||
| iter | 23 | 28 | 78 | ||||
| 65025 | 3.58e-04 | 1.1636 | residual | 9.04e-07 | 9.62e-07 | 7.05e-07 | |
| CPU times/s | 22.80 | 151.57 | 2457.02 | ||||
| iter | 21 | 28 | 183 | ||||
| 261121 | 1.81e-04 | 0.9840 | residual | 3.09e-07 | 8.43e-07 | 5.56e-07 | |
| CPU times/s | 168.58 | 1469.13 | 40739.35 | ||||
| iter | 22 | 30 | 283 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Numerical Methods in Computational Mathematics · Matrix Theory and Algorithms · Numerical methods for differential equations
A multi-level ADMM algorithm for elliptic PDE-constrained optimization problems
Xiaotong Chen School of Mathematical Sciences, Dalian University of Technology, Dalian, Liaoning 116025, China. ([email protected], [email protected], [email protected]).
Xiaoliang Song Department of Applied Mathematics, The Hong Kong Polytechnic University, Hung Hom, Kowloon, Hong Kong, China. ([email protected]).
Zixuan Chen11footnotemark: 1
Bo Yu11footnotemark: 1
Abstract
In this paper, the elliptic PDE-constrained optimization problem with box constraints on the control is studied. To numerically solve the problem, we apply the ‘optimize-discretize-optimize’ strategy. Specifically, the alternating direction method of multipliers (ADMM) algorithm is applied in function space first, then the standard piecewise linear finite element approach is employed to discretize the subproblems in each iteration. Finally, some efficient numerical methods are applied to solve the discretized subproblems based on their structures. Motivated by the idea of the multi-level strategy, instead of fixing the mesh size before the computation process, we propose the strategy of gradually refining the grid. Moreover, the subproblems in each iteration are solved inexactly. Based on the strategies above, an efficient convergent multi-level ADMM (mADMM) algorithm is proposed. We present the convergence analysis and the iteration complexity results of the proposed algorithm for the PDE-constrained optimization problems. Numerical results show the high efficiency of the mADMM algorithm.
keywords:
PDE-constrained optimization, ADMM, multi-level, convergence analysis
AMS:
49N05, 49M25, 65N12, 68W15
1 Introduction
In this paper, we consider the elliptic PDE-constrained optimization problem with box constraints on the control:
[TABLE]
where is a convex, open and bounded domain with - or polygonal boundary; the desired state and the source term are given; parameters , ; is the uniformly elliptic differential operater given by
[TABLE]
where
By the classical conclusion in the PDE theory, for a given and every , elliptic PDEs involved in the ()
[TABLE]
has a unique weak solution , where denotes the solution operator. It is well-defined and is a continuous linear injective operator[18, Theorem B.4].
As is known to all, there are two possible approaches to tackle optimization problems with PDE constraints numerically. One is First discretize, then optimize, another is First optimize, then discretize [16]. In the first approach, the discretization is applied to the original PDE-constrained optimization problem, while in the second one, the discretization is applied to the KKT system of the PDE-constrained optimization problem. Different from the strategies mentioned above, instead of applying discretized concept to problem () or its KKT system directly, ‘optimize-discretize-optimize’ strategy was proposed by Song in [22]. First the optimization algorithm is given in the sense of continuous function space, then the subproblems in each iteration are discretized by the standard piecewise linear finite element approach. Finally, numerical optimization methods are applied to solve the discretized subproblems numerically. The advantage of this method is that from the optimization algorithm in function space, we can have a better knowledge of the structure of the PDE-constrained optimization problem, which is important for choosing an appropriate discretization format to discretize the subproblems. Moreover, this strategy gives the freedom to discretize the subproblems differently, which makes the subproblems can be solved more effectively.
In this paper, we focus on the alternating direction method of multipliers (ADMM) method, which was originally proposed by Glowinski et al. and Gabay et al. in [9, 11] and has been broadly used in many areas. Motivated by the success of ADMM in solving finite dimensional large scale optimization problem [2, 7, 20, 21], ADMM-type method has been used to solve PDE-constrained optimization problems in function space in [4, 19, 23, 27]. While these works all focus on First discretize, then optimize method, and to the best of our knowledge, very little work has been done to apply First optimize, then discretize to solve PDE-constrained optimization problems by ADMM-type method. We apply ADMM as the outer optimization algorithm of the ‘optimize-discretize-optimize’ strategy, propose a new algorithm to solve PDE-constrained optimization problems efficiently and give the convergence analysis.
First, we briefly give the iterative format of the classical ADMM for the 2-block convex optimization problem with linear constraints:
[TABLE]
where ,, are finite dimensional real Euclidean spaces, , are convex functions (maybe nonsmooth), , are linear mappings, is given. The augmented Lagrangian function is given by the following form:
[TABLE]
where denotes the Lagrange multiplier, is the penalty parameter.
Given , penalty parameter and the step size parameter , then the iterative format of the classical ADMM is as follows:
[TABLE]
The advantage of ADMM is that it separates and into two subproblems. In each subproblem, there is only one variable, the other one is fixed. Thus each subproblem could be solved easily and efficiently. For the classical ADMM, the convergence analysis was first conducted by [8, 9, 10], and for the recent interesting new developments on the convergence analysis of ADMM-type method, see [14, 25, 26].
To apply ADMM-type method to solve the problem (), we use the solution operator and introduce an artificial variable , then we equivalently rewrite problem () as the following reduced form:
[TABLE]
where denotes the reduced cost function, denotes the indicator function of ,
[TABLE]
The augmented Lagrangian function of (RP) is defined as follows:
[TABLE]
where denotes the Lagrangian multiplier, is the penalty parameter.
The classical ADMM in finite dimensional spaces can be extended directly in Hilbert space for problem (RP). Given , parameters , , the iterative scheme is presented as follows:
[TABLE]
Usually, it is expensive and unnecessary to exactly compute the solution of each subproblem even if it is feasible. Thus it is natural to use some iterative methods such as Krylov-based methods to solve the subproblems which are equivalent to large scale or ill-conditioned linear systems. The inexact ADMM algorithm in finite dimension space and its convergence results under certain error criterion have been studied extensively recently (see [2, 20]). Taking the inexactness of the solutions in the function space into account, Song et al. applied the inexact ADMM algorithm to Hilbert space for PDE-constrained optimization problems and presented the convergence results in [24]. Given , parameters , . Let be a sequence satisfying and , the iterative scheme of the inexact ADMM in function space for (RP) is as follows:
[TABLE]
In [24], the discretized problem is considered and the level of discretization is fixed. In this paper, instead of considering the discretized problem, the finite element method is employed to discretize the subproblems in each iteration of the inexact ADMM algorithm. This strategy gives the freedom and flexibility to discretize the subproblems by different discretization schemes. The total error of the proposed algorithm for (RP) is consisted of two parts: the discretization error and the iteration error resulted from inexactly solving the discretized subproblems. For these two errors, our algorithm can be considered as an approximation of exact ADMM in function space, thus we regard it as an inexact ADMM algorithm in function space. In order to guarantee the convergence behavior of our algorithm, we consider controlling the mesh size and the inexactness of solving the subproblems.
In the classical finite element based ADMM-type algorithm to solve the PDE constrained optimization problem, the subproblems are always discretized on a fixed mesh size, which results in large scale optimization problems. Thus it is important to consider reducing the computation cost. Multi-grid method is a modern field of research starting with the works of Brandt [1] and Hackbusch [12, 13]. It is well known that multi-grid method solves elliptic problems with optimal computational complexity. Motivated by the idea of applying multi-grid method to tackle infinite dimension problems by Newton method in [6, 12], we apply the multi-level strategy to our algorithm. It is important to point out that in the initial stage of the algorithm, the iteration precision is required to be relatively low, which means using coarse mesh is sufficient. While as the iteration process proceeds, the iteration precision is supposed to be higher and higher. In this case, using finer mesh is necessary. It is obvious that the strategy of gradually refining the grid can strongly reduce the computation cost and make the algorithm faster than computing the problem on a fixed mesh size.
The main contribution of this paper is that we give the reasonable strategies of gradually refining the grid and inexactly solving the subproblems, and propose an efficient convergent multi-level ADMM (mADMM) algorithm. Specifically, we apply the ADMM algorithm in function space, then employ the standard piecewise linear finite element approach and implement the strategy of gradually refining the grid to the related subproblems appearing in each iteration. For the discretized subproblems, we use inexact strategies to solve them. For example, to solve the smooth subproblems, we can use some iterative methods such as Krylov-based methods. Thanks to the above strategies, we can solve the problem more efficiently and reduce the computation cost significantly. Moreover, we give the convergence analysis as well as the iteration complexity results for the mADMM algorithm.
The paper is organized as follows. In section 2, we apply the inexact ADMM algorithm in function space first, then use finite element method to discretize the associated subproblems and propose the multi-level ADMM (mADMM) algorithm. Section 3 gives the convergence analysis and the iteration complexity of the proposed mADMM algorithm. The numerical results are given in section 4. Section 5 contains a brief summary of this paper.
2 A multi-level ADMM algorithm
In this section, we apply the ‘optimize-discretize-optimize’ strategy and propose an efficient convergent multi-level ADMM (mADMM) algorithm in Algorithm 1. The strategies of gradually refining the grid and inexactly solving the subproblems are given to guarantee the convergence and efficiency of the mADMM algorithm.
2.1 The mADMM algorithm
Based on the inexact ADMM in function space, to numerically solve problem (RP), we consider the finite element method. Specifically, piecewise linear functions are utilized to discretize the variables in the related subproblems appearing in each iteration of the inexact ADMM in function space. We first consider a family of regular and quasi-uniform triangulations of , i.e. . With each element , we define the diameter of the set by and let denotes the diameter of the largest ball contained in . The mesh size of the grid is defined by . We suppose the following standard assumption in the context of error estimates holds (see[15, 16]).
Assumption 2.1**.**
(Regular and quasi-uniform triangulations) There exist two positive constants and such that
[TABLE]
hold for all and all . Moreover, let us define and let and denote its interior and its boundary, respectively. In the case that is a convex polyhedral domain, we have . In the case that is a domain with a - boundary , we assume that is convex and that all boundary vertices of are contained in , such that
[TABLE]
where denotes the measure of the set, and is a constant.
For the state variable and the control variable , we choose the same discretized state space and discretized control space defined by
[TABLE]
where denotes the space of polynomials of degree less than or equal to 1. For given source term and , let the discretized state associated with denoted by , which is defined as the unique solution for the discretized weak formulation of the state equation (1.1):
[TABLE]
Moreover, can be expressed by , where denotes the discretized version of the solution operator . Then we have the following well-known conclusion about error estimates.
Lemma 1**.**
([5], Thm. 4.4.6)* For a given , let be the unique weak solution of the state equation (1.1) and be the unique solution of (2.3). Then there exists a constant independent of , and such that*
[TABLE]
In particular, this implies and .
For the given triangulation with nodes , let be a set of nodal basis functions associated with nodes which spans , and satisfies the properties:
[TABLE]
Then , can be represented as and we have , . The other variables and operators in the subproblems of the inexact ADMM in function space are all discretized by piecewise linear functions similarly.
Before we give the proposed algorithm to solve the problem (RP), we first introduce the definition of node interpolation operator for the following convergence analysis.
Definition 2**.**
For a given regular and quasi-uniform triangulation of with nodes , let denotes a set of associated nodal basis functions. Then the interpolation operator is defined as
[TABLE]
Moreover, about the interpolation error estimate, we have the following result.
Lemma 3**.**
([5], Thm. 3.1.6)* For all and , we have*
[TABLE]
where is a constant which is independent of the mesh size .
In the classical finite element based algorithm, the discretization mesh size is fixed in advance. When computing on the finer mesh, the scale of the discretized problem becomes larger and the computation cost becomes larger. So it is important to consider reducing the computation cost. In the first several iterations of the algorithm, the iteration precision is not that good, which means using coarse mesh will not make the precision worse, but reduce the computation amount. While as the iteration process proceeds, the iteration precision becomes higher and higher. In this case, using finer mesh is necessary. Thus we introduce the idea of gradually refining the grid. Specifically, in the initial iteration we choose coarse mesh and obtain a solution first, then use the interpolation operator to project the obtained solution to the finer mesh. Finally, we apply appropriate numerical methods to solve the subproblems in the finer mesh and obtain a more precise solution, so on and so forth.
Let the mesh size of the th iterate denotes by , then the discretized reduced cost function is defined as follows,
[TABLE]
Let denotes the discretized feasible set,
[TABLE]
Moreover, we define and , where denotes the node interpolation operator. Based on the above representations, we present the iterative scheme of the multi-level ADMM algorithm (mADMM) in Algorithm 1.
Remark 2.1**.**
In order to guarantee the sequence satisfies . In the numerical experiment, we can choose as an example.
Remark 2.2**.**
In order to guarantee the mesh sizes of each iteration satisfy In the numerical experiment, we choose in Example 4.1, in Example 4.2.
2.2 Numerical computation of the subproblems in Algorithm 1
For , , let , be the relative coefficient vectors respectively. Let the -projections of and onto be and , respectively. Similarly, and denote their coefficient vectors. The stiffness matrix and the mass matrix are defined as:
[TABLE]
Furthermore, we define
[TABLE]
Then we can obtain the matrix-vector form of Algorithm 1 in Algorithm 2.
Let denotes the discretized state and the discretized adjoint state respectively. Then the -subproblem at the -th iterate is equivalent to solving the following linear system:
[TABLE]
Since , we eliminate the variable , then the above problem can be rewritten in the following reduced form without any computational cost:
[TABLE]
The equivalent linear system (2.11) can be solved inexactly by some Krylov-based method.
Finally, we give a terminal condition of Algorithm 2. Let be a given accuracy tolerance, we terminate the algorithm when , where the accuracy of a numerical solution is measured by the residual where
[TABLE]
3 Convergence analysis
In this section, based on the convergence results of the inexact ADMM in function space in Theorem 4, we give the convergence analysis and the iteration complexity of the proposed mADMM algorithm.
Theorem 4**.**
([24], Thm. 2.5)*
Suppose that the operator is uniformly elliptic. Let be the KKT point of (P), the sequence is generated by the inexact ADMM algorithm for (RP) with the associated state and adjoint state , then we have*
[TABLE]
Moreover, there exists a constant only depends on the initial point and the optimal solution such that for ,
[TABLE]
where the function defined as
[TABLE]
For the convenience of proving the convergence results, we give the iterative scheme of the multi-level discretized ADMM for (RP) and present a lemma to measure the gap between the solution sequences obtained by the ADMM in function space and the multi-level discretized ADMM.
Given , parameters , . The mesh sizes of each iteration satisfy Then the iterative format of the multi-level discretized ADMM algorithm is as follows:
[TABLE]
Lemma 5**.**
Let the initial point be , then , and
[TABLE]
Proof.
We employ the mathematical induction to prove the conclusion. While , with the definition of and the interpolation error estimate in Lemma 3, we have
[TABLE]
Then we can easily obtain that . The proof is similar to the case , here we omit it.
While , we assume for we have . Then for -subproblems in exact ADMM in function space and the multi-level discretized ADMM, we know and satisfy the following optimality conditions:
[TABLE]
Then subtracting the above two equalities we obtain
[TABLE]
For the multiplier and ,
[TABLE]
we can get the estimate
[TABLE]
For -subproblems, and satisfy the following optimality conditions respectively,
[TABLE]
Then we know from the above two equalities that
[TABLE]
so
[TABLE]
where we define
[TABLE]
For the term , we make use of the decomposition,
[TABLE]
From the well known error estimate in lemma 1 and the property that are bounded linear operators, we have
[TABLE]
Hence, there exists a constant such that
[TABLE]
Similarly, based on the property of the projection operator in Lemma 3, for the term , we have
[TABLE]
and
[TABLE]
For the term ,
[TABLE]
where we used (3.7), the property of the projection operator and the property of the mesh size . Moreover, as the mesh sizes satisfy there exists a constant such that , then we have
[TABLE]
Similarly,
[TABLE]
Then with the fact that operators are bounded linear operators, we know from the equality (3.11) and the estimations of norms of above that
[TABLE]
Moreover, we have
[TABLE]
Hence the conclusion holds for the case and we can get the assertion
[TABLE]
∎
Similar to the Lemma 5, we have the following lemma.
Lemma 6**.**
Let the initial point be , then , and
[TABLE]
Proof.
We employ the mathematical induction to prove the conclusion. The proof is similar to Lemma 5, here we do not talk about it in detail. ∎
To prove the convergence of the mADMM algorithm, let represents the exact solutions of the th iteration of Algorithm 1:
[TABLE]
The following lemma gives the gap between and .
Lemma 7**.**
([24], Lemma 4.4)
For any , we have
[TABLE]
where .
For the convenience of analyzing the non-ergodic iteration complexity, let denotes the KKT point of the discretized reduced problem with the mesh size
[TABLE]
Moreover, we provide a lemma and two propositions which are essential for analyzing the iteration complexity of our mADMM.
Lemma 8**.**
([2], Lemma 6.1)
If a sequence satisfies the following conditions:
[TABLE]
Then we have and .
Proposition 9**.**
Let be the sequence generated by Algorithm 1 and denotes the KKT point of the discretized reduced problem. Then for we have
[TABLE]
Proof.
First, for any , we have the following two important equalities hold
[TABLE]
The proof of the above two equalities can be easily obtained by the definition of norm.
By the optimality conditions of the -subproblem and -subproblem corresponding to and , we have
[TABLE]
Moreover, denotes the KKT point of the discretized reduced problem, so it satisfies the following KKT system
[TABLE]
Then by combining (3.29) and (3.31), we obtain
[TABLE]
Moreover, the subdifferential operator is a maximal monotone operator, so the following inequality holds,
[TABLE]
For the convenience of analyzing, we define . Therefore, we can derive that
[TABLE]
Then adding the above two equalities we obtain
[TABLE]
Next, we estimate the last two terms on the left side separately,
[TABLE]
where we used the equality (3.27).
Moreover, by employing the equality (3.28) and using , we have
[TABLE]
Then, substituting (LABEL:estimate1), (LABEL:estimate2) into (3.38), we can get the assertion of Proposition 9. ∎
Proposition 10**.**
Let be the sequence generated by Algorithm 1, denotes the KKT point of the discretized reduced problem and , defined in (3.24), (3.25), respectively. Then for we have
[TABLE]
Proof.
For the proof of Proposition 10, by substituting and for and in the proof of Proposition 9, we can get the assertion. ∎
Finally, based on the above results, the convergence results of Algorithm 1 is given by the following theorem.
Theorem 11**.**
Suppose that the operator is uniformly elliptic. Let be the KKT point of (), is obtained in the th iterate of Algorithm 1, where we suppose the mesh sizes of each iteration satisfy and the error vector satisfies , Then we have
[TABLE]
Moreover, there exists a constant only depends on the initial point and the optimal solution such that for ,
[TABLE]
where is defined as
[TABLE]
Proof.
By the optimality condition of the -subproblem in the ADMM in function space, we have
[TABLE]
As we know, the error between the inexact solution and exact solution contains two parts, error from gradually refining the grid and error from the inexactly solving the subproblems. We take them into consideration together as a total error, let represents the inexact solution of the th iteration, then from the optimality condition of the -subproblem, we have
[TABLE]
Moreover, by the optimality conditions of the subproblem corresponding to and in Algorithm 1 and multi-level discretized ADMM, we have
[TABLE]
Then we know from the four equalities above that
[TABLE]
Moreover, we have the estimate
[TABLE]
Then we know from (LABEL:equ:estimate), Lemma 5 and Lemma 6 that there exists constant C such that
[TABLE]
We know from Algorithm 1 that the mesh sizes of each iteration satisfy The error vector of the multi-level ADMM satisfy where is the upper bound of , i.e. . Thus we have
[TABLE]
For the discretization error and the iteration error, Algorithm 1 can be considered as an inexact ADMM algorithm in function space, then we know from Theorem 4 that the convergence of Algorithm 1 is guaranteed.
At last, we establish the proof of the iteration complexity results for the sequence generated by the mADMM. First, by the optimality condition for , we have
[TABLE]
Then by the definition of , we derive
[TABLE]
Next, for the convenience of giving an upper bound of , we define the following sequence , and with:
[TABLE]
First, we give an upper bound of . We know from Lemma 10 that so
[TABLE]
We know from the definition of , and Lemma 7 that
[TABLE]
where is a constant. So there exists a constant such that for every
[TABLE]
By Lemma 9, we have
[TABLE]
so there exists a constant such that Hence,
[TABLE]
By the definition of and , we have
[TABLE]
thus
Moreover, we have the estimate that
[TABLE]
where is a constant, we used (3.56), (3.57) and the property .
Then we know from Lemma 9 that
[TABLE]
where we used the property and (3.55). Hence, there exist constants such that
[TABLE]
Finally, by substituting (3.62) to (3.50), there exists a constant ,
[TABLE]
Thus, by Lemma 8, we know that
[TABLE]
holds. Therefore, combining the obtained global convergence results, we complete the whole proof of Theorem 11. ∎
4 Numerical experiments
In this section we illustrate the numerical performance of the proposed multi-level ADMM algorithm for PDE-constrained optimization problems. All our computational results are obtained by MATLAB R2017b running on a computer with 64-bit Windows 7.0 operation system, Intel(R) Core(TM) i7-6700U CPU (3.40 GHz), and 32 GB of memory.
First, we introduce the algorithmic details that are common to all examples. The discretization was carried out by using the standard piecewise linear finite element approach. To present numerical results, it is convenient to introduce the experimental order of convergence (EOC), which for some positive error function is defined by
[TABLE]
We note that if , then . In numerical experiments, we measure the accuracy of an approximate optimal solution by using the corresponding KKT residual error for each algorithm. For the purpose of showing the efficiency of our mADMM, we report the numerical results obtained by running the ihADMM (see [24] for details) and the classical ADMM method to compare with the results obtained by the mADMM. In this case, we terminate all the algorithms when with the maximum number of iterations set to 500. For all numerical examples and all algorithms, we choose zeros as the initial values and the penalty parameter was chosen as . About the step length , we choose .
Example 4.1**.**
([16], Example 3.3) Consider
[TABLE]
where the domain is the unit circle . Set the desired state , the parameters .
In this example, the exact solutions of the problem are unknown in advance. Instead we use the numerical solutions computed on the grid with as reference solutions. As an example, the discretized optimal control on the grid with is presented in Figure 1.
The error of the control w.r.t. the -norm, the EOC for the control, the numerical results for the accuracy of solution, the CPU time and the number of iterations obtained by our mADMM, the ihADMM and the classical ADMM are shown in Table 1. We can see from Table 1 that our mADMM is highly efficient in obtaining an approximate solution compared with the ihADMM and the classical ADMM in terms of the CPU time, especially when the discretization is in a fine level. Furthermore, it should be specially mentioned that the numerical results in terms of iterations illustrate the mesh-independent performance of the mADMM and the ihADMM. However, iterations of the classical ADMM will increase with the refinement of the discretization.
Example 4.2**.**
([17], Example 4.1) Consider
[TABLE]
where , the upper bound is , the lower bound is , and the regularization parameter is . We choose , where , denotes the solution operator. In addition, from the choice of parameters, it is easy to know that is the unique solution of the continuous problem.
The exact control and the discretized optimal control on the grid with are presented in Figure 2.
The error of the control w.r.t. the - norm, the EOC for the control, the numerical results for the accuracy of solution, the CPU time and the number of iterations obtained by our mADMM, the ihADMM and the classical ADMM are shown in Table 2. Experiment results show that our mADMM has evident advantage on CPU time over the ihADMM and the classical ADMM. Furthermore, we also notice that the numerical results in terms of iteration numbers illustrate the mesh-independent performance of the mADMM.
5 Conclusion
In this paper, we employ a multi-level ADMM algorithm to solve optimization problems with PDE constraints. Instead of solving the discretized problems, we apply the ‘optimize-discretize-optimize’ strategy. Such approach has the flexibility that allows us to discretize the subproblems of the inexact ADMM algorithm by different discretization schemes. Motivated by the multi-level strategy, we propose the proper strategy of gradually refining the grid and the strategy of solving the subproblems inexactly. We designed the convergent multi-level ADMM (mADMM) algorithm, which can significantly reduce the computation cost and make the algorithm faster. The convergence analysis and the iteration complexity results is presented. Numerical results demonstrated the efficiency of the proposed mADMM algorithm.
Acknowledgements
We would like to thank Prof. Long Chen very much for his FEM package iFEM [3] in Matlab.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] A. Brandt, Multi-level adaptive solutions to boundary-value problems. Math. Comp. 31 (1977) 333-390.
- 2[2] L. Chen, D.F. Sun and K.C. Toh, An efficient inexact symmetric Gauss-Seidel based majorized ADMM for high-dimensional convex composite conic programming. Math. Program. 161 (2017) 237-270.
- 3[3] L. Chen, i FEM: An Integrated Finite Element Methods Package in MATLAB. Technical Report. University of California at Irvine, Irvine (2009).
- 4[4] Z.X. Chen, X.L. Song, X.P. Zhang and B. Yu, A FE-ADMM algorithm for Lavrentiev-regularized state-constrained elliptic control problem. ESAIM: COCV 25 (2019) E 5.
- 5[5] P.G. Ciarlet, The Finite Element Method for Elliptic Problems. Volume 40 of Classics in Applied Mathematics. Society for Industrial and Applied Mathematics, Philadelphia (2002).
- 6[6] P. Deuflhard, Newton Methods for Nonlinear Problems: Affine Invariance and Adaptive Algorithms. Springer, Berlin (2011).
- 7[7] M. Fazel, T.K. Pong, D.F. Sun and P. Tseng, Hankel matrix rank minimization with applications to system identification and realization. SIAM J. Matrix Anal. Appl. 34 (2013) 946-977.
- 8[8] M. Fortin and R. Glowinski, On decomposition-coordination methods using an augmented Lagrangian. Augmented Lagrangian Methods: Applications to the Solution of Boundary Problems. Elsevier, Amsterdam (1983).
