Sparse approximate solution of fitting surface to scattered points by MLASSO model
Yong-Xia Hao, Chong-Jun Li, Ren-Hong Wang

TL;DR
This paper presents a multilevel LASSO (MLASSO) model for efficiently obtaining sparse surface approximations to scattered data, balancing data fit and sparsity, and highlighting high-gradient regions.
Contribution
The paper introduces a novel MLASSO model solved by ADMM for sparse surface fitting, improving upon existing methods in efficiency and accuracy.
Findings
MLASSO provides a good trade-off between data mismatch and sparsity.
The model effectively captures high-gradient surface regions.
Numerical experiments demonstrate superior performance over AGLASSO and MBA algorithms.
Abstract
The goal of this paper is to achieve a computational model and corresponding efficient algorithm for obtaining a sparse representation of the fitting surface to the given scattered data. The basic idea of the model is to utilize the principal shift invariant (PSI) space and the l1 norm minimization. In order to obtain different sparsity of the approximation solution, the problem is represented as a multilevel LASSO (MLASSO) model with different regularization parameters. The MLASSO model can be solved efficiently by the alternating direction method of multipliers. Numerical experiments indicate that compared to the AGLASSO model and the basic MBA algorithm, the MLASSO model can provide an acceptable compromise between the minimization of the data mismatch term and the sparsity of the solution. Moreover, the solution by the MLASSO model can reflect the regions of the underlying surface…
| Error | RMS | Iterations | Time(sec) | ||
|---|---|---|---|---|---|
| [35] | (24, 64, 193) | 2.1241e-3 | 2.3512e-3 | 3 | 0.0241 |
| Fig.2(a) | (18, 53, 143) | 2.2538e-3 | 4.3462e-3 | 17 | 0.0431 |
| Fig.2(b) | (13, 29, 84) | 3.8215e-3 | 5.6138e-3 | 1322 | 2.0032 |
| Fig.2(c) | (4, 41, 88) | 5.1433e-3 | 7.3768e-3 | 1841 | 2.4456 |
| Fig.2(d) | (15, 22, 49) | 5.3102e-3 | 8.7471e-3 | 2436 | 3.2345 |
| Fig.2(e) | (20, 51, 116) | 5.9973e-1 | 6.2737e-1 | 1 | 0.3917 |
| Fig.2(f) | (24, 61, 181) | 5.0691e-1 | 5.4132e-1 | 1 | 0.2961 |
| Fig.2(g) | (24, 63, 190) | 4.2005e-1 | 4.6263e-1 | 1 | 0.3524 |
| Fig.2(h) | (25, 62, 189) | 3.9211e-1 | 4.3247e-1 | 1 | 0.3012 |
| Error | RMS | Iterations | Time(sec) | ||
|---|---|---|---|---|---|
| [35] | (25, 62, 192, 674) | 1.1185e-3 | 3.3243e-4 | 4 | 0.2191 |
| Fig.3(a) | (17, 37, 50, 138) | 1.4298e-3 | 3.5873e-3 | 267 | 4.3145 |
| Fig.3(b) | (15, 21, 27, 88) | 2.1073e-3 | 4.2481e-3 | 1209 | 11.1452 |
| Fig.3(c) | (7, 41, 69, 63) | 3.1132e-3 | 5.6871e-3 | 1307 | 13.0139 |
| Fig.3(d) | (2, 53, 27, 59) | 3.9413e-3 | 6.1678e-3 | 1183 | 12.7346 |
| Fig.3(e) | (20, 52, 114, 90) | 5.2471e-1 | 6.0124e-1 | 1 | 5.1328 |
| Fig.3(f) | (23, 61, 181, 537) | 5.0109e-1 | 4.7453e-1 | 1 | 5.0129 |
| Fig.3(g) | (24, 48, 112, 544) | 5.1902e-1 | 5.4287e-1 | 1 | 4.9879 |
| Fig.3(h) | (25, 45, 176, 581) | 3.8716e-1 | 5.2634e-1 | 1 | 4.3472 |
| Error | RMS | Iterations | Time(sec) | ||
|---|---|---|---|---|---|
| [35] | (22, 64, 194) | 3.7541e-3 | 7.3274e-3 | 3 | 0.1102 |
| Fig.4(a) | (18, 41, 115) | 4.1637e-3 | 8.3761e-3 | 232 | 0.4257 |
| Fig.4(b) | (16, 41, 53) | 5.0749e-3 | 9.0652e-3 | 678 | 2.3091 |
| Fig.4(c) | (15, 40, 31) | 4.7049e-3 | 9.3981e-3 | 1103 | 1.6027 |
| Fig.4(d) | (14, 33, 49) | 5.2564e-3 | 9.2971e-3 | 1089 | 1.5453 |
| Fig.4(e) | (15, 51, 91) | 3.3797e-1 | 3.5205e-1 | 1 | 1.0274 |
| Fig.4(f) | (20, 62, 183) | 2.2143e-1 | 2.6308e-1 | 1 | 0.7348 |
| Fig.4(g) | (21, 64, 194) | 2.0213e-1 | 2.1453e-1 | 1 | 0.5762 |
| Fig.4(h) | (21, 64, 195) | 1.0254e-1 | 1.2190e-1 | 1 | 0.4209 |
| Error | RMS | Iterations | Time(sec) | ||
|---|---|---|---|---|---|
| [35] | (22, 63, 195, 673) | 2.1342e-3 | 3.2496e-3 | 4 | 0.7326 |
| Fig.5(a) | (16, 28, 45, 73) | 3.0054e-3 | 3.4124e-3 | 378 | 5.0312 |
| Fig.5(b) | (17, 31, 77, 54) | 2.3321e-3 | 3.0061e-3 | 1214 | 9.7821 |
| Fig.5(c) | (15, 25, 56, 24) | 4.1072e-3 | 5.1132e-3 | 2601 | 13.8871 |
| Fig.5(d) | (4, 61, 18, 0) | 5.7461e-3 | 6.3242e-3 | 3261 | 15.3487 |
| Fig.5(e) | (16, 52, 91, 40) | 1.5002e-1 | 2.1392e-1 | 1 | 5.1902 |
| Fig.5(f) | (20, 64, 185, 471) | 1.2301e-1 | 2.0342e-1 | 1 | 4.8106 |
| Fig.5(g) | (21, 64, 180, 589) | 2.0039e-1 | 2.1759e-1 | 1 | 4.7951 |
| Fig.5(h) | (20, 64, 185, 596) | 3.1402e-1 | 3.1132e-1 | 1 | 4.6283 |
| Error | RMS | Iterations | Time(sec) | ||
|---|---|---|---|---|---|
| [35] | (24, 63, 193) | 3.0285e-3 | 5.8147e-3 | 3 | 0.0617 |
| Fig.6(a) | (15, 50, 116) | 3.0273e-3 | 6.3044e-3 | 298 | 0.5042 |
| Fig.6(b) | (13, 33, 60) | 4.1409e-3 | 7.6258e-3 | 739 | 1.0354 |
| Fig.6(c) | (6, 50, 31) | 4.8321e-3 | 7.6578e-3 | 978 | 1.3277 |
| Fig.6(d) | (7, 34, 20) | 5.1157e-3 | 9.2785e-3 | 683 | 1.0317 |
| Fig.6(e) | (13, 19, 18) | 4.6342e-2 | 5.1462e-2 | 1 | 0.5745 |
| Fig.6(f) | (21, 39, 74) | 5.7749e-2 | 6.0324e-2 | 1 | 0.4182 |
| Fig.6(g) | (22, 42, 123) | 6.7765e-2 | 7.0079e-2 | 1 | 0.6242 |
| Fig.6(h) | (24, 45, 138) | 1.1091e-1 | 1.0072e-1 | 1 | 0.1562 |
| Error | RMS | Iterations | Time(sec) | ||
|---|---|---|---|---|---|
| [35] | (25, 63, 192, 674) | 1.6373e-3 | 3.8077e-3 | 4 | 0.7348 |
| Fig.7(a) | (16, 47, 105, 38) | 1.7665e-3 | 5.6282e-3 | 573 | 8.3451 |
| Fig.7(b) | (13, 31, 58, 10) | 2.3442e-3 | 5.6422e-3 | 978 | 13.6259 |
| Fig.7(c) | (7, 25, 96, 0) | 3.6609e-3 | 6.4672-3 | 854 | 11.7498 |
| Fig.7(d) | (0, 51, 36, 0) | 4.3713e-3 | 8.7693e-3 | 1588 | 19.8770 |
| Fig.7(e) | (14, 18, 19, 0) | 4.6868e-2 | 5.3092e-2 | 1 | 5.2829 |
| Fig.7(f) | (20, 38, 73, 119) | 5.1103e-2 | 5.5647e-2 | 1 | 5.1456 |
| Fig.7(g) | (22, 46, 68, 279) | 6.9144e-2 | 7.0093e-2 | 1 | 4.3378 |
| Fig.7(h) | (23, 34, 101, 328) | 1.0138e-1 | 1.1552e-1 | 1 | 3.6595 |
| Error | RMS | Iterations | Time(sec) | ||
|---|---|---|---|---|---|
| [35] | (25, 62, 193) | 3.2254e-3 | 5.0711e-3 | 3 | 0.0834 |
| Fig.8(a) | (17, 41, 110) | 3.7421e-3 | 6.3902e-3 | 19 | 1.1642 |
| Fig.8(b) | (15, 34, 57) | 4.0324e-3 | 6.2097e-3 | 938 | 1.7439 |
| Fig.8(c) | (9, 38, 46) | 4.5021e-3 | 8.1093e-3 | 1463 | 2.4762 |
| Fig.8(d) | (15, 29, 33) | 6.2043e-3 | 8.0072e-3 | 2230 | 2.7803 |
| Fig.8(e) | (21, 45, 126) | 2.0414e-1 | 6.0056e-1 | 1 | 0.7439 |
| Fig.8(f) | (24, 60, 161) | 2.1369e-1 | 3.1057e-1 | 1 | 0.5648 |
| Fig.8(g) | (23, 57, 166) | 3.2451e-1 | 2.2731e-1 | 1 | 0.4846 |
| Fig.8(h) | (24, 58, 179) | 5.3021e-1 | 2.1237e-1 | 1 | 0.4218 |
| Error | RMS | Iterations | Time(sec) | ||
|---|---|---|---|---|---|
| [35] | (25, 63, 194, 675) | 4.1124e-4 | 3.2017e-3 | 4 | 2.1406 |
| Fig.9(a) | (14, 34, 84, 108) | 5.0121e-4 | 5.0512e-3 | 687 | 12.1132 |
| Fig.9(b) | (15, 33, 48, 31) | 1.18974e-3 | 4.6645e-3 | 1152 | 18.4120 |
| Fig.9(c) | (13, 27, 52, 8) | 4.1321e-3 | 7.1452e-3 | 1333 | 18.2461 |
| Fig.9(d) | (12, 22, 70, 6) | 4.2134e-3 | 7.6731e-3 | 2231 | 28.1121 |
| Fig.9(e) | (22, 47, 126, 145) | 1.5782e-1 | 1.7287e-1 | 1 | 5.1631 |
| Fig.9(f) | (24, 58, 163, 491) | 2.2056e-1 | 2.2142e-1 | 1 | 5.4315 |
| Fig.9(g) | (24, 58, 164, 557) | 3.0349e-1 | 3.4397e-1 | 1 | 4.3341 |
| Fig.9(h) | (25, 60, 160, 524) | 5.2570e-1 | 5.3329e-1 | 1 | 3.9764 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNumerical methods in inverse problems · Electromagnetic Scattering and Analysis · Sparse and Compressive Sensing Techniques
\Year
2013 \MonthJanuary \Vol56 \No1 \BeginPage1 \EndPageXX \AuthorMarkYong-Xia Hao et al.
Corresponding author
\Emails
[email protected]; [email protected]; [email protected]
Sparse approximate solution of fitting surface to scattered points by MLASSO model
Yong-Xia Hao
Chong-Jun Li
Ren-Hong Wang
Faculty of Science, Jiangsu University, Zhenjiang 212000, China;
School of Mathematical Sciences, Dalian University of Technology, Dalian 116024, China;
Abstract
The goal of this paper is to achieve a computational model and corresponding efficient algorithm for obtaining a sparse representation of the fitting surface to the given scattered data. The basic idea of the model is to utilize the principal shift invariant (PSI) space and the norm minimization. In order to obtain different sparsity of the approximation solution, the problem is represented as a multilevel LASSO (MLASSO) model with different regularization parameters. The MLASSO model can be solved efficiently by the alternating direction method of multipliers. Numerical experiments indicate that compared to the AGLASSO model and the basic MBA algorithm in [35], the MLASSO model can provide an acceptable compromise between the minimization of the data mismatch term and the sparsity of the solution. Moreover, the solution by the MLASSO model can reflect the regions of the underlying surface where high gradients occur.
keywords:
Sparse solution, Principle shift invariant space, norm minimization, Alternating direction method of multipliers, MLASSO model
\MSC
65D17, 65K99
[TABLE]
\wuhao
1 Introduction
Sparse representation of a function via a linear combination of a small number of functions has recently received a lot of attention in several mathematical fields such as approximation theory [13, 33, 41, 42], compressed sensing, signal and image processing [7, 8, 9, 10] etc. The problem can be described as follows. Consider a linearly dependent set of functions and a function represented as Since the set of functions is not linearly independent, this representation is not unique. The problem is then to find the sparsest solution, i.e., the coefficient vector has as many zero components as possible (referred to minimizing the norm of the vector ). This optimization problem is NP-hard, since the norm is nonconvex and discontinuous. Hence, much attention has been paid to solutions minimizing instead.
In this paper, we consider the problem of reconstructing a surface from scattered data using a sparse representation. The scattered data fitting problem arises in many applications, such as signal processing, computer graphics and neural networks [32]. In a typical scattered data reconstruction problem, we are given a set of scattered points and associated noisy function values , where n is the error vector. Then we seek a function which fits the given data well. There are a lot of existing methods and algorithms in the literature. Various methods can be found in a survey on scattered data interpolation [38]. For approximation methods, B-splines have a solid mathematical foundation and have been used in many literatures, such as [34, 51] etc. Wavelet frames have also been used to reconstruct implicit surfaces from unorganized point sets in [16]. In order to control the local and global fitting error simultaneously, adaptive methods are presented in [11, 44]. The adaptivity is achieved by a portion of the data with a patch, testing the fit for satisfaction within a given tolerance, and subdividing the patch if the tolerance is not met [23]. In addition, several approximation methods employ a multilevel structure to approximate data efficiently. In particular, a multilevel scheme based on B-splines is proposed in [35] to approximate scattered data. These methods run on the approximation space , where are principal shift invariant (PSI) spaces generated by a single compactly supported function . The multilevel approximation procedure is as follows: for each level , the point set is approximated by a function obtained by the least square method, where . The procedure is terminated until certain conditions are satisfied. Then the final approximation surface is
[TABLE]
However, the methods above do not produce the sparse representation of the surface.
In this paper, we present an efficient method to obtain a sparse representation of the fitting surface to the given scattered points. We still choose the space defined above as the approximation space. But instead of using the multilevel scheme, we put all the basis functions of together as a frame of . Denote the basis functions of as , then . We then try to find the fitting surface as:
[TABLE]
Since the functions are linearly dependent, the representation of as above is not unique and we will seek a relatively sparse one. The choice of the space makes a sparse representation of exist and the function can be constructed in a multilevel way. We use the similar approach as those used in compressed sensing, i.e., to use the norm of the coefficient vector as the regularization term. Thus, the problem can be represented by the following minimization
[TABLE]
where are coefficient vectors. The parameters are called the regularization parameters, which serve as a weight to adjust the balance between the two terms. Large values of will lead to a sparse function , at the cost of a potentially large fitting error, while small values of will lead to a small fitting error, but with a potentially not too sparse fitting function . In addition, different values of can lead to different sparsity of . Let f denote the column vector , then the formulation (1.1) is equivalent to the following minimization:
[TABLE]
where is the observation matrix defined by
[TABLE]
For the case , the model reduces to the model presented in [30]. Moreover, the related minimization resulting from our proposed model can be efficiently solved using the alternating direction method of multipliers (ADMM) [54, 24].
This framework combines the ideas developed in compressed sensing with well-known concepts arising in adaptive and multilevel finite element methods. The solution of the minimization problem and the multilevel basis functions are used to control the grid refinement and adaptivity. Since we aim at finding a sparse representation of the surface, we discard those coefficients which are smaller than a certain threshold. Then only large coefficients of the solution are left which indicate important contributions of the underlying surface. Moreover, these large coefficients belong to the parts of the surface that have large fluctuations. It seems a little similar for our method and the approach in [15], both using the PSI space. However, the method in [15] was used to approximate functions expressed as a infinite sum of wavelet decomposition by a finite sum, while we deal with scattered data fitting problem by representing the fitting function as a finite sum directly with certain accuracy and more sparse coefficients. The behavior of our method is demonstrated via four examples: a discontinuous function, a non-smooth function, a smooth function and the Franke test function. In addition, we compare the numerical results with the AGLASSO model and the basic MBA algorithm in [35] followed by the same thresholding.
The rest of the paper is organized as follows. In the next section, we recall the main ingredients of the PSI space which will be used here. Moreover, we will propose the sparsity based regularization model for scattered data fitting. In Section 3, the ADMM algorithm will be applied to solve the minimization problem resulted from the proposed model. Numerical experiments are also performed to illustrate the algorithm. Section 4 is the conclusion.
2 Sparse solution of PSI approach to scattered data approximation
For a given set of scattered points and the corresponding noisy data , our task is to reconstruct a fitting surface with a sparse representation.
2.1 PSI space and regularization
Let be a bounded domain of interest where all data lie in and let be a carefully chosen, compactly supported function (e.g. uniform B-spline, box spline, radial basis functions). Denote
[TABLE]
where is a scaling parameter that controls the refinement of the space. Denote , we then look for a function which fits closely the given data. Then is composed of a sequence of functions as
[TABLE]
where
Here we choose a proper PSI space generated by B-spline as the approximation space since it enjoys desirable properties for data fitting. It has a simple structure and provides good approximation to smooth functions, which leads to simple and accurate algorithms. Moreover, it can be associated to a wavelet or frame system and hence one can solve the fitting problem by making use of the advantages that a wavelet (frame) system can offer [30]. These advantages include sparse approximation of functions in the wavelet (frame) domain, multilevel structure of basis functions, adaptivity to the data, norm equivalence, etc.
Recall that a function is said to satisfy the Strang-Fix conditions of order if
[TABLE]
Denote
[TABLE]
where is the Fourier transform of the function and denotes the Euclidean norm. Then if satisfies the Strang-Fix conditions [14, 31], a PSI space provides good approximation to (see [14, 31]), i.e., satisfies the Strang-Fix conditions of order if and only if for all ,
[TABLE]
Particularly, the B-spline of order satisfies the Strang-Fix conditions of order for all [16]. For more detailed discussions on PSI space, see [14].
Obviously, the union set of the basis functions of is not linearly independent. Thus the representation of is not unique and we want to determine a relatively sparse one, i.e., a representation with as many vanishing coefficients as possible. Every function can be written as
[TABLE]
where
[TABLE]
Let and f denote the column vector and respectively, then the problem can be formulated as follows.
[TABLE]
where is the observation matrix defined by
[TABLE]
Obviously, the model (2.1) balances the fitting accuracy and the norm. In order to achieve a sparse representation, small coefficients are neglected. That is, after obtaining the solution of the model (2.1), we discard the small elements of . Then the final solution only has large values left which indicate important contributions (fluctuations) of the real surface. Furthermore, comparing with the multilevel approximation approach given in [35], our method has the advantages of simplicity. Another important distinction is that it can be interpretable as a sparse strategy for reconstructing scattered data.
2.2 The MLASSO model
The model (2.1) is related to the LASSO model in some extent. Recall that the mathematical model of LASSO is:
[TABLE]
It was proposed originally in [48], and plays a very influential role in variable selection and dimensionality reduction. The Group LASSO (GLASSO) model proposed in [53] solves the convex optimization problem:
[TABLE]
where is a regularization parameter. The GLASSO model was proposed to perform variable selection on groups of variables for linear regression models. It has many applications in areas such as computer vision, data mining, etc. Meier et al. in [39] extended the GLASSO to logistic regression. The GLASSO does not, however, yield sparsity within a group. Moreover, GLASSO suffers from estimation inefficiency and selection inconsistency. To remedy these problems, the adaptive GLASSO method (AGLASSO) is proposed in [49] as:
[TABLE]
Obviously, the model (2.1) reduces to the LASSO model when . Moreover, the model (2.1) acts like the LASSO at the multilevel. Therefore, we denote the model (2.1) as the multilevel LASSO model (MLASSO). Compared with the AGLASSO model, the MLASSO model considers an additional penalty on the norm instead of norm of the regression coefficient vector, and it produces as election of variables with sparsity among different levels. It is known that when norm regularization term is applied to the data set, the resulting surface tends to be smooth without sharp discontinuities but have undesirable oscillations near the discontinuities [30]. Recently, several surface reconstruction approaches have been proposed to preserve surface discontinuity by replacing regularization using more sophisticated regularization, e.g., the Huber approximation of norm of function derivatives in [47], the local kernel regularization in [25] and the non-local means regularization in [16]. In our method, we use the regularization instead, in order to obtain the relatively sparse solution. The regularization term in (2.1) penalizes the roughness of the solution.
2.3 The parameter selection
The determination of the proper value of in the MLASSO model is an important problem and depends on the variance of the noise n, the properties of and . An appropriate choice of the regularization parameters is of vital importance for the quality of the resulting estimate and has been the subject of extensive research [3].
In recent years, there has been a growing interest in sophisticated regularization techniques which use multiple constraints as a mean of improving the quality of the solution [3]. Among them, only a few papers discussed the choice of multiple regularization parameters. However, most of them discuss the case that the regularization term is the norm. For example, a multi-parameter generalization of heuristic L-curve has been proposed in [3], a knowledge of noise (covariance) structure is required for a choice of parameter in [1, 2, 12], some reduction to a single parameter choice is suggested in [6]. At the same time, the discrepancy principle, which is widely used and known as the first parameter choice strategy proposed in the regularization theory [40], has been discussed in a multi-parameter context in [37]. Of course, there might be many different methods to choose the regularization parameters satisfying certain principle. In fact, the choice of the regularization parameters in the regularization modes, such as the LASSO model and the standard Tikhonov regularization model, have not yet solved. No choice is available for all the models. Basically, different models need different methods to decide the parameters, and even the same one may have different methods to choose, such as [55, 49, 27, 50, 28].
For the MLASSO model, instead of discussing similar methods as listed above, we propose two simple criteria for the choice of the parameters according to the sparsity and the support of the basis functions of the different levels. On one hand, since the length of is less than that of , so if one wants to obtain sparser solution, the parameters of the last several levels should be larger. In particular, choosing the same value for all the parameters, i.e., , obtains the global sparsity in the whole space . On the other hand, the support of the basis functions of the -th level is larger than that of the -th level, so if one wants smaller support, the parameters of the first several levels should be larger. Hence, the more appropriate choice of the parameters is to make the parameters of the first and last several levels greater than the middle several levels. We can only give such a qualitative guideline, since quantitative guidance for regularization parameters is rarely available. In this way, the solution has smaller support and sparser representation with good approximation accuracy. Numerical results also confirm this selection method as shown in the Tables 1-8. Moreover, in contrast with the methods listed above, this choice is easier and cheaper in the sense of computational cost.
Based on the above, the proposed method offers some interesting advantages:
-
Related with the LASSO model, the MLASSO model has a strong statistical background;
-
Compared to the LASSO model, the parameters in the MLASSO model can be chosen differently, thus one can attain high flexibility for the approximation accuracy and the sparsity of the solution;
-
Compared to the AGLASSO model, the MLASSO model can provide the sparsity of the solution by choosing different parameters and the distribution of the solution can reflect the large fluctuations of the underlying surface;
-
Utilizing the ADMM algorithm, the MLASSO model can be efficiently solved.
3 Numerical algorithm
3.1 Algorithm
In this section, we use the ADMM algorithm to solve the minimization model (2.1) for experimental evaluation. It turns out that ADMM is equivalent to or closely related ro many famous algorithms, such as the Douglas-Rachford splitting method in PDE literature [18, 19, 36], the Bregman iterative algorithms for problems in signal processing [26] and many others. In particular, we refer to [21, 52, 46, 45] for the relationship between ADMM and the split Bregman iteration scheme which is very influential in the area of image processing. Convergence analysis of the ADMM was given in [20, 29, 5].
In order to solve the MLASSO model (2.1) and guarantee the convergence, we denote
[TABLE]
is a diagonal matrix with as the main diagonal. Then we can rewrite (2.1) as
[TABLE]
By introducing an auxiliary variable , we convert the unconstrained minimization problem (3.1) into a constrained one:
[TABLE]
In this way, the MLASSO model (2.1) is turned into a classical minimization problem. The augmented Lagrangian function of problem (3.2) is
[TABLE]
where is a parameter of the algorithm. Then by applying the ADMM method, given the initialization and the parameters , it results in the following optimization algorithm:
[TABLE]
First of all, the above algorithm is convergent, since it is just the classical ADMM for two block of variables. Secondly, the system can be simply solved. The first step of each iteration in (3.3) is
[TABLE]
This linear system is positive definite and therefore it can be solved by the conjugate gradient method (CG). When the matrix is ill-posed, i.e., its condition number is huge, the convergence rate of the CG will be very slow. Under this case, the preconditioned CG [22, 43, 4] can be used instead to reduce the condition number of the coefficient matrix and improve the convergence speed. The second subproblem has a simple analytical solution based on soft-thresholding operator [17], that is
[TABLE]
where is the soft-thresholding operator defined by
[TABLE]
where
[TABLE]
The complete description of the algorithm for solving the model (2.1) is provided as Algorithm 1 as follows:
Algorithm 1**.**
*(Adapted ADMM for solving the MLASSO model (2.1))
Step 1) Set and the initial values choose appropriate sets of parameters , and two thresholds ;
Step 2) For , perform the iteration (3.3) until convergence;
Step 3) Assume is the solution obtained from Step 2), if , i.e., the absolute value of the -th element of is less than , set ;
Step 4) The final solution X are after the treatment of Step 3).*
In our numerical experiments, the initializations are and the stopping criteria is
[TABLE]
3.2 Numerical experiments
Given a test function with , we first sample data points with certain noises from it, i.e., . The error vector , whose entries consist of the pseudorandom values drawn from the standard uniform distribution on the open interval . Then apply different methods to obtain the approximation function . The difference between and is measured by computing the normalized RMS (root mean square) error [35]. That is,
[TABLE]
where , . Moreover, denote
[TABLE]
as the fitting error of the given scattered points . To demonstrate the accuracy of the proposed algorithm, we perform experiments with four functions: a discontinuous function , a non-smooth function , a smooth function and the Franke test function as follows.
[TABLE]
[TABLE]
[TABLE]
[TABLE]
For the numerical implementation in this paper, we employ the 2D tensor product quadratic B-spline as the function . We show that such a simple system with the Algorithm 1 can be used to effectively reconstruct surface of sparse representation from a scattered data set. The choices of the threshold can be chosen according to the accuracy and sparsity. We consider the four functions with and the 900 scattered points are chosen randomly. Moreover, we compare Algorithm 1 with the basic MBA algorithm presented in [35] under the termination condition and the AGLASSO model, both applying the same thresholding as Algorithm 1 to their results.
We experiment the above three methods for and starting from level 1 with biquadratic B-spline functions respectively. Fig.1 shows the scattered data points and the corresponding approximations of the four functions with Algorithm 1, the method in [35] and the AGLASSO model respectively. Fig.2-9 illustrate the distribution of the support of the B-spline functions with nonzero coefficients for and respectively. The red, green, blue and black rectangles denote the support of the B-splines corresponding to and respectively. Moreover, Tables 1-8 give the approximation accuracy, the iterations, the running time and the norm of the solution with different parameters for , where the length of and is 25, 64, 196 and 625 respectively. In addition, all our calculations are done in Matlab on a laptop with Inter Core i7 (2.90GHZ) CPU and 8.0G RAM.
Discussion 1**.**
The numerical results demonstrate that Algorithm 1 and the method in [35] have almost the same approximation errors, while Algorithm 1 obtains the sparse solution. Compared with the AGLASSO model, Algorithm 1 provides the sparser solutions with less error, though more iterations and more time.
Through the first two steps of Algorithm 1, we can obtain an approximation solution of the MLASSO model with no sparsity and the solution has some big components and some small ones which reflect different importance and contribution. Then by step three, we throw out small components of the computed solution which means we keep only the important ones with great contribution to the solution. Therefore, by choosing appropriate regularization parameters, the final solution can indicate the important parts we are interested in and identify the important features within the selected levels simultaneously of the exact surface since they are all determined by the big components. Experiment results verify this conclusion.
Taken together, Algorithm 1 can reconstruct the test functions reasonably with a sparse representation within a few levels by choosing some appropriate regularization parameters.
4 Conclusion
This paper presents an approach for scattered data fitting using the PSI space and the regularization. It is concluded into the MLASSO model which allows us to balance the accuracy and the sparsity of the fitting surface. The model can be solved using Algorithm 1 with the ADMM algorithm. Numerical examples demonstrate that compared to the basic MBA algorithm in [35] and the AGLASSO model, the MLASSO model provides an efficient, sparse, flexible and reasonable solution. Moreover, the distribution of the basis functions of the sparse solution can identify the regions of the underlying surface where large fluctuations occur.
\Acknowledgements
This work was supported by the National Natural Science Foundation of China (Nos.11526098, 11001037, 11290143, 11471066), the Research Foundation for Advanced Talents of Jiangsu University (No.14JDG034), the Natural Science Foundation of Jiangsu Province (No.BK20160487) and the Fundamental Research Funds for the Central Universities (DUT15LK44).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] F. Bauer, O. Ivanyshyn. Optimal regularization with two interdependent regularization parameters. Inverse Probl., 2007, 23: 331–342
- 2[2] F. Bauer, S. V. Pereverzev. An utilization of a rough approximation of a noise covariance within the framework of multi-parameter regularization. Int. J. Tomogr. Stat., 2006, 4: 1–12
- 3[3] M. Belge, M. E. Kilmer, E. L. Miller. Efficient determination of multiple regularization parameters in a generalized L-curve framework, Inverse Probl., 2002, 18: 1161–1183
- 4[4] M. Benzi. Preconditioning Techniques for Large Linear Systems: A Survey. J. Comput. Phys., 2002, 182: 418-477
- 5[5] S. Boyd, N. Parikh, E. Chu, B. Peleato, J. Eckstein. Distributed optimization and statistical learning via the alternating direction method of multipliers. Foundations and Trends of Machine Learning, 2010, 3: 1–122
- 6[6] C. Brezinski, M. Redivo-Zaglia, G. Rodriguez, S. Seatzu. Multi-parameter regularization techniques for ill-conditioned linear sysytems. Numer. Math., 2003, 94: 203–228
- 7[7] E. J. Cand e ` ` 𝑒 \grave{e} s, J. Romberg, T. Tao. Robust uncertainty principles: Exact signal reconstruction from highly incomplete frequency information. IEEE Trans. Inform. Theory, 2006, 52(2): 489–509
- 8[8] E. J. Cand e ` ` 𝑒 \grave{e} s, J. Romberg, T. Tao. Stable signal recovery from incomplete and inaccurate measurements. Comm. Pure Appl. Math., 2006, 59(8): 1207–1223
