On the Golub--Kahan bidiagonalization for ill-posed tensor equations with applications to color image restoration
Fatemeh P. A. Beik, Khalide Jbilou, Mehdi Najafi-Kalyani, Lothar, Reichel

TL;DR
This paper introduces the Tensor Golub--Kahan bidiagonalization algorithm combined with Tikhonov regularization to effectively solve ill-posed tensor equations, with applications demonstrated in color image restoration.
Contribution
The paper proposes a novel tensor bidiagonalization algorithm and explores its theoretical properties and practical applications in high-dimensional tensor problems.
Findings
The TGKB algorithm effectively stabilizes solutions to ill-posed tensor equations.
Numerical experiments confirm the algorithm's applicability to color image restoration.
Theoretical analysis reveals the conditioning of tensor equations and the utility of TGKB.
Abstract
This paper is concerned with solving ill-posed tensor linear equations. These kinds of equations may appear from finite difference discretization of high-dimensional convection-diffusion problems or when partial differential equations in many dimensions are discretized by collocation spectral methods. Here, we propose the Tensor Golub--Kahan bidiagonalization (TGKB) algorithm in conjunction with the well known Tikhonov regularization method to solve the mentioned problems. Theoretical results are presented to discuss on conditioning of the Stein tensor equation and to reveal that how the TGKB process can be exploited for general tensor equations. In the last section, some classical test problems are examined to numerically illustrate the feasibility of proposed algorithms and also applications for color image restoration are considered.
| Grid | Level of noise | Method | Iter() | CPU-times(sec) | ||
|---|---|---|---|---|---|---|
| 0.01 | Algorithm 2 | 39 | 2.31 | |||
| Algorithm 3 | 66 | 3.34 | ||||
| HT-BTF | 11 | 3.62 | ||||
| FHT-BTF | 5 | 0.98\bigstrut[b] | ||||
| 0.001 | Algorithm 2 | 134 | 7.53 | |||
| Algorithm 3 | 126 | 6.49 | ||||
| HT-BTF | 24 | 34.84 | ||||
| FHT-BTF | 8 | 2.35 \bigstrut[b] | ||||
| 0.01 | Algorithm 2 | 37 | 7.03 | |||
| Algorithm 3 | 103 | 17.28 | ||||
| HT-BTF | 11 | 12.32 | ||||
| FHT-BTF | 5 | 3.34 | ||||
| 0.001 | Algorithm 2 | 178 | 36.89\bigstrut[t] | |||
| Algorithm 3 | 193 | 36.97 | ||||
| HT-BTF | 21 | 72.95 | ||||
| FHT-BTF | 8 | 8.15\bigstrut[b] | ||||
| 0.01 | Algorithm 2 | 36 | 11.13\bigstrut[t] | |||
| Algorithm 3 | 127 | 35.85 | ||||
| HT-BTF | 11 | 21.45 | ||||
| FHT-BTF | 5 | 5.66 \bigstrut[b] | ||||
| 0.001 | Algorithm 2 | 154 | 58.64\bigstrut[t] | |||
| Algorithm 3 | 231 | 73.47 | ||||
| HT-BTF | 22 | 134.51 | ||||
| FHT-BTF | 7 | 11.65 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTensor decomposition and applications · Model Reduction and Neural Networks · Advanced Numerical Methods in Computational Mathematics
11footnotetext: Department of Mathematics, Vali-e-Asr University of Rafsanjan, PO Box 518, Rafsanjan, Iran ([email protected] (F. P. A. Beik); [email protected] (M. Najafi-Kalyani)).22footnotetext: Laboratory LMPA, 50 Rue F. Buisson, ULCO calais cedex, France ([email protected]). 33footnotetext: Department of Mathematical Sciences, Kent State University, Kent, OH 44242, USA ([email protected]).
On the Golub–Kahan bidiagonalization for ill-posed tensor equations with applications to color image restoration
Fatemeh P. A. Beik1
Khalide Jbilou2
Mehdi Najafi-Kalyani1 and Lothar Reichel3
Abstract
This paper is concerned with solving ill-posed tensor linear equations. These kinds of equations may appear from finite difference discretization of high-dimensional convection-diffusion problems or when partial differential equations in many dimensions are discretized by collocation spectral methods. Here, we propose the Tensor Golub–Kahan bidiagonalization (TGKB) algorithm in conjunction with the well known Tikhonov regularization method to solve the mentioned problems. Theoretical results are presented to discuss on conditioning of the Stein tensor equation and to reveal that how the TGKB process can be exploited for general tensor equations. In the last section, some classical test problems are examined to numerically illustrate the feasibility of proposed algorithms and also applications for color image restoration are considered.
keywords:
Tensor linear operator equation, Ill-posed problem, Tikhonov regularization, Golub–Kahan bidiagonalization.
AMS:
65F10, 15A24
1 Introduction
This paper deals with solving severely ill-conditioned tensor equations. We are particularly interested in Sylvester and Stein tensor equations. It should be commented the proposed iterative schemes can be used for solving,
[TABLE]
where is an arbitrary linear tensor operator. An ill-posed tensor equation may appear in color image restoration, video restoration, and when solving certain partial differential equations by collocation methods in several space dimensions [3, 17, 18, 19, 21]. Throughout this work, vectors and matrices are respectively denoted by lowercase and capital letters, and tensors of order three (or higher) are represented by Euler script letters. Before stating the main problems, we need to recall the definition of -mode product from [14].
Definition 1**.**
The -mode (matrix) product of a tensor with a matrix is denoted by and is of size
[TABLE]
and its elements are defined as follows:
[TABLE]
The Sylvester and Stein tensor equations are respectively given by
[TABLE]
and
[TABLE]
where the right-hand side tensors and the coefficient matrices () are known, and is the unknown tensor.
Sylvester tensor equation may arise from the discretization of a linear partial differential equation in several space-dimensions by finite differences [1, 3, 8] or by spectral methods [3, 17, 18, 19, 20]. Some discussions on conditioning of (2) under certain conditions are presented in [21] where Najafi et al. proposed using the standard Tikhonov regularization technique in conjunction with global Hessenberg processes in tensor form to solve (2) with perturbed right-hand sides. Some results for perturbation analysis of (3) are given in [16] and a more recent work by Xu and Wang [22] where Eq. (3) is solved by tensor form of the BiCG and BiCR methods. Liang and Zheng [16] established some results for perturbation analysis of (3) in the case is even and with being a Schur stable (all the eigenvalues of lie in the open unite disc). However, presented results rely on the matrix two norm of
More recently, Huang et al. [13] proposed global form of well–known iterative methods in their tensor forms to solve a class of tensor equations via the Einstein product. Here, we comment that the proposed iterative approach in this work can be also used when the mentioned problem in [13] is ill-posed.
In this paper, we first establish some results to analyze the conditioning of (3) motivated by [16, 22]. Then the tensor form of the GKB process is proposed for solving ill-posed tensor equations. More precisely, we illustrate how tensor–based GKB process can be exploited to solve ill-posed problems (2) and (3). To this end, we apply the established results in [3] and generalize exploited techniques of [5]. It is immediate to observe that the results (in Section 3) can be also used for solving ill-posed problem of the general form (1).
The remainder of paper is organized as follows. Before ending this section, we present some symbols and notations used throughout next sections. We further recall the concept of contract product between two tensors. In Section 2, we present some results related to sensitivity analysis of (3). Section 3 is devoted for constructing an approach based on tensor form of GKB and Gauss-type quadrature in conjunction with Tikhonov regularization technique to solve ill-posed tensor equations. In order to illustrate the effectiveness of proposed iterative schemes, some numerical results are reported in Section 4. Finally the paper is ended with a brief conclusion in Section 5.
1.1 Notations
Given a -mode tensor , the notation stands for element of . For a given square matrix with real eigenvalues, we denote the minimum and maximum eigenvalues of by and , respectively. The set of all eigenvalues (spectrum) of is signified by . The symmetric and skew-symmetric parts of are respectively denoted by and , i.e.,
[TABLE]
By condition number of an invertible matrix , we mean “” where is the matrix -norm. The notation is exploited for multi-dimensional Kronecker product. The vector is obtained by using the standard vectorization operator with respect to frontal slices of . The mode- matrization of a given tensor is denoted by which arranges the mode- fibers to be the columns of resulting matrix. We recall that a fiber is defined by fixing every index but one; see [14] for more details.
1.2 Contracted product
The product between two -mode tensors
[TABLE]
is defined as an matrix whose -th entry is
[TABLE]
where
[TABLE]
The product can be mentioned as a special case of the contracted product [9]. More precisely, is the contracted product of -mode tensors and along the first modes. For , it can be observed that
[TABLE]
and for .
We finish this part, by recalling the following two useful results from [3].
Lemma 2**.**
If , and , then we have
[TABLE]
Proposition 3**.**
Suppose that is an -mode tensor with the column tensors and . For an arbitrary -mode tensor with -mode column tensors , the following statement holds
[TABLE]
2 On the sensitivity analysis of Stein tensor equation
In this section, we mainly discuss on conditioning of Stein tensor equation (3). To this end, first, we consider a linear system of equations which is equivalent to (3) and then derive some lower and upper bounds for the condition number of the coefficient matrix of the linear system of equations.
It is well-known that (2) is equivalent to the linear system of equations,
[TABLE]
with , , and
[TABLE]
In addition, it can be observed that
[TABLE]
In view of the above relation, we deduce that (3) corresponds to the following linear system of equations,
[TABLE]
As a result, in view of the fact that“”, the sensitivity analyses of (2) and (3) are closely related to deriving bounds for condition numbers of and , respectively. Basically, for linear system of equations and , we know that
[TABLE]
Also, under the assumption , for the linear system of equations
[TABLE]
the following result exists in the literature
[TABLE]
one may refer to [10] for further details about perturbation analysis for linear system of equations.
In [21], some lower and upper bounds for has been derived under certain conditions. Therefore, in the sequel, we assume that is invertible and limit the discussions to deriving bounds for .
In [22], it is shown that
[TABLE]
Furthermore, for the case , the following upper bound for the condition number is also presented
[TABLE]
Now, we start our results by establishing the following proposition which presents an upper bound for the condition number of under certain condition.
Proposition 4**.**
Assume that , then
[TABLE]
Proof.
For simplicity, let . It is immediate to conclude that
[TABLE]
Evidently, we have It is well-known that
[TABLE]
From the above relation and the fact that
[TABLE]
we get,
[TABLE]
Now we can conclude the result immediately. ∎
For deriving alternative bounds for , we first prove the following two propositions.
Proposition 5**.**
Let and for , then
[TABLE]
Proof.
We prove the assertion by induction. For , using the fact that (for ), we can conclude the result from the following equality (see [23])
[TABLE]
Assume that (10) is true for . Now for , setting
[TABLE]
we get
[TABLE]
Using the assumption of induction for the term , we can conclude the result immediately. ∎
Proposition 6**.**
Assume that . Then,
[TABLE]
and
[TABLE]
where and with and for .
Proof.
It is not difficult to verify that
[TABLE]
Setting and , in view of Proposition 5, we obtain
[TABLE]
and
[TABLE]
which completes the proof immediately. ∎
Remark 7**.**
If the matrices s for are all positive definite, then
[TABLE]
Furthermore, if we have
[TABLE]
then the following upper bound can be derived immediately from Proposition 6,
[TABLE]
Here we recall a useful proposition which is a consequence of Weyl’s Theorem, see [11, Theorem 4.3.1].
Proposition 8**.**
Suppose that are two symmetric matrices. Then,
[TABLE]
Using Proposition 8 and some straightforward algebraic computations, we can prove the following result.
Proposition 9**.**
Let Assume that is an even number and , then
[TABLE]
where
[TABLE]
Here, for a given matrix , the notation stands for the spectral radius of .
Remark 10**.**
A simple conclusion of the above proposition is that if then the matrix is positive definite, i.e., is a symmetric positive definite. In this case, we can obtain an upper bound for . In fact, from (13), it can be seen that
[TABLE]
Therefore, we have
[TABLE]
Now, in view of inequality (9) together with (14) gives an upper bound for the condition number of as follows:
[TABLE]
We end this part by the following remark which is an observation for the case that ’s for are all diagonalizable.
Remark 11**.**
Let be a diagonalizable matrix, i.e, there exists nonsingular matrix associated with such that for . Setting , we have . Hence, if then
[TABLE]
As a result, we get
[TABLE]
where
[TABLE]
In this case, we have the following inequality
[TABLE]
Notice that analogous to the proof of Proposition 4, in the case that , we have
[TABLE]
In addition, with the similar strategy used in [22], if then
[TABLE]
Finally, comment that if the matrices are all positive definite matrices () then
[TABLE]
3 Tensor form of GKB and Gauss-type quadrature
In this section, we briefly describe the implantation of GKB process in tensor framework. For simplicity, in the sequel, we use two linear operators such that
[TABLE]
The adjoint of and are respectively given by
[TABLE]
for . Using the linear operators (15) and (16), the tensor equations (2) and (3) are respectively written by
[TABLE]
We comment that all of the results in this section can be applied for any other linear operator from to
Consider the linear system of equation where . We recall that the well-known GKB process, applied to the matrix , produces the decomposition where and are orthogonal matrices and is a bidiagonal matrix. It is natural to use the process for an arbitrary linear operator over The corresponding approach is called GKB based on tensor format (GKB*-*BTF) which is summarized in Algorithm 1.
In Algorithm 1, suppose that . Moreover, assume that there is no break-down in the algorithm and let be an lower bidiagonal matrix whose nonzero entries are those computed in Lines 1 and 1 of Algorithm 1. In the following, the matrix stands for the matrix extracted from as follows:
[TABLE]
Theorem 12**.**
Let , , and be the -mode tensors with frontal slices , , and for computed by Algorithm 1. Then the following statements hold
[TABLE]
in which is an mode tensor with “” column tensors and is an matrix of the form where is the th column of the identity matrix of order .
Proof.
[TABLE]
Note that the th frontal slice of (17) is given by
[TABLE]
In view of (19) and (20), we can conclude the validity of (17). To derive (18), one may first notice that Lines 1, 1 and 1 gives
[TABLE]
where is assumed to be zero. Now considering the frontal slice of the right-hand side of (18), we can deduce the second assertion. ∎
Remark 13**.**
It is obvious that one may state the above theorem for any linear operator over instead of . In what follows, the results are stated for and it should be commented that all of results remain true, if we replace by or any other linear operators over .
Let the linear system associated with (3) be extremely ill-conditioned. In the case that the right-hand side of (3) contains some noise, it is inefficient to approximate the solution of (3) without any regularization technique. To overcome this, we may use Tikhonov regularization which consists of solving the following minimization problem,
[TABLE]
(over ) instead of solving in which is called the regularization parameter.
Let be an approximate solution where is defined as before. From (17), by Lemma 2 and Proposition 3, we have
[TABLE]
which shows that (21) is equivalent to the following low dimensional minimization problem,
[TABLE]
As a result, the solution of (27) is given by
[TABLE]
Consequently, we have
[TABLE]
Therefore, if we define the function by
[TABLE]
we can conclude the following result.
Proposition 14**.**
Assume that and are positive constants such that . Let be defined by (28). Then any solution of satisfying
[TABLE]
determines a solution of (27) such that
[TABLE]
and satisfies
[TABLE]
Proof.
Eq. (29) follows from (22) immediately. ∎
Proposition 15**.**
Let be defined by (28). Then the function is strictly decreasing and convex for . Moreover,
[TABLE]
In particular, Newton’s method applied to the solution of the equation with initial approximate solution to the left of solution converges monotonically and quadratically.
Proof.
See [7, Proposition 3.6]. ∎
For simplicity, we set Consider the integral
[TABLE]
for suitable function and assume that and are respectively the -point Gauss quadrature and -point Gauss-Radau rule. In [5], it has been discussed that using spectral factorization of , the function can be expressed by
[TABLE]
with .
Analogous to [5], we can deduce that
[TABLE]
It is known from [4] that
[TABLE]
and
[TABLE]
In fact the above two relations show that the and provide lower and upper bounds for (or with ). The bounds are helpful for determining by the discrepancy principle in an inexpensive way. To this end, at step , we find by solving the following nonlinear equation,
[TABLE]
We comment that in view of Proposition 15, one may use Newton’s method efficiently to solve (30). If for the solution , we have
[TABLE]
Then Proposition 14 illustrates that satisfies
[TABLE]
If (31) does not holds, then we need to apply one more step of Algorithm 1 replacing with . As pointed out in [5], the bound (31) can be satisfied for small values of .
Assume that the bound (31) hold then we need to find the vector by solving (27) in which we set where satisfies (30) and (31). Finally, we can determine the approximate solution by
[TABLE]
Based on above discussions, we can construct two approaches based on GKB process to solve (21). These strategies are summarized in Algorithms 2 and 3. In the next section, we numerically examine the feasibility of these algorithms. It turns out that each step of Algorithm 2 requires less CPU-time than Algorithm 3 to be performed.
4 Numerical experiments
In this section, we report some numerical experiments to compare performances of the proposed methods. We limit ourselves to the case in (2) and (3). In all the test problems, the right-hand side tensors are assumed to be contaminated by an error tensor which has normally distributed random entries with zero mean being scaled to have a specific level of noise (). All computations were carried out using Tensor Toolbox [2] in Matlab R2018b with an Intel Core i7-4770K CPU @ 3.50GHz processor and 24GB RAM.
The relative error that we computed is given by
[TABLE]
where denotes the desired solution of the error-free problem and is the -th computed approximation by the proposed algorithms.
In Tables 1, 2, 4 and 6, the iterations were stopped when
[TABLE]
where is user-chosen constant and is the norm of error, i.e., . We comment that the norm in left-hand side of the above relation is computed inexpensively in view of (22).
For comparison with existing approaches in the literature, we use global Hessenberg process in conjunction with Tikhonov regularization based on tensor format (HT*-BTF) and flexible HT-BTF (FHT-BTF) proposed in [21] for which we determine the regularization parameter by discrepancy principle described in [24]. When the coefficient matrices are full, as anticipated, FHT-BTF outperforms other examined algorithms. However, for large and sparse coefficient matrices, FHT-BTF needs more CPU time than Algorithms 2 and 3. Our observations illustrate that FHT-*BTF take a long time with respect to the stopping criterion (32) for large problems. Therefore, for the results reported in Tables 3, 5 and 7, we used an alternative stopping criterion given by,
[TABLE]
where the maximum number of iterations was allowed. In FHT*-BTF method, we used two steps of stabilized biconjugate gradients based on tensor format (BiCGSTAB-*BTF) [8] as the inner iteration; see [21] for further details.
We reported the required number of iterations and consumed CPU-time (in seconds) by algorithms to compute suitable approximate solutions satisfying the stopping criteria. For more clarification, we divide this section into two main parts. In Subsections 4.1 and 4.2, we provide some numerical examples to solve ill-posed problems in the forms (2) and (3), respectively.
To test the performance of algorithms for image restoration, the exact solutions are tensors of sizes 111The corresponding color image is available at https://www.hlevkin.com/TestImages/Boats.ppm and which the second one associated with a hyperspectral image of natural scenes being also used in [21, Example 5.3]. Blurring matrices have the following forms in Subsections 4.1 and 4.2, respectively,
[TABLE]
and
[TABLE]
where s are either Gaussian Toeplitz matrix given by,
[TABLE]
or the uniform Toeplitz matrix defined by
[TABLE]
In literature, (34) and (35) have been used as blurring matrices for testing applications of iterative schemes for image deblurring; see [4, 5, 6, 12] for instance.
4.1 Experimental results for ill-posed Sylvester tensor equations
As a first test problem, we consider (2) in which the coefficient matrices are full and extremely ill-conditioned. This kind of equations may arise from discretization of a fully three-dimensional microscale dual phase lag problem by a mixed-collocation finite difference method; see [17, 18, 19] for further details.
Example 16**.**
Consider (2) with a perturbed right-hand side such that for are defined by
[TABLE]
where with . The same problem was solved by global schemes choosing odd values of for which the coefficient matrices are very well-conditioned; see [3]. Similar to [21, Example 5.4], the value of is chosen to be even which results extremely ill-conditioned coefficient matrices. The error free right-hand side of (2) is constructed so that is its exact solution. The obtained numerical results are disclosed in Table 1.
As can be seen in Table 1, FHT*-*BTF works better than the other approaches, this could be expected as the coefficient matrices are full. Now we present experimental results related to image restoration. In fact, error free right-hand sides in (2) is constructed such that the exact solution is a hyperspectral image. Here the matrices s () are sparse and it is observed that Algorithm 2 surpasses other examined iterative schemes.
Example 17**.**
We consider the case that a tensor of order is the exact solution of (2) which corresponds to a hyperspectral image of natural scenes222http://personalpages.manchester.ac.uk/staff/d.h.foster.The coefficient matrices and are given by (35) with suitable dimensions such that for , and for which result , and .
The obtained numerical results are disclosed in Table 2 for which the algorithms was terminated once (32) satisfied. As pointed out earlier, (F)HT-BTF method can not be efficiently used with respect to stopping criterion (32). Therefore, we rerun all of the algorithms with respect to (33) and report the results in Table 3. As seen, Algorithms 2 and 3 work better than (F)HT-BTF. We further comment that Algorithm 2 consumes less CPU-time than Algorithm 3.
4.2 Experimental results for ill-posed Stein tensor equations
In this subsection, we apply the proposed approaches for solving two ill-posed problems in the form (3). Here, error free right-hand sides are constructed such that exact solutions of (3) are color images. The iterations in the algorithms were stopped in two different ways, i.e., (32) and (33) are used separately.
Example 18**.**
This example is concerned with the restoration of a color image. The “original” exact image333The image is available at https://www.hlevkin.com/TestImages/Boats.ppm is stored by a tensor. We consider (3) in which is given by (34), and are given by (35) with suitable dimensions. Here we set for and for and . It can be seen that and and .
Example 19**.**
We consider the case that a tensor of order is the exact solution of (3). The coefficient matrices and are defined by (35) with suitable dimensions such that for and for and for . Here, we have , and .
The obtained numerical results for Examples 18 and 19 are disclosed in Tables 4, 5, 6 and 7. Similar to what we observed for second example of previous subsection, Algorithm 2 is superior to other examined approaches.
5 Conclusions
In this paper, we first present some results for conditioning of the Stein tensor equation. Then, we proposed the global Golub–Kahan bidiagonalization process with applications for solving ill-posed linear tensor equations such as Sylvester and Stein tensor equations where the iterative schemes can be also implemented for an arbitrary linear operator over . We gave some new theoretical results and present some numerical examples with applications to color image restoration to show the applicability and the effectiveness of the proposed schemes for computing solutions of high quality.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Ballani J and Grasedyck L. A projection method to solve linear systems in tensor format. Numerical Linear Algebra with Applications. 2013; 20 (1): 27–43.
- 2[2] Bader BW and Kolda TG. MATLAB Tensor Toolbox Version 2.5. http://www.sandia.gov/~tgkolda/Tensor Toolbox .
- 3[3] Beik FPA, Movahed FS and Ahmadi-Asl S. On the Krylov subspace methods based on tensor format for positive definite Sylvester tensor equations. Numerical Linear Algebra with Applications. 2016; 23 (3): 444–466.
- 4[4] Bentbib AH, Guide M. El, Jbilou K and Reichel L. A global Lanczos method for image restoration, Journal of Computational and Applied Mathematics. 2016; 300 233–244.
- 5[5] Bentbib AH, Guide M. El, Jbilou K and Reichel L. Global Golub–Kahan bidiagonalization applied to large discrete ill-posed problems. Journal of Computational and Applied Mathematics. 2017; 322 46–56.
- 6[6] Bouhamidi A, Jbilou K, Reichel L, and Sadok H. A generalized global Arnoldi method for ill-posed matrix equations. Journal of Computational and Applied Mathematics. 2012; 236 2078–2089.
- 7[7] Buccini A. Tikhonov–type iterative regularization methods for ill-posed inverse problems:theoretical aspects and applications. Ph D thesis. University of Insubria. http://insubriaspace.cineca.it/bitstream/10277/703/1/Phd_Thesis_Buccinialessandro_completa.pdf
- 8[8] Chen Z and Lu LZ. A projection method and Kronecker product preconditioner for solving Sylvester tensor equations. Science China Mathematics. 2012; 55 (6): 1281–1292.
