Tensor absolute value equations
Shouqiang Du, Liping Zhang, Chiyu Chen, Liqun Qi

TL;DR
This paper introduces tensor absolute value equations, explores their properties, establishes solution existence conditions, and proposes an algorithm with preliminary numerical validation.
Contribution
It generalizes absolute value equations to tensors, links them to tensor complementarity problems, and develops a Levenberg-Marquardt-type algorithm for their solution.
Findings
Established equivalence to tensor complementarity problems
Provided sufficient conditions for solution existence
Demonstrated algorithm efficiency through preliminary results
Abstract
This paper is concerned with solving some structured multi-linear systems, which are called tensor absolute value equations. This kind of absolute value equations is closely related to tensor complementarity problems and is a generalization of the well-known absolute value equations in the matrix case. We prove that tensor absolute value equations are equivalent to some special structured tensor complementary problems. Some sufficient conditions are given to guarantee the existence of solutions for tensor absolute value equations. We also propose a Levenberg-Marquardt-type algorithm for solving some given tensor absolute value equations and preliminary numerical results are reported to indicate the efficiency of the proposed algorithm.
| 0 | 0.8143, 0.2435, 0.9293, 0.3500, 0.1966, 0.2511, 0.6160, 0.4733 | 562.2589 | 1500602.8826 |
| 1 | 0.7407, 0.2435, 0.6880, 0.3545, 0.1072, 0.3670, 0.4555, 0.3271 | 148.1702 | 203193.6101 |
| 2 | 0.4542, 0.4477, 0.4349, 0.4348 -0.2944, 0.7819, 0.4209, 0.3007 | 25.5263 | 23486.6536 |
| 3 | 1.0757, 0.3147, 0.2655, 0.4343 -0.2368, 0.4908, 0.1690, 0.3781 | 20.2932 | 48494.4079 |
| 4 | 1.2158, 0.3379, 0.4481, 0.5825 -0.2075, 0.1750, 0.3290, 0.0197 | 18.6526 | 49990.1630 |
| 5 | 0.8865, 0.3812, 0.3176, 0.5179 -0.3075, 0.3684, 0.4486, 0.2840 | 10.0354 | 20912.3905 |
| 6 | 0.8742, 0.2928, 0.3744, 0.5308 -0.3895, 0.5269, 0.2838, 0.3997 | 2.9292 | 3867.9206 |
| 7 | 0.8798, 0.2888, 0.3406, 0.6301 -0.3722, 0.4799, 0.3198, 0.3293 | 1.3213 | 1522.0099 |
| 8 | 0.8664, 0.2829, 0.3003, 0.6746 -0.3890, 0.4936, 0.3325, 0.3355 | 0.7455 | 1084.3075 |
| 9 | 0.8684, 0.2850, 0.2737, 0.6914 -0.3985, 0.4960, 0.3394, 0.3411 | 0.1766 | 482.0095 |
| 10 | 0.8690, 0.2852, 0.2752, 0.6895 -0.3976, 0.4957, 0.3383, 0.3411 | 0.0144 | 21.2907 |
| 11 | 0.8692, 0.2853, 0.2753, 0.6894 -0.3975, 0.4956, 0.3383, 0.3410 | 0.0029 | 2.4370 |
| 12 | 0.8692, 0.2853, 0.2754, 0.6893 -0.3975, 0.4956, 0.3383, 0.3410 | 0.0002 | 0.1396 |
| 13 | 0.8692, 0.2853, 0.2754, 0.6892 -0.3975, 0.4956, 0.3383, 0.3410 | 0.0001 | 0.0008 |
| 14 | 0.8692, 0.2853, 0.2754, 0.6892 -0.3975, 0.4956, 0.3383, 0.3410 | 0.0000 | 0.0000 |
| diag of | |
|---|---|
| 1 | -1,-1,-1,-1,-1,-1,-1,-1,-1,-1 |
| 2 | -1, 1,-1, 1,-1, 1,-1,-1,-1, 1 |
| 3 | 1, 1,-1,-1, 1, 1,-1,-1,-1,-1 |
| 4 | -1, 1,-1, 1,-1, 1,-1,-1, 1, 1 |
| 5 | 1,-1, 1, 1, 1, 1,-1, 1,-1, 1 |
| Iter. | Time | |||
|---|---|---|---|---|
| 1 | -0.3485,-0.0971,-0.7753,-1.2447,-0.7739, | 0.00000012 | 15 | 0.2135 |
| -0.5628,-0.4868, 0.4480, 0.2925,-0.9003 | ||||
| 2 | 0.4184,-0.0423,-0.2989, 1.0357,-1.0340, | 0.00000022 | 15 | 0.2060 |
| 0.3109,-0.3686,-0.2755,-0.6852, 0.9528 | ||||
| 3 | 0.7454, 0.5055,-0.6641, 0.3093,-0.1769, | 0.00000003 | 18 | 0.2673 |
| 1.1273,-0.4514,-1.1430,-0.0619,-0.2421 | ||||
| 4 | -0.9570, 0.5494,-2.1429,-0.1959,-1.8247, | 0.00000000 | 11 | 0.1355 |
| -0.3996, 0.8803,-0.3457, 0.0458, 0.1694 | ||||
| 5 | 0.3385,-1.1498, 1.0413, 0.3533, 0.7606, | 0.00000006 | 10 | 0.1265 |
| -0.1214,-0.3290,-0.0458,-0.2049, 0.4027 |
| Iter. | Time | |||
|---|---|---|---|---|
| 1 | -0.1040,-0.7455,-0.7363,-0.5619,-0.1842, | 0.00000072 | 20 | 0.2523 |
| -0.5972,-0.2999,-0.1341,-0.2126,-0.8949 | ||||
| 2 | -0.1040, 0.7455,-0.7363, 0.5619,-0.1842, | 0.00000090 | 17 | 0.2050 |
| 0.5972,-0.2999,-0.1341,-0.2126, 0.8949 | ||||
| 3 | 0.1040, 0.7455,-0.7363,-0.5619, 0.1842, | 0.00000091 | 24 | 0.2838 |
| 0.5972,-0.2999,-0.1341,-0.2126,-0.8949 | ||||
| 4 | -0.1040, 0.7455,-0.7363, 0.5619,-0.1842, | 0.00000064 | 16 | 0.1896 |
| 0.5972,-0.2999,-0.1341, 0.2126, 0.8949 | ||||
| 5 | 0.1040,-0.7455, 0.7363, 0.5619, 0.1842, | 0.00000075 | 14 | 0.1638 |
| 0.5972,-0.2999, 0.1341,-0.2126, 0.8949 |
| Iter. | Time | Attempts | |||
|---|---|---|---|---|---|
| 31.00 | 0.6783 | 0.00000098 | 20/100 | ||
| 19.40 | 0.3109 | 0.00000099 | 20/157 | ||
| 19.55 | 0.3456 | 0.00000099 | 20/205 | ||
| 13.65 | 0.1907 | 0.00000082 | 20/244 | ||
| 14.05 | 0.2168 | 0.00000084 | 20/290 | ||
| 68.25 | 1.7492 | 0.00000051 | 20/145 | ||
| 21.75 | 0.3864 | 0.00000099 | 20/216 | ||
| 15.70 | 0.2535 | 0.00000089 | 20/114 | ||
| 14.60 | 0.2121 | 0.00000071 | 20/129 | ||
| 13.60 | 0.1980 | 0.00000099 | 20/114 |
| Iter. | Time | Attempts | |||
|---|---|---|---|---|---|
| 12.67 | 0.1644 | 0.00000070 | 3/20 | ||
| 13.67 | 0.1708 | 0.00000099 | 3/20 | ||
| 11.93 | 0.1516 | 0.00000075 | 14/20 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTensor decomposition and applications · Matrix Theory and Algorithms · Power System Optimization and Stability
Tensor absolute value equations
Shouqiang Du College of Mathematics and statistic, Qingdao University, Qingdao 266071, P. R. China ([email protected]).
Liping Zhang Corresponding author. Department of Mathematical Sciences, Tsinghua University, Beijing 100084, P. R. China ([email protected]).
Chiyu Chen Department of Mathematical Sciences, Tsinghua University, Beijing 100084, P. R. China ([email protected]).
Liqun Qi Department of Applied Mathematics, The Hong Kong Polytechnic University, Hung Hom, Kowloon, Hong Kong, P. R. China ([email protected]).
Abstract
This paper is concerned with solving some structured multi-linear systems, which are called tensor absolute value equations. This kind of absolute value equations is closely related to tensor complementarity problems and is a generalization of the well-known absolute value equations in the matrix case. We prove that tensor absolute value equations are equivalent to some special structured tensor complementary problems. Some sufficient conditions are given to guarantee the existence of solutions for tensor absolute value equations. We also propose a Levenberg-Marquardt-type algorithm for solving some given tensor absolute value equations and preliminary numerical results are reported to indicate the efficiency of the proposed algorithm.
keywords:
M-tensors, absolute value equations, Levenberg-Marquardt method, tensor complementarity problem
AMS:
15A48, 15A69, 65K05, 90C30, 90C20
1 Introduction
The systems of multi-linear equations can be expressed by tensor-vector products, just as we rewrite linear systems by matrix-vector products. Let be an th-order tensor in and be a vector in . Then a multi-linear equation can be expressed as
[TABLE]
where is a vector in [21] with
[TABLE]
Solving multi-linear systems is always an important problem in engineering and scientific computing [8, 15]. In this paper, we consider the systems of multi-linear absolute value equations, which can be expressed as
[TABLE]
where is a vector in with
[TABLE]
It is easy to see that the system of multi-linear absolute value equations (2) is a generalization of the well-known absolute value equations
[TABLE]
with a matrix . The absolute value equations (AVE) has wide applications in applied science and technology such as optimization physical and economic equilibrium problems [17, 18, 19]. As was shown in [19], the general NP-hard linear complementarity problem [7] which subsumes many mathematical programming problems can be formulated as an AVE. This implies that the AVE is NP-hard in its general form. Analogous to AVE, we call (2) tensor absolute value equations (TAVE). Obviously, the TAVE is also NP-hard. Thus, investigating the existence of solutions for the TAVE is a significant problem.
Recently, Song and Qi [25] introduced a class of complementarity problems, called tensor complementarity problems, where the involved function is defined by some homogenous polynomial of degree with . It is known that the tensor complementarity problem is a generalization of the linear complementarity problem [7]; and a subclass of nonlinear complementarity problems [10]. The tensor complementarity problem was studied recently by many scholars [4, 26]. In [19], it was shown that the AVE is equivalent to a generalized linear complementarity problem. Can we show that the TAVE is equivalent to a generalized tensor complementarity problem? Although some computational methods have been presented for the AVE, it is very difficult to extend these algorithms to solve the TAVE because the TAVE (2) is a nonlinear equation. The Levenberg-Marquardt method is one of the important algorithms for solving nonlinear equations [11]. Can we propose an efficient algorithm such as the Levenberg-Marquardt method for solving the TAVE (2)? To our best knowledge, there is no general answer to these questions. Therefore, we shall focus on some special tensor absolute value equations.
Let be an th-order -dimensional unit tensor, whose entries are if and only if and otherwise zero. A tensor is called a nonnegative tensor if all its entries are nonnegative, denoted . A tensor is called a -tensor, if all its diagonal entries are nonnegative and off-diagonal entries are nonpositive. -tensor is a special class of Z-tensor, which was first introduced and studied in [9, 29]. To define the -tensors, we need to introduce the tensor eigenvalues first. Let be an th-order -dimensional tensor. If a scalar and a nonzero vector satisfy
[TABLE]
where , then we call an eigenvalue of and a corresponding eigenvector. Qi [21] and Lim [16] first defined the eigenvalues of tensors independently. The spectral radius of a tensor is defined by
[TABLE]
A tensor is called an -tensor, if it can be written as with and ; furthermore, it is called a strong -tensor if . One can refer to a survey [3] for the spectral theory of nonnegative tensors. In this paper, we first investigate the existence of solutions for the TAVE (2). We show that the TAVE (2) with positive right-hand side always has a unique solution when is strong -tensor. Another sufficient condition for the existence of solution is also given. Can we compute the solution? We propose an inexact Levenberg-Marquardt method for solving the TAVE (2).
The rest of this paper is organized as follows. In Section 2, we introduce the tensor absolute value equations which is a generalization of absolute value equations with matrix case. In Section 3, some sufficient conditions for the existence of solution to the TAVE are given. In Section 4, we first reformulate the TAVE as a special tensor complementarity problem and then we propose an an inexact Levenberg-Marquardt-type algorithm for solving the TAVE. Some numerical results are reported in Section 5. Finally, some conclusions are given.
Throughout this paper, we assume that . We use small letters x, y,…, for scalars, small bold letters for vectors, capital letters for matrixes, calligraphic letters for tensors, calligraphic letters for diagonal tensor whose diagonal elements are or . All the tensors discussed in this paper are real. denotes the set of all th order -dimensional tensors. Let , then is called a symmetric tensor if its entries are invariant under any permutation of their indices. denotes the set of all symmetric tensors. For such a matrix will denote the transpose of . The identity matrix of arbitrary dimension will be denoted by .
2 Tensor absolute value equations
In this section, we present some basic definitions and properties in absolute value equations, nonlinear complementarity problems, and nonsmooth analysis, which will be used in the sequel.
We recall the absolute value equations (AVE) of the type
[TABLE]
where , and denotes the vector with absolute values of each component of . The AVE (3) has been widely investigated in many literatures such as [17, 18, 19]. In [19], some results about the AVE are given, which we list as follows:
- (i)
The AVE (3) is equivalent to the bilinear program
[TABLE]
and the generalized linear complementarity problem
[TABLE]
- (ii)
Let and . Then
[TABLE]
implies that
[TABLE]
Clearly, the tensor absolute value equation (2) is a generalization of the AVE (3) from the matrix case to the tensor case. Take an equation with the coefficient tensor as an example. The tensor absolute equation
[TABLE]
is a condense form of
[TABLE]
We want to find and that satisfy the above two equations.
The following example shows a specific tensor absolute value equation.
Example 2.1**.**
Let a tensor be defined by , , and zero otherwise. Let . Then the corresponding tensor absolute value equation is
[TABLE]
By simplicity computation, we see that the TAVE (4) in Example 2.1 has no solution. In the next section we will discuss the existence of solution for the TAVE (2). We can extend the result (ii) to the TAVE and obtain a similar condition for the existence of solution to (2).
Below, we introduce the classical nonlinear complementarity problem. The tensor complementarity problem recently introduced in [25] is a special kind of nonlinear complementarity problem. It will be shown in Section 4 that the TAVE (2) can be reformulated as a special kind of generalized tensor complementarity problem.
Definition 1**.**
Given a given mapping , the nonlinear complementarity problem, denoted by NCP(), is to find a vector satisfying
[TABLE]
Many solution methods developed for NCP() or related problems are based on reformulating them as a system of equations using so-called NCP-functions [10]. Here a function is called an NCP-function if
[TABLE]
Given an NCP-function , let us define
[TABLE]
It is obvious that is a solution of NCP() if and only if it solves the system of nonsmooth equations
[TABLE]
For the solution of , we recall some definitions in nonsmooth analysis. Suppose that is a locally Lipschitz function, where is nonempty and open. By Rademacher’s Theorem, is differentiable almost everywhere. Let denote the set of points at which is differentiable. For any , we write for the usual Jacobian matrix of partial derivatives. The -subdifferential of at is the set defined by
[TABLE]
The Clarke’s generalized Jacobian of at is the set defined by
[TABLE]
where “co” denotes the convex hull. Then, is a nonempty convex compact subset of [6]. The function is semismooth [13, 22] at if
[TABLE]
exists for all . If is semismooth at all , we call semismooth on . The function is called strongly semismooth [23] if it is semismooth and for any and ,
[TABLE]
where denotes the directional derivative [2] of at in direction , i.e.,
[TABLE]
Note that if the function is semismooth at , the directional derivative exists for all and
[TABLE]
We now present some NCP-functions which are widely used in nonlinear complementarity problems. For more details about NCP-functions and their smoothing approximations, one can refer to [24, 30] and references therein.
Here we give some well-known NCP-functions as follows:
- •
The min function:
[TABLE]
- •
The Fischer-Burmeister function:
[TABLE]
It has been shown that all these NCP-functions are globally Lipschitz continuous, directionally differentiable, and strongly semismooth [12, 27]. For example, the generalized gradient of is equal to the set of all such that
[TABLE]
where is any vector satisfying .
In Section 4, we will use the Fischer-Burmeister function to reformulate the TAVE (2) as a system of equations and then we will propose an algorithm to solve the system of equations.
We now introduce the tensor complementarity problem which first defined by Song and Qi [25].
Definition 2**.**
Given any given tensor and vector , the tensor complementarity problem, denoted by TCP(), is to find a vector satisfying
[TABLE]
Note that when , the tensor reduces to a matrix, denoted by , and the TCP() becomes: find a vector such that
[TABLE]
which is just the linear complementarity problem [7]. Very recently, a class of -person noncooperative games are in [14], where the utility function of every player is given by a homogeneous polynomial defined by the payoff tensor of that player, which is a natural extension of the bimatrix game where the utility function of every player is given by a quadratic form defined by the payoff matrix of that player. Such a problem is called the multilinear game. The multilinear game is reformulated as a tensor complementarity problem. Some semismooth Newton-type methods are recently proposed for solving the tensor complementarity problems (see, e.g., [5]). In Section 4, we will extend the result (i) to the TAVE (2) and show that the TAVE (2) is equivalent to a bi-multilinear program and a generalized tensor complementarity problem.
3 Existence of solutions
In this section, we give some sufficient conditions for the existence of solutions to the TAVE (2). Specially, we extend the result (ii) about the AVE (3) to the TAVE (2).
We need the following lemmas which are recently established in [8, Theorems 3.2, 3.3, 3.4].
Lemma 3**.**
Let . If is a strong -tensor, then for every positive vector the multilinear system of equations has a unique positive solution.
Lemma 4**.**
Let be a -tensor. Then it is a strong -tensor if and only if the multilinear system of equations has a unique positive solution for every positive vector .
Lemma 5**.**
Let be an -tensor and . If there exists such that , then the multilinear system of equations has a nonnegative solution.
By the above lemmas, we have the following theorems.
Theorem 6**.**
Let . If can be written as with and , then for every positive vector the TAVE (2) has a unique positive solution.
Proof.
Let . Then yields
[TABLE]
which implies that is a strong -tensor. By Lemma 3, the multilinear system of equations
[TABLE]
has a unique positive solution for every positive vector . Hence, for every positive vector , the TAVE (2) has a unique positive solution. ∎
Combining [9, Theorem 3] and Lemma 4, we can rewrite the above theorem into an equivalent condition for being a strong -tensor.
Theorem 7**.**
Let be a -tensor. Then can be written as the form of
[TABLE]
if and only if for every positive vector the TAVE (2) has a unique positive solution.
Proof.
On one hand, by Theorem 6, we have the existence and uniqueness of the positive solution of the TAVE (2) for every positive vector . On the other hand, if for every positive vector the TAVE (2) has a unique positive solution, then there exists a vector such that
[TABLE]
Since is a -tensor, is also a -tensor. Thus, by [9, Theorem 3], is a strong -tensor and then the form of (5) holds. ∎
Remark. The sufficient condition in Theorem 7 can be weakened as follows: if the TAVE (2) has a nonnegative solution for every positive vector , then we also have the form (5). In fact, let be a solution of the TAVE (2). Then there exists such that . By [9, Theorem 3], we can obtain the conclusion.
Theorem 8**.**
Let and be in the form of with and . If there exists a vector such that , then the TAVE (2) has a nonnegative solution.
Proof.
It follows from
[TABLE]
that is an -tensor. By Lemma 5, there is such that
[TABLE]
Thus, we have
[TABLE]
This completes the proof. ∎
We next extend the result (i) about the AVE (3) to the TAVE (2). Here, we assume that is even. We first introduce the product of a tensor and a diagonal tensor.
Definition 9**.**
Let and be a diagonal tensor with diagonal elements . We denote their product, whose elements are defined as
[TABLE]
Obviously, Definition 9 is well-defined due to the assumption that is even.
By simplicity computation, we have the following proposition.
Proposition 10**.**
Let and . We have
[TABLE]
Proof.
Let us define a vector as
[TABLE]
Then by some definitions introduced in Section 1, the th-component of the vector can be written as
[TABLE]
Let . Then by Definition 9, the th-component of the vector can be written as
[TABLE]
Combining (3) and (7), we have
[TABLE]
∎
It is easy to see that
[TABLE]
holds for any vector , because the th-component of the vectors and are in the form of
[TABLE]
Here the sign of is corresponded to the diagonal element or of .
The following theorem is a generalization of the result (ii) from AVE to TAVE.
Theorem 11**.**
Let , and . If the multilinear system of equations
[TABLE]
has a solution, then the tensor absolute value equation
[TABLE]
also has a solution.
Proof.
Let be the solution of the multilinear system of equations (9). Then we have
[TABLE]
Take
[TABLE]
Then (10) can be rewritten as
[TABLE]
which, together with Proposition 10 and (8), implies that is a solution of the tensor absolute value equation
[TABLE]
Thus, we complete the proof. ∎
We give an example to verify the above theorem.
Example 3.1**.**
Let with and zero otherwise, and . Consider the multilinear system of equations
[TABLE]
It is rewritten as
[TABLE]
This implies that is a solution of
[TABLE]
Let be a diagonal tensor with and . Then we have with , , and zero otherwise, i.e., . By Theorem 11, is just a solution of the tensor absolute value equation
[TABLE]
We now verify the conclusion. We rewrite (11) as
[TABLE]
By simplicity computation, the above equation has a solution .
4 Reformulation and algorithm
In this section, we extend the result (i) from AVE to TAVE. We show that the TAVE (2) is equivalent to a bi-multiliear program and a generalized tensor complementarity problem. We first introduce the following definition.
Definition 12**.**
Let and . Define
[TABLE]
The generalized tensor complementarity problem is to find satisfying
[TABLE]
We call the following nonlinear program as a bi-multiliear program:
[TABLE]
Theorem 13**.**
Let and . Then the TAVE (2) is equivalent to the generalized tensor complementarity problem (12) and the bi-multilinear program (13).
Proof.
Clearly, the generalized tensor complementarity problem (12) is equivalent to the bi-multilinear program (13). That is, .
We only need to prove . In fact, . Hence, we have
[TABLE]
This implies that is a feasible solution of (13). Since
[TABLE]
we have
[TABLE]
This completes the proof. ∎
By the above theorem, in order to solve the TAVE (2), we propose an algorithm for solving the generalized tensor complementarity problem (12). Using the Fischer-Burmeister function , we can reformulate (12) as the following equation:
[TABLE]
Hence, is a solution of (2) if and only if . Moreover, is strongly semismooth since the composition of strongly semismooth function is again strongly semismooth [20], and according to the Jacobian chain rule, we have the following result.
Theorem 14**.**
Let . Then the function is strongly semismooth. Moreover, for any , we have
[TABLE]
where and are diagonal matrices in with entries , where denotes the set with being replaced by , and and are given by
[TABLE]
Here, for a tensor and a vector , let be a matrix in whose -th component is defined by
[TABLE]
In order to propose an algorithm for the solution of , we define a merit function as
[TABLE]
We present some properties of the merit function, which can be obtained by [6, Theorem 2.2.4 and Theorem 2.6.6].
Theorem 15**.**
Let . Then the merit function is continuously differentiable with
[TABLE]
for any .
We are now in the position to propose a Levenberg-Marquardt-type algorithm to solve the semismooth system of equations , which is an extension of the nonsmooth inexact Levenberg-Marquardt-type method in [11]. To ensure global convergence, a line search is performed to minimize the smooth merit function . Because the problem with data in a structure of tensor is large scale, and the inexact version is more suited to the large-scale case [11], we have the following algorithm.
Algorithm 4.1**.**
(Inexact Levenberg-Marquardt-type method)**
- Step 0.
Given a starting vector and some scales , , , . Set .
- Step 1.
If , stop. Otherwise, compute .
- Step 2.
Find a solution satisfying
[TABLE]
where is the Levenberg-Marquardt parameter. If the condition
[TABLE]
is not satisfied, set
[TABLE]
- Step 3.
Find the smallest integer such that and
[TABLE]
- Step 4.
Set , , and go to Step 1.
In what follows, we analyze the global convergence of Algorithm 1. We shall assume that Algorithm 1 produce an infinite sequence . By [11, Theorem 15 and Theorem 16], we immediately obtain the following theorems.
Theorem 16**.**
Assume that the sequence is bounded and that the sequence satisfies
[TABLE]
where is a sequence of numbers with and as . Then each accumulation point of is a stationary point of .
Theorem 17**.**
Let the assumptions of Theorem 16 hold. If one of the accumulation points of , denoted , is an isolated solution of the TAVE (2), then
[TABLE]
In the implementation of Algorithm 4.1, the computational most intensive part is the approximation solution of system (14) with . We note that the system is always solvable. In fact, if , the matrix is symmetric positive definite and hence system (14) is surely solvable. If , the matrix reduces to , which is guaranteed to be only positive semidefinite. However, in this case, (14) reduces to the normal gradient equation , is therefore solvable. We now have to specify which element we select at the -th iteration. By Theorem 14, we have that an element of can be obtained in the following way. Let
[TABLE]
be the set of “degenerate indices” and define to be a vector whose components are if and [math] otherwise. Then, the matrix defined by
[TABLE]
where and are diagonal matrices whose -th diagonal elements are given, respectively, by
[TABLE]
and by
[TABLE]
belongs to . In the next section, we compute as the formulation.
5 Numerical results
In this section, we present the numerical performance of Algorithm 4.1 for the TAVE (2). All codes were written by using Matlab Version R2015b and Tensor Toolbox Version 2.6 [1]. The numerical experiments were done on a laptop with an Intel Core i7-4720HQ CPU (2.6GHz) and RAM of 7.89GB.
In the implementation of Algorithm 4.1, we set and the Levenberg-Marquardt parameter . We also set a maximum iteration steps for the algorithm, , .
The first numerical experiment focuses on the behaviour of algorithm’s iteration. We generate a random symmetric nonnegative tensor and a random vector . All entries of and are uniform random numbers in the interval . We calculate in order to make TAVE have at least one solution. Then we use Algorithm 4.1 to solve TAVE: , with a random initial point chosen randomly from which is shown as in table 1. The iteration of Algorithm 4.1 is shown in Table 1. From the table, tends to [math] as the number of iteration increases. And also tends to 0 except that it increases from to . This shows that does converge to 0 but not converge monotonically when the algorithm converges.
The second numerical experiment aims to verify Theorem 11. We first generate a random symmetric nonnegative tensor and a random vector . All entries of are uniform random numbers in the interval . Let . Since is a diagonal tensor whose diagonal elements are or , there are at most different . The first attempt is to generate all these tensors. For each tensor , set (see Definition 9) and . We check whether is equals to for all . The result shows that each is just one of the solution to the corresponding TAVE problem .
The second attempt of the second numerical experiment is to generate five of all tensors randomly and use Algortihm 4.1 to solve the corresponding TAVE. The diagonal elements of the five is shown in Table 2.
We first select the initial points for Algorithm 4.1 by using normal distribution, i.e., entries are from standardized normal distribution independently. Here we call these initial points type-I initial points. The results of corresponding TAVE with type-I initial points is summarized in Table 3. We can easily find out that none of the five is in the form of . Because Algorithm 4.1 is based on the thoughts of Newton method, thus its convergence relies heavily on the initial point. In order to detect solution which is mentioned in Theorem 11 by Algorithm 4.1, we should choose the initial points in another way. For each , we generate type-II initial points by adding a random number chosen from uniform distribution over to . The results of corresponding TAVE with type-II initial points is shown in Table 4. The solutions are exactly in the form of .
In Tables 3 and 4, denotes the experiment No. corresponding to Table 2. denotes the solution vectors returned by Algorithm 4.1. denotes the Euclid norm of . If the norm of is small enough, we can regard as an approximate solution of TAVE. Iter. denotes the number of iteration and Time denotes the time of iteration that finds corresponding by Algorithm 4.1. In the second experiment, we verify Theorem 11 from the instant correctly. Besides, from Table 3 and 4, we find that under the conditions of Theorem 11, the solution may not be the only solution of TAVE . There might be some other solutions, such as the solution in Table 3. To discuss the uniqueness of the positive solution, we conduct our third experiment.
Our third numerical experiment focuses on Theorem 7. Here we first generate a random symmetric nonnegative tensor whose entries are uniform random numbers in the interval . Let , where . Since , the choice of makes sure that . Then let , and satisfies the conditions of Theorem 7, i.e., is strong M-tensor. Tensors and are given in Tables 5 and 6, respectively.
We choose random positive vectors . For each , we find repeatable solutions of TAVE: with random vectors from as initial points repeatedly and summarize the results in Table 7.
In Table 7, denotes the solution of TAVE. denotes random generated mentioned above. Iter. denotes the average number of iteration that finds the corresponding solution successfully. Time denotes the average time of iteration that finds the corresponding solution by Algorithm
- is the maximum norm of all returned by Algorithm 1 whose is the corresponding solution. Attempts has the form N/T, N denotes the number of the corresponding found by Algorithm 4.1 and T denotes the number of initial points in all.
In this experiment, we use “while” loop in Matlab program to guarantee that we can get exact solutions (might be repeatable) for each . According to Table 7, for each , Algorithm 4.1 only returns unique positive solution in all repeatable solutions. This phenomenon fits Theorem 7 very well. Besides, in order to get valid solutions, the initial points we attempt is times more than the valid ones. This means that most of the random initial points fail to find a solution by Algorithm 4.1. The reason might be that the convergence of Newton type method depends on the initial point badly. Theorem 7 shows that under this circumstances, there’s only one unique positive solution of TAVE. If and only if the initial point is in the convergence region of some solution of TAVE, the algorithm will converge. Therefore, it’s harder to find valid solutions if . Table 8 shows the solutions found by Algorithm 4.1 when . The initial points attempted in all is much less.
Moreover, under the circumstances that and is strong M-tensor, whether the unique positive solution of TAVE is the unique solution of TAVE remains a question. In our experiment we haven’t found other solutions except for the unique positive ones.
6 Conclusion
We have introduced tensor absolute value equations. The simple definition is a natural generalization of the definition of absolute value equations in the matrix case. We have established some basic properties for tensor absolute value equations and we reformulate tensor absolute value equations as a generalized tensor complementarity problem. We have proposed some sufficient conditions for the existence of solution to the multilinear equations. We propose an inexact Levenberg-Marquardt-type method (Algorithm 4.1) to solve the tensor absolute value equations and some numerical results have shown that our algorithm is performing well.
There are some questions which are still in study. For example, we known that “The AVE (3) is uniquely solvable for any if the singular values of exceed ” [19]. Can we extend the conclusion to TVAE (2), i.e., the statement “The TAVE (2) is uniquely solvable for any if the singular values of tensor exceed ” is correct or not? This is still an open question.
Acknowledgments
Shouqiang Du’s work was supported by the National Natural Science Foundation of China (Grant No. 11671220, 11401331) and the Nature Science Foundation of Shandong Province (ZR2015AQ013, ZR2016AM29). Liping Zhang’s work was supported by the National Natural Science Foundation of China (Grant No. 11271221). Liqun Qi’s work was supported by the Hong Kong Research Grant Council (Grant No. PolyU 501212, 501913, 15302114 and 15300715).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] B.W. Bader, T.G. Kolda , et al. , MATLAB Tensor Toolbox Version 2.6 (2012). http://www.sandia.gov/ ∼ similar-to \sim tgkolda/Tensor Toolbox/
- 2[2] J.F. Bonnans, R. Cominetti, and A. Shapiro , Second order optimality conditions based on parabolic second order tangent sets , SIAM Journal on Optimization, 9 (1999), pp. 466–493.
- 3[3] K. Chang, L. Qi, and T. Zhang , A survey on the spectral theory of nonnegative tensors , Numerical Linear Algebra with Applications, 20 (2013), pp. 891–912.
- 4[4] M. Che, L. Qi, and Y. Wei , Positive definite tensors to nonlinear complementarity problems , Journal of Optimization Theory and Applications, 168 (2016), pp. 475–487.
- 5[5] M. Che and L. Qi , A semismooth Newton method for tensor eigenvalue complementarity problem , Computational Optimization and Applications, 65 (2016), pp. 109–126.
- 6[6] F.H. Clarke , Optimization and Nonsmooth Analysis , Wiley, New York, 1983.
- 7[7] R.W. Cottle, J.-S. Pang, and R.E. Stone , The Linear Complementarity Problem , Academic Press, Boston, 1992.
- 8[8] W. Ding and Y. Wei , Solving multi-linear systems with M 𝑀 M -Tensors , Journal of Scientific Computing, 68 (2016), pp. 683–715.
