Tensor absolute value equations

Shouqiang Du; Liping Zhang; Chiyu Chen; Liqun Qi

arXiv:1705.06415·math.NA·May 19, 2017

Tensor absolute value equations

Shouqiang Du, Liping Zhang, Chiyu Chen, Liqun Qi

PDF

Open Access

TL;DR

This paper introduces tensor absolute value equations, explores their properties, establishes solution existence conditions, and proposes an algorithm with preliminary numerical validation.

Contribution

It generalizes absolute value equations to tensors, links them to tensor complementarity problems, and develops a Levenberg-Marquardt-type algorithm for their solution.

Findings

01

Established equivalence to tensor complementarity problems

02

Provided sufficient conditions for solution existence

03

Demonstrated algorithm efficiency through preliminary results

Abstract

This paper is concerned with solving some structured multi-linear systems, which are called tensor absolute value equations. This kind of absolute value equations is closely related to tensor complementarity problems and is a generalization of the well-known absolute value equations in the matrix case. We prove that tensor absolute value equations are equivalent to some special structured tensor complementary problems. Some sufficient conditions are given to guarantee the existence of solutions for tensor absolute value equations. We also propose a Levenberg-Marquardt-type algorithm for solving some given tensor absolute value equations and preliminary numerical results are reported to indicate the efficiency of the proposed algorithm.

Tables8

Table 1. Table 1: Iterations of Algorithm 4.1 for a random tensor 𝒜 ∈ S 6 , 8 𝒜 subscript 𝑆 6 8 \mathcal{A}\in S_{6,8} and corresponding 𝐛 𝐛 {\bf b}

$𝐤$	$𝐱^{k}$	$‖ H (𝐱^{k}) ‖$	$‖ \nabla Ψ (𝐱^{k}) ‖$
0	$($ 0.8143, 0.2435, 0.9293, 0.3500, 0.1966, 0.2511, 0.6160, 0.4733 $)^{T}$	562.2589	1500602.8826
1	$($ 0.7407, 0.2435, 0.6880, 0.3545, 0.1072, 0.3670, 0.4555, 0.3271 $)^{T}$	148.1702	203193.6101
2	$($ 0.4542, 0.4477, 0.4349, 0.4348 -0.2944, 0.7819, 0.4209, 0.3007 $)^{T}$	25.5263	23486.6536
3	$($ 1.0757, 0.3147, 0.2655, 0.4343 -0.2368, 0.4908, 0.1690, 0.3781 $)^{T}$	20.2932	48494.4079
4	$($ 1.2158, 0.3379, 0.4481, 0.5825 -0.2075, 0.1750, 0.3290, 0.0197 $)^{T}$	18.6526	49990.1630
5	$($ 0.8865, 0.3812, 0.3176, 0.5179 -0.3075, 0.3684, 0.4486, 0.2840 $)^{T}$	10.0354	20912.3905
6	$($ 0.8742, 0.2928, 0.3744, 0.5308 -0.3895, 0.5269, 0.2838, 0.3997 $)^{T}$	2.9292	3867.9206
7	$($ 0.8798, 0.2888, 0.3406, 0.6301 -0.3722, 0.4799, 0.3198, 0.3293 $)^{T}$	1.3213	1522.0099
8	$($ 0.8664, 0.2829, 0.3003, 0.6746 -0.3890, 0.4936, 0.3325, 0.3355 $)^{T}$	0.7455	1084.3075
9	$($ 0.8684, 0.2850, 0.2737, 0.6914 -0.3985, 0.4960, 0.3394, 0.3411 $)^{T}$	0.1766	482.0095
10	$($ 0.8690, 0.2852, 0.2752, 0.6895 -0.3976, 0.4957, 0.3383, 0.3411 $)^{T}$	0.0144	21.2907
11	$($ 0.8692, 0.2853, 0.2753, 0.6894 -0.3975, 0.4956, 0.3383, 0.3410 $)^{T}$	0.0029	2.4370
12	$($ 0.8692, 0.2853, 0.2754, 0.6893 -0.3975, 0.4956, 0.3383, 0.3410 $)^{T}$	0.0002	0.1396
13	$($ 0.8692, 0.2853, 0.2754, 0.6892 -0.3975, 0.4956, 0.3383, 0.3410 $)^{T}$	0.0001	0.0008
14	$($ 0.8692, 0.2853, 0.2754, 0.6892 -0.3975, 0.4956, 0.3383, 0.3410 $)^{T}$	0.0000	0.0000

Table 2. Table 2: Diagonal elements of 𝒟 k subscript 𝒟 𝑘 \mathcal{D}_{k}

$𝐤$	diag of $𝒟_{k}$
1	$($ -1,-1,-1,-1,-1,-1,-1,-1,-1,-1 $)^{T}$
2	$($ -1, 1,-1, 1,-1, 1,-1,-1,-1, 1 $)^{T}$
3	$($ 1, 1,-1,-1, 1, 1,-1,-1,-1,-1 $)^{T}$
4	$($ -1, 1,-1, 1,-1, 1,-1,-1, 1, 1 $)^{T}$
5	$($ 1,-1, 1, 1, 1, 1,-1, 1,-1, 1 $)^{T}$

Table 3. Table 3: Numerical results for tensors 𝒜 k subscript 𝒜 𝑘 \mathcal{A}_{k} with type-I initial points

$𝐤$	$𝐱_{𝐤}$	$‖ H (𝐱_{𝐤}) ‖$	Iter.	Time
1	$($ -0.3485,-0.0971,-0.7753,-1.2447,-0.7739,	0.00000012	15	0.2135
1	-0.5628,-0.4868, 0.4480, 0.2925,-0.9003 $)^{T}$	0.00000012	15	0.2135
2	$($ 0.4184,-0.0423,-0.2989, 1.0357,-1.0340,	0.00000022	15	0.2060
2	0.3109,-0.3686,-0.2755,-0.6852, 0.9528 $)^{T}$	0.00000022	15	0.2060
3	$($ 0.7454, 0.5055,-0.6641, 0.3093,-0.1769,	0.00000003	18	0.2673
3	1.1273,-0.4514,-1.1430,-0.0619,-0.2421 $)^{T}$	0.00000003	18	0.2673
4	$($ -0.9570, 0.5494,-2.1429,-0.1959,-1.8247,	0.00000000	11	0.1355
4	-0.3996, 0.8803,-0.3457, 0.0458, 0.1694 $)^{T}$	0.00000000	11	0.1355
5	$($ 0.3385,-1.1498, 1.0413, 0.3533, 0.7606,	0.00000006	10	0.1265
5	-0.1214,-0.3290,-0.0458,-0.2049, 0.4027 $)^{T}$	0.00000006	10	0.1265

Table 4. Table 4: Numerical results for tensors 𝒜 k subscript 𝒜 𝑘 \mathcal{A}_{k} with type-II initial points

$𝐤$	$𝐱_{𝐤}$	$‖ H (𝐱_{𝐤}) ‖$	Iter.	Time
1	$($ -0.1040,-0.7455,-0.7363,-0.5619,-0.1842,	0.00000072	20	0.2523
1	-0.5972,-0.2999,-0.1341,-0.2126,-0.8949 $)^{T}$	0.00000072	20	0.2523
2	$($ -0.1040, 0.7455,-0.7363, 0.5619,-0.1842,	0.00000090	17	0.2050
2	0.5972,-0.2999,-0.1341,-0.2126, 0.8949 $)^{T}$	0.00000090	17	0.2050
3	$($ 0.1040, 0.7455,-0.7363,-0.5619, 0.1842,	0.00000091	24	0.2838
3	0.5972,-0.2999,-0.1341,-0.2126,-0.8949 $)^{T}$	0.00000091	24	0.2838
4	$($ -0.1040, 0.7455,-0.7363, 0.5619,-0.1842,	0.00000064	16	0.1896
4	0.5972,-0.2999,-0.1341, 0.2126, 0.8949 $)^{T}$	0.00000064	16	0.1896
5	$($ 0.1040,-0.7455, 0.7363, 0.5619, 0.1842,	0.00000075	14	0.1638
5	0.5972,-0.2999, 0.1341,-0.2126, 0.8949 $)^{T}$	0.00000075	14	0.1638

Table 5. Table 5: A random symmetric nonnegative tensor ℬ = ( b i 1 i 2 i 3 i 4 ) ∈ S ( 4 , 4 ) ℬ subscript 𝑏 subscript 𝑖 1 subscript 𝑖 2 subscript 𝑖 3 subscript 𝑖 4 𝑆 4 4 \mathcal{B}=(b_{i_{1}i_{2}i_{3}i_{4}})\in S(4,4)

$b_{1111} = 0.8147$	$b_{1112} = 0.9058$	$b_{1113} = 0.1270$	$b_{1114} = 0.9134$	$b_{1122} = 0.6324$
$b_{1123} = 0.0975$	$b_{1124} = 0.2785$	$b_{1133} = 0.5469$	$b_{1134} = 0.9575$	$b_{1144} = 0.9649$
$b_{1222} = 0.1576$	$b_{1223} = 0.9706$	$b_{1224} = 0.9572$	$b_{1233} = 0.4854$	$b_{1234} = 0.8003$
$b_{1244} = 0.1419$	$b_{1333} = 0.4218$	$b_{1334} = 0.9157$	$b_{1344} = 0.7922$	$b_{1444} = 0.9595$
$b_{2222} = 0.6557$	$b_{2223} = 0.0357$	$b_{2224} = 0.8491$	$b_{2233} = 0.9340$	$b_{2234} = 0.6787$
$b_{2244} = 0.7577$	$b_{2333} = 0.7431$	$b_{2334} = 0.3922$	$b_{2344} = 0.6555$	$b_{2444} = 0.1712$
$b_{3333} = 0.7060$	$b_{3334} = 0.0318$	$b_{3344} = 0.2769$	$b_{3444} = 0.0462$	$b_{4444} = 0.0971$

Table 6. Table 6: The symmetric tensor 𝒜 = ( a i 1 i 2 i 3 i 4 ) ∈ S ( 4 , 4 ) 𝒜 subscript 𝑎 subscript 𝑖 1 subscript 𝑖 2 subscript 𝑖 3 subscript 𝑖 4 𝑆 4 4 \mathcal{A}=(a_{i_{1}i_{2}i_{3}i_{4}})\in S(4,4) based on ℬ ℬ \mathcal{B}

$a_{1111} = 40.8037$	$a_{1112} = - 0.9058$	$a_{1113} = - 0.1270$	$a_{1114} = - 0.9134$	$a_{1122} = - 0.6324$
$a_{1123} = - 0.0975$	$a_{1124} = - 0.2785$	$a_{1133} = - 0.5469$	$a_{1134} = - 0.9575$	$a_{1144} = - 0.9649$
$a_{1222} = - 0.1576$	$a_{1223} = - 0.9706$	$a_{1224} = - 0.9572$	$a_{1233} = - 0.4854$	$a_{1234} = - 0.8003$
$a_{1244} = - 0.1419$	$a_{1333} = - 0.4218$	$a_{1334} = - 0.9157$	$a_{1344} = - 0.7922$	$a_{1444} = - 0.9595$
$a_{2222} = 40.9627$	$a_{2223} = - 0.0357$	$a_{2224} = - 0.8491$	$a_{2233} = - 0.9340$	$a_{2234} = - 0.6787$
$a_{2244} = - 0.7577$	$a_{2333} = - 0.7431$	$a_{2334} = - 0.3922$	$a_{2344} = - 0.6555$	$a_{2444} = - 0.1712$
$a_{3333} = 40.9124$	$a_{3334} = - 0.0318$	$a_{3344} = - 0.2769$	$a_{3444} = - 0.0462$	$a_{4444} = 41.5213$

Table 7. Table 7: Numerical results for the third experiment

$𝐱$	$𝐛$	Iter.	Time	$\max ‖ H (𝐱) ‖$	Attempts
${(0.8100, 0.7881, 0.7786, 0.8003)}^{T}$	${(1.4193, 0.2916, 0.1978, 1.5877)}^{T}$	31.00	0.6783	0.00000098	20/100
${(0.7285, 0.7212, 0.7156, 0.7098)}^{T}$	${(0.8045, 0.6966, 0.8351, 0.2437)}^{T}$	19.40	0.3109	0.00000099	20/157
${(0.7219, 0.7313, 0.7230, 0.7098)}^{T}$	${(0.2157, 1.1658, 1.1480, 0.1049)}^{T}$	19.55	0.3456	0.00000099	20/205
${(0.8453, 0.8603, 0.8294, 0.8276)}^{T}$	${(0.7223, 2.5855, 0.6669, 0.1873)}^{T}$	13.65	0.1907	0.00000082	20/244
${(0.8445, 0.8584, 0.8321, 0.8507)}^{T}$	${(0.0825, 1.9330, 0.4390, 1.7947)}^{T}$	14.05	0.2168	0.00000084	20/290
${(0.7104, 0.7055, 0.6849, 0.6957)}^{T}$	${(0.8404, 0.8880, 0.1001, 0.5445)}^{T}$	68.25	1.7492	0.00000051	20/145
${(0.6775, 0.6771, 0.6677, 0.6750)}^{T}$	${(0.3035, 0.6003, 0.4900, 0.7394)}^{T}$	21.75	0.3864	0.00000099	20/216
${(0.9021, 0.8787, 0.8894, 0.8805)}^{T}$	${(1.7119, 0.1941, 2.1384, 0.8396)}^{T}$	15.70	0.2535	0.00000089	20/114
${(0.8104, 0.8007, 0.7908, 0.7841)}^{T}$	${(1.3546, 1.0722, 0.9610, 0.1240)}^{T}$	14.60	0.2121	0.00000071	20/129
${(0.8957, 0.8939, 0.8661, 0.8808)}^{T}$	${(1.4367, 1.9609, 0.1977, 1.2078)}^{T}$	13.60	0.1980	0.00000099	20/114

Table 8. Table 8: Solutions of TAVE when 𝐛 = ( − 1 , 1 , 1 , 1 ) 𝐛 1 1 1 1 {\bf b}=(-1,1,1,1)

$𝐱$	$𝐛$	Iter.	Time	$\max ‖ H (𝐱) ‖$	Attempts
${(0.0800, 0.3629, 0.3543, 0.3505)}^{T}$	${(- 1, 1, 1, 1)}^{T}$	12.67	0.1644	0.00000070	3/20
${(- 0.2593, 0.2948, 0.2891, 0.2903)}^{T}$	${(- 1, 1, 1, 1)}^{T}$	13.67	0.1708	0.00000099	3/20
${(0.6258, 0.6600, 0.6522, 0.6537)}^{T}$	${(- 1, 1, 1, 1)}^{T}$	11.93	0.1516	0.00000075	14/20

Equations156

A x^{m - 1} = b,

A x^{m - 1} = b,

(A x^{m - 1})_{i} = i_{2} = 1 \sum n \dots i_{m} = 1 \sum n a_{i i_{2} \dots i_{m}} x_{i_{2}} \dots x_{i_{m}}, i = 1, \dots, n .

(A x^{m - 1})_{i} = i_{2} = 1 \sum n \dots i_{m} = 1 \sum n a_{i i_{2} \dots i_{m}} x_{i_{2}} \dots x_{i_{m}}, i = 1, \dots, n .

A x^{m - 1} - ∣ x ∣^{[m - 1]} = b,

A x^{m - 1} - ∣ x ∣^{[m - 1]} = b,

∣ x ∣^{[m - 1]} = (∣ x_{1} ∣^{m - 1}, \dots, ∣ x_{n} ∣^{m - 1})^{T} .

∣ x ∣^{[m - 1]} = (∣ x_{1} ∣^{m - 1}, \dots, ∣ x_{n} ∣^{m - 1})^{T} .

A x - ∣ x ∣ = b

A x - ∣ x ∣ = b

A x^{m - 1} = λ x^{[m - 1]},

A x^{m - 1} = λ x^{[m - 1]},

\rho(\mathcal{A})=\max\{|\lambda|:\,\mbox{ $\lambda$ is an eiegnvalue of $\mathcal{A}$}\}.

\rho(\mathcal{A})=\max\{|\lambda|:\,\mbox{ $\lambda$ is an eiegnvalue of $\mathcal{A}$}\}.

A x - ∣ x ∣ = b,

A x - ∣ x ∣ = b,

0 = min {((A + I) x - b)^{T} ((A - I) x - b) ∣ (A + I) x - b \geq 0, (A - I) x - b \geq 0},

0 = min {((A + I) x - b)^{T} ((A - I) x - b) ∣ (A + I) x - b \geq 0, (A - I) x - b \geq 0},

(A + I) x - b \geq 0, (A - I) x - b \geq 0 ((A + I) x - b)^{T} ((A - I) x - b) = 0.

(A + I) x - b \geq 0, (A - I) x - b \geq 0 ((A + I) x - b)^{T} ((A - I) x - b) = 0.

(C-I){\bf z}={\bf b},\quad{\bf z}\geq{\bf 0}\quad\mbox{has a solution ${\bf z}\in R^{n}$}

(C-I){\bf z}={\bf b},\quad{\bf z}\geq{\bf 0}\quad\mbox{has a solution ${\bf z}\in R^{n}$}

A{\bf x}-|{\bf x}|={\bf b}\quad\mbox{has a solution for any $A=CD$ with $D=diag(\pm 1)$}.

A{\bf x}-|{\bf x}|={\bf b}\quad\mbox{has a solution for any $A=CD$ with $D=diag(\pm 1)$}.

A x^{2} - ∣ x ∣^{2} = b

A x^{2} - ∣ x ∣^{2} = b

\left\{\begin{array}[]{l}a_{111}x_{1}^{2}+(a_{112}+a_{121})x_{1}x_{2}+a_{122}x_{2}^{2}-|x_{1}|^{2}=b_{1},\\ a_{211}x^{2}_{1}+(a_{212}+a_{221})x_{1}x_{2}+a_{222}x^{2}_{2}-|x_{2}|^{2}=b_{2}.\end{array}\right.

\left\{\begin{array}[]{l}a_{111}x_{1}^{2}+(a_{112}+a_{121})x_{1}x_{2}+a_{122}x_{2}^{2}-|x_{1}|^{2}=b_{1},\\ a_{211}x^{2}_{1}+(a_{212}+a_{221})x_{1}x_{2}+a_{222}x^{2}_{2}-|x_{2}|^{2}=b_{2}.\end{array}\right.

\left\{\begin{array}[]{rcl}x_{1}^{3}-x_{2}^{3}-{|x_{1}|}^{3}&=&1,\\ -2x_{1}^{3}+x_{2}^{3}-{|x_{2}|}^{3}&=&2.\end{array}\right.

\left\{\begin{array}[]{rcl}x_{1}^{3}-x_{2}^{3}-{|x_{1}|}^{3}&=&1,\\ -2x_{1}^{3}+x_{2}^{3}-{|x_{2}|}^{3}&=&2.\end{array}\right.

x \geq 0, F (x) \geq 0, x^{T} F (x) = 0.

x \geq 0, F (x) \geq 0, x^{T} F (x) = 0.

ϕ (a, b) = 0 \Leftrightarrow a \geq 0, b \geq 0, ab = 0.

ϕ (a, b) = 0 \Leftrightarrow a \geq 0, b \geq 0, ab = 0.

Φ (x) = (ϕ (x_{1}, F_{1} (x)), \dots, ϕ (x_{n}, F_{n} (x)))^{T} .

Φ (x) = (ϕ (x_{1}, F_{1} (x)), \dots, ϕ (x_{n}, F_{n} (x)))^{T} .

Φ (x) = 0 .

Φ (x) = 0 .

\partial_{B}\Theta({\bf x})=\left\{V\in R^{n_{2}\times n_{1}}|\,\mbox{$\exists\{{\bf x}^{k}\}\subseteq D_{\Theta}$ with ${\bf x}^{k}\to{\bf x}$, $J\Theta({\bf x}^{k})\to V$}\right\}.

\partial_{B}\Theta({\bf x})=\left\{V\in R^{n_{2}\times n_{1}}|\,\mbox{$\exists\{{\bf x}^{k}\}\subseteq D_{\Theta}$ with ${\bf x}^{k}\to{\bf x}$, $J\Theta({\bf x}^{k})\to V$}\right\}.

\partial\Theta({\bf x})=\mbox{co($\partial_{B}\Theta({\bf x})$)},

\partial\Theta({\bf x})=\mbox{co($\partial_{B}\Theta({\bf x})$)},

V \in \partial Θ (x + t \tilde{d}) \tilde{d} \to d, t ↓ 0 lim V \tilde{d}

V \in \partial Θ (x + t \tilde{d}) \tilde{d} \to d, t ↓ 0 lim V \tilde{d}

V d - Θ^{'} (x; d) = O (∥ d ∥^{2}), d \to 0,

V d - Θ^{'} (x; d) = O (∥ d ∥^{2}), d \to 0,

Θ^{'} (x; d) = t ↓ 0 lim \frac{Θ ( x + t d ) - Θ ( x )}{t} .

Θ^{'} (x; d) = t ↓ 0 lim \frac{Θ ( x + t d ) - Θ ( x )}{t} .

Θ^{'} (x; d) = V \in \partial Θ (x + t \tilde{d}) \tilde{d} \to d, t ↓ 0 lim V \tilde{d} .

Θ^{'} (x; d) = V \in \partial Θ (x + t \tilde{d}) \tilde{d} \to d, t ↓ 0 lim V \tilde{d} .

ϕ (a, b) = min {a, b} .

ϕ (a, b) = min {a, b} .

ϕ_{F B} (a, b) = a + b - a^{2} + b^{2} .

ϕ_{F B} (a, b) = a + b - a^{2} + b^{2} .

(v_{a},v_{b})=\left\{\begin{array}[]{ll}\left(1-\frac{a}{\sqrt{a^{2}+b^{2}}},1-\frac{b}{\sqrt{a^{2}+b^{2}}}\right)&\quad\mbox{if $(a,b)\neq(0,0)$},\\ (1-\xi,1-\varsigma)&\quad\mbox{if $(a,b)=(0,0)$},\end{array}\right.

(v_{a},v_{b})=\left\{\begin{array}[]{ll}\left(1-\frac{a}{\sqrt{a^{2}+b^{2}}},1-\frac{b}{\sqrt{a^{2}+b^{2}}}\right)&\quad\mbox{if $(a,b)\neq(0,0)$},\\ (1-\xi,1-\varsigma)&\quad\mbox{if $(a,b)=(0,0)$},\end{array}\right.

x \geq 0, A x^{m - 1} + q \geq 0, x^{T} (A x^{m - 1} + q) = 0.

x \geq 0, A x^{m - 1} + q \geq 0, x^{T} (A x^{m - 1} + q) = 0.

x \geq 0, A x + q \geq 0, x^{T} (A x + q) = 0,

x \geq 0, A x + q \geq 0, x^{T} (A x + q) = 0,

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTensor decomposition and applications · Matrix Theory and Algorithms · Power System Optimization and Stability

Full text

Tensor absolute value equations

Shouqiang Du College of Mathematics and statistic, Qingdao University, Qingdao 266071, P. R. China ([email protected]).

Liping Zhang Corresponding author. Department of Mathematical Sciences, Tsinghua University, Beijing 100084, P. R. China ([email protected]).

Chiyu Chen Department of Mathematical Sciences, Tsinghua University, Beijing 100084, P. R. China ([email protected]).

Liqun Qi Department of Applied Mathematics, The Hong Kong Polytechnic University, Hung Hom, Kowloon, Hong Kong, P. R. China ([email protected]).

Abstract

This paper is concerned with solving some structured multi-linear systems, which are called tensor absolute value equations. This kind of absolute value equations is closely related to tensor complementarity problems and is a generalization of the well-known absolute value equations in the matrix case. We prove that tensor absolute value equations are equivalent to some special structured tensor complementary problems. Some sufficient conditions are given to guarantee the existence of solutions for tensor absolute value equations. We also propose a Levenberg-Marquardt-type algorithm for solving some given tensor absolute value equations and preliminary numerical results are reported to indicate the efficiency of the proposed algorithm.

keywords:

M-tensors, absolute value equations, Levenberg-Marquardt method, tensor complementarity problem

AMS:

15A48, 15A69, 65K05, 90C30, 90C20

1 Introduction

The systems of multi-linear equations can be expressed by tensor-vector products, just as we rewrite linear systems by matrix-vector products. Let $\mathcal{A}$ be an $m$ th-order tensor in $R^{n}\times\cdots\times R^{n}$ and ${\bf b}$ be a vector in $R^{n}$ . Then a multi-linear equation can be expressed as

[TABLE]

where $\mathcal{A}{\bf x}^{m-1}$ is a vector in $R^{n}$ [21] with

[TABLE]

Solving multi-linear systems is always an important problem in engineering and scientific computing [8, 15]. In this paper, we consider the systems of multi-linear absolute value equations, which can be expressed as

[TABLE]

where $|{\bf x}|^{[m-1]}$ is a vector in $R^{n}$ with

[TABLE]

It is easy to see that the system of multi-linear absolute value equations (2) is a generalization of the well-known absolute value equations

[TABLE]

with a matrix $A\in R^{n\times n}$ . The absolute value equations (AVE) has wide applications in applied science and technology such as optimization physical and economic equilibrium problems [17, 18, 19]. As was shown in [19], the general NP-hard linear complementarity problem [7] which subsumes many mathematical programming problems can be formulated as an AVE. This implies that the AVE is NP-hard in its general form. Analogous to AVE, we call (2) tensor absolute value equations (TAVE). Obviously, the TAVE is also NP-hard. Thus, investigating the existence of solutions for the TAVE is a significant problem.

Recently, Song and Qi [25] introduced a class of complementarity problems, called tensor complementarity problems, where the involved function is defined by some homogenous polynomial of degree $n$ with $n>2$ . It is known that the tensor complementarity problem is a generalization of the linear complementarity problem [7]; and a subclass of nonlinear complementarity problems [10]. The tensor complementarity problem was studied recently by many scholars [4, 26]. In [19], it was shown that the AVE is equivalent to a generalized linear complementarity problem. Can we show that the TAVE is equivalent to a generalized tensor complementarity problem? Although some computational methods have been presented for the AVE, it is very difficult to extend these algorithms to solve the TAVE because the TAVE (2) is a nonlinear equation. The Levenberg-Marquardt method is one of the important algorithms for solving nonlinear equations [11]. Can we propose an efficient algorithm such as the Levenberg-Marquardt method for solving the TAVE (2)? To our best knowledge, there is no general answer to these questions. Therefore, we shall focus on some special tensor absolute value equations.

Let $\mathcal{I}$ be an $m$ th-order $n$ -dimensional unit tensor, whose entries are $1$ if and only if $i_{1}=\cdots=i_{m}$ and otherwise zero. A tensor $\mathcal{A}$ is called a nonnegative tensor if all its entries are nonnegative, denoted $\mathcal{A}\geq 0$ . A tensor is called a $Z$ -tensor, if all its diagonal entries are nonnegative and off-diagonal entries are nonpositive. $M$ -tensor is a special class of Z-tensor, which was first introduced and studied in [9, 29]. To define the $M$ -tensors, we need to introduce the tensor eigenvalues first. Let $\mathcal{A}$ be an $m$ th-order $n$ -dimensional tensor. If a scalar $\lambda\in R$ and a nonzero vector ${\bf x}\in R^{n}$ satisfy

[TABLE]

where ${\bf x}^{[m-1]}=(x_{1}^{m-1},\ldots,x_{n}^{m-1})^{T}$ , then we call $\lambda$ an eigenvalue of $\mathcal{A}$ and $x$ a corresponding eigenvector. Qi [21] and Lim [16] first defined the eigenvalues of tensors independently. The spectral radius of a tensor $\mathcal{A}$ is defined by

[TABLE]

A tensor $\mathcal{A}$ is called an $M$ -tensor, if it can be written as $\mathcal{A}=s\mathcal{I}-\mathcal{B}$ with $\mathcal{B}\geq 0$ and $s\geq\rho(\mathcal{B})$ ; furthermore, it is called a strong $M$ -tensor if $s>\rho(\mathcal{B})$ . One can refer to a survey [3] for the spectral theory of nonnegative tensors. In this paper, we first investigate the existence of solutions for the TAVE (2). We show that the TAVE (2) with positive right-hand side $b$ always has a unique solution when $\mathcal{A}-\mathcal{I}$ is strong $M$ -tensor. Another sufficient condition for the existence of solution is also given. Can we compute the solution? We propose an inexact Levenberg-Marquardt method for solving the TAVE (2).

The rest of this paper is organized as follows. In Section 2, we introduce the tensor absolute value equations which is a generalization of absolute value equations with matrix case. In Section 3, some sufficient conditions for the existence of solution to the TAVE are given. In Section 4, we first reformulate the TAVE as a special tensor complementarity problem and then we propose an an inexact Levenberg-Marquardt-type algorithm for solving the TAVE. Some numerical results are reported in Section 5. Finally, some conclusions are given.

Throughout this paper, we assume that $m\geq 2$ . We use small letters x, y,…, for scalars, small bold letters ${\bf x},{\bf y},\ldots,$ for vectors, capital letters $A,B,\ldots,$ for matrixes, calligraphic letters $\mathcal{A},\mathcal{B},\ldots,$ for tensors, calligraphic letters $\mathcal{D}$ for diagonal tensor whose diagonal elements are $1$ or $-1$ . All the tensors discussed in this paper are real. $T(m,n)$ denotes the set of all $m$ th order $n$ -dimensional tensors. Let $\mathcal{A}=(a_{i_{1}i_{2}\ldots i_{m}})\in T(m,n)$ , then $\mathcal{A}$ is called a symmetric tensor if its entries $a_{i_{1}i_{2}\ldots i_{m}}$ are invariant under any permutation of their indices. $S(m,n)$ denotes the set of all symmetric tensors. For such a matrix $A^{T}$ will denote the transpose of $A$ . The identity matrix of arbitrary dimension will be denoted by $I$ .

2 Tensor absolute value equations

In this section, we present some basic definitions and properties in absolute value equations, nonlinear complementarity problems, and nonsmooth analysis, which will be used in the sequel.

We recall the absolute value equations (AVE) of the type

[TABLE]

where $A\in R^{n\times n}$ , ${\bf b}\in R^{n}$ and $|{\bf x}|$ denotes the vector with absolute values of each component of ${\bf x}$ . The AVE (3) has been widely investigated in many literatures such as [17, 18, 19]. In [19], some results about the AVE are given, which we list as follows:

(i)

The AVE (3) is equivalent to the bilinear program

[TABLE]

and the generalized linear complementarity problem

[TABLE]

(ii)

Let $C\in R^{n\times n}$ and ${\bf b}\in R^{n}$ . Then

[TABLE]

implies that

[TABLE]

Clearly, the tensor absolute value equation (2) is a generalization of the AVE (3) from the matrix case to the tensor case. Take an equation with the coefficient tensor $\mathcal{A}\in R^{2\times 2\times 2}$ as an example. The tensor absolute equation

[TABLE]

is a condense form of

[TABLE]

We want to find $x_{1}$ and $x_{2}$ that satisfy the above two equations.

The following example shows a specific tensor absolute value equation.

Example 2.1.

Let a tensor $\mathcal{A}\in T(4,2)$ be defined by $a_{1111}=a_{2111}=a_{2222}=1$ , $a_{1222}=-1$ , and zero otherwise. Let ${\bf b}=(1,2)^{T}$ . Then the corresponding tensor absolute value equation is

[TABLE]

By simplicity computation, we see that the TAVE (4) in Example 2.1 has no solution. In the next section we will discuss the existence of solution for the TAVE (2). We can extend the result (ii) to the TAVE and obtain a similar condition for the existence of solution to (2).

Below, we introduce the classical nonlinear complementarity problem. The tensor complementarity problem recently introduced in [25] is a special kind of nonlinear complementarity problem. It will be shown in Section 4 that the TAVE (2) can be reformulated as a special kind of generalized tensor complementarity problem.

Definition 1.

Given a given mapping $F:R^{n}\to R^{n}$ , the nonlinear complementarity problem, denoted by NCP( $F$ ), is to find a vector ${\bf x}\in R^{n}$ satisfying

[TABLE]

Many solution methods developed for NCP( $F$ ) or related problems are based on reformulating them as a system of equations using so-called NCP-functions [10]. Here a function $\phi:R^{2}\to R$ is called an NCP-function if

[TABLE]

Given an NCP-function $\phi$ , let us define

[TABLE]

It is obvious that ${\bf x}\in R^{n}$ is a solution of NCP( $F$ ) if and only if it solves the system of nonsmooth equations

[TABLE]

For the solution of $\Phi({\bf x})={\bf 0}$ , we recall some definitions in nonsmooth analysis. Suppose that $\Theta:U\subseteq R^{n_{1}}\to R^{n_{2}}$ is a locally Lipschitz function, where $U$ is nonempty and open. By Rademacher’s Theorem, $\Theta$ is differentiable almost everywhere. Let $D_{\Theta}\subseteq R^{n_{1}}$ denote the set of points at which $\Theta$ is differentiable. For any ${\bf x}\in D_{\Theta}$ , we write $J\Theta({\bf x})$ for the usual $n_{2}\times n_{1}$ Jacobian matrix of partial derivatives. The $B$ -subdifferential of $\Theta$ at ${\bf x}\in U$ is the set defined by

[TABLE]

The Clarke’s generalized Jacobian of $\Theta$ at ${\bf x}$ is the set defined by

[TABLE]

where “co” denotes the convex hull. Then, $\partial\Theta({\bf x})$ is a nonempty convex compact subset of $R^{n_{2}\times n_{1}}$ [6]. The function $\Theta$ is semismooth [13, 22] at ${\bf x}\in R^{n_{1}}$ if

[TABLE]

exists for all ${\bf d}\in R^{n_{1}}$ . If $\Theta$ is semismooth at all ${\bf x}\in U$ , we call $\Theta$ semismooth on $U$ . The function $\Theta$ is called strongly semismooth [23] if it is semismooth and for any ${\bf x}\in U$ and $V\in\partial\Theta({\bf x}+t{\bf d})$ ,

[TABLE]

where $\Theta^{\prime}({\bf x};{\bf d})$ denotes the directional derivative [2] of $\Theta$ at ${\bf x}$ in direction ${\bf d}$ , i.e.,

[TABLE]

Note that if the function $\Theta$ is semismooth at ${\bf x}$ , the directional derivative $\Theta^{\prime}({\bf x};{\bf d})$ exists for all ${\bf d}\in R^{n_{1}}$ and

[TABLE]

We now present some NCP-functions which are widely used in nonlinear complementarity problems. For more details about NCP-functions and their smoothing approximations, one can refer to [24, 30] and references therein.

Here we give some well-known NCP-functions as follows:

•

The min function:

[TABLE]

•

The Fischer-Burmeister function:

[TABLE]

It has been shown that all these NCP-functions are globally Lipschitz continuous, directionally differentiable, and strongly semismooth [12, 27]. For example, the generalized gradient $\partial\phi_{FB}(a,b)$ of $\phi_{FB}(a,b)$ is equal to the set of all $(v_{a},v_{b})$ such that

[TABLE]

where $(\xi,\varsigma)$ is any vector satisfying $\xi^{2}+\varsigma^{2}\leq 1$ .

In Section 4, we will use the Fischer-Burmeister function to reformulate the TAVE (2) as a system of equations and then we will propose an algorithm to solve the system of equations.

We now introduce the tensor complementarity problem which first defined by Song and Qi [25].

Definition 2.

Given any given tensor $\mathcal{A}\in T(m,n)$ and vector ${\bf q}\in R^{n}$ , the tensor complementarity problem, denoted by TCP( $\mathcal{A},{\bf q}$ ), is to find a vector ${\bf x}\in R^{n}$ satisfying

[TABLE]

Note that when $n=2$ , the tensor $\mathcal{A}$ reduces to a matrix, denoted by $A$ , and the TCP( $\mathcal{A},{\bf q}$ ) becomes: find a vector ${\bf x}\in R^{n}$ such that

[TABLE]

which is just the linear complementarity problem [7]. Very recently, a class of $n$ -person noncooperative games are in [14], where the utility function of every player is given by a homogeneous polynomial defined by the payoff tensor of that player, which is a natural extension of the bimatrix game where the utility function of every player is given by a quadratic form defined by the payoff matrix of that player. Such a problem is called the multilinear game. The multilinear game is reformulated as a tensor complementarity problem. Some semismooth Newton-type methods are recently proposed for solving the tensor complementarity problems (see, e.g., [5]). In Section 4, we will extend the result (i) to the TAVE (2) and show that the TAVE (2) is equivalent to a bi-multilinear program and a generalized tensor complementarity problem.

3 Existence of solutions

In this section, we give some sufficient conditions for the existence of solutions to the TAVE (2). Specially, we extend the result (ii) about the AVE (3) to the TAVE (2).

We need the following lemmas which are recently established in [8, Theorems 3.2, 3.3, 3.4].

Lemma 3.

Let $\mathcal{A}\in T(m,n)$ . If $\mathcal{A}$ is a strong $M$ -tensor, then for every positive vector ${\bf b}$ the multilinear system of equations $\mathcal{A}{\bf x}^{m-1}={\bf b}$ has a unique positive solution.

Lemma 4.

Let $\mathcal{A}\in T(m,n)$ be a $Z$ -tensor. Then it is a strong $M$ -tensor if and only if the multilinear system of equations $\mathcal{A}{\bf x}^{m-1}={\bf b}$ has a unique positive solution for every positive vector ${\bf b}$ .

Lemma 5.

Let $\mathcal{A}\in T(m,n)$ be an $M$ -tensor and ${\bf b}\geq{\bf 0}$ . If there exists ${\bf v}\geq{\bf 0}$ such that $\mathcal{A}{\bf v}^{m-1}\geq{\bf b}$ , then the multilinear system of equations $\mathcal{A}{\bf x}^{m-1}={\bf b}$ has a nonnegative solution.

By the above lemmas, we have the following theorems.

Theorem 6.

Let $\mathcal{A}\in T(m,n)$ . If $\mathcal{A}$ can be written as $\mathcal{A}=c\mathcal{I}-\mathcal{B}$ with $\mathcal{B}\geq 0$ and $c>\rho(\mathcal{B})+1$ , then for every positive vector ${\bf b}$ the TAVE (2) has a unique positive solution.

Proof.

Let $s=c-1$ . Then $\mathcal{A}=c\mathcal{I}-\mathcal{B}$ yields

[TABLE]

which implies that $\mathcal{A}-\mathcal{I}$ is a strong $M$ -tensor. By Lemma 3, the multilinear system of equations

[TABLE]

has a unique positive solution for every positive vector ${\bf b}$ . Hence, for every positive vector ${\bf b}$ , the TAVE (2) has a unique positive solution. ∎

Combining [9, Theorem 3] and Lemma 4, we can rewrite the above theorem into an equivalent condition for $\mathcal{A}-\mathcal{I}$ being a strong $M$ -tensor.

Theorem 7.

Let $\mathcal{A}\in T(m,n)$ be a $Z$ -tensor. Then $\mathcal{A}$ can be written as the form of

[TABLE]

if and only if for every positive vector ${\bf b}$ the TAVE (2) has a unique positive solution.

Proof.

On one hand, by Theorem 6, we have the existence and uniqueness of the positive solution of the TAVE (2) for every positive vector ${\bf b}$ . On the other hand, if for every positive vector ${\bf b}$ the TAVE (2) has a unique positive solution, then there exists a vector ${\bf x}>{\bf 0}$ such that

[TABLE]

Since $\mathcal{A}$ is a $Z$ -tensor, $\mathcal{A}-\mathcal{I}$ is also a $Z$ -tensor. Thus, by [9, Theorem 3], $\mathcal{A}-\mathcal{I}$ is a strong $M$ -tensor and then the form of (5) holds. ∎

Remark. The sufficient condition in Theorem 7 can be weakened as follows: if the TAVE (2) has a nonnegative solution for every positive vector ${\bf b}$ , then we also have the form (5). In fact, let ${\bf x}\geq{\bf 0}$ be a solution of the TAVE (2). Then there exists ${\bf x}\geq{\bf 0}$ such that $(\mathcal{A}-\mathcal{I}){\bf x}^{m-1}>{\bf 0}$ . By [9, Theorem 3], we can obtain the conclusion.

Theorem 8.

Let ${\bf b}\geq{\bf 0}$ and $\mathcal{A}\in T(m,n)$ be in the form of $\mathcal{A}=c\mathcal{I}-\mathcal{B}$ with $\mathcal{B}\geq 0$ and $c=\rho(\mathcal{B})+1$ . If there exists a vector ${\bf v}\geq{\bf 0}$ such that $(\mathcal{A}-\mathcal{I}){\bf v}^{m-1}\geq{\bf b}$ , then the TAVE (2) has a nonnegative solution.

Proof.

It follows from

[TABLE]

that $\mathcal{A}-\mathcal{I}$ is an $M$ -tensor. By Lemma 5, there is ${\bf x}^{*}\geq{\bf 0}$ such that

[TABLE]

Thus, we have

[TABLE]

This completes the proof. ∎

We next extend the result (i) about the AVE (3) to the TAVE (2). Here, we assume that $m$ is even. We first introduce the product of a tensor and a diagonal tensor.

Definition 9.

Let $\mathcal{C}=(c_{i_{1}i_{2}\ldots i_{m}})\in T(m,n)$ and $\mathcal{B}\in T(m,n)$ be a diagonal tensor with diagonal elements $b_{i\ldots i}$ . We denote $\mathcal{CB}=(a_{i_{1}i_{2}\ldots i_{m}})$ their product, whose elements are defined as

[TABLE]

Obviously, Definition 9 is well-defined due to the assumption that $m$ is even.

By simplicity computation, we have the following proposition.

Proposition 10.

Let $\mathcal{C}=(c_{i_{1}i_{2}\ldots i_{m}})\in T(m,n)$ and ${\bf x}\in R^{n}$ . We have

[TABLE]

Proof.

Let us define a vector ${\bf u}\in R^{n}$ as

[TABLE]

Then by some definitions introduced in Section 1, the $i$ th-component of the vector $\mathcal{C}(\mathcal{D}{\bf x}^{m-1})$ can be written as

[TABLE]

Let $\mathcal{A}=\mathcal{C}\mathcal{D}$ . Then by Definition 9, the $i$ th-component of the vector $(\mathcal{C}\mathcal{D}){\bf x}^{m-1}$ can be written as

[TABLE]

Combining (3) and (7), we have

[TABLE]

∎

It is easy to see that

[TABLE]

holds for any vector ${\bf x}\in R^{n}$ , because the $i$ th-component of the vectors $|{\bf x}|^{m-1}$ and $\mathcal{D}{\bf x}^{m-1}$ are in the form of

[TABLE]

Here the sign of $x_{i}$ is corresponded to the diagonal element $1$ or $-1$ of $\mathcal{D}$ .

The following theorem is a generalization of the result (ii) from AVE to TAVE.

Theorem 11.

Let $\mathcal{C}\in T(m,n)$ , ${\bf b}\in R^{n}$ and $\mathcal{A}=\mathcal{C}\mathcal{D}$ . If the multilinear system of equations

[TABLE]

has a solution, then the tensor absolute value equation

[TABLE]

also has a solution.

Proof.

Let ${\bf z}^{*}$ be the solution of the multilinear system of equations (9). Then we have

[TABLE]

Take

[TABLE]

Then (10) can be rewritten as

[TABLE]

which, together with Proposition 10 and (8), implies that ${\bf x}^{*}$ is a solution of the tensor absolute value equation

[TABLE]

Thus, we complete the proof. ∎

We give an example to verify the above theorem.

Example 3.1.

Let $\mathcal{C}\in T(4,2)$ with $c_{1111}=c_{1222}=c_{2111}=c_{2222}=1$ and zero otherwise, and ${\bf b}=(8,8)^{T}$ . Consider the multilinear system of equations

[TABLE]

It is rewritten as

[TABLE]

This implies that ${\bf z}^{*}=(2,2)^{T}$ is a solution of

[TABLE]

Let $\mathcal{D}\in T(4,2)$ be a diagonal tensor with $d_{1111}=1$ and $d_{2222}=-1$ . Then we have $\mathcal{A}\in T(4,2)$ with $a_{1111}=a_{2111}=1$ , $a_{1222}=a_{2222}=-1$ , and zero otherwise, i.e., $\mathcal{A}=\mathcal{C}\mathcal{D}$ . By Theorem 11, ${\bf x}^{*}=(2,-2)^{T}$ is just a solution of the tensor absolute value equation

[TABLE]

We now verify the conclusion. We rewrite (11) as

[TABLE]

By simplicity computation, the above equation has a solution $x_{1}=2,x_{2}=-2$ .

4 Reformulation and algorithm

In this section, we extend the result (i) from AVE to TAVE. We show that the TAVE (2) is equivalent to a bi-multiliear program and a generalized tensor complementarity problem. We first introduce the following definition.

Definition 12.

Let $\mathcal{A}\in T(m,n)$ and ${\bf x},{\bf b}\in R^{n}$ . Define

[TABLE]

The generalized tensor complementarity problem is to find ${\bf x}\in R^{n}$ satisfying

[TABLE]

We call the following nonlinear program as a bi-multiliear program:

[TABLE]

Theorem 13.

Let $\mathcal{A}\in T(m,n)$ and ${\bf b}\in R^{n}$ . Then the TAVE (2) is equivalent to the generalized tensor complementarity problem (12) and the bi-multilinear program (13).

Proof.

Clearly, the generalized tensor complementarity problem (12) is equivalent to the bi-multilinear program (13). That is, $(\ref{gtave})\Leftrightarrow(\ref{biprog})$ .

We only need to prove $(\ref{TAVE})\Leftrightarrow(\ref{biprog})$ . In fact, $|{\bf x}|^{m-1}=|{\bf x}^{m-1}|$ . Hence, we have

[TABLE]

This implies that ${\bf x}$ is a feasible solution of (13). Since

[TABLE]

we have

[TABLE]

This completes the proof. ∎

By the above theorem, in order to solve the TAVE (2), we propose an algorithm for solving the generalized tensor complementarity problem (12). Using the Fischer-Burmeister function $\phi_{FB}$ , we can reformulate (12) as the following equation:

[TABLE]

Hence, ${\bf x}$ is a solution of (2) if and only if $H({\bf x})={\bf 0}$ . Moreover, $H({\bf x})$ is strongly semismooth since the composition of strongly semismooth function is again strongly semismooth [20], and according to the Jacobian chain rule, we have the following result.

Theorem 14.

Let $\mathcal{A}\in S(m,n)$ . Then the function $H({\bf x})$ is strongly semismooth. Moreover, for any ${\bf x}\in R^{n}$ , we have

[TABLE]

where $D_{a}({\bf x})=diag(a_{i}({\bf x}))$ and $D_{b}({\bf x})=diag(b_{i}({\bf x}))$ are diagonal matrices in $R^{n\times n}$ with entries $(a_{i}({\bf x}),b_{i}({\bf x}))\in\partial\phi_{FB}(F_{i}({\bf x}),G_{i}({\bf x}))$ , where $\partial\phi_{FB}(F_{i}({\bf x}),G_{i}({\bf x}))$ denotes the set $\partial\phi_{FB}(a,b)$ with $(a,b)$ being replaced by $(F_{i}({\bf x}),G_{i}({\bf x}))$ , and $JF({\bf x})$ and $JG({\bf x})$ are given by

[TABLE]

Here, for a tensor $\mathcal{T}=(t_{i_{1}\ldots i_{m}})\in T(m,n)$ and a vector ${\bf x}\in R^{n}$ , let $\mathcal{T}{\bf x}^{m-2}$ be a matrix in $R^{n\times n}$ whose $(i,j)$ -th component is defined by

[TABLE]

In order to propose an algorithm for the solution of $H({\bf x})={\bf 0}$ , we define a merit function as

[TABLE]

We present some properties of the merit function, which can be obtained by [6, Theorem 2.2.4 and Theorem 2.6.6].

Theorem 15.

Let $\mathcal{A}\in S(m,n)$ . Then the merit function $\Psi({\bf x})$ is continuously differentiable with

[TABLE]

for any $Q\in\partial H({\bf x})$ .

We are now in the position to propose a Levenberg-Marquardt-type algorithm to solve the semismooth system of equations $H({\bf x})={\bf 0}$ , which is an extension of the nonsmooth inexact Levenberg-Marquardt-type method in [11]. To ensure global convergence, a line search is performed to minimize the smooth merit function $\Psi$ . Because the problem with data in a structure of tensor is large scale, and the inexact version is more suited to the large-scale case [11], we have the following algorithm.

Algorithm 4.1.

(Inexact Levenberg-Marquardt-type method)**

Step 0.

Given a starting vector ${\bf x}^{0}\in R^{n}$ and some scales $p>2$ , $0<\beta<1/2$ , $\rho>0$ , $\epsilon\geq 0$ . Set $k:=0$ .

Step 1.

If $\|H({\bf x}^{k})\|\leq\epsilon$ , stop. Otherwise, compute $Q^{k}\in\partial H({\bf x}^{k})$ .

Step 2.

Find a solution ${\bf d}^{k}$ satisfying

[TABLE]

where $\mu_{k}\geq 0$ is the Levenberg-Marquardt parameter. If the condition

[TABLE]

is not satisfied, set

[TABLE]

Step 3.

Find the smallest integer $i^{k}\in\{0,1,2,\ldots\}$ such that $t_{k}=2^{-i^{k}}$ and

[TABLE]

Step 4.

Set ${\bf x}^{k+1}={\bf x}^{k}+t_{k}{\bf d}^{k}$ , $k:=k+1$ , and go to Step 1.

In what follows, we analyze the global convergence of Algorithm 1. We shall assume that Algorithm 1 produce an infinite sequence $\{{\bf x}^{k}\}$ . By [11, Theorem 15 and Theorem 16], we immediately obtain the following theorems.

Theorem 16.

Assume that the sequence $\{\mu_{k}\}$ is bounded and that the sequence $\{{\bf r}^{k}\}$ satisfies

[TABLE]

where $\{\alpha_{k}\}$ is a sequence of numbers with $0<\alpha_{k}<1$ and $\alpha_{k}\to 0$ as $k\to\infty$ . Then each accumulation point of $\{{\bf x}^{k}\}$ is a stationary point of $\Psi$ .

Theorem 17.

Let the assumptions of Theorem 16 hold. If one of the accumulation points of $\{{\bf x}^{k}\}$ , denoted ${\bf x}^{*}$ , is an isolated solution of the TAVE (2), then

[TABLE]

In the implementation of Algorithm 4.1, the computational most intensive part is the approximation solution of system (14) with ${\bf r}^{k}={\bf 0}~{}~{}~{}\forall k$ . We note that the system is always solvable. In fact, if $\mu_{k}>0$ , the matrix $(Q^{k})^{T}Q^{k}+\mu_{k}I$ is symmetric positive definite and hence system (14) is surely solvable. If $\mu_{k}=0$ , the matrix $(Q^{k})^{T}Q^{k}+\mu_{k}I$ reduces to $(Q^{k})^{T}Q^{k}$ , which is guaranteed to be only positive semidefinite. However, in this case, (14) reduces to the normal gradient equation $Q^{k}{\bf d}=-H({\bf x}^{k})$ , is therefore solvable. We now have to specify which element $Q^{k}\in\partial H({\bf x}^{k})$ we select at the $k$ -th iteration. By Theorem 14, we have that an element of $\partial H({\bf x}^{k})$ can be obtained in the following way. Let

[TABLE]

be the set of “degenerate indices” and define ${\bf z}\in R^{n}$ to be a vector whose components $z_{i}$ are $1$ if $i\in\Lambda$ and [math] otherwise. Then, the matrix $Q^{k}$ defined by

[TABLE]

where $A$ and $B$ are $n\times n$ diagonal matrices whose $i$ -th diagonal elements are given, respectively, by

[TABLE]

and by

[TABLE]

belongs to $\partial H({\bf x}^{k})$ . In the next section, we compute $Q^{k}$ as the formulation.

5 Numerical results

In this section, we present the numerical performance of Algorithm 4.1 for the TAVE (2). All codes were written by using Matlab Version R2015b and Tensor Toolbox Version 2.6 [1]. The numerical experiments were done on a laptop with an Intel Core i7-4720HQ CPU (2.6GHz) and RAM of 7.89GB.

In the implementation of Algorithm 4.1, we set $\varepsilon=10^{-6},\rho=10^{-10},p=2.1,\beta=10^{-4}$ and the Levenberg-Marquardt parameter $\mu_{k}=0.3\,\,\forall k\in N$ . We also set a maximum iteration steps for the algorithm, $i.e.$ , $N_{max}=300$ .

The first numerical experiment focuses on the behaviour of algorithm’s iteration. We generate a random symmetric nonnegative tensor $\mathcal{A}\in S_{6,8}$ and a random vector ${\bf x}^{*}\in R^{8}$ . All entries of $\mathcal{A}$ and ${\bf x}^{*}$ are uniform random numbers in the interval $[0,1]$ . We calculate ${\bf b}=\mathcal{A}{{\bf x}^{*}}^{m-1}-|{\bf x}^{*}|^{m-1}$ in order to make TAVE have at least one solution. Then we use Algorithm 4.1 to solve TAVE: $\mathcal{A}{\bf x}^{m-1}-|{\bf x}|^{m-1}={\bf b}$ , with a random initial point chosen randomly from $[0,1]^{8}$ which is shown as $x^{0}$ in table 1. The iteration of Algorithm 4.1 is shown in Table 1. From the table, $\|H({\bf x}^{k})\|$ tends to [math] as the number of iteration $k$ increases. And $\|\nabla\Psi({\bf x}^{k})\|$ also tends to 0 except that it increases from ${\bf k}=2$ to ${\bf k}=4$ . This shows that $\|\nabla\Psi({\bf x}^{k})\|$ does converge to 0 but not converge monotonically when the algorithm converges.

The second numerical experiment aims to verify Theorem 11. We first generate a random symmetric nonnegative tensor $\mathcal{C}\in S(4,10)$ and a random vector ${\bf z}^{*}=(0.1040,0.7455,0.7363,0.5619,0.1842,0.5972,0.2999,0.1341,0.2126,$ $0.8949)^{\mathrm{T}}\in R^{10}$ . All entries of $\mathcal{C}$ are uniform random numbers in the interval $[0,1]$ . Let ${\bf b}=(\mathcal{C}-\mathcal{I}){{\bf z}^{*}}^{m-1}$ . Since $\mathcal{D}\in S(4,10)$ is a diagonal tensor whose diagonal elements are $1$ or $-1$ , there are at most $2^{10}=1024$ different $\mathcal{D}$ . The first attempt is to generate all these $1024$ tensors. For each tensor $\mathcal{D}_{k}$ , set $\mathcal{A}_{k}=\mathcal{C}\mathcal{D}_{k}$ (see Definition 9) and ${\bf x_{k}}=(\mathcal{D}_{k}{\bf z^{*}}^{m-1})^{\frac{1}{m-1}}$ . We check whether $\mathcal{A}_{k}{\bf x_{k}}^{m-1}-|{\bf x_{k}}|^{m-1}$ is equals to ${\bf b}$ for all $k\in\{1,2,...,1024\}$ . The result shows that each ${\bf x_{k}}$ is just one of the solution to the corresponding TAVE problem $\mathcal{A}_{k}{\bf x}^{m-1}-|{\bf x}|^{m-1}={\bf b}_{k}$ .

The second attempt of the second numerical experiment is to generate five $\mathcal{D}_{k}$ of all $1024$ tensors randomly and use Algortihm 4.1 to solve the corresponding TAVE. The diagonal elements of the five $\mathcal{D}_{k}$ is shown in Table 2.

We first select the initial points for Algorithm 4.1 by using normal distribution, i.e., entries are from standardized normal distribution $N(0,1)$ independently. Here we call these initial points type-I initial points. The results of corresponding TAVE with type-I initial points is summarized in Table 3. We can easily find out that none of the five ${\bf x_{k}}$ is in the form of $(\mathcal{D}_{k}{\bf z^{*}}^{m-1})^{\frac{1}{m-1}}$ . Because Algorithm 4.1 is based on the thoughts of Newton method, thus its convergence relies heavily on the initial point. In order to detect solution which is mentioned in Theorem 11 by Algorithm 4.1, we should choose the initial points in another way. For each $\mathcal{D}_{k}$ , we generate type-II initial points by adding a random number chosen from uniform distribution over $(-0.3,0.3)$ to $(\mathcal{D}_{k}{\bf z^{*}}^{m-1})^{\frac{1}{m-1}}$ . The results of corresponding TAVE with type-II initial points is shown in Table 4. The solutions are exactly in the form of $(\mathcal{D}_{k}{\bf z^{*}}^{m-1})^{\frac{1}{m-1}}$ .

In Tables 3 and 4, ${\bf k}$ denotes the experiment No. corresponding to Table 2. ${\bf x_{k}}$ denotes the solution vectors returned by Algorithm 4.1. $\|H({\bf x_{k}})\|$ denotes the Euclid norm of $H({\bf x_{k}})$ . If the norm of $H({\bf x_{k}})$ is small enough, we can regard ${\bf x_{k}}$ as an approximate solution of TAVE. Iter. denotes the number of iteration and Time denotes the time of iteration that finds corresponding $x$ by Algorithm 4.1. In the second experiment, we verify Theorem 11 from the instant correctly. Besides, from Table 3 and 4, we find that under the conditions of Theorem 11, the solution $(\mathcal{D}{\bf z^{*}}^{m-1})^{\frac{1}{m-1}}$ may not be the only solution of TAVE $\mathcal{A}{\bf x}^{m-1}-|{\bf x}|^{m-1}={\bf b}$ . There might be some other solutions, such as the solution in Table 3. To discuss the uniqueness of the positive solution, we conduct our third experiment.

Our third numerical experiment focuses on Theorem 7. Here we first generate a random symmetric nonnegative tensor $\mathcal{B}$ whose entries are uniform random numbers in the interval $[0,1]$ . Let $c=1+(1+0.01)\max_{1\leq i\leq n}{(\mathcal{B}e^{3})_{i}}$ , where $e=(1,1,1,1)^{\mathrm{T}}$ . Since $\max_{1\leq i\leq n}{(\mathcal{B}e^{3})_{i}}\geq\rho(\mathcal{B})$ , the choice of $c$ makes sure that $c>\rho(\mathcal{B})+1$ . Then let $\mathcal{A}=c\mathcal{I}-\mathcal{B}$ , and $\mathcal{A}$ satisfies the conditions of Theorem 7, i.e., $\mathcal{A}-\mathcal{I}$ is strong M-tensor. Tensors $\mathcal{B}$ and $\mathcal{A}$ are given in Tables 5 and 6, respectively.

We choose $10$ random positive vectors ${\bf b}_{k}\in R_{+}^{4},k=1,\ldots,10$ . For each ${\bf b}_{k}$ , we find $20$ repeatable solutions of TAVE: $\mathcal{A}{\bf x}^{m-1}-|{\bf x}|^{m-1}={\bf b}_{k}$ with random vectors from $N(0,1)^{4}$ as initial points repeatedly and summarize the results in Table 7.

In Table 7, ${\bf x}$ denotes the solution of TAVE. ${\bf b}$ denotes random generated ${\bf b}_{k}$ mentioned above. Iter. denotes the average number of iteration that finds the corresponding solution successfully. Time denotes the average time of iteration that finds the corresponding solution by Algorithm

$\max{\|H({\bf x})\|}$ is the maximum norm of all $H({\bf x})$ returned by Algorithm 1 whose ${\bf x}$ is the corresponding solution. Attempts has the form N/T, N denotes the number of the corresponding ${\bf x}$ found by Algorithm 4.1 and T denotes the number of initial points in all.

In this experiment, we use “while” loop in Matlab program to guarantee that we can get exact $20$ solutions (might be repeatable) for each $\bf b_{k}\geq 0$ . According to Table 7, for each $\bf b_{k}\geq 0$ , Algorithm 4.1 only returns unique positive solution in all $20$ repeatable solutions. This phenomenon fits Theorem 7 very well. Besides, in order to get $20$ valid solutions, the initial points we attempt is $10$ times more than the valid ones. This means that most of the random initial points fail to find a solution by Algorithm 4.1. The reason might be that the convergence of Newton type method depends on the initial point badly. Theorem 7 shows that under this circumstances, there’s only one unique positive solution of TAVE. If and only if the initial point is in the convergence region of some solution of TAVE, the algorithm will converge. Therefore, it’s harder to find valid solutions if $\bf b\geq 0$ . Table 8 shows the solutions found by Algorithm 4.1 when ${\bf b}=(-1,1,1,1)$ . The initial points attempted in all is much less.

Moreover, under the circumstances that $\bf b\geq 0$ and $\mathcal{A}-\mathcal{I}$ is strong M-tensor, whether the unique positive solution of TAVE is the unique solution of TAVE remains a question. In our experiment we haven’t found other solutions except for the unique positive ones.

6 Conclusion

We have introduced tensor absolute value equations. The simple definition is a natural generalization of the definition of absolute value equations in the matrix case. We have established some basic properties for tensor absolute value equations and we reformulate tensor absolute value equations as a generalized tensor complementarity problem. We have proposed some sufficient conditions for the existence of solution to the multilinear equations. We propose an inexact Levenberg-Marquardt-type method (Algorithm 4.1) to solve the tensor absolute value equations and some numerical results have shown that our algorithm is performing well.

There are some questions which are still in study. For example, we known that “The AVE (3) is uniquely solvable for any ${\bf b}\in R^{n}$ if the singular values of $A$ exceed $1$ ” [19]. Can we extend the conclusion to TVAE (2), i.e., the statement “The TAVE (2) is uniquely solvable for any ${\bf b}\in R^{n}$ if the singular values of tensor $\mathcal{A}$ exceed $1$ ” is correct or not? This is still an open question.

Acknowledgments

Shouqiang Du’s work was supported by the National Natural Science Foundation of China (Grant No. 11671220, 11401331) and the Nature Science Foundation of Shandong Province (ZR2015AQ013, ZR2016AM29). Liping Zhang’s work was supported by the National Natural Science Foundation of China (Grant No. 11271221). Liqun Qi’s work was supported by the Hong Kong Research Grant Council (Grant No. PolyU 501212, 501913, 15302114 and 15300715).

Bibliography30

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] B.W. Bader, T.G. Kolda , et al. , MATLAB Tensor Toolbox Version 2.6 (2012). http://www.sandia.gov/ ∼ similar-to \sim tgkolda/Tensor Toolbox/
2[2] J.F. Bonnans, R. Cominetti, and A. Shapiro , Second order optimality conditions based on parabolic second order tangent sets , SIAM Journal on Optimization, 9 (1999), pp. 466–493.
3[3] K. Chang, L. Qi, and T. Zhang , A survey on the spectral theory of nonnegative tensors , Numerical Linear Algebra with Applications, 20 (2013), pp. 891–912.
4[4] M. Che, L. Qi, and Y. Wei , Positive definite tensors to nonlinear complementarity problems , Journal of Optimization Theory and Applications, 168 (2016), pp. 475–487.
5[5] M. Che and L. Qi , A semismooth Newton method for tensor eigenvalue complementarity problem , Computational Optimization and Applications, 65 (2016), pp. 109–126.
6[6] F.H. Clarke , Optimization and Nonsmooth Analysis , Wiley, New York, 1983.
7[7] R.W. Cottle, J.-S. Pang, and R.E. Stone , The Linear Complementarity Problem , Academic Press, Boston, 1992.
8[8] W. Ding and Y. Wei , Solving multi-linear systems with M 𝑀 M -Tensors , Journal of Scientific Computing, 68 (2016), pp. 683–715.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Tensor absolute value equations

Abstract

keywords:

AMS:

1 Introduction

2 Tensor absolute value equations

Example 2.1**.**

Definition 1**.**

Definition 2**.**

3 Existence of solutions

Lemma 3**.**

Lemma 4**.**

Lemma 5**.**

Theorem 6**.**

Proof.

Theorem 7**.**

Proof.

Theorem 8**.**

Proof.

Definition 9**.**

Proposition 10**.**

Proof.

Theorem 11**.**

Proof.

Example 3.1**.**

4 Reformulation and algorithm

Definition 12**.**

Theorem 13**.**

Proof.

Theorem 14**.**

Theorem 15**.**

Algorithm 4.1**.**

Theorem 16**.**

Theorem 17**.**

5 Numerical results

6 Conclusion

Acknowledgments

Example 2.1.

Definition 1.

Definition 2.

Lemma 3.

Lemma 4.

Lemma 5.

Theorem 6.

Theorem 7.

Theorem 8.

Definition 9.

Proposition 10.

Theorem 11.

Example 3.1.

Definition 12.

Theorem 13.

Theorem 14.

Theorem 15.

Algorithm 4.1.

Theorem 16.

Theorem 17.