On the Equivalence of Semidifinite Relaxations for MIMO Detection with   General Constellations

Ya-Feng Liu; Zi Xu; Cheng Lu

arXiv:1902.06381·cs.IT·February 19, 2019

On the Equivalence of Semidifinite Relaxations for MIMO Detection with General Constellations

Ya-Feng Liu, Zi Xu, Cheng Lu

PDF

Open Access

TL;DR

This paper investigates the theoretical relationship between different semidefinite relaxation methods for MIMO detection, establishing their equivalence and providing insights into their tightness under certain conditions.

Contribution

It proves the equivalence of two distinct SDRs for MIMO detection, enhancing understanding of their theoretical properties and performance.

Findings

01

Two SDRs for MIMO detection are equivalent.

02

The tightness of one SDR can be guaranteed under a specific condition.

03

Theoretical insight into the relationship between different relaxation techniques.

Abstract

The multiple-input multiple-output (MIMO) detection problem is a fundamental problem in modern digital communications. Semidefinite relaxation (SDR) based algorithms are a popular class of approaches to solving the problem because the algorithms have a polynomial-time worst-case complexity and generally can achieve a good detection error rate performance. In spite of the existence of various different SDRs for the MIMO detection problem in the literature, very little is known about the relationship between these SDRs. This paper aims to fill this theoretical gap. In particular, this paper shows that two existing SDRs for the MIMO detection problem, which take quite different forms and are proposed by using different techniques, are equivalent. As a byproduct of the equivalence result, the tightness of one of the above two SDRs under a sufficient condition can be obtained.

Equations62

r = H x^{*} + ν,

r = H x^{*} + ν,

{exp (i θ) ∣ θ = \frac{2 ( j - 1 ) π}{M}, j = 1, 2, \dots, M}, i = 1, 2, \dots, n,

{exp (i θ) ∣ θ = \frac{2 ( j - 1 ) π}{M}, j = 1, 2, \dots, M}, i = 1, 2, \dots, n,

\begin{array}[]{cl}\displaystyle\min_{\mathbf{x}\in\mathbb{C}^{n}}&\left\|\mathbf{H}\mathbf{x}-\mathbf{r}\right\|_{2}^{2}\\[3.0pt] \mbox{s.t.}&|x_{i}|^{2}=1,~{}\arg{(x_{i})}\in\mathcal{A},~{}i=1,2,\ldots,n,\end{array}

\begin{array}[]{cl}\displaystyle\min_{\mathbf{x}\in\mathbb{C}^{n}}&\left\|\mathbf{H}\mathbf{x}-\mathbf{r}\right\|_{2}^{2}\\[3.0pt] \mbox{s.t.}&|x_{i}|^{2}=1,~{}\arg{(x_{i})}\in\mathcal{A},~{}i=1,2,\ldots,n,\end{array}

s_{j} = cos (\frac{2 ( j - 1 ) π}{M}) + i sin (\frac{2 ( j - 1 ) π}{M}), j = 1, 2, \dots, M;

s_{j} = cos (\frac{2 ( j - 1 ) π}{M}) + i sin (\frac{2 ( j - 1 ) π}{M}), j = 1, 2, \dots, M;

x, X min

x, X min

X_{i, i} = 1, i = 1, \dots, n,

ar g (x_{i}) \in A, i = 1, \dots, n,

X = x x^{†},

x, X min

x, X min

X_{i, i} = 1, i = 1, \dots, n,

X ⪰ x x^{†},

X ⪰ x x^{†} ⟺ [1 x x^{T} X] ⪰ 0 .

X ⪰ x x^{†} ⟺ [1 x x^{T} X] ⪰ 0 .

\begin{array}[]{cl}\displaystyle\min_{\mathbf{y},\mathbf{Y}}&\hat{\mathbf{Q}}\bullet\mathbf{Y}+2\hat{\mathbf{c}}^{{{T}}}\mathbf{y}\\[3.0pt] \mbox{s.t.}&Y_{i,i}+Y_{n+i,n+i}=1,~{}i=1,2,\ldots,n,\\[5.0pt] &\mathbf{Y}\succeq\mathbf{y}\mathbf{y}^{{{T}}},\end{array}

\begin{array}[]{cl}\displaystyle\min_{\mathbf{y},\mathbf{Y}}&\hat{\mathbf{Q}}\bullet\mathbf{Y}+2\hat{\mathbf{c}}^{{{T}}}\mathbf{y}\\[3.0pt] \mbox{s.t.}&Y_{i,i}+Y_{n+i,n+i}=1,~{}i=1,2,\ldots,n,\\[5.0pt] &\mathbf{Y}\succeq\mathbf{y}\mathbf{y}^{{{T}}},\end{array}

\hat{Q} = [Re (Q) Im (Q) - Im (Q) Re (Q)], \hat{c} = [Re (c) Im (c)], y = [Re (x) Im (x)] .

\hat{Q} = [Re (Q) Im (Q) - Im (Q) Re (Q)], \hat{c} = [Re (c) Im (c)], y = [Re (x) Im (x)] .

Y_{i} = 1 y_{i} y_{n + i} y_{i} Y_{i, i} Y_{n + i, i} y_{n + i} Y_{i, n + i} Y_{n + i, n + i}, i = 1, 2, \dots, n

Y_{i} = 1 y_{i} y_{n + i} y_{i} Y_{i, i} Y_{n + i, i} y_{n + i} Y_{i, n + i} Y_{n + i, n + i}, i = 1, 2, \dots, n

P_{j} = 1 Re (s_{j}) Im (s_{j}) [1 Re (s_{j}) Im (s_{j})], j = 1, 2, \dots, M .

P_{j} = 1 Re (s_{j}) Im (s_{j}) [1 Re (s_{j}) Im (s_{j})], j = 1, 2, \dots, M .

Y_{i} \in {P_{1}, P_{2}, \dots, P_{M}}, i = 1, 2, \dots, n .

Y_{i} \in {P_{1}, P_{2}, \dots, P_{M}}, i = 1, 2, \dots, n .

\begin{array}[]{cl}\displaystyle\min_{\mathbf{y},\mathbf{Y},\mathbf{t}}&\hat{\mathbf{Q}}\bullet\mathbf{Y}+2\hat{\mathbf{c}}^{{{T}}}\mathbf{y}\\[3.0pt] \mbox{s.t.}&\displaystyle{\mathbf{Y}}_{i}=\sum_{j=1}^{M}t_{i,j}\mathbf{P}_{j},~{}i=1,2,\ldots,n,\\[3.0pt] &\mathbf{A}\mathbf{t}=\mathbf{e}_{n},~{}\mathbf{t}\geq\bm{0},\\[3.0pt] &\mathbf{Y}\succeq\mathbf{y}\mathbf{y}^{{{T}}},\end{array}

\begin{array}[]{cl}\displaystyle\min_{\mathbf{y},\mathbf{Y},\mathbf{t}}&\hat{\mathbf{Q}}\bullet\mathbf{Y}+2\hat{\mathbf{c}}^{{{T}}}\mathbf{y}\\[3.0pt] \mbox{s.t.}&\displaystyle{\mathbf{Y}}_{i}=\sum_{j=1}^{M}t_{i,j}\mathbf{P}_{j},~{}i=1,2,\ldots,n,\\[3.0pt] &\mathbf{A}\mathbf{t}=\mathbf{e}_{n},~{}\mathbf{t}\geq\bm{0},\\[3.0pt] &\mathbf{Y}\succeq\mathbf{y}\mathbf{y}^{{{T}}},\end{array}

S = I_{n} \otimes s^{T}, A = I_{n} \otimes e_{M}^{T} .

S = I_{n} \otimes s^{T}, A = I_{n} \otimes e_{M}^{T} .

\begin{array}[]{rl}\displaystyle y_{i}=\displaystyle\mathbf{t}_{i}^{{{T}}}\mathbf{s}_{R},~{}y_{n+i}=\mathbf{t}_{i}^{{{T}}}\mathbf{s}_{I},~{}Y_{i,i}=\displaystyle\mathbf{s}_{R}^{{{T}}}\text{Diag}(\mathbf{t}_{i})\mathbf{s}_{R},\\[12.0pt] \displaystyle Y_{n+i,n+i}=\mathbf{s}_{I}^{{{T}}}\text{Diag}(\mathbf{t}_{i})\mathbf{s}_{I},~{}Y_{i,n+i}=\displaystyle\mathbf{s}_{R}^{{{T}}}\text{Diag}(\mathbf{t}_{i})\mathbf{s}_{I}.\end{array}

\begin{array}[]{rl}\displaystyle y_{i}=\displaystyle\mathbf{t}_{i}^{{{T}}}\mathbf{s}_{R},~{}y_{n+i}=\mathbf{t}_{i}^{{{T}}}\mathbf{s}_{I},~{}Y_{i,i}=\displaystyle\mathbf{s}_{R}^{{{T}}}\text{Diag}(\mathbf{t}_{i})\mathbf{s}_{R},\\[12.0pt] \displaystyle Y_{n+i,n+i}=\mathbf{s}_{I}^{{{T}}}\text{Diag}(\mathbf{t}_{i})\mathbf{s}_{I},~{}Y_{i,n+i}=\displaystyle\mathbf{s}_{R}^{{{T}}}\text{Diag}(\mathbf{t}_{i})\mathbf{s}_{I}.\end{array}

t min

t min

At = e_{n}, t \geq 0,

t \in {0, 1}^{M n},

\bar{\mathbf{Q}}=\hat{\mathbf{S}}^{{{T}}}\hat{\mathbf{Q}}\hat{\mathbf{S}},~{}\bar{\mathbf{c}}=\hat{\mathbf{S}}^{{{T}}}\hat{\mathbf{c}},~{}\text{and}~{}\hat{\mathbf{S}}=\left[\begin{array}[]{c}\textrm{Re}(\mathbf{S})\\ \textrm{Im}(\mathbf{S})\\ \end{array}\right].

\bar{\mathbf{Q}}=\hat{\mathbf{S}}^{{{T}}}\hat{\mathbf{Q}}\hat{\mathbf{S}},~{}\bar{\mathbf{c}}=\hat{\mathbf{S}}^{{{T}}}\hat{\mathbf{c}},~{}\text{and}~{}\hat{\mathbf{S}}=\left[\begin{array}[]{c}\textrm{Re}(\mathbf{S})\\ \textrm{Im}(\mathbf{S})\\ \end{array}\right].

t, T min

t, T min

At = e_{n}, t \geq 0,

T ⪰ t t^{T},

T_{i, i} = Diag (t_{i}), i = 1, 2, \dots, n,

\hat{S} T \hat{S}^{T} = Y and \hat{S} t = y,

\hat{S} T \hat{S}^{T} = Y and \hat{S} t = y,

\bm{\eta}_{i}=\left\{\begin{array}[]{@{}ll}\bm{\Lambda}_{i}^{1/2}\mathbf{U}_{i}^{T}\mathbf{s}_{R},~{}~{}\textrm{for}~{}i=1,2,\ldots,n;\\[3.0pt] \bm{\Lambda}_{i}^{1/2}\mathbf{U}_{i}^{T}\mathbf{s}_{I},~{}~{}\textrm{for}~{}i=n+1,n+2,\ldots,2n.\end{array}\right.

\bm{\eta}_{i}=\left\{\begin{array}[]{@{}ll}\bm{\Lambda}_{i}^{1/2}\mathbf{U}_{i}^{T}\mathbf{s}_{R},~{}~{}\textrm{for}~{}i=1,2,\ldots,n;\\[3.0pt] \bm{\Lambda}_{i}^{1/2}\mathbf{U}_{i}^{T}\mathbf{s}_{I},~{}~{}\textrm{for}~{}i=n+1,n+2,\ldots,2n.\end{array}\right.

\begin{array}[]{rl}\left\|\bm{\eta}_{i}\right\|^{2}=Y_{i,i}-y_{i}^{2},~{}i=1,2,\ldots,2n,\\[5.0pt] \bm{\eta}_{i}^{T}\bm{\eta}_{n+i}=Y_{i,n+i}-y_{i}y_{n+i},~{}i=1,2,\ldots,n.\end{array}

\begin{array}[]{rl}\left\|\bm{\eta}_{i}\right\|^{2}=Y_{i,i}-y_{i}^{2},~{}i=1,2,\ldots,2n,\\[5.0pt] \bm{\eta}_{i}^{T}\bm{\eta}_{n+i}=Y_{i,n+i}-y_{i}y_{n+i},~{}i=1,2,\ldots,n.\end{array}

ξ_{i}^{T} ξ_{j} = Y_{i, j} - y_{i} y_{j}, i, j = 1, 2, \dots, 2 n .

ξ_{i}^{T} ξ_{j} = Y_{i, j} - y_{i} y_{j}, i, j = 1, 2, \dots, 2 n .

Z_{i}^{T} Z_{i} ⪯ I_{M}, Z_{i} η_{i} = ξ_{i}, Z_{i} η_{n + i} = ξ_{n + i}, i = 1, 2, \dots, n .

Z_{i}^{T} Z_{i} ⪯ I_{M}, Z_{i} η_{i} = ξ_{i}, Z_{i} η_{n + i} = ξ_{n + i}, i = 1, 2, \dots, n .

\mathbf{T}_{i,j}=\left\{\begin{array}[]{@{}ll}\mathbf{t}_{i}\mathbf{t}_{j}^{T}+\mathbf{X}_{i}^{{{T}}}(\mathbf{Y}-\mathbf{y}\mathbf{y}^{T})\mathbf{X}_{j},~{}~{}\textrm{if}~{}i\neq j;\\[3.0pt] \text{Diag}(\mathbf{t}_{i}),~{}~{}\textrm{if}~{}i=j,\end{array}\right.

\mathbf{T}_{i,j}=\left\{\begin{array}[]{@{}ll}\mathbf{t}_{i}\mathbf{t}_{j}^{T}+\mathbf{X}_{i}^{{{T}}}(\mathbf{Y}-\mathbf{y}\mathbf{y}^{T})\mathbf{X}_{j},~{}~{}\textrm{if}~{}i\neq j;\\[3.0pt] \text{Diag}(\mathbf{t}_{i}),~{}~{}\textrm{if}~{}i=j,\end{array}\right.

λ_{m i n} (H^{†} H) sin (\frac{π}{M}) > H^{†} ν_{\infty},

λ_{m i n} (H^{†} H) sin (\frac{π}{M}) > H^{†} ν_{\infty},

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Wireless Communication Techniques · Advanced MIMO Systems Optimization · Advanced Wireless Network Optimization

Full text

On the Equivalence of Semidifinite Relaxations for MIMO Detection with General Constellations

Abstract

The multiple-input multiple-output (MIMO) detection problem is a fundamental problem in modern digital communications. Semidefinite relaxation (SDR) based algorithms are a popular class of approaches to solving the problem because the algorithms have a polynomial-time worst-case complexity and generally can achieve a good detection error rate performance. In spite of the existence of various different SDRs for the MIMO detection problem in the literature, very little is known about the relationship between these SDRs. This paper aims to fill this theoretical gap. In particular, this paper shows that two existing SDRs for the MIMO detection problem, which take quite different forms and are proposed by using different techniques, are equivalent. As a byproduct of the equivalence result, the tightness of one of the above two SDRs under a sufficient condition can be obtained.

**Index Terms— ** Complex quadratic optimization, equivalent relaxation, MIMO detection, semidefinite relaxation, tight relaxation.

1 Introduction

The MIMO detection problem is a fundamental problem in modern digital communications, which has been extensively studied for several decades [1]. Recently, it has received renewed interest, due to its potential applications in massive MIMO technology in 5G [1, 2]. The MIMO detection problem is generally modeled as a complex quadratic optimization problem. Various algorithms have been proposed to solve the problem. One of the most celebrated algorithms is the sphere decoder algorithm [3, 4]. The sphere decoder algorithm is a special branch-and-bound based enumeration algorithm, which is guaranteed to find the globally optimal solution of the problem. However, the worst-case and expected complexity of the sphere decoder algorithm are exponential [5, 6]. Motivated by some real-time applications, some efficient sub-optimal algorithms have also been proposed. For instance, the zero-forcing detector algorithm [7], the minimum mean-squared error detector algorithm [8], and the decision feedback detector algorithm [9], are all low-complexity sub-optimal algorithms. The performance of these algorithms are generally not good in the sense that their detection error rates are very high.

In the past two decades, the semidefinite relaxation (SDR) detector algorithms have received great attention [10]–[19]. The SDR detector has been proposed first for the BPSK constellation [10, 11] and then extended to the QPSK constellation [12, 13]. It has been shown that the SDR detector achieves a considerably lower detection error rate than all previously mentioned sub-optimal algorithms. Moreover, the SDR detector has a guaranteed polynomial-time worst-case complexity. To understand why the SDR detector performs remarkably well in practice, the approximation ratios of some SDR based algorithms have been studied in [14, 15, 16]. In particular, for the BPSK case, it has been shown in [17] that the SDR based algorithm can achieve the maximum possible diversity order. In addition to the above analysis results, some sufficient conditions, under which the SDRs are tight, have been also identified in [18, 19].

Besides the BPSK and QPSK cases, the SDR based algorithms have also been extended to other general constellation cases, especially the high-order QAM and $M$ -PSK constellations [20, 21]. Various SDR models have been proposed. For example, in [22], the detection problem is first reformulated as a quadratic integer optimization problem, and then some SDRs are designed by exploiting the special structure of the quadratic integer optimization problem. In [23], the number of design variables in the above quadratic integer optimization reformulation is further reduced, and a more compact SDR is proposed. Very recently, an SDR for the general $M$ -PSK constellation is proposed in [24], which is an enhanced SDR over the classical complex SDR and is obtained by adding valid linear cuts into an equivalent real reformulation of the classical complex SDR. Numerical results in [24] show that the enhanced SDR is much tighter than the classical complex SDR.

While various SDRs have been proposed for the MIMO detection problem due to different motivations and/or by using different techniques, there are very few works studying the relationship between these SDRs. To the best of our knowledge, the only work along this line is [25], where some SDRs for the QAM constellation have been compared. The goal of this work is to provide a comprehensive comparison of existing SDRs for the MIMO detection problem with a general constellation. Due to the space limitation, we only present one of our main results here (and more results will be presented in the journal extension). In particular, we show that an enhanced SDR proposed in [24] and a famous SDR proposed in [22] are equivalent; see (E $\mathbb{R}$ SDR1) and (E $\mathbb{R}$ SDR2) further ahead. As a byproduct of the above equivalence result, we can show the tightness of (E $\mathbb{R}$ SDR2) proposed in [22] under a sufficient condition. This tightness result remains unknown until this paper.

We adopt the following standard notations in this paper. We use $\mathbb{C}^{m\times n}$ ( $\mathbb{R}^{m\times n}$ ) and $\mathbb{C}^{m}$ ( $\mathbb{R}^{m}$ ) to denote the set of $(m\times n)$ -dimensional complex (real) matrices and $m$ -dimensional complex (real) vectors, respectively. We use $(\cdot)^{{{T}}}$ and $(\cdot)^{\dagger}$ to denote the transpose and Hermitian transpose of a matrix/vector, respectively. We use $\mathrm{Re}(\cdot)$ and $\mathrm{Im}(\cdot)$ to denote the element-wise real and imaginary parts of a complex matrix/vector/number, respectively. We use $\|\cdot\|_{2}$ and $\|\cdot\|_{\infty}$ to denote the $2$ -norm and $\infty$ -norm of a matrix/vector. The notations $\mathbf{e},~{}\bm{0},$ and $\mathbf{I}$ represent the all-one vector, the all-zero matrix/vector, and the identity matrix of appropriate sizes, respectively. For a given complex number $x,$ $\arg{(x)}$ denotes its argument. For a given vector $\mathbf{t},$ $\text{Diag}(\mathbf{t})$ denotes the diagonal matrix formed by it. Finally, for two given matrices $\mathbf{A}$ and $\mathbf{B}$ (of appropriate sizes), $\mathbf{A}\succeq\bm{0}$ means that $\mathbf{A}$ is a Hermitian positive semidefinite (PSD) matrix; $\mathbf{A}\bullet\mathbf{B}$ denotes the trace of their product $\mathbf{A}\mathbf{B}$ , i.e., $\sum_{i}\sum_{j}A_{i,j}B_{j,i}$ ; and $\mathbf{A}\otimes\mathbf{B}$ denotes their Kronecker product.

2 MIMO Detection Problem Formulation

Consider a complex-valued MIMO channel model

[TABLE]

where $\mathbf{r}\in\mathbb{C}^{m}$ is the vector of received signals, $\mathbf{H}\in\mathbb{C}^{m\times n}$ is an $m\times n$ complex channel matrix (for $n$ inputs and $m$ outputs with $m\geq n$ ), $\mathbf{x}^{\ast}\in\mathbb{C}^{n}$ is the vector of transmitted symbols, and $\bm{\nu}\in\mathbb{C}^{m}$ is an additive white circularly symmetric Gaussian noise with zero mean. Throughout the paper, we assume that the $M$ -PSK modulation scheme with $M\geq 2$ is adopted111The main results in this paper can also be extended to the QAM case.. Then, each entry $x_{i}^{\ast}$ of $\mathbf{x}^{\ast}$ belongs to a finite set of symbols

[TABLE]

where i is the imaginary unit (which satisfies $\textbf{i}^{2}=-1$ ). The MIMO detection problem is to recover the vector of transmitted symbols $\mathbf{x}^{\ast}$ from the vector of received signals $\mathbf{r}$ based on the knowledge of the channel matrix $\mathbf{H}$ . The mathematical formulation of the problem is

[TABLE]

where $\mathcal{A}=\left\{0,2\pi/M,\ldots,2(M-1)\pi/M\right\}.$

3 Review of Some Existing SDRs for (P)

The MIMO detection problem (P) is NP-hard [5]. Therefore, there is no polynomial-time algorithms which can solve it to global optimality in general (unless P=NP). In the last two decades, the SDR based algorithms have been widely studied in the signal processing and wireless communication community [26, 27] and particularly have been designed for solving problem (P). The SDR based algorithms for solving problem (P) not only enjoy a polynomial-time worst-case complexity but also generally achieve a very good detection error rate performance. In this section, we briefly review some existing SDRs for problem (P).

For notational simplicity, let $\mathbf{Q}=\mathbf{H}^{{\dagger}}\mathbf{H}$ and $\mathbf{c}=-\mathbf{H}^{{\dagger}}\mathbf{r};$ let $\mathbf{s}=[s_{1},s_{2},\ldots,s_{M}]^{T}\in\mathbb{C}^{M}$ be the vector of all constellation symbols, where

[TABLE]

and finally let $\mathbf{s}_{R}=\mathrm{Re}(\mathbf{s})$ and $\mathbf{s}_{I}=\mathrm{Im}(\mathbf{s}).$

By introducing an $n\times n$ complex matrix $\mathbf{X}=\mathbf{x}\mathbf{x}^{\dagger}$ , problem (P) can be equivalently reformulated as

[TABLE]

where the variables $\mathbf{x}\in\mathbb{C}^{n}$ and $\mathbf{X}\in\mathbb{C}^{n\times n}$ and $X_{i,i}$ is the $i$ -th diagonal entry of $\mathbf{X}.$ A straightforward (but loose) SDR of problem (P) is

[TABLE]

which drops the argument constraints $\arg\left(x_{i}\right)\in\mathcal{A}$ for all $i=1,2,\ldots,n$ and relaxes the nonconvex constraint $\mathbf{X}=\mathbf{x}\mathbf{x}^{\dagger}$ to

[TABLE]

It has been shown in [24] that ( $\mathbb{C}$ SDR) is equivalent to the following real SDR

[TABLE]

where the variables $\mathbf{y}\in\mathbb{R}^{2n}$ and $\mathbf{Y}\in\mathbb{R}^{2n\times 2n}$ and

[TABLE]

Based on ( $\mathbb{R}$ SDR), an enhanced SDR for (P) has recently been proposed in [24]. Define the following $3\times 3$ matrices

[TABLE]

and

[TABLE]

By the definition of $\mathbf{y}$ in (2), ideally each ${\mathbf{Y}}_{i}$ in (3) must be one of matrices $\mathbf{P}_{j}$ with $j=1,2,\ldots,M,$ i.e.,

[TABLE]

By relaxing the above combinatorial constraints and dropping some redundant constraints, reference [24] proposes the following enhanced SDR for (P):

[TABLE]

where the variables $\mathbf{y}\in\mathbb{R}^{2n},$ $\mathbf{Y}\in\mathbb{R}^{2n\times 2n},$ $\mathbf{t}\in\mathbb{R}^{Mn},$ $\hat{\mathbf{Q}}$ and $\hat{\mathbf{c}}$ are defined in (2), ${\mathbf{Y}}_{i}$ is defined in (3), $\mathbf{P}_{j}$ is defined in (4), and

[TABLE]

In (E $\mathbb{R}$ SDR1), $\mathbf{t}=[\mathbf{t}_{1}^{{{T}}},\mathbf{t}_{2}^{{{T}}},\ldots,\mathbf{t}_{n}^{{{T}}}]^{T}$ and $\mathbf{t}_{i}=[t_{i,1},t_{i,2},\ldots,t_{i,M}]^{{{T}}}\in\mathbb{R}^{M}.$ Due to the symmetry of ${\mathbf{Y}}_{i},$ the constraint ${\mathbf{Y}}_{i}=\sum_{j=1}^{M}t_{i,j}\mathbf{P}_{j}$ can be explicitly expressed as the following $5$ linear constraints:

[TABLE]

Another interesting SDR for problem (P) is proposed in [22] based on the following observation: for each $x_{i}^{\ast}$ of $\mathbf{x}^{\ast},$ there holds $x_{i}^{\ast}=\mathbf{t}_{i}^{{{T}}}\mathbf{s},$ where only one entry of $\mathbf{t}_{i}\in\mathbb{R}^{M}$ is one and all the others are zero. Then, problem (P) is reformulated in [22] as follows:

[TABLE]

where

[TABLE]

Based on the above reformulation and by exploiting the special structure of vector $\mathbf{t},$ reference [22] proposes the following SDR222A slight difference between (E $\mathbb{R}$ SDR2) presented here and (Model III) in [22] lies in the elimination of one variable in each $\mathbf{t}_{i}$ by using the property that the summation of $\mathbf{t}_{i}$ is equal to one for $i=1,2,\ldots,n.$

[TABLE]

where the variables $\mathbf{t}\in\mathbb{R}^{Mn}$ and $\mathbf{T}\in\mathbb{R}^{Mn\times Mn}$ and $\mathbf{T}_{i,i}\in\mathbb{R}^{M\times M}$ denotes the $i$ -th diagonal block of matrix $\mathbf{T}.$ The last constraint $\mathbf{T}_{i,i}=\text{Diag}(\mathbf{t}_{i})$ requires that $\mathbf{T}_{i,i}$ is a diagonal matrix and all its diagonal entries are equal to $\mathbf{t}_{i}.$

4 Main Results

In this section, we present the main result of this paper. We show, somewhat surprisingly, that (E $\mathbb{R}$ SDR1) and (E $\mathbb{R}$ SDR2) are equivalent, although they are derived by using different techniques and due to different motivations and they take quite different forms. The equivalence here means that, for any given feasible point $(\mathbf{T},\mathbf{t})$ of (E $\mathbb{R}$ SDR2), there exists a feasible point $(\mathbf{y},\mathbf{Y},\mathbf{t})$ of (E $\mathbb{R}$ SDR1) such that the two problems have the same objective value at the corresponding points; and for any given feasible point $(\mathbf{y},\mathbf{Y},\mathbf{t})$ of (E $\mathbb{R}$ SDR1), there exists a feasible point $(\mathbf{T},\mathbf{t})$ of (E $\mathbb{R}$ SDR2) such that the two problems also have the same objective value.

Theorem 1

(E $\mathbb{R}$ SDR1) and (E $\mathbb{R}$ SDR2) are equivalent.

Proof: Due to the space reason, we only give a proof outline here. To show the theorem, it suffices to show that a pair of the feasible points of (E $\mathbb{R}$ SDR1) and (E $\mathbb{R}$ SDR2) satisfies the following relationship

[TABLE]

where $\hat{\mathbf{S}}$ is given in (6). The conditions in (7) guarantee that the two SDRs have the same objective value. Now, given any feasible point $(\mathbf{T},\mathbf{t})$ of (E $\mathbb{R}$ SDR2), one can easily check that the same $\mathbf{t}$ jointly with $\mathbf{y}$ and $\mathbf{Y}$ given in (7) is a feasible point of (E $\mathbb{R}$ SDR1) and they achieve the same objective value as that of (E $\mathbb{R}$ SDR2) at $(\mathbf{T},\mathbf{t})$ . Next, given any feasible point $(\mathbf{y},\mathbf{Y},\mathbf{t})$ of (E $\mathbb{R}$ SDR1), we shall construct a matrix $\mathbf{T}$ such that it, jointly with the given $\mathbf{t},$ is a feasible point of (E $\mathbb{R}$ SDR2) and the two problems have the same objective value at these two points.

Without loss of generality, suppose that the PSD matrix $\mathbf{Y}-\mathbf{y}\mathbf{y}^{{{T}}}$ is not zero. Let $r\geq 1$ denote the rank of $\mathbf{Y}-\mathbf{y}\mathbf{y}^{{{T}}}.$ Furthermore, suppose $\mathbf{Y}-\mathbf{y}\mathbf{y}^{{{T}}}$ has the following eigenvalue decomposition $\mathbf{Y}-\mathbf{y}\mathbf{y}^{{{T}}}=\mathbf{U}\bm{\Lambda}\mathbf{U}^{{{T}}},$ where $\mathbf{U}\in\mathbb{R}^{2n\times r}$ and $\bm{\Lambda}\succ\bm{0}.$ Similarly, for each $i=1,2,\ldots,n,$ one can easily show that $\text{Diag}(\mathbf{t}_{i})-\mathbf{t}_{i}\mathbf{t}_{i}^{{{T}}}$ is PSD due to the fact that $\mathbf{e}_{M}^{{{T}}}\mathbf{t}_{i}=1$ and $\mathbf{t}_{i}\geq\bm{0}.$ Suppose that $\text{Diag}(\mathbf{t}_{i})-\mathbf{t}_{i}\mathbf{t}_{i}^{{{T}}}=\mathbf{U}_{i}\bm{\Lambda}_{i}\mathbf{U}_{i}^{{{T}}},$ where $\mathbf{U}_{i}\in\mathbb{R}^{M\times M}$ and $\bm{\Lambda}_{i}\succeq\bm{0}.$ Construct the vectors

[TABLE]

By using (5), one can check that the above $\left\{\bm{\eta}_{i}\in\mathbb{R}^{M\times 1}\right\}$ satisfy

[TABLE]

Suppose $\mathbf{Y}-\mathbf{y}\mathbf{y}^{T}=\left[\begin{array}[]{ccc}\bm{\xi}_{1}&\ldots&\bm{\xi}_{2n}\\ \end{array}\right]^{T}\left[\begin{array}[]{cccc}\bm{\xi}_{1}&\ldots&\bm{\xi}_{2n}\\ \end{array}\right],$ where $\bm{\xi}_{i}\in\mathbb{R}^{r\times 1}$ for all $i.$ Obviously,

[TABLE]

One can show from (8) and (9) that there exist $\left\{\mathbf{Z}_{i}\in\mathbb{R}^{r\times M}\right\}$ such that

[TABLE]

Now, we can construct the desired matrix $\mathbf{T}\in\mathbb{R}^{nM\times nM}.$ Let the $(i,j)$ -th block of $\mathbf{T}$ be

[TABLE]

where $\mathbf{X}_{i}=\mathbf{U}\bm{\Lambda}^{-1/2}\mathbf{Z}_{i}\bm{\Lambda}_{i}^{1/2}\mathbf{U}_{i}^{{{T}}}\in\mathbb{R}^{2n\times M},~{}i=1,2,\ldots,n$ and $\left\{\mathbf{Z}_{i}\right\}_{i=1}^{n}$ are given in (10). One can check that the above constructed $\mathbf{T}$ and the given $\mathbf{t}$ jointly satisfy all constraints in (E $\mathbb{R}$ SDR2) and equations in (7) (and thus they achieve the same objective value as that of (E $\mathbb{R}$ SDR1) at $(\mathbf{y},\mathbf{Y},\mathbf{t})$ ). This completes the proof. Q.E.D.

Two remarks on Theorem 1 are in order. First, combining Theorem 1 and [24, Theorem 4.4], we can immediately obtain the following tightness result of (E $\mathbb{R}$ SDR2).

Theorem 2

Suppose that $M\geq 2.$ If the inputs $\mathbf{H}$ and $\bm{\nu}$ in (1) satisfy

[TABLE]

where $\lambda_{\min}\left(\mathbf{H}^{{\dagger}}\mathbf{H}\right)$ denotes the smallest eigenvalue of $\mathbf{H}^{{\dagger}}\mathbf{H},$ then (E $\mathbb{R}$ SDR2) is tight for (P).

The sufficient condition in (11) is intuitive, which roughly says that problem (P) is an “easy” problem (polynomial-time solvable) if the channel matrix is well conditioned and the number of constellation points and the noise level are below a certain threshold. Second, Theorem 1 reveals that there is some “redundancy” in (E $\mathbb{R}$ SDR2). In particular, we can see from (7) that there is a correspondence between the feasible sets of (E $\mathbb{R}$ SDR2) and (E $\mathbb{R}$ SDR1) and all information contained in the high-dimensional space $\left(\mathbf{T},\mathbf{t}\right)$ in (E $\mathbb{R}$ SDR2) is kept in the low-dimensional space $\left(\mathbf{y},\mathbf{Y},\mathbf{t}\right)$ in (E $\mathbb{R}$ SDR1) under the mapping in (7). To be more specific, the matrix variable in (E $\mathbb{R}$ SDR1) is of dimension $2n\times 2n$ but the matrix variable in (E $\mathbb{R}$ SDR2) is of dimension $Mn\times Mn.$ Hence, (E $\mathbb{R}$ SDR1) should be more efficiently solvable than (E $\mathbb{R}$ SDR2) especially when $M$ is much larger than $2.$ The equivalence shown in Theorem 1 provides useful insight into possibly reducing the “redundancy” in existing SDRs for more general combinatorial optimization problems and designing new computationally more efficient SDRs.

5 Simulation Results

In this section, we present some preliminary simulation results to verify the equivalence between (E $\mathbb{R}$ SDR1) and (E $\mathbb{R}$ SDR2). In our simulations, all entries of the channel matrix $\mathbf{H}\in\mathbb{C}^{m\times n}$ are generated independently and identically according to the standard complex Gaussian distribution, and all entries of the transmitted symbol vector $\mathbf{x}^{\ast}\in\mathbb{C}^{n}$ are drawn independently and uniformly from the $8$ -PSK constellation. In our simulation, we focus on the 8-PSK constellation with $(m,n)=(10,10)$ . We define the SNR as follows:

[TABLE]

where $\sigma_{\mathbf{x}}^{2}=\mathbb{E}[\|\mathbf{x}^{\ast}\|_{2}^{2}],$ $\sigma_{\bm{\nu}}^{2}=\mathbb{E}[\|\bm{\nu}\|_{2}^{2}],$ and $\mathbb{E}[\cdot]$ is the expectation operator. For each SNR value, we randomly generate $100$ problem instances $(\mathbf{H},\mathbf{x}^{\ast},\bm{\nu})$ and the results presented below are obtained by averaging over all generated instances. We use the solver SeDuMi in CVX [28] to solve the two SDRs, i.e., (E $\mathbb{R}$ SDR1) and (E $\mathbb{R}$ SDR2).

Fig. 1 shows the average difference of the optimal objective values of (E $\mathbb{R}$ SDR1) and (E $\mathbb{R}$ SDR2) and the average difference of the first and second equations in (7) at the optimal solutions, i.e., $\|\hat{\mathbf{S}}\mathbf{T}\hat{\mathbf{S}}^{{{T}}}-\mathbf{Y}\|_{2}$ and $\|\hat{\mathbf{S}}\mathbf{t}-\mathbf{y}\|_{2}$ , versus different SNRs. As can be observed from Fig. 1, the difference under all these three measures is very small (in the order of $1$ e $-4$ ) over the whole range of tested SNRs, and this shows that (E $\mathbb{R}$ SDR1) and (E $\mathbb{R}$ SDR2) are indeed equivalent. Fig. 2 shows the average CPU time taken to solve (E $\mathbb{R}$ SDR1) and (E $\mathbb{R}$ SDR2) versus different SNRs. We can see clearly from Fig. 2 that solving (E $\mathbb{R}$ SDR1) is much more efficient than solving (E $\mathbb{R}$ SDR2). It is expected that the time difference of solving the two SDRs will become larger as the dimension of the problem (especially the number of constellation points) increases. All the above simulation results are consistent with our analysis.

Bibliography28

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] S. Yang and L. Hanzo, “Fifty years of MIMO detection: The road to large-scale MIM Os,” IEEE Commun. Surveys Tuts. , vol. 17, no. 4, pp. 1941–1988, 2015.
2[2] H. Liu, M.-C. Yue, A. M.-C. So, and W.-K. Ma, “A discrete first-order method for large-scale MIMO detection with provable guarantees,” in Proc. IEEE Int. Workshop Signal Process. Adv. Wireless Commun. (SPAWC) , Jul. 2017, pp. 669–673.
3[3] M. O. Damen, H. E. Gamal, and G. Caire, “On maximum-likelihood detection and the search for the closest lattice point,” IEEE Trans. Inf. Theory , vol. 49, no. 10, pp. 2389–2402, Oct. 2003.
4[4] A. D. Murugan, H. E. Gamal, M. O. Damen, and G. Caire, “A unified framework for tree search decoding: Rediscovering the sequential decoder,” IEEE Trans. Inf. Theory , vol. 52, no. 3, pp. 933–953, Mar. 2006.
5[5] S. Verdú, “Computational complexity of optimum multiuser detection,” Algorithmica , vol. 4, no. 1–4, pp. 303–312, Jun. 1989.
6[6] J. Jaldén and B. Ottersten, “On the complexity of sphere decoding in digital communications,” IEEE Trans. Signal Process. , vol. 53, no. 4, pp. 1474–1484, Apr. 2005.
7[7] K. S. Schneider, “Optimum detection of code division multiplexed signals,” IEEE Trans. Aerosp. Electron. Syst. , vol. AES-15, no. 1, pp. 181–185, Jan. 1979.
8[8] M. Honig, U. Madhow, and S. Verdú, “Blind adaptive multiuser detection,” IEEE Trans. Inf. Theory , vol. 41, no. 4, pp. 944–960, Jul. 1995.