On the Equivalence of Semidifinite Relaxations for MIMO Detection with General Constellations
Ya-Feng Liu, Zi Xu, Cheng Lu

TL;DR
This paper investigates the theoretical relationship between different semidefinite relaxation methods for MIMO detection, establishing their equivalence and providing insights into their tightness under certain conditions.
Contribution
It proves the equivalence of two distinct SDRs for MIMO detection, enhancing understanding of their theoretical properties and performance.
Findings
Two SDRs for MIMO detection are equivalent.
The tightness of one SDR can be guaranteed under a specific condition.
Theoretical insight into the relationship between different relaxation techniques.
Abstract
The multiple-input multiple-output (MIMO) detection problem is a fundamental problem in modern digital communications. Semidefinite relaxation (SDR) based algorithms are a popular class of approaches to solving the problem because the algorithms have a polynomial-time worst-case complexity and generally can achieve a good detection error rate performance. In spite of the existence of various different SDRs for the MIMO detection problem in the literature, very little is known about the relationship between these SDRs. This paper aims to fill this theoretical gap. In particular, this paper shows that two existing SDRs for the MIMO detection problem, which take quite different forms and are proposed by using different techniques, are equivalent. As a byproduct of the equivalence result, the tightness of one of the above two SDRs under a sufficient condition can be obtained.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Wireless Communication Techniques · Advanced MIMO Systems Optimization · Advanced Wireless Network Optimization
On the Equivalence of Semidifinite Relaxations for MIMO Detection with General Constellations
Abstract
The multiple-input multiple-output (MIMO) detection problem is a fundamental problem in modern digital communications. Semidefinite relaxation (SDR) based algorithms are a popular class of approaches to solving the problem because the algorithms have a polynomial-time worst-case complexity and generally can achieve a good detection error rate performance. In spite of the existence of various different SDRs for the MIMO detection problem in the literature, very little is known about the relationship between these SDRs. This paper aims to fill this theoretical gap. In particular, this paper shows that two existing SDRs for the MIMO detection problem, which take quite different forms and are proposed by using different techniques, are equivalent. As a byproduct of the equivalence result, the tightness of one of the above two SDRs under a sufficient condition can be obtained.
**Index Terms— ** Complex quadratic optimization, equivalent relaxation, MIMO detection, semidefinite relaxation, tight relaxation.
1 Introduction
The MIMO detection problem is a fundamental problem in modern digital communications, which has been extensively studied for several decades [1]. Recently, it has received renewed interest, due to its potential applications in massive MIMO technology in 5G [1, 2]. The MIMO detection problem is generally modeled as a complex quadratic optimization problem. Various algorithms have been proposed to solve the problem. One of the most celebrated algorithms is the sphere decoder algorithm [3, 4]. The sphere decoder algorithm is a special branch-and-bound based enumeration algorithm, which is guaranteed to find the globally optimal solution of the problem. However, the worst-case and expected complexity of the sphere decoder algorithm are exponential [5, 6]. Motivated by some real-time applications, some efficient sub-optimal algorithms have also been proposed. For instance, the zero-forcing detector algorithm [7], the minimum mean-squared error detector algorithm [8], and the decision feedback detector algorithm [9], are all low-complexity sub-optimal algorithms. The performance of these algorithms are generally not good in the sense that their detection error rates are very high.
In the past two decades, the semidefinite relaxation (SDR) detector algorithms have received great attention [10]–[19]. The SDR detector has been proposed first for the BPSK constellation [10, 11] and then extended to the QPSK constellation [12, 13]. It has been shown that the SDR detector achieves a considerably lower detection error rate than all previously mentioned sub-optimal algorithms. Moreover, the SDR detector has a guaranteed polynomial-time worst-case complexity. To understand why the SDR detector performs remarkably well in practice, the approximation ratios of some SDR based algorithms have been studied in [14, 15, 16]. In particular, for the BPSK case, it has been shown in [17] that the SDR based algorithm can achieve the maximum possible diversity order. In addition to the above analysis results, some sufficient conditions, under which the SDRs are tight, have been also identified in [18, 19].
Besides the BPSK and QPSK cases, the SDR based algorithms have also been extended to other general constellation cases, especially the high-order QAM and -PSK constellations [20, 21]. Various SDR models have been proposed. For example, in [22], the detection problem is first reformulated as a quadratic integer optimization problem, and then some SDRs are designed by exploiting the special structure of the quadratic integer optimization problem. In [23], the number of design variables in the above quadratic integer optimization reformulation is further reduced, and a more compact SDR is proposed. Very recently, an SDR for the general -PSK constellation is proposed in [24], which is an enhanced SDR over the classical complex SDR and is obtained by adding valid linear cuts into an equivalent real reformulation of the classical complex SDR. Numerical results in [24] show that the enhanced SDR is much tighter than the classical complex SDR.
While various SDRs have been proposed for the MIMO detection problem due to different motivations and/or by using different techniques, there are very few works studying the relationship between these SDRs. To the best of our knowledge, the only work along this line is [25], where some SDRs for the QAM constellation have been compared. The goal of this work is to provide a comprehensive comparison of existing SDRs for the MIMO detection problem with a general constellation. Due to the space limitation, we only present one of our main results here (and more results will be presented in the journal extension). In particular, we show that an enhanced SDR proposed in [24] and a famous SDR proposed in [22] are equivalent; see (ESDR1) and (ESDR2) further ahead. As a byproduct of the above equivalence result, we can show the tightness of (ESDR2) proposed in [22] under a sufficient condition. This tightness result remains unknown until this paper.
We adopt the following standard notations in this paper. We use () and () to denote the set of -dimensional complex (real) matrices and -dimensional complex (real) vectors, respectively. We use and to denote the transpose and Hermitian transpose of a matrix/vector, respectively. We use and to denote the element-wise real and imaginary parts of a complex matrix/vector/number, respectively. We use and to denote the -norm and -norm of a matrix/vector. The notations and represent the all-one vector, the all-zero matrix/vector, and the identity matrix of appropriate sizes, respectively. For a given complex number denotes its argument. For a given vector denotes the diagonal matrix formed by it. Finally, for two given matrices and (of appropriate sizes), means that is a Hermitian positive semidefinite (PSD) matrix; denotes the trace of their product , i.e., ; and denotes their Kronecker product.
2 MIMO Detection Problem Formulation
Consider a complex-valued MIMO channel model
[TABLE]
where is the vector of received signals, is an complex channel matrix (for inputs and outputs with ), is the vector of transmitted symbols, and is an additive white circularly symmetric Gaussian noise with zero mean. Throughout the paper, we assume that the -PSK modulation scheme with is adopted111The main results in this paper can also be extended to the QAM case.. Then, each entry of belongs to a finite set of symbols
[TABLE]
where i is the imaginary unit (which satisfies ). The MIMO detection problem is to recover the vector of transmitted symbols from the vector of received signals based on the knowledge of the channel matrix . The mathematical formulation of the problem is
[TABLE]
where
3 Review of Some Existing SDRs for (P)
The MIMO detection problem (P) is NP-hard [5]. Therefore, there is no polynomial-time algorithms which can solve it to global optimality in general (unless P=NP). In the last two decades, the SDR based algorithms have been widely studied in the signal processing and wireless communication community [26, 27] and particularly have been designed for solving problem (P). The SDR based algorithms for solving problem (P) not only enjoy a polynomial-time worst-case complexity but also generally achieve a very good detection error rate performance. In this section, we briefly review some existing SDRs for problem (P).
For notational simplicity, let and let be the vector of all constellation symbols, where
[TABLE]
and finally let and
By introducing an complex matrix , problem (P) can be equivalently reformulated as
[TABLE]
where the variables and and is the -th diagonal entry of A straightforward (but loose) SDR of problem (P) is
[TABLE]
which drops the argument constraints for all and relaxes the nonconvex constraint to
[TABLE]
It has been shown in [24] that (SDR) is equivalent to the following real SDR
[TABLE]
where the variables and and
[TABLE]
Based on (SDR), an enhanced SDR for (P) has recently been proposed in [24]. Define the following matrices
[TABLE]
and
[TABLE]
By the definition of in (2), ideally each in (3) must be one of matrices with i.e.,
[TABLE]
By relaxing the above combinatorial constraints and dropping some redundant constraints, reference [24] proposes the following enhanced SDR for (P):
[TABLE]
where the variables and are defined in (2), is defined in (3), is defined in (4), and
[TABLE]
In (ESDR1), and Due to the symmetry of the constraint can be explicitly expressed as the following linear constraints:
[TABLE]
Another interesting SDR for problem (P) is proposed in [22] based on the following observation: for each of there holds where only one entry of is one and all the others are zero. Then, problem (P) is reformulated in [22] as follows:
[TABLE]
where
[TABLE]
Based on the above reformulation and by exploiting the special structure of vector reference [22] proposes the following SDR222A slight difference between (ESDR2) presented here and (Model III) in [22] lies in the elimination of one variable in each by using the property that the summation of is equal to one for
[TABLE]
where the variables and and denotes the -th diagonal block of matrix The last constraint requires that is a diagonal matrix and all its diagonal entries are equal to
4 Main Results
In this section, we present the main result of this paper. We show, somewhat surprisingly, that (ESDR1) and (ESDR2) are equivalent, although they are derived by using different techniques and due to different motivations and they take quite different forms. The equivalence here means that, for any given feasible point of (ESDR2), there exists a feasible point of (ESDR1) such that the two problems have the same objective value at the corresponding points; and for any given feasible point of (ESDR1), there exists a feasible point of (ESDR2) such that the two problems also have the same objective value.
Theorem 1
(ESDR1) and (ESDR2) are equivalent.
Proof: Due to the space reason, we only give a proof outline here. To show the theorem, it suffices to show that a pair of the feasible points of (ESDR1) and (ESDR2) satisfies the following relationship
[TABLE]
where is given in (6). The conditions in (7) guarantee that the two SDRs have the same objective value. Now, given any feasible point of (ESDR2), one can easily check that the same jointly with and given in (7) is a feasible point of (ESDR1) and they achieve the same objective value as that of (ESDR2) at . Next, given any feasible point of (ESDR1), we shall construct a matrix such that it, jointly with the given is a feasible point of (ESDR2) and the two problems have the same objective value at these two points.
Without loss of generality, suppose that the PSD matrix is not zero. Let denote the rank of Furthermore, suppose has the following eigenvalue decomposition where and Similarly, for each one can easily show that is PSD due to the fact that and Suppose that where and Construct the vectors
[TABLE]
By using (5), one can check that the above satisfy
[TABLE]
Suppose \mathbf{Y}-\mathbf{y}\mathbf{y}^{T}=\left[\begin{array}[]{ccc}\bm{\xi}_{1}&\ldots&\bm{\xi}_{2n}\\ \end{array}\right]^{T}\left[\begin{array}[]{cccc}\bm{\xi}_{1}&\ldots&\bm{\xi}_{2n}\\ \end{array}\right], where for all Obviously,
[TABLE]
One can show from (8) and (9) that there exist such that
[TABLE]
Now, we can construct the desired matrix Let the -th block of be
[TABLE]
where and are given in (10). One can check that the above constructed and the given jointly satisfy all constraints in (ESDR2) and equations in (7) (and thus they achieve the same objective value as that of (ESDR1) at ). This completes the proof. Q.E.D.
Two remarks on Theorem 1 are in order. First, combining Theorem 1 and [24, Theorem 4.4], we can immediately obtain the following tightness result of (ESDR2).
Theorem 2
Suppose that If the inputs and in (1) satisfy
[TABLE]
where denotes the smallest eigenvalue of then (ESDR2) is tight for (P).
The sufficient condition in (11) is intuitive, which roughly says that problem (P) is an “easy” problem (polynomial-time solvable) if the channel matrix is well conditioned and the number of constellation points and the noise level are below a certain threshold. Second, Theorem 1 reveals that there is some “redundancy” in (ESDR2). In particular, we can see from (7) that there is a correspondence between the feasible sets of (ESDR2) and (ESDR1) and all information contained in the high-dimensional space in (ESDR2) is kept in the low-dimensional space in (ESDR1) under the mapping in (7). To be more specific, the matrix variable in (ESDR1) is of dimension but the matrix variable in (ESDR2) is of dimension Hence, (ESDR1) should be more efficiently solvable than (ESDR2) especially when is much larger than The equivalence shown in Theorem 1 provides useful insight into possibly reducing the “redundancy” in existing SDRs for more general combinatorial optimization problems and designing new computationally more efficient SDRs.
5 Simulation Results
In this section, we present some preliminary simulation results to verify the equivalence between (ESDR1) and (ESDR2). In our simulations, all entries of the channel matrix are generated independently and identically according to the standard complex Gaussian distribution, and all entries of the transmitted symbol vector are drawn independently and uniformly from the -PSK constellation. In our simulation, we focus on the 8-PSK constellation with . We define the SNR as follows:
[TABLE]
where and is the expectation operator. For each SNR value, we randomly generate problem instances and the results presented below are obtained by averaging over all generated instances. We use the solver SeDuMi in CVX [28] to solve the two SDRs, i.e., (ESDR1) and (ESDR2).
Fig. 1 shows the average difference of the optimal objective values of (ESDR1) and (ESDR2) and the average difference of the first and second equations in (7) at the optimal solutions, i.e., and , versus different SNRs. As can be observed from Fig. 1, the difference under all these three measures is very small (in the order of e) over the whole range of tested SNRs, and this shows that (ESDR1) and (ESDR2) are indeed equivalent. Fig. 2 shows the average CPU time taken to solve (ESDR1) and (ESDR2) versus different SNRs. We can see clearly from Fig. 2 that solving (ESDR1) is much more efficient than solving (ESDR2). It is expected that the time difference of solving the two SDRs will become larger as the dimension of the problem (especially the number of constellation points) increases. All the above simulation results are consistent with our analysis.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] S. Yang and L. Hanzo, “Fifty years of MIMO detection: The road to large-scale MIM Os,” IEEE Commun. Surveys Tuts. , vol. 17, no. 4, pp. 1941–1988, 2015.
- 2[2] H. Liu, M.-C. Yue, A. M.-C. So, and W.-K. Ma, “A discrete first-order method for large-scale MIMO detection with provable guarantees,” in Proc. IEEE Int. Workshop Signal Process. Adv. Wireless Commun. (SPAWC) , Jul. 2017, pp. 669–673.
- 3[3] M. O. Damen, H. E. Gamal, and G. Caire, “On maximum-likelihood detection and the search for the closest lattice point,” IEEE Trans. Inf. Theory , vol. 49, no. 10, pp. 2389–2402, Oct. 2003.
- 4[4] A. D. Murugan, H. E. Gamal, M. O. Damen, and G. Caire, “A unified framework for tree search decoding: Rediscovering the sequential decoder,” IEEE Trans. Inf. Theory , vol. 52, no. 3, pp. 933–953, Mar. 2006.
- 5[5] S. Verdú, “Computational complexity of optimum multiuser detection,” Algorithmica , vol. 4, no. 1–4, pp. 303–312, Jun. 1989.
- 6[6] J. Jaldén and B. Ottersten, “On the complexity of sphere decoding in digital communications,” IEEE Trans. Signal Process. , vol. 53, no. 4, pp. 1474–1484, Apr. 2005.
- 7[7] K. S. Schneider, “Optimum detection of code division multiplexed signals,” IEEE Trans. Aerosp. Electron. Syst. , vol. AES-15, no. 1, pp. 181–185, Jan. 1979.
- 8[8] M. Honig, U. Madhow, and S. Verdú, “Blind adaptive multiuser detection,” IEEE Trans. Inf. Theory , vol. 41, no. 4, pp. 944–960, Jul. 1995.
