Noncoherent Multiuser Massive SIMO for Low-Latency Industrial IoT Communications
Zheng Dong, He Chen, Jian-Kang Zhang, Branka Vucetic

TL;DR
This paper introduces a noncoherent detection framework for multiuser massive SIMO systems in industrial IoT, using a new constellation design and power optimization to enhance low-latency, high-reliability communication without channel estimation.
Contribution
It proposes a novel UDCG-based constellation design and a max-min KL distance optimization for noncoherent massive SIMO systems, improving error performance in low-latency IIoT applications.
Findings
Outperforms existing max-min Euclidean distance methods in error performance.
Better error performance than conventional coherent ZF receivers for cell edge users.
Effective in large antenna array scenarios with reduced latency and overhead.
Abstract
In this paper, we consider a multiuser massive single-input multiple-output (SIMO) enabled Industrial Internet of Things (IIoT) communication system. To reduce the latency and overhead caused by channel estimation, we assume that only the large-scale fading coefficients are available. We employ a noncoherent maximum-likelihood (ML) detector at the receiver side which does not need the instantaneous channel state information (CSI). For such a massive SIMO system, we present a new design framework to assure that each transmitted signal matrix can be uniquely determined in the noise-free case and be reliably estimated in noisy cases. The key idea is to utilize a new concept called the uniquely decomposable constellation group (UDCG) based on the practically used quadrature amplitude modulation~(QAM) constellation. To improve the average error performance when the antenna array size is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced MIMO Systems Optimization · Advanced Wireless Communication Techniques · Cooperative Communication and Network Coding
Noncoherent Multiuser Massive SIMO for Low-Latency Industrial IoT Communications
Zheng Dong∗, He Chen∗, Jian-Kang Zhang*†*, and Branka Vucetic∗
∗The University of Sydney, Australia, emails:{zheng.dong, he.chen, branka.vucetic}@sydney.edu.au
*†*McMaster University, Canada, email:[email protected]
Abstract
In this paper, we consider a multiuser massive single-input multiple-output (SIMO) enabled Industrial Internet of Things (IIoT) communication system. To reduce the latency and overhead caused by channel estimation, we assume that only the large-scale fading coefficients are available. We employ a noncoherent maximum-likelihood (ML) detector at the receiver side which does not need the instantaneous channel state information (CSI). For such a massive SIMO system, we present a new design framework to assure that each transmitted signal matrix can be uniquely determined in the noise-free case and be reliably estimated in noisy cases. The key idea is to utilize a new concept called the uniquely decomposable constellation group (UDCG) based on the practically used quadrature amplitude modulation (QAM) constellation. To improve the average error performance when the antenna array size is scaled up, we propose a max-min Kullback-Leibler (KL) distance design by carrying out optimization over the transmitted power and the sub-constellation assignment. Finally, simulation results show that the proposed design outperforms significantly the existing max-min Euclidean distance based method in terms of error performance. Moreover, our proposed approach also has a better error performance than the conventional coherent zero-forcing (ZF) receiver with orthogonal training for cell edge users.
I Introduction
The fourth stage of industrial revolution, also termed Industry 4.0 or Industrial Internet of Things (IIoT), is a new vision that in general refers to the introduction of advanced Internet technologies in industrial control and manufacturing, with the aim of significantly boosting the flexibility, versatility, usability and efficiency of future smart factories [1]. There is a general consensus that providing powerful and pervasive connectivity between machines, people and objects in industrial environments will be essential to realize this new vision.
Connectivity in factories has until now been dominated by wired systems, which has been preferred for its real-time capability and reliability. Wireless systems are mainly applied in noncritical use cases such as monitoring of conditions, which poses comparatively low communication performance requirements since communication failure will not lead to serious accidents (e.g., economic losses and safety problems) [2]. The main reason behind this is that stringent performance in terms of relibility and latency required by critical use cases are several orders of magnitude better than what is achievable by today’s wireless technologies [3]. On the other hand, there are increasing necessity for the use of wireless solutions in critical use cases because it reduces cost for large-scale deployment, enables flexible communication needed in smart production, and opens new fields of application, such as the control of moving objects together with their simultaneous localization and allocation. As such, there is an urgent need to develop ultra-reliable low-latency wireless communications for critical IIoT use cases. Recently, massive multiple-input multiple output (MIMO) technology, which refers to the usage of a large number of antennas in wireless systems to harness the spatial diversity, has been regarded indispensable for providing ultrahigh reliability [4]. In light of this, how to develop low-latency massive MIMO systems has become one of the most important research problems in the field.
Motivated by the above facts, we consider an uplink multiuser massive single-input multiple-output (SIMO) enabled IIoT communication system. We note that the channel estimation is a major obstacle to reduce the transmission latency due to the following two reasons: 1) the estimation of channel state information (CSI) is commonly done by transmitting some known orthogonal pilot symbols, the minimum number of which is no less than the number of transmitter antennas [5]. This will cause significant delay when the user number is large; 2) different from most conventional communication systems which commonly have very long data packets, the control/data packets in IIoT applications are typically very short. Thus, the overhead caused by channel estimation is non-negligible and will reduce the information rate significantly [6]. In our design, to avoid the overhead and latency caused by channel estimation, we adopt a noncoherent maximum-likelihood (ML) receiver, where no instantaneous CSI is needed at all the nodes. The proposed new transmission framework is based on the concept called uniquely decomposable constellation group (UDCG) with QAM constellations. Simulation results show that our proposed design outperforms the currently available max-min Euclidean distance based method. Our contributions can be summarized as follows:
- •
In our design, by jointly considering two time slots, constellations with phase information (e.g., QAM) can be used by all the users [7, 8], while existing works on single-user or multiuser noncoherent massive SIMO considered only one time slot, the information can only be modulated on the amplitudes of the input signals, resulting in a low spectral efficiency [9, 10, 11].
- •
We present an explicit construction of UDCG for any number of users. Unlike design in [12] which is based on the trellis coding structure, whose received constellation is very complicated and hard to decode, the received constellation in our design has a simple geometric structure.
II The System Model and Noncoherent Maximum-Likelihood (ML) Detector
II-A The System Model and Noncoherent ML Detector
We consider a massive SIMO system consisting of single-antenna users transmitting simultaneously to the BS with () receiving antennas on the same time-frequency grid. By using a discrete-time complex baseband-equivalent model, the received signal at the antenna array of BS in the -th time slot can be expressed by
[TABLE]
where represents the transmitted signals from all the users, is an additive circularly-symmetric complex Gaussian (CSCG) noise vector with covariance . We let denote the complex channel matrix between the receiver antenna array and all the users, where characterizes the small-scale fading caused by local scattering while , captures the propagation loss due to distance and shadowing. All the entries of are assumed to be i.i.d. complex Gaussian distributed with zero mean and unit variance. The channel coefficients are assumed to be in block fading which are quasi-static in the current block and change to other independent values in the next block with a channel coherence time . We consider a space-time block modulation (STBM) [8] scheme over time slots and the received signal vectors can be stacked together into a matrix given by
[TABLE]
where , and .
Assumption** 1**
Throughout this paper, we adopt the following assumptions:
The small scale channel fading matrix is completely unknown to the BS and all the users, while the large scale fading matrix is available to all the nodes; 2. 2.
The transmitted signals are subject to an instantaneous average power constraint111Note that our design can be directly extended to the case with peak power constraint.: , , . For convenience, we assume that the users are labeled such that .
We consider a noncoherent ML detector which is optimal for uniformly distributed discrete input signals in terms of error probability. First, we note that (1) can be reformulated by . Then, the vectorized version of the received signal can be written as
[TABLE]
As all the entries of and are i.i.d. CSCG distributed, we immediately have , and the covariance matrix of can be calculated by . Then, the conditional distribution of the received signal at BS for any transmitted signal matrix is given by
[TABLE]
The noncoherent ML detector aims to estimate the transmitted information carrying matrix from the received signal vector by solving the optimization problem: , or equivalently,
[TABLE]
From (2), we can observe that the detector relies on the sufficient statistic of the transmitted signal matrix: . The detailed discussion is given in the following subsection.
II-B Unique Identification of the Transmitted Signal Matrix
In this section, we first identify what conditions the transmitted signal matrix must satisfy to enable the unique identification of the transmitted signal matrix . We can observe from (2) that, to achieve reliable communication between all the users and BS in the considered massive SIMO system, the receiver must be able to uniquely determine each transmitted signal matrix once has been received, which can be formally stated as follows:
Proposition** 1**
Any reliable communications for the multiuser massive SIMO system described in (1) require that, for the transmitted signal matrix selected from , if and only if there exist any two signal matrices satisfying , then we have . ∎
The proof is omitted due to space limitation. Inspired by Proposition 1, to aid our system design, we introduce the following concept called uniquely-factorable multiuser space-time modulation (UF-MUSTM) as follows:
Definition** 1**
A multiuser space-time modulation codebook is said to form a UF-MUSTM codebook if for any pair of codewords satisfying , we have . ∎
Definition 1 motivates us to design a UF-MUSTM codebook for the considered noncoherent massive SIMO, which will be given in Sec. III. Therefore, our primary task in the rest of this paper is to propose a new method for the systematic design of such UF-MUSTM .
We note that, most of the existing space-time code designs consider point-to-point MIMO systems, where all the transmitting antennas are connected to the same transmitter. As a result, the transmitted information carrying signals are accessible by all the antennas where unitary space-time code design can be employed [7, 8]. However, in our considered MUSTM noncoherent massive SIMO system, the signals transmitted from different users are not allowed to fully collaborate, which greatly limits the codebook design. Moreover, the performance analysis for non-unitary codeword of MUSTM is very challenging as shown in [7].
III The Multiuser Massive SIMO System with Noncoherent ML Receiver
In this section, we present a UF-MUSTM framework with a slot-by-slot ML detection receiver. We find that when the number of receiver antennas increases, the pairwise error probability (PEP) between the two codewords will be dominated by the Kullback-Leibler (KL) distance between them. Motivated by this fact, a max-min KL distance design is proposed by performing optimization on sub-constellation assignment and power allocation among all the users.
III-A The KL Distance between the Transmitted Space-Time Modulation Codewords
In practice, the computational complexity of the optimal noncoherent ML detector described in (2) can be prohibitively high when the size of is large. To reduce the receiver complexity, our main idea is to use a small block size for the ML receiver. If only one time slot is involved in the ML detector in (2), i.e., we consider , the correlation matrix degenerate into a real scalar , where the phase information of the transmitted symbols is lost and information bits from all the users can only be modulated on the amplitudes of the transmitted symbols. However, such a design typically has a very low spectral efficiency [9, 10, 11]. To improve the spectrum efficiency by allowing constellation with phase information be transmitted by all the users, we need at least two time slots [7, 8]. Motivated by the above observation, we consider the case with , where the transmitted signals from the first and second time slots are represented by . We also denote , in which
[TABLE]
where , , and , with . By (3), we immediately have
[TABLE]
By inserting (3), (4) into~(2), the ML receiver can be given by:
[TABLE]
where and are the received signal vectors in the first and second time slots, respectively. It can be observed that the diagonal entries in (3) are and , in which the phase information is lost, while the off-diagonal term is c=\mathbf{x}_{1}^{H}\mathbf{D}\mathbf{x}_{2}=\sum_{k=1}^{K}\beta_{k}x_{k,1}^{*}x_{k,2}=\sum_{k=1}^{K}\beta_{k}|x_{k,1}||x_{k,2}|\exp\big{(}j\arg(x_{k,2})-j\arg(x_{k,1})\big{)}, suggesting that we can transmit a known reference signal vector in the first time slot and then transmit the information bearing signal vector to enable a “differential-like” transmission. The exact PEP is extremely hard to evaluate for the matrix given above [7]. Moreover, the exact expression for the pairwise error probability (PEP) do not seem to be tractable for optimization. Inspired by the Chernoff-Stein Lemma, when the number of receiver antennas goes to infinity, the PEP will goes to zero exponentially where the exponent determined by the KL distance [8]. Hence, in this paper, we propose to maximize the minimum KL distance between the conditional distributions of the received signals corresponding to different input signals.
We now calculate the KL distance between the received signals induced by the transmitted signals matrices and , which is also the expectation of the likelihood function between two received signals vectors. In essence, the likelihood function between the received signal vectors corresponding to the two transmitted signals convergence in probability to the KL-distance with the increase of the number of receiver antennas. More specifically, the KL-distance between the received signal corresponding to transmitted matrix and is given by:
[TABLE]
in which
[TABLE]
We can observe that is actually the KL-distance if there is only one receiver antenna. Due to the assumption of the independence of channel coefficients, and the KL distance with antennas is times of .
III-B Multiuser Space-Time Modulation witn QAM Division
The main objective of this subsection is to propose a new QAM division based MUSTM design framework for the considered massive SIMO system. This design is based on the uniquely decomposable constellation group (UDCG) originally proposed in [13], building upon the commonly used spectrally efficient QAM signaling. Now, we introduce the definition of UDCG as follows:
Definition** 2**
A group of constellations form a UDCG, denoted by \big{\{}\sum_{k=1}^{K}x_{k}:x_{k}\in\mathcal{X}_{k}\big{\}}=\uplus_{k=1}^{K}\mathcal{X}_{k}=\mathcal{X}_{1}\uplus\ldots\uplus\mathcal{X}_{K}, if there exist two groups of for such that , then we have for . ∎
As QAM constellation is commonly used in modern digital communications, which has a simple geometric structure, we now give the following construction of UDCG based on QAM constellation. For simplicity, we consider that each user is using the 4-QAM constellation 222The case with general QAM constellation is much more complicated and will be left as a future work..
Lemma** 1**
The UDCG with multilevel 4-QAM constellations*: The -ary square QAM constellation \mathcal{Q}=\big{\{}[\pm(m-\frac{1}{2})\pm j(n-\frac{1}{2})]d:m,n=1,\ldots,2^{K-1}\big{\}}, with being the minimum Euclidean distance between the constellation points, can be uniquely decomposed into the sum of multilevel 4-QAM sub-constellations denoted by , where \mathcal{X}_{k}=\big{\{}(\pm\frac{1}{2}\pm\frac{j}{2})\times 2^{k-1}d\big{\}} for . ∎*
With the help of UDCG, we are now ready to propose a QAM division based UF-MUSTM for the considered massive SIMO system with a noncoherent ML receiver given in (III-A). The structure of each transmitted signal matrix is given by , in which
[TABLE]
In our design, the diagonal matrix is used to compensate for the large scale fading between different users. The vector is introduced to adjust the relative transmitting power between all the users and is the information carrying vector. We let where constitute a UDCG with sum-QAM constellation such that as defined in Lemma 1 and , with , . The matrix is a permutation matrix, where denotes a standard basis column vector of length with 1 in the -th position and 0 in other positions. is a permutation over elements characterized by . We also let be a permutation such that for . From the above definition, we immediately have .
For transmitted signal matrices , we have the following desired properties:
Proposition** 2**
Consider and , where and belong to as described in Definition 1. If , then we have . ∎
III-C User-constellation Assignment and Power Allocation for the Noncoherent ML Detector
We consider the user-constellation assignment and power allocation for the noncoherent ML detector of design. For our design given in (8), we have
[TABLE]
where
[TABLE]
in which , , and . From (III-C), we can find that and are independent of the permutation function , but are determined the power allocation vector , and the information carrying vectors and . Now, with the help of (III-C), we have
[TABLE]
As a consequence, (7) can be reformulated by
[TABLE]
Recall that, the power constraint in Assumption 1 is , . That is, for the first and second time slots, we have , and . The above power constraints are equivalent to
[TABLE]
For the considered massive SIMO when is large, the PEP will goes to zero exponentially where the exponent determined by the KL distance [8]. Since we have one-to-one correspondence between , , and , , we now aim to solve the following optimization problem:
Problem** 1**
Find the optimal power control coefficients and permutation , such that:
[TABLE]
∎
We can observe that (11) is a max-min optimization problem where the objective function can be divided into two parts: and .
We first consider the inner optimization problem on . It can be observed that the minimum of is attained when , or equivalently . Also, the minimum value of is attained when and are the nearest neighboring points and is minimized, e.g., the minimal value of can be obtained simultaneously when and . In this case, we have and hence and , where the objective function in (11) is a monotonically decreasing against . We note that , and hence problem (11) can be reformulated by:
[TABLE]
The optimization on Problem (12) can be carried out by first fixing to find the optimal value of , and then perform further optimization on . To that end, we know from (12b) that, for any given , the feasible range of is given by for , or equivalently d^{2}\leq\min\,\Big{\{}\frac{P_{\pi^{-1}(k)}^{2}\beta_{\pi^{-1}(k)}^{2}}{E_{k}}\Big{\}}_{k=1}^{K}. By the Cauchy-Swartz inequality, we have
[TABLE]
where the inequality in holds if and only if , for . Or equivalently, the optimal power allocation is where for . Our next task is to check the power constraint on given in (12b) is violated or not. For d^{2}\leq\min\,\Big{\{}\frac{P_{\pi^{-1}(k)}^{2}\beta_{\pi^{-1}(k)}^{2}}{E_{k}}\Big{\}}_{k=1}^{K}, we have , and , where no power constraints on are violated for . Finally, the optimization problem on can be given by
[TABLE]
Or equivalently, we aim to solve
[TABLE]
Before proceeding on, we establish the following lemma.
Lemma** 2**
Suppose that two positive sequences and are arranged both in a nondecreasing order. If we let denote the set containing all the possible permutations of , then, the solution to the optimization problem, \max_{\pi\in\Pi}\min\Big{\{}\frac{a_{k}}{b_{\pi(k)}}\Big{\}}_{k=1}^{K}, is given by for . ∎
The proof is omitted due to space constraint. By Lemma 2, and note that , to maximize , we should let , i.e., the average power of the sub-constellations should be in ascending order. All the above discussions can be summarized into the following theorem:
Theorem** 1**
The users are ordered such that , and we denote d^{\star}=\min_{k}\big{\{}\frac{P_{k}\beta_{k}}{\sqrt{E_{k}}}\big{\}}_{k=1}^{K}, the the optimal power for each user can be given by . And the optimal permutation matrix is the identity matrix, i.e., . ∎
IV Simulation Results and Discussion
In this section, computer simulations are performed to demonstrate the effectiveness of our proposed design in comparison with other existing benchmarks. The small-scale fading is assumed to be the normalized Rayleigh fading. The path-loss as a function of transmission distance at antenna far-field can be approximated by
[TABLE]
where m is the reference distance, (GHz) is the wavelength of carrier, is the path-loss exponent [14]. In the above model, () is the Gaussian random shadowing attenuation resulting from blockage of objects. For the receiver, we assume that the noise power is , where the channel bandwidth MHz and is the power spectral density of noise with J/K being the Boltzman’s constant, reference temperature K (“room temperature”), and noise figure dB.
We first examine the error performance of the proposed design under the instantaneous average power constraint for different number of users as illustrated in Fig. LABEL:fig:avrbervsdist. It is assumed that the average power upper bound is =316 mW (25 dBm), . All the users are assumed to be uniformly distributed within the cell of radius . It can be observed that, with the increased number of users, the error performance deteriorates quickly caused by the mutual interference between users. Then, more BS antennas are needed to achieve the same average BER. We also compare our design with MED based method proposed in [9, 15]. Since we are using two time slots, while the MED methods only need one time slot, we assume 2-PAM constellations are used by all the users for the MED based design. We can also find that the proposed approach outperforms the MED based method significantly in terms of BER in all the schemes.
Next, we compare the error performance of the proposed method with the conventional zero-forcing (ZF) receiver using orthogonal training sequence in Fig. LABEL:fig:trainingbsproposed. Without loss of generality, we consider a system with users. For the orthogonal training based method, at least 4 time slots are needed and we assume that the channel coefficients are quasi-constant in these times slots. As 4-QAM are used by each user for the proposed scheme, we assume that 64-QAM are assumed for the training based approach in order to achieve a fair comparison. For the training prcocess, we assume that, a popular least-square (LS) channel estimator is employed [5]. It can be observed from Fig. LABEL:fig:trainingbsproposed that, when the antenna number is small and the channel gain is large (i.e., the distance is small), the training based method outperforms the proposed design in term of BER. However, when the antenna number is relatively large, the proposed design has a better error performance, especially at the cell edge.
V Conclusion
In this paper, we have proposed a new noncoherent multiuser massive SIMO design for the low-latency IIoT wireless communication applications. Assuming that the large-scale fading coefficients are known, we presented a simple and systematic construction of the transmitted signal matrix based on the concept of UDCG. In our design, we have used a noncoherent ML receiver over two time slots where no instantaneous CSI is required. To improve the error performance, the minimum KL distance between the received signals corresponding to different transmitted signal matrices was maximized by proper power allocation and sub-constellation assignment. Computer simulations reveal that, our method outperforms significantly the MED based design in terms of average BER. Our approach can also have a better error performance than the orthogonal training design for cell edge users when the array size is large.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] R. Drath and A. Horch, “Industrie 4.0: Hit or hype?,” IEEE Ind. Electron. Mag. , vol. 8, pp. 56–58, June 2014.
- 2[2] V. K. L. Huang, Z. Pang, C. A. Chen, and K. F. Tsang, “New trends in the practical deployment of industrial wireless: From noncritical to critical use cases,” IEEE Ind. Electron. Mag. , vol. 12, pp. 50–58, June 2018.
- 3[3] H. Chen, R. Abbas, P. Cheng, M. Shirvanimoghaddam, W. Hardjawana, W. Bao, Y. Li, and B. Vucetic, “Ultra-reliable low latency cellular networks: Use cases, challenges and approaches,” to appear in IEEE Commun. Mag. , 2018.
- 4[4] P. Popovski, J. J. Nielsen, C. Stefanovic, E. d. Carvalho, E. Strom, K. F. Trillingsgaard, A. Bana, D. M. Kim, R. Kotaba, J. Park, and R. B. Sorensen, “Wireless access for ultra-reliable low-latency communication: Principles and building blocks,” IEEE Netw. , vol. 32, pp. 16–23, March 2018.
- 5[5] M. Biguesh and A. B. Gershman, “Training-based MIMO channel estimation: a study of estimator tradeoffs and optimal training signals,” IEEE Trans. Signal Process. , vol. 54, pp. 884–893, Mar. 2006.
- 6[6] G. Durisi, T. Koch, and P. Popovski, “Toward massive, ultrareliable, and low-latency wireless communication with short packets,” Proc. IEEE , vol. 104, pp. 1711–1726, Sept. 2016.
- 7[7] M. Brehler and M. K. Varanasi, “Asymptotic error probability analysis of quadratic receivers in rayleigh-fading channels with applications to a unified analysis of coherent and noncoherent space-time receivers,” IEEE Trans. Inf. Theory , vol. 47, pp. 2383–2399, Sep 2001.
- 8[8] M. J. Borran, A. Sabharwal, and B. Aazhang, “On design criteria and construction of noncoherent space-time constellations,” IEEE Trans. Inf. Theory , vol. 49, pp. 2332–2351, Oct 2003.
