Downlink Precoding for Massive MIMO Systems Exploiting Virtual Channel   Model Sparsity

Thomas Ketseoglou; Ender Ayanoglu

arXiv:1706.03294·cs.IT·July 14, 2017

Downlink Precoding for Massive MIMO Systems Exploiting Virtual Channel Model Sparsity

Thomas Ketseoglou, Ender Ayanoglu

PDF

TL;DR

This paper introduces a novel precoding method for Massive MIMO systems that exploits channel sparsity and extends to OFDM, achieving improved performance with high-order QAM constellations.

Contribution

It proposes a new sparse representation-based precoding technique combined with OFDM and PGP, enhancing Massive MIMO performance for large constellation sizes.

Findings

01

Significantly better performance than no precoder or simple beamforming.

02

Effective for constellation sizes up to M=64.

03

Applicable to frequency-selective channels with high flexibility.

Abstract

In this paper, the problem of designing a forward link linear precoder for Massive Multiple-Input Multiple-Output (MIMO) systems in conjunction with Quadrature Amplitude Modulation (QAM) is addressed. First, we employ a novel and efficient methodology that allows for a sparse representation of multiple users and groups in a fashion similar to Joint Spatial Division and Multiplexing. Then, the method is generalized to include Orthogonal Frequency Division Multiplexing (OFDM) for frequency selective channels, resulting in Combined Frequency and Spatial Division and Multiplexing, a configuration that offers high flexibility in Massive MIMO systems. A challenge in such system design is to consider finite alphabet inputs, especially with larger constellation sizes such as $M \geq 16$ . The proposed methodology is next applied jointly with the complexity-reducing Per-Group Processing (PGP)…

Equations78

y_{u} = H_{u} x_{u} + n_{u},

y_{u} = H_{u} x_{u} + n_{u},

y_{d} = H_{u}^{h} x_{d} + n_{d},

y_{d} = H_{u}^{h} x_{d} + n_{d},

P maximize subject to I (x_{d}; y_{d}) tr (P P^{h}) = N_{u},

P maximize subject to I (x_{d}; y_{d}) tr (P P^{h}) = N_{u},

\tilde{H}_{g} = F_{N_{u}}^{h} H_{g},

\tilde{H}_{g} = F_{N_{u}}^{h} H_{g},

∣ cos (θ_{l g k n}) - \frac{p}{D N _{u}} ∣ \leq \frac{1}{D N _{u}},

∣ cos (θ_{l g k n}) - \frac{p}{D N _{u}} ∣ \leq \frac{1}{D N _{u}},

D N_{u} cos (θ_{l g k n}) - 1 \leq p \leq D N_{u} cos (θ_{l g k n}) + 1,

D N_{u} cos (θ_{l g k n}) - 1 \leq p \leq D N_{u} cos (θ_{l g k n}) + 1,

cos (θ_{g} - Δ θ) + \frac{1}{D N _{u}} < cos (θ_{g^{'}} + Δ θ) - \frac{1}{D N _{u}},

cos (θ_{g} - Δ θ) + \frac{1}{D N _{u}} < cos (θ_{g^{'}} + Δ θ) - \frac{1}{D N _{u}},

H_{g, v} = S_{g}^{t} \tilde{H}_{g} = S_{g}^{t} F_{N_{u}}^{h} H_{g},

H_{g, v} = S_{g}^{t} \tilde{H}_{g} = S_{g}^{t} F_{N_{u}}^{h} H_{g},

H_{g} = F_{N_{u}} S_{g} S_{g}^{t} F_{N_{u}}^{h} H_{g} = F_{N_{u}, S_{g}} H_{g, v},

H_{g} = F_{N_{u}} S_{g} S_{g}^{t} F_{N_{u}}^{h} H_{g} = F_{N_{u}, S_{g}} H_{g, v},

H_{g}^{h} F_{N_{u}, S_{g}^{'}} = 0,

H_{g}^{h} F_{N_{u}, S_{g}^{'}} = 0,

H_{d, g} = H_{g}^{h} = H_{g, v}^{h} F_{N_{u}, S_{g}^{'}}^{h} .

H_{d, g} = H_{g}^{h} = H_{g, v}^{h} F_{N_{u}, S_{g}^{'}}^{h} .

\begin{split}{\mathbf{y}}_{d}=\left[\begin{array}[]{c}{{\mathbf{H}}_{1,v}}^{h}{\mathbf{F}}_{N_{u},{\cal S}_{1}}^{h}\\ {{\mathbf{H}}_{2,v}}^{h}{\mathbf{F}}_{N_{u},{\cal S}_{2}}^{h}\\ \vdots\\ {{\mathbf{H}}_{G,v}}^{h}{\mathbf{F}}_{N_{u},{\cal S}_{G}}^{h}\end{array}\right]\left[\begin{array}[]{c c c c}{\mathbf{F}}_{N_{u},{\cal S}_{1}}&{\mathbf{F}}_{N_{u},{\cal S}_{2}}&\cdots&{\mathbf{F}}_{N_{u},{\cal S}_{G}}\end{array}\right]\\ \left[\begin{array}[]{cccccc}{\bf P}_{1}&{\bf 0}&{\bf 0}&\cdots&{\bf 0}&{\bf 0}\\ {\bf 0}&{\bf P}_{2}&{\bf 0}&\cdots&{\bf 0}&{\bf 0}\\ {\bf 0}&{\bf 0}&{\bf P}_{3}&\cdots&{\bf 0}&{\bf 0}\\ \vdots&\vdots&\vdots&\ddots&\vdots&\vdots\\ {\bf 0}&{\bf 0}&{\bf 0}&\cdots&{\bf P}_{G-1}&{\bf 0}\\ {\bf 0}&{\bf 0}&{\bf 0}&\cdots&{\bf 0}&{\bf P}_{G}\\ \end{array}\right]\left[\begin{array}[]{c}{{\mathbf{x}}_{1}}\\ {{\mathbf{x}}_{2}}\\ \vdots\\ {{\mathbf{x}}_{G}}\end{array}\right]+{\mathbf{n}},\end{split}

\begin{split}{\mathbf{y}}_{d}=\left[\begin{array}[]{c}{{\mathbf{H}}_{1,v}}^{h}{\mathbf{F}}_{N_{u},{\cal S}_{1}}^{h}\\ {{\mathbf{H}}_{2,v}}^{h}{\mathbf{F}}_{N_{u},{\cal S}_{2}}^{h}\\ \vdots\\ {{\mathbf{H}}_{G,v}}^{h}{\mathbf{F}}_{N_{u},{\cal S}_{G}}^{h}\end{array}\right]\left[\begin{array}[]{c c c c}{\mathbf{F}}_{N_{u},{\cal S}_{1}}&{\mathbf{F}}_{N_{u},{\cal S}_{2}}&\cdots&{\mathbf{F}}_{N_{u},{\cal S}_{G}}\end{array}\right]\\ \left[\begin{array}[]{cccccc}{\bf P}_{1}&{\bf 0}&{\bf 0}&\cdots&{\bf 0}&{\bf 0}\\ {\bf 0}&{\bf P}_{2}&{\bf 0}&\cdots&{\bf 0}&{\bf 0}\\ {\bf 0}&{\bf 0}&{\bf P}_{3}&\cdots&{\bf 0}&{\bf 0}\\ \vdots&\vdots&\vdots&\ddots&\vdots&\vdots\\ {\bf 0}&{\bf 0}&{\bf 0}&\cdots&{\bf P}_{G-1}&{\bf 0}\\ {\bf 0}&{\bf 0}&{\bf 0}&\cdots&{\bf 0}&{\bf P}_{G}\\ \end{array}\right]\left[\begin{array}[]{c}{{\mathbf{x}}_{1}}\\ {{\mathbf{x}}_{2}}\\ \vdots\\ {{\mathbf{x}}_{G}}\end{array}\right]+{\mathbf{n}},\end{split}

\begin{split}{\mathbf{y}}_{d}&=\left[\begin{array}[]{c}{{\mathbf{H}}_{1,v}}^{h}\\ {{\mathbf{H}}_{2,v}}^{h}\\ \vdots\\ {{\mathbf{H}}_{G,v}}^{h}\end{array}\right]\left[\begin{array}[]{cccccc}{\bf P}_{1}&{\bf 0}&{\bf 0}&\cdots&{\bf 0}&{\bf 0}\\ {\bf 0}&{\bf P}_{2}&{\bf 0}&\cdots&{\bf 0}&{\bf 0}\\ {\bf 0}&{\bf 0}&{\bf P}_{3}&\cdots&{\bf 0}&{\bf 0}\\ \vdots&\vdots&\vdots&\ddots&\vdots&\vdots\\ {\bf 0}&{\bf 0}&{\bf 0}&\cdots&{\bf P}_{G-1}&{\bf 0}\\ {\bf 0}&{\bf 0}&{\bf 0}&\cdots&{\bf 0}&{\bf P}_{G}\\ \end{array}\right]\left[\begin{array}[]{c}{{\mathbf{x}}_{1}}\\ {{\mathbf{x}}_{2}}\\ \vdots\\ {{\mathbf{x}}_{G}}\end{array}\right]+{\mathbf{n}}\\ &=\left[\begin{array}[]{c}{{\mathbf{H}}_{1,v}}^{h}{\mathbf{P_{1}}}{{\mathbf{x}}_{1}}\\ {{\mathbf{H}}_{2,v}}^{h}{\mathbf{P_{2}}}{{\mathbf{x}}_{2}}\\ \vdots\\ {{\mathbf{H}}_{G,v}}^{h}{\mathbf{P_{G}}}{{\mathbf{x}}_{G}}\end{array}\right]+{\mathbf{n}}.\end{split}

\begin{split}{\mathbf{y}}_{d}&=\left[\begin{array}[]{c}{{\mathbf{H}}_{1,v}}^{h}\\ {{\mathbf{H}}_{2,v}}^{h}\\ \vdots\\ {{\mathbf{H}}_{G,v}}^{h}\end{array}\right]\left[\begin{array}[]{cccccc}{\bf P}_{1}&{\bf 0}&{\bf 0}&\cdots&{\bf 0}&{\bf 0}\\ {\bf 0}&{\bf P}_{2}&{\bf 0}&\cdots&{\bf 0}&{\bf 0}\\ {\bf 0}&{\bf 0}&{\bf P}_{3}&\cdots&{\bf 0}&{\bf 0}\\ \vdots&\vdots&\vdots&\ddots&\vdots&\vdots\\ {\bf 0}&{\bf 0}&{\bf 0}&\cdots&{\bf P}_{G-1}&{\bf 0}\\ {\bf 0}&{\bf 0}&{\bf 0}&\cdots&{\bf 0}&{\bf P}_{G}\\ \end{array}\right]\left[\begin{array}[]{c}{{\mathbf{x}}_{1}}\\ {{\mathbf{x}}_{2}}\\ \vdots\\ {{\mathbf{x}}_{G}}\end{array}\right]+{\mathbf{n}}\\ &=\left[\begin{array}[]{c}{{\mathbf{H}}_{1,v}}^{h}{\mathbf{P_{1}}}{{\mathbf{x}}_{1}}\\ {{\mathbf{H}}_{2,v}}^{h}{\mathbf{P_{2}}}{{\mathbf{x}}_{2}}\\ \vdots\\ {{\mathbf{H}}_{G,v}}^{h}{\mathbf{P_{G}}}{{\mathbf{x}}_{G}}\end{array}\right]+{\mathbf{n}}.\end{split}

P_{g} maximize subject to I (x_{d, g}; y_{d, g}) tr (P_{g} P_{g}^{h}) = N_{d, g},

P_{g} maximize subject to I (x_{d, g}; y_{d, g}) tr (P_{g} P_{g}^{h}) = N_{d, g},

y_{d, g} = H_{g, v}^{h} P_{g} x_{g} + n_{g},

y_{d, g} = H_{g, v}^{h} P_{g} x_{g} + n_{g},

H_{u, g, k, n} = l = 1 \sum L β_{l g k n} a_{x} (θ_{l g k n}, ϕ_{l g k n}) a_{z}^{t} (θ_{l g k n}),

H_{u, g, k, n} = l = 1 \sum L β_{l g k n} a_{x} (θ_{l g k n}, ϕ_{l g k n}) a_{z}^{t} (θ_{l g k n}),

a_{z} (θ_{l g k n}) = [1, exp (- j 2 π D cos (θ_{l g k n})), \dots, exp (- j 2 π D (N_{u, z} - 1) cos (θ_{l g k n}))]^{t},

a_{z} (θ_{l g k n}) = [1, exp (- j 2 π D cos (θ_{l g k n})), \dots, exp (- j 2 π D (N_{u, z} - 1) cos (θ_{l g k n}))]^{t},

a_{x} (θ_{l g k n}, ϕ_{l g k n}) = [1, exp (- j 2 π D sin (θ_{l g k n}) cos (ϕ_{l g k n})), \dots, exp (- j 2 π D (N_{u, x} - 1) sin (θ_{l g k n}) cos (ϕ_{l g k n}))]^{t},

a_{x} (θ_{l g k n}, ϕ_{l g k n}) = [1, exp (- j 2 π D sin (θ_{l g k n}) cos (ϕ_{l g k n})), \dots, exp (- j 2 π D (N_{u, x} - 1) sin (θ_{l g k n}) cos (ϕ_{l g k n}))]^{t},

\tilde{H}_{u, g, k, n} = l = 1 \sum L β_{l g k n} F_{N_{u, x}}^{h} a_{x} (θ_{l g k n}, ϕ_{l g k n}) a_{z}^{t} (θ_{l g k n}) F_{N_{u, z}}^{*} .

\tilde{H}_{u, g, k, n} = l = 1 \sum L β_{l g k n} F_{N_{u, x}}^{h} a_{x} (θ_{l g k n}, ϕ_{l g k n}) a_{z}^{t} (θ_{l g k n}) F_{N_{u, z}}^{*} .

\tilde{h}_{u, g, k, n} = l = 1 \sum L β_{l g k n} \tilde{a}_{z, x} (θ_{l g k n}, ϕ_{l g k n}),

\tilde{h}_{u, g, k, n} = l = 1 \sum L β_{l g k n} \tilde{a}_{z, x} (θ_{l g k n}, ϕ_{l g k n}),

\tilde{a}_{z, x} (θ_{l g k n}, ϕ_{l g k n}) = (F_{N_{u}, z} \otimes F_{N_{u}, x})^{h} (a_{z} (θ_{l g k n}) \otimes a_{x} (θ_{l g k n}, ϕ_{l g k n})) .

\tilde{a}_{z, x} (θ_{l g k n}, ϕ_{l g k n}) = (F_{N_{u}, z} \otimes F_{N_{u}, x})^{h} (a_{z} (θ_{l g k n}) \otimes a_{x} (θ_{l g k n}, ϕ_{l g k n})) .

\begin{split}{\mathbf{y}}_{d}&=\left[\begin{array}[]{c}{{\mathbf{H}}_{1,v}}^{h}\left({\mathbf{F}}_{N_{u,z}}\otimes{\mathbf{F}}_{N_{u,x}}\right)_{{\cal S}_{1}}^{h}\\ {{\mathbf{H}}_{2,v}}^{h}\left({\mathbf{F}}_{N_{u,z}}\otimes{\mathbf{F}}_{N_{u,x}}\right)_{{\cal S}_{2}}^{h}\\ \vdots\\ {{\mathbf{H}}_{G,v}}^{h}\left({\mathbf{F}}_{N_{u},z}\otimes{\mathbf{F}}_{N_{u},x}\right)_{{\cal S}_{G}}^{h}\end{array}\right]\\ &\times\left[\begin{array}[]{c c c c}\left({\mathbf{F}}_{N_{u,z}}\otimes{\mathbf{F}}_{N_{u,x}}\right)_{{\cal S}_{1}}&\left({\mathbf{F}}_{N_{u,z}}\otimes{\mathbf{F}}_{N_{u,x}}\right)_{{\cal S}_{2}}&\cdots&\left({\mathbf{F}}_{N_{u,z}}\otimes{\mathbf{F}}_{N_{u,x}}\right)_{{\cal S}_{G}}\end{array}\right]\\ &\times\left[\begin{array}[]{cccccc}{\bf P}_{1}&{\bf 0}&{\bf 0}&\cdots&{\bf 0}&{\bf 0}\\ {\bf 0}&{\bf P}_{2}&{\bf 0}&\cdots&{\bf 0}&{\bf 0}\\ {\bf 0}&{\bf 0}&{\bf P}_{3}&\cdots&{\bf 0}&{\bf 0}\\ \vdots&\vdots&\vdots&\ddots&\vdots&\vdots\\ {\bf 0}&{\bf 0}&{\bf 0}&\cdots&{\bf P}_{G-1}&{\bf 0}\\ {\bf 0}&{\bf 0}&{\bf 0}&\cdots&{\bf 0}&{\bf P}_{G}\\ \end{array}\right]+{\mathbf{n}},\end{split}

\begin{split}{\mathbf{y}}_{d}&=\left[\begin{array}[]{c}{{\mathbf{H}}_{1,v}}^{h}\left({\mathbf{F}}_{N_{u,z}}\otimes{\mathbf{F}}_{N_{u,x}}\right)_{{\cal S}_{1}}^{h}\\ {{\mathbf{H}}_{2,v}}^{h}\left({\mathbf{F}}_{N_{u,z}}\otimes{\mathbf{F}}_{N_{u,x}}\right)_{{\cal S}_{2}}^{h}\\ \vdots\\ {{\mathbf{H}}_{G,v}}^{h}\left({\mathbf{F}}_{N_{u},z}\otimes{\mathbf{F}}_{N_{u},x}\right)_{{\cal S}_{G}}^{h}\end{array}\right]\\ &\times\left[\begin{array}[]{c c c c}\left({\mathbf{F}}_{N_{u,z}}\otimes{\mathbf{F}}_{N_{u,x}}\right)_{{\cal S}_{1}}&\left({\mathbf{F}}_{N_{u,z}}\otimes{\mathbf{F}}_{N_{u,x}}\right)_{{\cal S}_{2}}&\cdots&\left({\mathbf{F}}_{N_{u,z}}\otimes{\mathbf{F}}_{N_{u,x}}\right)_{{\cal S}_{G}}\end{array}\right]\\ &\times\left[\begin{array}[]{cccccc}{\bf P}_{1}&{\bf 0}&{\bf 0}&\cdots&{\bf 0}&{\bf 0}\\ {\bf 0}&{\bf P}_{2}&{\bf 0}&\cdots&{\bf 0}&{\bf 0}\\ {\bf 0}&{\bf 0}&{\bf P}_{3}&\cdots&{\bf 0}&{\bf 0}\\ \vdots&\vdots&\vdots&\ddots&\vdots&\vdots\\ {\bf 0}&{\bf 0}&{\bf 0}&\cdots&{\bf P}_{G-1}&{\bf 0}\\ {\bf 0}&{\bf 0}&{\bf 0}&\cdots&{\bf 0}&{\bf P}_{G}\\ \end{array}\right]+{\mathbf{n}},\end{split}

\tilde{a}_{y, x} (θ_{l g k n}, ϕ_{l g k n}) = (F_{N_{u, y}} \otimes F_{N_{u, x}})^{h} (a_{y} (θ_{l g k n}, ϕ_{l g k n}) \otimes a_{x} (θ_{l g k n}, ϕ_{l g k n})),

\tilde{a}_{y, x} (θ_{l g k n}, ϕ_{l g k n}) = (F_{N_{u, y}} \otimes F_{N_{u, x}})^{h} (a_{y} (θ_{l g k n}, ϕ_{l g k n}) \otimes a_{x} (θ_{l g k n}, ϕ_{l g k n})),

a_{y} (θ_{l g k n}) = [1, exp (- j 2 π D sin (θ_{l g k n}) sin (ϕ_{l g k n})), \dots, exp (- j 2 π D (N_{u, y} - 1)) sin (θ_{l g k n}) sin (ϕ_{l g k n}))]^{t},

a_{y} (θ_{l g k n}) = [1, exp (- j 2 π D sin (θ_{l g k n}) sin (ϕ_{l g k n})), \dots, exp (- j 2 π D (N_{u, y} - 1)) sin (θ_{l g k n}) sin (ϕ_{l g k n}))]^{t},

a_{x} (θ_{l g k n}, ϕ_{l g k n}) = [1, exp (- j 2 π D sin (θ_{l g k n}) cos (ϕ_{l g k n})), \dots, exp (- j 2 π D (N_{u, x} - 1) sin (θ_{l g k n}) cos (ϕ_{l g k n}))]^{t} .

a_{x} (θ_{l g k n}, ϕ_{l g k n}) = [1, exp (- j 2 π D sin (θ_{l g k n}) cos (ϕ_{l g k n})), \dots, exp (- j 2 π D (N_{u, x} - 1) sin (θ_{l g k n}) cos (ϕ_{l g k n}))]^{t} .

∣ cos (θ_{l g k n}) - \frac{p}{D N _{u, z}} ∣ < \frac{1}{D N _{u, z}},

∣ cos (θ_{l g k n}) - \frac{p}{D N _{u, z}} ∣ < \frac{1}{D N _{u, z}},

∣ sin (θ_{l g k n}) cos (ϕ_{l g k n}) - \frac{p}{D N _{u, x}} ∣ < \frac{1}{D N _{u, x}} .

∣ sin (θ_{l g k n}) cos (ϕ_{l g k n}) - \frac{p}{D N _{u, x}} ∣ < \frac{1}{D N _{u, x}} .

\tilde{h}_{u, g, k, n} = \frac{1}{L} l = 1 \sum L β_{l g k n} (F_{N_{u}, y} \otimes F_{N_{u}, x})^{h} \times (a_{y} (θ_{l g k n}, ϕ_{l g k n}) \otimes a_{x} (θ_{l g k n}, ϕ_{l g k n})) δ (τ - τ_{l}) .

\tilde{h}_{u, g, k, n} = \frac{1}{L} l = 1 \sum L β_{l g k n} (F_{N_{u}, y} \otimes F_{N_{u}, x})^{h} \times (a_{y} (θ_{l g k n}, ϕ_{l g k n}) \otimes a_{x} (θ_{l g k n}, ϕ_{l g k n})) δ (τ - τ_{l}) .

\tilde{h}_{u, g, k, n}^{(f)} = \frac{1}{L} l = 1 \sum L β_{l g k n} (F_{N_{u}, y} \otimes F_{N_{u}, x})^{h} \times (a_{y} (θ_{l g k n}, ϕ_{l g k n}) \otimes a_{x} (θ_{l g k n}, ϕ_{l g k n})) w_{f},

\tilde{h}_{u, g, k, n}^{(f)} = \frac{1}{L} l = 1 \sum L β_{l g k n} (F_{N_{u}, y} \otimes F_{N_{u}, x})^{h} \times (a_{y} (θ_{l g k n}, ϕ_{l g k n}) \otimes a_{x} (θ_{l g k n}, ϕ_{l g k n})) w_{f},

\tilde{h}_{u, g, k, n}^{(f)} = \frac{1}{L} l = 1 \sum L β_{l g k n} (F_{N_{u}, z} \otimes F_{N_{u}, x})^{h} \times (a_{y} (θ_{l g k n}, ϕ_{l g k n}) \otimes a_{x} (θ_{l g k n}, ϕ_{l g k n})) w_{f, l},

\tilde{h}_{u, g, k, n}^{(f)} = \frac{1}{L} l = 1 \sum L β_{l g k n} (F_{N_{u}, z} \otimes F_{N_{u}, x})^{h} \times (a_{y} (θ_{l g k n}, ϕ_{l g k n}) \otimes a_{x} (θ_{l g k n}, ϕ_{l g k n})) w_{f, l},

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Downlink Precoding for Massive MIMO Systems Exploiting Virtual Channel Model Sparsity

Thomas Ketseoglou

and Ender Ayanoglu

T. Ketseoglou is with the Electrical and Computer Engineering Department, California State Polytechnic University, Pomona, California (e-mail: [email protected]). E. Ayanoglu is with the Center for Pervasive Communications and Computing, Department of Electrical Engineering and Computer Science, University of California, Irvine (e-mail: [email protected]). This work was partially supported by NSF grant 1547155.

Abstract

In this paper, the problem of designing a forward link linear precoder for Massive Multiple-Input Multiple-Output (MIMO) systems in conjunction with Quadrature Amplitude Modulation (QAM) is addressed. First, we employ a novel and efficient methodology that allows for a sparse representation of multiple users and groups in a fashion similar to Joint Spatial Division and Multiplexing. Then, the method is generalized to include Orthogonal Frequency Division Multiplexing (OFDM) for frequency selective channels, resulting in Combined Frequency and Spatial Division and Multiplexing, a configuration that offers high flexibility in Massive MIMO systems. A challenge in such system design is to consider finite alphabet inputs, especially with larger constellation sizes such as $M\geq 16$ . The proposed methodology is next applied jointly with the complexity-reducing Per-Group Processing (PGP) technique, on a per user group basis, in conjunction with QAM modulation and in simulations, for constellation size up to $M=64$ . We show by numerical results that the precoders developed offer significantly better performance than the configuration with no precoder or the plain beamformer and with $M\geq 16$ .

I INTRODUCTION

Massive MIMO employs a very large number of antennas and enables very high spectral efficiency [1, 2, 3]. For Massive MIMO to be capable of offering its full benefits, accurate and instantaneous channel state information is required at the base station (BS). Within Massive MIMO research, the problem of designing an optimal linear precoder toward maximizing the mutual information between the input and output on the downlink in conjunction with a finite input alphabet modulation and multiple antennas per user has not been considered in the literature, due to its complexity. There are techniques proposed for downlink linear precoding in a multi-user MIMO scenario, e.g., Joint Spatial Division and Multiplexing (JSDM) [4], but their implementation has been challenging so far. In addition, there has been a lack of publications on how to realistically integrate OFDM in Massive MIMO with success and without sacrificing the spectral efficiency of the system. On the other hand, the problem of finite-alphabet input MIMO linear precoding has been extensively studied in the literature. Globally optimal linear precoding techniques were presented [5, 6] for scenarios employing channel state information available at the transmitter (CSIT)111Under CSIT the transmitter has perfect knowledge of the MIMO channel realization at each transmission. with finite-alphabet inputs, capable of achieving mutual information rates much higher than the previously presented Mercury Waterfilling (MWF) [7] techniques by introducing input symbol correlation through a unitary input transformation matrix in conjunction with channel weight adjustment (power allocation). In addition, more recently, [8] has presented an iterative algorithm for precoder optimization for sum rate maximization of Multiple Access Channels (MAC) with Kronecker MIMO channels. Furthermore, more recent work has shown that when only Statistical Channel State Information (SCSI)222SCSI pertains to the case in which the transmitter has knowledge of only the MIMO channel correlation matrices [9, 10] and the thermal noise variance. is available at the transmitter, in asymptotic conditions when the number of transmitting and receiving antennas grows large, but with a constant transmitting to receiving antenna number ratio, one can design the optimal precoder by looking at an equivalent constant channel and its corresponding adjustments as per the pertinent theory [11], and applying a modified expression for the corresponding ergodic mutual information evaluation over all channel realizations. This development allows for a precoder optimization under SCSI in a much easier way [11]. Finally, [12, 13] present for the first time results for mutual information maximizing linear precoding with large size MIMO configurations and QAM constellations. Such systems are particularly difficult to analyze and design when the inputs are from a finite alphabet, especially with QAM constellation sizes, $M\geq 16$ .

In this paper, we present optimal linear precoding techniques for Massive MIMO, suitable for QAM with constellation size $M\geq 16$ and CSIT. Two types of antenna arrays are considered for the Base Station (BS), Uniform Linear Arrays (ULA) and Uniform Planar Arrays (UPA). In the UPA case, we consider arrays deployed either over the $x,~{}y$ direction or the $z,~{}x$ one. We show that by projecting the per user antenna uplink channels on the DFT based angular domain, called virtual channel model (VCM) herein, a sparse representation is possible for the channels. Then, by dividing spatially “distant” users into separate spatial sectors, we show that the spatial virtual channel representations between these users become approximately orthogonal. We then show that the concept of JSDM [4] can be easily applied in the sparse virtual channel model domain representation and show that linear precoding on the downlink using Per-Group Precoding (PGP) in conjunction with the Gauss-Hermite approximation in MIMO [14, 13]. However, the issue of group-based decoding is still present at the destination. We employ two different methods toward mitigating this problem. Then, by generalizing the presented approach to the frequency-selective (FS) channel case and applying OFDM, we show that much more flexibility and gains are available by the techniques presented. We show that when OFDM is integrated in the VCM-based JSDM system developed, resulting in Combined Frequency and Spatial Division and Multiplexing (CFSDM), the system can offer much higher rates, overcome issues of spatial overlapping by employing different carriers between spatially overlapping groups, and also easily decode different users’ data within a group. In all examples presented, we show high gains achievable by the proposed downlink precoding approach. More specifically, the paper makes the following contributions in Massive MIMO:

It provides an analytical framework that allows spatially separated user groups to be approximately orthogonal and thus to require independent per-group precoding beams from the base station. 2. 2.

It proves that the presented semi-orthogonal decomposition fits the JSDM [4] model. 3. 3.

It shows that the selected pre-beamforming matrices for the JSDM decomposition are optimal. 4. 4.

It employs both linear and planar arrays. 5. 5.

It generalizes the approach to include OFDM under frequency-selective channel conditions in a very flexible way. 6. 6.

It shows very significant gains in conjunction with PGP [13] and QAM modulation.

The paper is organized as follows: Section II presents the system model and problem statement. Then, in Section III, we present a novel virtual channel approach which allows for efficient downlink precoding in a JSDM fashion for ULA and UPA for narrowband channels. In Section IV we focus on FS channels, which naturally leads to OFDM type of systems imposed to the presented JSDM approach. In Section IV, we present numerical results for optimal downlink precoding on the system proposed that implements the Gauss-Hermite approximation in the block coordinate gradient ascent method in conjunction with the complexity reducing PGP methodology[13]. Finally, our conclusions are presented in Section V.

II SYSTEM MODEL AND PROBLEM STATEMENT

Consider the following uplink equation on a narrowband (flat-fading) Massive MIMO system with a single cell

[TABLE]

where ${\mathbf{y}_{u}}$ is the $N_{u}\times 1$ received vector at the base, ${\mathbf{H}}_{u}=[{\mathbf{H}}_{1},\cdots,{\mathbf{H}}_{G}]$ is the $N_{u}\times K_{eff}$ channel matrix of the received data from all $K$ users, employing $N_{u}$ receiving antennas at the base, with $K_{eff}=\sum_{g}N_{d,g}$ , where $N_{d,g}$ is defined below, and where users have been divided into $G$ groups with $K_{g}$ users in group $g$ ( $1\leq g\leq G$ ), with user $k$ of group $g$ denoted as $k^{(g)}$ and employing $N_{d,k^{(g)}}$ transmitting antennas, with ( $\sum_{g=1}^{G}K_{g}=K$ ), ${\mathbf{H}}_{g}=[{\mathbf{H}}_{g^{(1)}}\cdots{\mathbf{H}}_{g^{(K_{g})}}]$ is group $g$ ’s uplink channel matrix of size $N_{u}\times N_{d,g}$ , with $N_{d,g}$ comprising the total number of antennas in the group, i.e., $N_{d,g}=\sum_{k^{(g)}}N_{d,k^{(g)}}$ , where ${\mathbf{n}}_{u}$ represents the independent, identically distributed (i.i.d.) complex circularly symmetric Gaussian noise of variance per component $\sigma_{u}^{2}=\frac{1}{\mathrm{SNR}_{s,u}}$ , where ${\mathrm{SNR}_{s,u}}$ is the channel symbol signal-to-noise ratio ( $\mathrm{SNR}$ ). The uplink symbol vector of size ${\mathbf{x}}_{u}$ of size $\sum_{g}^{G}{N_{d,g}}\times 1$ has i.i.d. components drawn from a QAM constellation of order $M$ . The corresponding downlink equation can then be derived by using for the downlink channel matrix ${\mathbf{H}}_{d}={\mathbf{H}}_{u}^{h}$ , assuming that Time Division Duplexing (TDD) is employed in the system, to be

[TABLE]

where ${\mathbf{y}}_{d}$ is the downlink received vector of size $\sum_{g=1}^{G}N_{d,g}\times 1$ , ${\mathbf{x}}_{d}$ is the $N_{u}\times 1$ vector of transmitted symbols drawn independently from a QAM constellation, and the vector ${\mathbf{n}}_{d}$ of size $\sum_{g=1}^{G}N_{d,g}\times 1$ is the downlink circularly symmetric complex Gaussian noise with independent components. The optimal CSIT precoder ${\mathbf{P}}$ needs to satisfy

[TABLE]

where the constraint is due to keeping the total power emitted from the $N_{u}$ antennas constant.

The problem in (LABEL:eq_MMIMO) results in exponential complexity at both transmitter and receiver, and it becomes especially difficult for QAM with constellation size $M\geq 16$ or large MIMO configurations. There are two major difficulties in (LABEL:eq_MMIMO): a) There are $N_{u}$ input symbols in (LABEL:eq_MMIMO) where $N_{u}$ is very large, thus making the design of the precoder and its optimization practically impossible, and b) The decoding operation at the receiver needs to be performed by employing all elements of ${\mathbf{y}}_{d}$ simultaneously, another impossible demand due to the users being distributed over the entire cell. In order to circumvent these difficulties, the JSDM concept was proposed in [4] which divides users into groups based on channel similarity. However, a major impediment to deploying JSDM in practice has been the lack of a simple way that identifies the different groups of users with ease. Furthermore, [4] has employed Gaussian input symbols, an assumption that can lead to discrepancies as far as the precoder performance is concerned, especially in high $\mathrm{SNR}$ [5, 15]. In this paper, a methodology that employs the virtual channel model, based on the DFT channel angular domain, is employed in order to facilitate the group selection problem in JSDM and then the methodology of [13] is employed in order to allow for the design of an optimal overall precoder on a per group basis.

III THE NARROWBAND SYSTEM DESCRIPTION UNDER THE VIRTUAL CHANNEL MODEL

III-A Uniform Linear Array (ULA) at the Base with Flat Fading

We begin with a ULA deployed at the BS along the $z$ direction as depicted in Fig. 1 and for flat fading, i.e., $B<B_{COH}$ , where $B,~{}B_{COH}$ are the RF signal bandwidth and the coherence bandwidth of the channel, respectively. Each user group on the uplink transmits from the same “cluster” of elevation angles $\theta_{g}\in[\theta_{g}-\Delta\theta,\theta_{g}+\Delta\theta]$ , distributed uniformly in the support interval, thus each user’s $k^{(g)}$ of group $g$ , ( $1\leq k^{(g)}\leq K_{g}$ and $1\leq g\leq G$ ) transmitting antenna $n$ channel, ${\mathbf{h}}_{u,g,k,n}=\frac{1}{{\sqrt{L}}}\sum_{l=1}^{L}\beta_{lgkn}{\mathbf{a}}(\theta_{lgkn})$ , where ${\mathbf{a}}(\theta_{lgkn})=[1,\exp(-j{2\pi}D\cos(\theta_{lgkn})),\cdots,\exp(-j{2\pi D(N_{u}-1)}\cos(\theta_{lgkn}))]^{t}$ is the array response vector, with $D=d/\lambda$ representing the normalized distance of successive array elements, $\lambda$ being the wavelegth, $\theta_{lgkn}$ is the elevation (incidence) angle of the $l$ path of group $g$ $k$ user’s $n$ receiving antenna, and the path gains $\beta_{lgkn}$ are independent complex Gaussian random variables with zero mean and variance $1$ , same for all users in the group. The VCM representation, presented in [16], is formed by projecting the original channel ${\mathbf{H}_{u}}$ to the $N_{u}$ dimensional space formed by the $N_{u}\times N_{u}$ DFT matrix $F_{N_{u}}$ , with row $k$ , column $l$ ( $1\leq k,~{}l\leq N_{u}$ ) element equal to $\exp(-j\frac{2\pi}{N_{u}}(k-1)(l-1))$ . For Massive MIMO systems, i.e., when $N_{u}\gg 1$ , the following Lemma 1 and 2 as well as Theorem 1 are true.

Lemma 1.

By employing the VCM for a ULA at the BS and under flat fading, the number of non-zero components of the VCM representation is small, i.e., the number of non-zero or significant elements in the channels of each group $g$ VCM representation, $|{\cal S}_{g}|,$ satisfy $|{\cal S}_{g}|\ll N_{u}$ . Thus, in the VCM domain, a sparse overall group channel representation results.

Proof.

By projecting each group channel ${\mathbf{H}}_{g}$ on the DFT virtual channel space [16], we get

[TABLE]

where ${\mathbf{F}}_{N_{u}}$ is the DFT matrix of order $N_{u}$ . Since each group attains the same angular behavior, over all users and antennas in the group, only a few, consecutive elements of ${\tilde{\mathbf{H}}}_{g}$ will be significant [17]. This comes as a result of the fact that significant angular components need to be in the main lobe of the response vector, i.e., the condition

[TABLE]

with $D=\frac{d}{\lambda}$ , needs to be satisfied for angular component in the VCM $p$ ( $1\leq p\leq N_{u}$ ) to be significant, i.e., with power $>1$ . From (5), we can easily see that the corresponding condition over the significant components becomes

[TABLE]

i.e., there are 3 significant non-zero components in the VCM representation for each channel’s path. Since each path contains a different angle, due to the ULA model presented above, this number will be increased, but will be upper-bounded by $DN_{u}|\cos(\theta_{g}+\Delta\theta)-\cos(\theta_{g}-\Delta\theta)|+3=3+2DN_{u}|\sin(\theta_{g})\sin(\Delta\theta)|\approx 3+2DN_{u}|\sin(\theta_{g})|(\Delta\theta)$ , where $\Delta\theta$ is in radians. For a typical scenario, $N_{u}=100$ , $D=1/2$ , $\theta_{g}=30^{\circ}$ , and $\Delta\theta=4^{\circ}=0.0698~{}\mathrm{radian}$ , then the maximum number of non-zero (significant) paths is upper-bounded by 7. ∎

Lemma 2.

Within the premise of the previous Lemma, if $\cos(\theta_{g}-\Delta\theta)<\cos(\theta_{g^{\prime}}+\Delta\theta)-\frac{2}{DN_{u}}$ , where $g$ and $g^{\prime}$ represent two different groups ( $g\neq g^{\prime}$ ) and with $\theta_{g}>\theta_{g^{\prime}}~{}\text{and}~{}0\leq\theta_{g},~{}\theta_{g^{\prime}}\leq 90^{\circ}$ , then their support sets for each group are mutually exclusive, thus their corresponding virtual channel model beams (VCMB) become orthogonal. A similar relationship holds in the remaining quadrants.

Proof.

When $\theta_{g}>\theta_{g^{\prime}}~{}\text{and}~{}0\leq\theta_{g},~{}\theta_{g^{\prime}}\leq 90^{\circ}$ , since the $\cos(\cdot)$ function is decreasing in this quadrant, we can easily see that the two support sets for the two groups, ${\cal S}_{g},~{}{\cal S}_{g^{\prime}},$ will be disjoint. This comes from the fact that the assumed condition is equivalent to

[TABLE]

which means that the two support sets are not overlapping, by virtue of (5). We can develop similar conditions for all remaining quadrants. Thus, by assuming adequate spatial separation between groups, we can ensure that the support sets of each group in the virtual channel representation do not overlap. Then, due to the non-overlapping of the support sets, there exists orthogonality between the components of each group in the virtual channel model, as it is next shown. ∎

Theorem 1.

By employing the VCM for a ULA at the BS and under flat fading, provided user groups are sufficiently geographically apart, as per previous lemma, the channel model of the entire downlink channel can be expressed in a fashion that is fully suitable for JSDM type of processing where different groups become orthogonal and the downlink precoder is designed on a per group basis employing the virtual channel model representation alone. In the resulting JSDM type of decomposition, the corresponding group channel matrices are the virtual channel matrices of the group VCM projections and the group pre-beamforming matrices are the group’s non-zero (significant) VCM beamforming directions.

Proof.

By employing a size $|{\cal S}_{g}|\times N_{u}$ selection matrix333A selection matrix ${\mathbf{S}}^{t}$ of size $k\times n$ with $k<n$ consists of rows equal to different unit row vectors ${\mathbf{e}}_{i}$ where the row vector element $i$ is equal to $1$ in the $i$ th position and is equal to [math] in all other positions. Such a matrix has the property that ${\mathbf{S}}^{t}{\mathbf{S}}={\mathbf{I}}$ .

[TABLE]

where the group $g$ virtual channel matrix is a reduced size, $r_{g}\times N_{d,g}$ , matrix, with $r_{g}=|{\cal S}_{g}|$ the number of significant angular components in group $g$ , due to the sparsity available in the angular domain. We can then write for the uplink group $g$ channel matrix ${\mathbf{H}}_{g}$ ,

[TABLE]

where ${\mathbf{F}}_{N_{u},{\cal S}_{g}}$ represents the selected columns of ${\mathbf{F}}_{N_{u}}$ due to its sparse representation in the angular domain. We can then write that due to non-overlapping supports in groups $g$ , $g^{\prime}$ , ${\cal S}_{g}\cap_{g\neq g^{\prime}}{\cal S}_{g^{\prime}}=\emptyset$ , that

[TABLE]

for $g\neq g^{\prime}$ . By TDD channel reciprocity, the group $g$ downlink channel matrix is given as

[TABLE]

Since each group attains its non-zero virtual channel representation at non-overlapping positions, we can then use pre-beamformers provided by the matrix ${\mathbf{B}}=[{\mathbf{F}}_{N_{u},~{}{\cal S}_{1}}\cdots{\mathbf{F}}_{N_{u},~{}{\cal S}_{G}}]$ . As we show below these pre-beamformers are optimal for the type of JSDM presented here. Then, due to non-overlapping of the support sets, i.e., ${\cal S}_{n}\cap_{m\neq n}{\cal S}_{m}=\emptyset$ , we see that the system becomes approximately orthogonal inter-group wise, i.e., $\sum_{m\neq g}{\mathbf{H}}_{\cal N}^{h}\cdot{\mathbf{W}}_{{\cal S}_{g}}^{t}{\mathbf{W}}_{{\cal S}_{m}}^{*}\approx 0$ . Then,

[TABLE]

where for $1\leq g\leq G$ , ${\mathbf{H}}_{g,v}^{h}$ is a size $N_{d,g}\times|{\cal S}_{g}|$ matrix, ${\mathbf{F}}_{N_{u},{\cal S}_{G}}$ is a size $|{\cal S}_{g}|\times N_{u}$ matrix, ${\mathbf{P}}_{g}$ is a size $|{\cal S}_{g}|\times|{\cal S}_{g}|$ matrix, and ${\mathbf{x}}_{g}$ is the group $g$ downlink symbol vector of size $|{\cal S}_{g}|\times 1$ . Now due to orthogonality, we can write equivalently

[TABLE]

∎

Since each group’s precoding becomes independent of other groups, the overall downlink precoding becomes much easier and less complex for both the transmitter and the receiver. In addition, the introduction of the pre-beamformers in the form of VCM beamforming directions also simplifies the RF chains [4]. The individual precoding of each group becomes now the optimization of a $|{\cal S}_{g}|\times|{\cal S}_{g}|$ precoding matrix ${\mathbf{P}}_{g}$ , as per the next theorem.

Theorem 2.

For each group $g$ in the VCM representation, the equivalent optimum precoder, ${\mathbf{P}}_{g}$ needs to satisfy

[TABLE]

where the group $g$ reception model becomes

[TABLE]

where ${{\mathbf{H}}_{g,v}}^{h}$ is the VCM group’s downlink matrix of size $N_{d,g}\times|{\cal S}_{g}|$ , ${\mathbf{y}}_{d,g}$ is the group’s size $N_{d,g}$ reception vector, and ${\mathbf{n}}_{g}$ is the corresponding noise. This per group precoding problem is equivalent to a precoding problem within the original group channel model, i.e., the VCM transformation does not result in mutual information gain loss in the precoding process. In other words, employing ${\mathbf{F}}_{N_{u},{\cal S}_{g}}$ as beamforming matrix per each group $g$ ( $1\leq g\leq G$ ), is optimal from a maximization of input-output mutual information standpoint.

Proof.

The only part of the theorem that needs proof is the one relating to no information loss. This is easy to prove, since the model in (12) relies equivalently on a ${\mathbf{F}}_{N_{u},{\cal S}_{g}}{\mathbf{P}}_{g}$ precoder and the channel is ${{\mathbf{H}}_{g,v}}^{h}{\mathbf{F}}_{N_{u},{\cal S}_{g}}^{h}$ , the optimal precoder’s left singular vector matrix has to be equal to the Hermitian matrix of the right singular vector matrix of ${{\mathbf{H}}_{g,v}}^{h}{\mathbf{F}}_{N_{u},{\cal S}_{g}}^{h}$ [5]. Assume that the Singular Value Decomposition (SVD) of ${\mathbf{H}}_{g,v}^{h}={\mathbf{U}}{\mathbf{S}}{\mathbf{V}}^{h}.$ Then, it is easy to show that the right singular vector matrix of ${{\mathbf{H}}_{g,v}}^{h}{\mathbf{F}}_{N_{u},{\cal S}_{g}}^{h}$ is equal to ${\mathbf{F}}_{N_{u},{\cal S}_{g}}{\mathbf{V}}$ , under the condition that $N_{u}>|{\cal S}_{g}|,~{}N_{u}>N_{d,g}$ , which is true in Massive MIMO. Thus, based on this theorem, the pre-beamformers applied herein are optimal for the JSDM system presented. ∎

Having put forth the premise for the model, after the basic theorems of our model are proven, we can proceed to precode with the ${\mathbf{P}}_{g}$ precoders for each group and apply PGP [14] which divides each group further into subgroups in order to simplify both the transmitter as well as the receiver complexity exponentially with small gain loss [14]. Further, by combining PGP with the Gauss-Hermite approximation, we are able to derive the PGP optimal precoder for finite QAM constellations with constellation size $M$ easily [13].

III-B Uniform Planar Array (UPA) at the Base

The concept generalizes easily to Uniform Planar Arrays (UPA), both for arrays formed in the $z,~{}x$ plane as well as in the $x,~{}y$ plane, a UPA deployed along the $x,~{}y$ plane is shown in Fig. 2. The theory behind planar arrays results in a Kronecker product of two virtual channels, one channel per array dimension, as shown below. UPAs result in a three-dimensional spatial representation, thus offering higher user capacity per cell. Among the two UPA possibilities, we present the analysis for an $x,~{}z$ direction deployed UPA, as the analysis for an $x,~{}y$ direction deployed UPA is very similar.

For a UPA formed on $x,~{}z$ directions, each group $g$ ’s uplink channel, ${\mathbf{H}}_{g}$ , corresponds to the combination of $N_{u,x}$ uniform linear arrays deployed along the $x$ direction with $N_{u,z}$ uniform linear elements deployed in the $z$ direction. Without loss in generality, we assume that the normalized distances are the same for each direction and equal to $D$ . UPAs results in a two-dimensional (matrix) antenna response matrix per user, group, and antenna expressed as

[TABLE]

where the path gain $\beta_{lgkn}$ is as in the ULA case, $\theta_{lgkn}$ is the elevation angle for the $z$ -element, same as in the ULA case, and $\phi_{lgkn}$ is the azimuth angle of user $k$ ’s $n$ antenna for group $g$ , assumed to be a uniform r.v. in the interval $[\phi_{g}-\Delta\phi,\phi_{g}+\Delta\phi]$ . In (16), the two spatial vectors ${\mathbf{a}}_{x}(\theta_{lgkn},\phi_{lgkn}),~{}{\mathbf{a}}_{z}(\theta_{lgkn})$ are given as

[TABLE]

and

[TABLE]

respectively.

By projecting the channel matrix ${\mathbf{H}}_{u,g,k,n}$ to both angular directions, i.e., on VCM for $z,~{}x$ directions, we get

[TABLE]

By taking the vector form of both sides in (19) and using identities from [18], e.g., $\mathrm{vec}({\mathbf{a}}\otimes{\mathbf{b}})={\mathbf{b}}{\mathbf{a}}^{t}$ and $({\mathbf{A}}\otimes{\mathbf{B}})({\mathbf{C}}\otimes{\mathbf{D}})=({\mathbf{A}}{\mathbf{C}})\otimes({\mathbf{B}}{\mathbf{D}})$ , we can write for the vector of ${\tilde{\mathbf{H}}}_{u,g,k,n}$ , ${\tilde{\mathbf{h}}}_{u,g,k,n}\doteq\mathrm{vec}({\tilde{\mathbf{H}}}_{u,g,k,n}),$ the following equation

[TABLE]

where

[TABLE]

The behavior in (21) is similar with the ULA case, i.e., sparsity is achieved and different groups occupy different support sets in the angular domain. The expansion basis matrix now for the VCM becomes the Kronecker product of the two DFT matrices ${\mathbf{F}}_{N_{u,x}}$ and ${\mathbf{F}}_{N_{u,z}}$ . The downlink reception model stays within the same premise, but the new Kronecker product basis is employed. Due to the Kronecker product, the group sparsity presents some periodicity with period equal to $N_{u,z}$ . In other words, the reception model now becomes

[TABLE]

where the notation $({\mathbf{A}})_{{\cal S}_{g}}$ means the matrix resulting from selecting the columns of ${\mathbf{A}}$ that belong to ${\cal S}_{g}$ . The case of a UPA over $x,~{}y$ dimensions can be treated in a similar way by invoking

[TABLE]

with

[TABLE]

and

[TABLE]

The sparsity in the UPA case is due to the behavior of both angles, i.e., the elevation and the azimuth ones. The corresponding conditions to Lemma 1 are posted in the next lemma.

Lemma 3.

In the UPA over $z,~{}x$ dimensions, when $N_{u}\doteq N_{u,z}N_{u,x}\gg 1$ , then the significant components of the channel for group $g$ , i.e., the support set ${\cal S}_{g}$ , are found through the following two conditions

[TABLE]

and

[TABLE]

Proof.

The proof stems from generalizing the condition in (5) to the geometries of the UPA array. For the $z$ direction the equation remains unchanged, while for the $x$ direction the factor $\cos(\theta_{lgkn})$ needs to be substituted by $\sin(\theta_{lgkn})\cos(\phi_{lgkn})-\frac{p}{DN_{u,x}}$ . For significant factors to exist, both conditions need to be satisfied simultaneously, because the composite array factor is the product of the two individual ones. This completes the proof of the lemma. ∎

In comparison to the ULA channel case sparsity behavior though, it is important to stress that UPA channels present a repetitive, semi-periodic sparsity structure, due to the Kronecker product that exists in the vectorized form of the channel vectors. This behavior is further contrasted to the ULA one in Section V where numerical results are used to depict differences between ULA and UPA behavior with regards to sparsity in the VCM representation.

IV THE WIDEBAND SYSTEM DESCRIPTION UNDER THE VIRTUAL CHANNEL MODEL

For the wideband case, we look at two possibilities: a) flat fading, and b) frequency-selective fading with OFDM. We treat both in order to facilitate a general understanding of the possibilities and different scenarios available. However, we only study the frequency-selective fading with OFDM case in our results. The presentation looks at a UPA deployed over the $z,~{}x$ directions. However, similar descriptions can be found for ULA and for different directions of deploying the array.

Generalizing the previously presented scenario to wideband channels under slow fading and looking at it from $Q$ distinct frequencies adds one more dimension to the problem, i.e., one can project the frequencies to a discrete number of time components, $b$ [19]. The resulting channels are decomposed as a triple Kronecker product, i.e., a tensor type of product. The channel for user $k$ of group $g$ will comprise the sum of $P$ paths, each of a different delay, $\tau_{l}$ . Assume there are $b$ frequency slots available, starting at $0~{}\mathrm{Hz}$ and increasing up to $(Q-1)\Delta f$ , with $\Delta f$ being the frequency bin bandwidth. We can then write the following equation for the wideband model virtual angular channel of an $x,~{}y$ UPA scenario

[TABLE]

Upon taking the Fourier transform with respect to $\tau$ in (28) we get

[TABLE]

where ${\mathbf{w}}_{f}=[\exp(-j2\pi f\tau_{1}),\exp(-j2\pi f\tau_{2}),\cdots,\exp(-j2\pi f\tau_{L})]^{t}$ . Localizing the spectrum of the channel on the $Q$ frequency bins, starting at [math] and with each bin having width $\Delta f$ , we get a frequency angular domain matrix for ${\tilde{\mathbf{h}}}_{u,g,k,n}$ as follows

[TABLE]

where

[TABLE]

is an $L\times Q$ Fourier transform matrix, and ${\mathbf{w}}_{f,l}$ is its $l$ th ( $1\leq l\leq L$ ) row. Projecting the rows of matrix ${\mathbf{W}}_{f}$ to the virtual time domain, by employing the DFT matrix of order $Q$ , ${\mathbf{F}}_{Q}$ , results in a virtual time decomposition, given as ${\mathbf{w}}_{f,l}{\mathbf{F}}_{Q}^{*}$ , where ${\mathbf{w}}_{f,l}$ is the $l$ th row of ${\mathbf{W}}_{f}$ . We can then write for the entire channel representation in the virtual domain, both angular and time, the following equation

[TABLE]

Transforming this virtual channel model matrix to a vector and using properties of Kronecker product [18] gives for the overall virtual channel model user matrix,

[TABLE]

We see that although an interesting development has occurred in this case, its applicability in the downlink precoding scenario is limited. We will see below that this changes dramatically as one moves to the frequency-selective case, due to the potential to employ OFDM on top of the JSDM model and achieve a number of additional benefits, e.g., increased capacity, easier group processing, and easier overall system deployment.

For the frequency-selective case, e.g., OFDM, the corresponding results need to be developed. Using the Tap Delay Line (TDL) model of an FS channel [20], we can write for the subcarrier domain uplink channel response of the UPA444Similar results are derived for any UPA or ULA configuration within the context of this paper. in time domain as

[TABLE]

where $\delta(\cdot)$ represents the Dirac delta function, $B$ is the system bandwidth, with $B\gg B_{COH}$ and where $B_{COH}$ is the coherence bandwidth of the channel. We can then write for the frequency response of the channel

[TABLE]

where ${\mathbf{F}}_{L,Q}$ is the last $Q-L$ row-truncated DFT matrix of order $Q$ , i.e., a matrix of size $L\times Q$ , ${\mathbf{f}}_{L,Q,l}$ is its $l$ th column, ${\mathbf{M}}_{h}$ is an $N_{u,x}N_{u,y}\times L$ matrix equal to $[{\mathbf{a}}_{y}(\theta_{lgkn},\phi_{lgkn})\otimes{\mathbf{a}}_{x}(\theta_{1gkn},\phi_{1gkn})\cdots{\mathbf{a}}_{y}(\theta_{Lgkn},\phi_{Lgkn})\otimes{\mathbf{a}}_{x}(\theta_{Lgkn},\phi_{Lgkn})]\mathrm{diag}[\beta_{1gkn}\cdots\beta_{Lgkn}]$ , where $\mathrm{diag}[\cdot]$ is the diagonal matrix of the vector in the brackets. Thus, ${\tilde{\mathbf{H}}}_{u,g,k,n}^{(f)}$ is of size $N_{u,x}N_{u,y}\times Q$ , with only a few non-zero entries on each column, all of them on the same row numbers. The $q$ th column of ${\tilde{\mathbf{H}}}_{u,g,k,n}^{(f)}$ is the uplink channel impulse response denoted as ${\mathbf{h}}_{u,g,k,n}^{(q)}$ . By recalling the fact that the spatial channel is sparse when projected to the virtual angles, exploiting the virtual channel domain representation, and after using the channel reciprocity between uplink and downlink due to TDD, for each subcarrier, we can rewrite the downlink channel of user’s $k$ , antenna $n$ , subcarrier $q$ , and group $g$ as ${(\mathbf{h}}_{u,g,k,n,v}^{(q)})^{h}$ . We can then write for the downlink channel over all subcarriers, ${\mathbf{H}}_{d,g,k,n}^{(f)},$

[TABLE]

then by stacking together all antennas for user $k$ , we get

[TABLE]

where ${\mathbf{H}}_{u,g,k,v}^{(q)}=\left[{\mathbf{H}}_{u,g,k,1,v}^{(q)}\cdots{\mathbf{H}}_{u,g,k,N_{d,k^{(g)}},v}^{(q)}\right],$ a size $|{\cal S}_{g}|\times N_{d,k^{(g)}}$ matrix. Each group, $g$ ( $1\leq g\leq G$ ) can be considered independently due to JSDM, as explained above. We can then employ different subcarriers for different users within a group or between different groups which is explained in more detail next.

IV-A Combined Frequency and Spatial Division and Multiplexing (CFSDM)

In certain scenarios, it is envisaged that there is partial overlapping between adjacent groups which can lead to significant reduction in system capacity as multiple common VCMBs need to be switched off to avoid cross-group interference. Furthermore, the user co-ordination-related issues within each group might make JSDM difficult to deploy, in general. One very promising solution to mitigate both of these problems, without sacrificing the overall system capacity, is proposed herein by virtue of a novel combination of the concept of CFSDM. This idea is described below.

In CFSDM, group support sets with common VCMBs are assigned different OFDM subcarriers. In addition, users with multiple antennas within each group are also assigned different OFDM subcarriers. Finally, for users with a single antenna on the downlink, offering multiple subcarriers is the only possibility toward higher data rates. The novelty of combining JSDM based on the VCM decomposition as proposed here and OFDM lies over the fact that it helps mitigate interference issues associated with common inter-group VCMBs as well as intra-group co-ordination. Due to the orthogonality among the subcarriers in OFDM, it is possible for two groups with common VCMBs to receive data on two different subcarriers without interference, while utilizing all the VCMBs available to them. In a similar fashion, for users within a group, assigning different subcarriers to each user makes it feasible that each user receives its data on a separate subcarrier utilizing its own receiving antennas only, thus obliterating the requirement for user co-ordination at the receiver. Specifically, let’s look at a system with FS and OFDM as described in the previous subsection. Assume the system groups are as in Section I and that the OFDM component contains $Q$ orthogonal subcarriers, for some “high enough” number, $Q$ (e.g., $Q\geq 64$ ). First, let’s assume that there is overlapping of the VCMBs between groups $g$ and $g^{\prime}$ , i.e., ${\cal S}_{g}\cap{\cal S}_{g^{\prime}}\neq\emptyset$ . The system then assigns these groups to different subcarrier groups, say ${\cal S}_{g,q}$ , ${\cal S}_{g^{\prime},q^{\prime}}$ , which will be defined explicitly after the user subcarriers are assigned. Since there are $K_{(g)}$ users in group $g$ , there is a need to assign $K_{g}$ subcarriers for group $g$ and $K_{g^{\prime}}$ for group $g^{\prime}$ , if no coordination exists between users in the groups. In order for the two groups to employ all spatial capability available to them, the two groups need to avoid interference over the common VCMBs, thus in total the two groups need $K_{g}+K_{g^{\prime}}$ different subcarriers assigned to them. Within each group, say for group $g$ , user $k^{(g)}$ employing subcarrier $q_{g,k}$ , there will be a PGP precoder employed in the subcarrier domain pertaining to the following receiver model

[TABLE]

Now, precoding is performed on a per user and subcarrier basis, without the need for user co-operation within the group. This CFSDM approach allows for more flexible data rate allocations on a per user basis as well as helps in overcoming issues associated with spatial overlapping between groups. The following lemma also helps simplify the precoder design when the number of group antennas $N_{d,g}$ is smaller than the number of available spatial dimensions $|{\cal S}_{g}|$ .

Lemma 4.

When all users in a group have the same number of antennas and with $L\ll\sqrt{Q}$ and subcarrier pairs $q,~{}q^{\prime}$ assigned within a group satisfying $|q-q^{\prime}|\ll\sqrt{Q}$ , then all subcarrier virtual downlink channel matrices, i.e., for all $q=1,2,\cdots,Q$ , $({\mathbf{H}}_{u,g,k,v}^{(q)})^{h}$ have the same singular values. Thus, the optimal precoder over all subcarriers is the same.

Proof.

We can easily rewrite (38) by employing Kronecker matrix products as

[TABLE]

where $({\mathbf{M}}_{u,g,k,v})$ is a $N_{k^{(g)}}L\times|{\cal S}_{g}|$ virtual channel matrix derived from (35) and ${\mathbf{f}_{q,L}}$ represents the $q$ th column of the matrix ${\mathbf{F}}_{L,Q}$ . Now, based on the assumptions of the lemma, for any different subcarriers assigned to the group and for all $1\leq l\leq L$ , we have $\exp(j2\pi\frac{(q-q^{\prime})l}{Q})\approx 1$ , from which we see that the matrices are approximately $\left({\mathbf{I}}_{N_{k^{(g)}}}\otimes{\mathbf{f}_{q,L}}^{h}\right)({\mathbf{M}}_{u,g,k,v})^{h}$ equal for all users in the group, thus they possess approximately equal singular values. ∎

Note that for a massive MIMO system a large number of $Q$ will be needed. In addition in the millimeter wavelength channels envisaged for 5th Generation (5G) cellular wireless systems, the assumption of $L\ll\sqrt{Q}$ will also be valid, since $L$ is small [21]. Thus, by assigning contiguous frequency subcarriers to different users within groups we can achieve the conditions of the above lemma. Based on the premise of this lemma, the optimal downlink precoder in the group is the same, independently of the subcarrier employed. This is due to the fact that for CSIT optimal precoding, the optimal precoder only depends on the singular values of the channel matrix [5, 14]. Thus, if many subcarriers are deployed to offer higher data rates, the precoding complexity stays the same.

It is important to stress that with CFSDM, within each group, all users share the same spatial subspace, e.g., based on the same VCMBs per group. In addition, the group users share the same time domain. However, users’ signals within the group are separated based on OFDM’s frequency domain orthogonality. Futhermore, based on the same principle, users between overlapping VCMBs, although they share some of the spatial subspace, are orthogonal in the frequency domain, thus they do not interfere. Finally, the requirement for different subcarriers between groups is only imposed if the two groups share many VCMBs. If only a few VCMBs are shared, an alternative possibility is to switch off those common VCMBs, thus obliterating the intergroup interference. However, one can still see advantages of employing CFSDM.

V Numerical Results

In this section, we present our numerical results based on ULA and UPA Massive MIMO systems with $N_{u}=100$ antennas at the base station. The systems employ QAM with size $M=16,~{}64$ . We present results for both systems with and without OFDM. We have used an $L=3$ Gauss-Hermite approximation [13] which results in $3^{2N_{r}}$ total nodes in the Gauss-Hermite approximation due to MIMO in order to facilitate results with optimal precoding in conjunction with QAM modulation. The implementation of the globally optimizing methodology is performed by employing two backtracking line searches, one for ${\mathbf{W}}$ and another one for ${\boldsymbol{\Sigma}}_{G}^{2}$ at each iteration, in a fashion similar to [11]. For the results presented, it is worth mentioning that only a few iterations (e.g., typically $<8$ ) are required to converge to the optimal solution results as presented in this paper. We apply the complexity reducing method of PGP [14] which offers semi-optimal results under exponentially lower transmitter and receiver complexity [14]. PGP divides the transmitting and receiving antennas into independent groups, thus achieving a much simpler detector structure while the precoder search is also dramatically reduced as well. We divide this section in three parts, the first part looks at the VCM sparse channel representations for ULA and UPA systems, the second one examines the performance of linear precoding for Massive MIMO without OFDM, while the third one studies systems with OFDM. We use $N_{t,v},N_{r,v}$ to denote the number of data symbol inputs, and the number of of antenna outputs, respectively, in the virtual domain. By employing PGP, one can trade in higher values of $N_{t,v},~{}N_{r,v}$ for higher overall throughput, albeit at a slightly increased complexity at the transmitter and receiver, as explained in detail in some of the examples below. Alternatively, one can employ a smaller number of $N_{t,v},~{}N_{r,v}$ , in order to achieve higher throughput, but at significantly lower complexity. In all cases, it is stressed that the actual number of transmission and reception antennas stays the same, while all physical antennas are employed always. The details of these techniques are omitted here due to space limitation.

It is worthwhile mentioning that for precoding methods with finite inputs, two types of channels are regularly present in the literature [5, 11, Xiao2, 12, 13, 14]: a) Type-I channels in which the precoder offers gain in the lower $\mathrm{SNR}$ regime, and b) Type II channels in which the precoder offers gain in the high $\mathrm{SNR}$ regime. Our results herein fully corroborate this type of behavior in all cases considered.

V-A VCM Channel Sparsity for ULA and UPA Scenarios

First, we present results for the sparse behavior of the VCM representation in the ULA case. We randomly create 5 groups of channels as per the ULA model presented. The base ULA is deployed along the $z$ direction with $N_{u}=100$ elements spaced at a normalized distance $D=0.5$ . There are $L=5$ paths in each channel (a smaller number of $L$ results in sparser representations). The elevation angles for groups $G_{1},~{}G_{2},~{}G_{3},~{}G_{4},~{}G_{5}$ are at $5^{\circ},~{}33^{\circ},~{}61^{\circ},~{}89^{\circ},~{}\text{and}~{}117^{\circ}$ , respectively. In addition, the groups possess $16,~{}2,~{}4,~{}4,~{}\text{and}~{}6$ antennas, respectively. The angular spread for all groups is taken to be $\pm~{}4^{\circ}$ around the elevation angle of each group. The channels are projected to the VCM space, then only components greater than 1 in absolute square power are selected. In all cases considered, this selection process results in more than $94\%$ of the total power of each channel selected. The corresponding, non-overlapping support sets are as follows (the numbers of each set correspond to the numbered components of the VCM representation vector, i.e., the significant VCMBs):

${\cal S}_{1}=[56,~{}57,~{}58,~{}59,~{}60,~{}61,~{}62,~{}63,~{}64,~{}65]$ ,

${\cal S}_{2}=[38,~{}39,~{}40,~{}41,~{}42,~{}43,~{}44]$ ,

${\cal S}_{3}=[27,~{}28,~{}29,~{}30,~{}31,~{}32,~{}33,~{}34]$ ,

${\cal S}_{4}=[1,~{}2,~{}3,~{}4,~{}5,~{}6,~{}7,~{}99,~{}100]$ ,

${\cal S}_{5}=[70,~{}71,~{}72,~{}73,~{}74,~{}75,~{}76,~{}77,~{}78,~{}79]$ .

We observe that a ULA allows for easy sparse non-overlapping support sets for multiple groups.

Next we present similar results for a UPA array along the $x,~{}y$ directions. In this example, there are 8 groups, $G_{1}$ through $G_{8}$ , formed. The normalized distance between successive elements in both directions is $D=0.6$ , while the number of elements on each direction is equal to 10, i.e., $N_{u,x}=N_{u,y}=10$ . There are a total of $16,~{}1,~{}4,~{}4,~{}6,~{}6,~{}\text{and}~{}1$ antennas available for each group. The angle spread per dimension is $\pm 2^{\circ}$ , while $L=2$ . The corresponding VCMBs per group are as follows:

${\cal S}_{1}=[1,~{}2,~{}3,~{}4,~{}5,~{}6,~{}7,~{}8,~{}9,~{}10,~{}11,~{}12,~{}20,~{}21,~{}31,~{}41,~{}51,~{}61,~{}71,~{}81,~{}91]$ ,

${\cal S}_{2}=[12,~{}13,~{}14,~{}15]$ ,

${\cal S}_{3}=[2,~{}3,~{}4,~{}5,~{}6,~{}11,~{}12,~{}13,~{}14,~{}15,~{}16,~{}17,~{}23,~{}24,~{}34,~{}44,~{}54,~{}64,~{}74,~{}84,~{}93,~{}94,~{}]$ ,

${\cal S}_{4}=[3,~{}4,~{}5,~{}6,~{}14,~{}15,~{}24,~{}34,~{}64,~{}74,~{}75,~{}84,~{}85,~{}91,~{}92,~{}93,~{}94,~{}95,~{}96,~{}97,~{}98,~{}99,~{}100]$ ,

${\cal S}_{5}=[74,~{}83,~{}84,~{}85,~{}94]$ ,

${\cal S}_{6}=[73,~{}83]$ ,

${\cal S}_{7}=[1,~{}2,~{}11,~{}21,~{}61,~{}71,~{}81,~{}82,~{}91,~{}92,~{}93,~{}94,~{}95,~{}96,~{}97,~{}98,~{}99,~{}100]$ ,

${\cal S}_{8}=[1,~{}2,~{}3,~{}4,~{}8,~{}9,~{}10,~{}11,~{}20,~{}21,~{}31,~{}41,~{}51,~{}61,~{}71,~{}81,~{}91,~{}92,~{}99,~{}100]$ .

It is easy to see that UPA deployments offer more VCMBs per group, however at a cost to orthogonality. In addition, UPAs offer better resolution compared to ULAs, thus they could in principle offer higher capacity. An additional benefit of a UPA is the fact that one gets more VCMBs per group thus the resulting throughput with precoding is higher. Due to the significant overlapping between different group VCMBs, there are two options when UPAs are selected for higher capacity: a) Release common VCMBs, i.e., leave the common VCMBs between groups unused, however at the expense of performance, or, b) Employ OFDM in parallel to the JSDM in the system. The latter approach can offer very high capacity due to its capability to mitigate overlapping in spatial domain while at the same time it takes advantage of orthogonality between non-overlapping VCMBs. Both approaches are explained in more detail below.

V-B Precoding Results without OFDM

As a first example, we present results for a ULA with 5 groups formed, shown as $G_{1},~{}G_{2},\cdots,G_{5}$ , respectively. Groups $1,~{}2,~{}3,~{}4,~{}5$ occupy the following groups of non-overlapping, i.e., disjoint VCMBs

${\cal S}_{1}=[57,~{}58,~{}59,~{}60,~{}61,~{}62,~{}63,~{}64]$ ,

${\cal S}_{2}=[39,~{}41,~{}42,43,~{}44,~{}45,~{}46,~{}47]$ ,

${\cal S}_{3}=[25,~{}26,~{}27,~{}28,~{}29,~{}30,~{}31,~{}32,~{}33,34,~{}35]$ ,

${\cal S}_{4}=[1,~{}2,~{}3,~{}4,~{}5,~{}6,~{}98,~{}99,100]$ , and

${\cal S}_{5}=[68,~{}69,~{}70,~{}71,~{}72,~{}73,~{}74,~{}75,~{}76,~{}77]$ ,

respectively. The groups include $4,~{}2,~{}4,~{}4,~{}6$ antennas at the User Equipment (UE), respectively. In the non-OFDM case, users within groups need to co-ordinate their downlink. Thus, the number of users within the group becomes irrelevant and only the number of antennas becomes essential. In Fig. 3 we present results for $G_{4}$ . We observe that high gains in throughput are available for low $\mathrm{SNR}$ , i.e., a Type-I channel behavior. For example, at $\mathsf{SNR}_{b}=-7~{}dB$ there is an 33% throughput increase by using PGP over the no precoding case. In addition, there is an precoding gain of $4-5~{}dB$ over the low $\mathrm{SNR}$ regime. As far as complexity is concerned, based on the analysis of [13], the PGP precoding example presented with $N_{t,v}=6$ require a complexity (both at the transmitter and receiver) on the order of $3M^{4}$ , while the no precoding example requires a complexity at the receiver on the order of $M^{18}$ , thus PGP needs $(1/3)M^{14}$ less complexity. For the $N_{t,v}=8$ case the complexity reduction with PGP over the no PGP case becomes $(1/4)M^{14}$ . Thus, we see that PGP helps keep the UE complexity low, while it gives significant gains in throughput and $\mathrm{SNR}$ .

In Fig. 4 we present results for $G_{5}$ . Here, we observe high gains in throughput in high $\mathrm{SNR}$ regime. Here we employ $N_{t,v}=6$ . We observe that this is a Type-II channel behavior. At $\mathsf{SNR}_{b}>0$ , the no precoding case throughput saturates at $40~{}bps/Hz$ . However, with PGP we get significantly higher throughput, e.g., at $\mathsf{SNR}_{b}=10~{}dB$ the throughput is $48~{}bps/Hz$ . Further, it takes PGP $(1/6)M^{16}$ less UE complexity than the no precoding one in order to achieve this additional throughput at the UE.

For a UPA along the $z,~{}x$ directions, with $N_{u,z}=N_{u,x}=10$ , $D=0.6$ , we get 8 groups with the following VCMBs:

${\cal S}_{1}=[1,2,3,10,11,12,21,31,41,71,81,91],$

${\cal S}_{2}=[3,4,11,12,13,14,15,16,17,20,23,33,93]$ ,

${\cal S}_{3}=[3,4,14,94,]$ ,

${\cal S}_{4}=[4,14,24,34,44,54,64,74,83,84,85,93,94,95]$ ,

${\cal S}_{5}=[53,63,72,73,74,83,93,]$ ,

${\cal S}_{6}=[62,72]$ ,

${\cal S}_{7}=[1,11,21,61,71,81,91,92,93,99,100,]$ ,

and ${\cal S}_{8}=[1,2,3,4,5,6,7,8,9,10,11,21,81,91,100]$ .

The corresponding number of each group UE antennas is $4,~{}2,~{}4,~{}4,~{}6,~{}1,~{}6,~{}\text{and}~{}8$ , respectively. We see that partial overlapping exists between different groups VCMBs. Without OFDM, we need to leave the common VCMBs unused to avoid primary interference between groups. We thus end up with the following revised sets:

${\cal S}_{1}=[31,~{}41]$ ,

${\cal S}_{2}=[13,~{}15,~{}16,~{}17]$ ,

${\cal S}_{3}=[94]$ ,

${\cal S}_{4}=[64,~{}84,~{}85,~{}95]$ ,

${\cal S}_{5}=[53,~{}63,~{}73]$ ,

${\cal S}_{6}=[62]$ ,

${\cal S}_{7}=[61,~{}92,~{}93,~{}99]$ , and

${\cal S}_{8}=\emptyset$ .

In Fig. 5 we present results on the $G_{1}$ downlink precoding where we have applied PGP with two additional “ficticious” inputs, similar to [13] and see dramatic improvements on downlink throughput. We see the dramatic impact of VCMB overlapping in the case of UPA. Notice that the complexity involved in the PGP is two times higher than the one on the no precoding case, due to $N_{t,v}=4$ “ficticious” antennas being introduced, while the incurred loss in $G_{1}$ due to the reduction on the number of useful VCMBs is highly mitigated. This example is a Type-II channel behavior in which PGP achieves double the throughput in high $\mathrm{SNR}$ , while the corresponding UE complexity is two times higher than the no precoding one, since $N_{t,v}=4>N_{t}$ .

For the same system, in $G_{2}$ we get the results presented in Fig. 6. For the PGP and plain beamforming cases we show results for both $M=16,~{}64$ . The PGP and plain beamforming results use $N_{t,v}=N_{t}=4$ “ficticious” antennas each, the same number as the no precoding case. We observe $\mathrm{SNR}$ and thoughput gains in low $\mathrm{SNR}$ . For example an $\mathrm{SNR}$ gain higher than 8 dB with PGP in the $\mathsf{SNR_{b}}$ over the no precoding case in low $\mathrm{SNR}$ , while the incurred UE receiver complexity with PGP is $(1/2)M^{4}$ times lower than the no precoding case.

V-C Precoding Results with OFDM

We next present results with OFDM. For a UPA deployed over the $x,~{}y$ directions, with $N_{u,x}=N_{u,y}=10$ , an OFDM system with $Q=64$ subcarriers, we get 3 groups with the following VCMB’s. $G_{1}$ has

${\cal S}_{1}=[1,~{}2,~{}10,~{}11,~{}21,~{}81,~{}91]$ ,

$G_{2}$ has ${\cal S}_{2}=[11,~{}12,~{}13,~{}14,~{}15,~{}16,~{}17,~{}18,~{}19,~{}20]$ ,

and $G_{3}$ has ${\cal S}_{3}=[3,~{}4,~{}5,~{}12,~{}13,~{}14,~{}15,~{}24,~{}34,~{}44,~{}54,~{}64,~{}74,~{}84,~{}94]$ .

$G_{1}$ comprises 2 users with two antennas each, $G_{2}$ and $G_{3}$ comprise 2 users with 4 antennas each. There is VCMB overlapping between the groups, however by employing CFSDM we can avoid the interference coming from overlapping VCMBs. In addition, by employing different subcarriers between the different users in each group in CFSDM, we can avoid joint decoding within the group level, i.e., the users decode their data totally independently. In this particular example we envisaged employing in total 6 OFDM subcarriers, 2 per group for all 3 groups. In Fig. 7 and Fig. 8 we present results for user 1, user 2 of $G_{1}$ , respectively. In both cases we see Type-I channel behavior. In this example, both users employ 2 receiving antennas. By virtue of CFSDM, the downlink can employ all VCMBs for both users, i.e., no need to partition the VCMB set. The example here applies 4 downlink pre-beamformers per user and in the PGP results we use 2 groups of size $4\times 4$ each, by extending the receiving antennas to 4, using 2 “ficticious” antennas, i.e., $N_{t,v}=4$ in a fashion similar to [13]. Futhermore, a revised, improved version of plain beamforming is used in which, only inputs with non-zero associated singular values are employed. We call this form of plain beamforming Singular Value Aware Plain Beamforming (SVAPB). We see very high throughput and $\mathrm{SNR}$ gains offered by PGP over the no precoding in low $\mathrm{SNR}$ , and the plain beamforming case, over all shown $\mathrm{SNR}$ , respectively, although the latter performs better than standard beamforming due to SVAPB. In Fig. 7 we show at $\mathsf{SNR}_{b}=5~{}dB$ more than 3 times better throughput with PGP than the no precoding case, while for a quite wide range of $\mathsf{SNR}_{b}$ we see gains on the order of $8~{}dB$ in $\mathrm{SNR}$ . The corresponding complexity with PGP is $(1/2)M^{10}$ times lower than the no precoding one. In Fig. 8 we observe a gain in throughput of $33\%$ at $\mathsf{SNR}_{b}=15~{}dB$ , while the $\mathrm{SNR}$ gain is on the order of $5~{}dB$ . The corresponding complexity with PGP is same with the one in Fig. 7, i.e., $(1/2)M^{10}$ lower than the no precoding one.

VI Conclusions

In this paper, a novel methodology for Massive MIMO systems is presented, allowing for optimal downlink linear precoding with finite-alphabet inputs, e.g., QAM and multiple antennas per user. The methodology is based on a sparse VMC decomposition of the downlink channels, which then allows for orthogonality between different user groups, due to non-overlapping sets of VCMBs. The methodology is applied in systems with or without OFDM and for ULA and UPA antenna configurations. By employing the PGP technique to the proposed system, we show very high gains are available on the downlink. However, in the non-OFDM deployment, the users in each group need to co-ordinate their detection processes in order to achieve precoding gains. When OFDM is available, there is more flexibility in system design. For example, users in the group can be assigned different subcarriers, thus ameliorating the need for intra-group detection coordination. In addition, in cases of partial overlapping of the available VCMB sets, by employing separate subcarriers, as in OFDM, the interfering groups can become completely orthogonal, thus fully mitigating the inter-group interference due to partial VCMB overlap. The novel combination of OFDM with the VCM-based JSDM system presented is called Combined Frequency and Spatial Division and Multiplexing (CFSDM) and offers additional advantages, such as high throughput to users with single antenna and it also obliterates the need for intragroup user decoding coordination. Our numerical results show high gains, e.g., typically higher than $60\%$ and in some cases as high as $200\%$ in throughput while the incurred precoding complexity is exponentially lower at both the transmitter and receiver sites.

Bibliography21

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] T. Marzetta, “Noncooperative Cellular Wireless with Unlimited Numbers of Base Station Antennas,” IEEE Transactions on Wireless Communications , vol. 9, pp. 3590–3600, November 2010.
2[2] J. Jose, A. Ashikhmin, T. Marzetta, and S. Vishwanath, “Pilot Contamination and Precoding in Multi-Cell TDD Systems,” IEEE Transactions on Wireless Communications , vol. 10, pp. 2640–2651, August 2011.
3[3] H. Ngo, E. Larsson, and T. Marzetta, “Energy and Spectral Efficiency of Very Large Multiuser MIMO Systems,” IEEE Transactions on Communications , vol. 61, pp. 1436–1449, April 2013.
4[4] J. Nam, J. Y. Ahn, A. Adhikary, and G. Caire, “Joint spatial division and multiplexing: The large-scale array regime,” IEEE Trans. Inf. Theory , vol. 59, pp. 6441–6463, October 2012.
5[5] C. Xiao, Y. Zheng, and Z. Ding, “Globally Optimal Linear Precoders for Finite Alphabet Signals Over Complex Vector Gaussian Channels,” IEEE Transactions on Signal Processing , vol. 59, pp. 3301–3314, July 2011.
6[6] M. Lamarca, “Linear Precoding for Mutual Information Maximization in MIMO Systems,” in Proceedings International Symposium of Wireless Communication Systems 2009 , 2009, pp. 26–30.
7[7] F. Perez-Cruz, M. Rodriguez, and S. Verdu, “MIMO Gaussian Channels with Arbitrary Inputs: Optimal Precoding and Power Allocation,” IEEE Transactions on Information Theory , vol. 56, pp. 1070–1086, March 2010.
8[8] M. Girnyk, M. Vehkapera, and L. K. Rasmussen, “Large System Analysis of Correlated MIMO Multiple Access Channels with Arbitrary Signaling in the Presence of Interference,” IEEE Transactions on Wireless Communications , vol. 4, pp. 2060–2073, April 2014.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Downlink Precoding for Massive MIMO Systems Exploiting Virtual Channel Model Sparsity

Abstract

I INTRODUCTION

II SYSTEM MODEL AND PROBLEM STATEMENT

III THE NARROWBAND SYSTEM DESCRIPTION UNDER THE VIRTUAL CHANNEL MODEL

III-A Uniform Linear Array (ULA) at the Base with Flat Fading

Lemma 1**.**

Proof.

Lemma 2**.**

Proof.

Theorem 1**.**

Proof.

Theorem 2**.**

Proof.

III-B Uniform Planar Array (UPA) at the Base

Lemma 3**.**

Proof.

IV THE WIDEBAND SYSTEM DESCRIPTION UNDER THE VIRTUAL CHANNEL MODEL

IV-A Combined Frequency and Spatial Division and Multiplexing (CFSDM)

Lemma 4**.**

Proof.

V Numerical Results

V-A VCM Channel Sparsity for ULA and UPA Scenarios

V-B Precoding Results without OFDM

V-C Precoding Results with OFDM

VI Conclusions

Lemma 1.

Lemma 2.

Theorem 1.

Theorem 2.

Lemma 3.

Lemma 4.