Continuous Analog Channel Estimation Aided Beamforming for Massive MIMO   Systems

Vishnu V. Ratnam; Andreas F. Molisch

arXiv:1901.08763·cs.IT·August 23, 2019

Continuous Analog Channel Estimation Aided Beamforming for Massive MIMO Systems

Vishnu V. Ratnam, Andreas F. Molisch

PDF

TL;DR

This paper proposes a novel continuous analog channel estimation (CACE) technique for massive MIMO systems that reduces pilot overhead, enhances phase-noise resilience, and enables efficient analog beamforming without digital processing.

Contribution

Introduction of CACE, a new CE method that eliminates digital processing, reduces overhead, and improves phase-noise robustness in massive MIMO beamforming.

Findings

01

CACE significantly reduces channel estimation overhead compared to conventional methods.

02

CACE provides resilience against oscillator phase-noise.

03

Simulation results show only a small SNR loss with CACE.

Abstract

Analog beamforming greatly reduces the implementation cost of massive antenna transceivers by using only one up/down-conversion chain. However, it incurs a large pilot overhead when used with conventional channel estimation (CE) techniques. This is because these CE techniques involve digital processing, requiring the up/down-conversion chain to be time-multiplexed across the antenna dimensions. This paper introduces a novel CE technique, called continuous analog channel estimation (CACE), that avoids digital processing, enables analog beamforming at the receiver and additionally provides resilience against oscillator phase-noise. By avoiding time-multiplexing of up/down-conversion chains, the CE overhead is reduced significantly and furthermore becomes independent of the number of antenna elements. In CACE, a reference tone is transmitted continuously with the data signals, and the…

Figures9

Click any figure to enlarge with its caption.

Equations117

\tilde{s}_{tx} (t)

\tilde{s}_{tx} (t)

H (t)

H (t)

H (f)

[\overset{ˉ}{a}_{rx} (ψ_{azi}^{rx}, ψ_{ele}^{rx})]_{M_{V} h + v}

[\overset{ˉ}{a}_{rx} (ψ_{azi}^{rx}, ψ_{ele}^{rx})]_{M_{V} h + v}

\displaystyle=\exp{\Big{\{}{\rm j}2\pi\frac{\Delta_{\rm H}h\sin(\psi^{\rm rx}_{\rm azi})\sin(\psi^{\rm rx}_{\rm ele})+\Delta_{\rm V}(v-1)\cos(\psi^{\rm rx}_{\rm ele})}{\lambda}\Big{\}}},\!\!\!\!\!\!\!

\tilde{s}_{rx, BB} (t)

\tilde{s}_{rx, BB} (t)

\displaystyle={\rm LPF}\Big{\{}\sum_{\ell=0}^{L-1}\alpha_{\ell}\mathbf{a}_{\rm rx}(\ell){\mathbf{a}_{\rm tx}(\ell)}^{{\dagger}}\mathbf{s}_{\rm tx}(t-\tau_{\ell})\sqrt{2}e^{-{\rm j}[2\pi f_{\rm c}t+\theta(t)]}\Big{\}}+\tilde{\mathbf{w}}(t)

\displaystyle=\frac{1}{\sqrt{T_{\rm s}}}\bigg{[}\boldsymbol{\mathcal{H}}(0)\mathbf{t}\sqrt{E^{(\rm r)}}+\!\!\!\!\sum_{k\in\mathcal{K}\setminus\mathcal{G}}\!\!\!\!\boldsymbol{\mathcal{H}}(f_{k})\mathbf{t}x_{k}e^{{\rm j}2\pi f_{k}t}\bigg{]}e^{-{\rm j}\theta(t)}+\tilde{\mathbf{w}}(t),\!\!\!\!

\frac{d θ ( t )}{d t} = w_{θ} (t),

\frac{d θ ( t )}{d t} = w_{θ} (t),

\hat{s}_{rx, BB} (t)

\hat{s}_{rx, BB} (t)

y [n] ≜ y (n T_{s} / K)

y [n] ≜ y (n T_{s} / K)

\displaystyle=\frac{1}{T_{\rm s}}\mathbf{t}^{{\dagger}}{\boldsymbol{\mathcal{H}}(0)}^{{\dagger}}\sqrt{E^{(\rm r)}}\bigg{[}\boldsymbol{\mathcal{H}}(0)\mathbf{t}\sqrt{E^{(\rm r)}}

\displaystyle\qquad\qquad+\sum_{k\in\mathcal{K}\setminus\mathcal{G}}\boldsymbol{\mathcal{H}}(f_{k})\mathbf{t}x_{k}e^{{\rm j}2\pi k\frac{n}{K}}\bigg{]}{A^{*}[n]e^{-{\rm j}\theta[n]}}

\displaystyle\ \ +\sqrt{\frac{1}{T_{\rm s}}}{\hat{\mathbf{w}}[n]}^{{\dagger}}\bigg{[}\boldsymbol{\mathcal{H}}(0)\mathbf{t}\sqrt{E^{(\rm r)}}+\!\!\!\!\sum_{k\in\mathcal{K}\setminus\mathcal{G}}\!\!\!\!\boldsymbol{\mathcal{H}}(f_{k})\mathbf{t}x_{k}e^{{\rm j}2\pi k\frac{n}{K}}\bigg{]}e^{-{\rm j}\theta[n]}

+ \frac{1}{T _{s}} t^{†} H (0)^{†} E^{(r)} \tilde{w} [n] A^{*} [n] + \hat{w} [n]^{†} \tilde{w} [n],

\tilde{w} [n]

\tilde{w} [n]

e^{- j θ [n]}

k \in K \sum Ω [k] Ω [k + k_{1}]^{*} = δ_{0, k_{1}}^{K},

k \in K \sum Ω [k] Ω [k + k_{1}]^{*} = δ_{0, k_{1}}^{K},

Δ_{k_{1}, k_{2}} ≜ E {Ω [k_{1}] Ω [k_{2}]^{*}}

\displaystyle\ \qquad\approx\frac{\delta_{k_{1},k_{2}}^{K}}{K}\bigg{[}\frac{1-e^{-(\frac{\sigma_{\theta}^{2}T_{\rm s}-{\rm j}4\pi k_{1}}{4})}}{e^{\frac{\sigma_{\theta}^{2}T_{\rm s}-{\rm j}4\pi k_{1}}{2K}}-1}+\frac{1-e^{-(\frac{\sigma_{\theta}^{2}T_{\rm s}+{\rm j}4\pi k_{1}}{4})}}{1-e^{-\frac{\sigma_{\theta}^{2}T_{\rm s}+{\rm j}4\pi k_{1}}{2K}}}\bigg{]},\!

E {W [k_{1}] W [k_{2}]^{†}}

E {W [k_{1}] W [k_{2}]^{†}}

E {W [k_{1}] W [k_{2}]^{T}}

\hat{w} [n]

\hat{w} [n]

A [n]

Y_{k} = \frac{T _{s}}{K} n = 0 \sum K - 1 y [n] e^{- j 2 π \frac{k n}{K}}

Y_{k} = \frac{T _{s}}{K} n = 0 \sum K - 1 y [n] e^{- j 2 π \frac{k n}{K}}

\displaystyle\quad=\sum_{\dot{k}\in\hat{\mathcal{G}}}\mathbf{t}^{{\dagger}}{\boldsymbol{\mathcal{H}}(0)}^{{\dagger}}\bigg{(}\sum_{\bar{k}\in\mathcal{K}\setminus\mathcal{G}}\boldsymbol{\mathcal{H}}(f_{\bar{k}})\mathbf{t}\sqrt{E^{(\rm r)}}x_{\bar{k}}\Omega^{*}[\dot{k}]\Omega[\dot{k}+k-\bar{k}]

\displaystyle\qquad\qquad\qquad+\boldsymbol{\mathcal{H}}(0)\mathbf{t}E^{(\rm r)}\Omega^{*}[\dot{k}]\Omega[\dot{k}+k]\bigg{)}

\displaystyle\qquad+\sum_{\dot{k}\in\hat{\mathcal{G}}}\sqrt{T_{\rm s}}{\mathbf{W}[\dot{k}]}^{{\dagger}}\bigg{(}\boldsymbol{\mathcal{H}}(0)\mathbf{t}\sqrt{E^{(\rm r)}}\Omega[k+\dot{k}]

\displaystyle\qquad\qquad\qquad+\sum_{\bar{k}\in\mathcal{K}\setminus\mathcal{G}}\boldsymbol{\mathcal{H}}(f_{\bar{k}})\mathbf{t}x_{\bar{k}}\Omega[k+\dot{k}-\bar{k}]\bigg{)}

+ \dot{k} \in \hat{G} \sum T_{s} t^{†} H (0) W [k + \dot{k}] E^{(r)} Ω^{*} [\dot{k}]

+ T_{s} \dot{k} \in \hat{G} \sum W^{†} [\dot{k}] W [k + \dot{k}] .

S_{k}

S_{k}

E {∣ S_{k} ∣^{2}}

E {∣ S_{k} ∣^{2}}

I_{k}

I_{k}

E {I_{k}}

E {I_{k}}

= (1) \dot{k} \in \hat{G} \sum M_{rx} β_{0, 0} E^{(r)} Δ_{\dot{k} + k, \dot{k}} = 0

E {∣ I_{k} ∣^{2}}

\displaystyle\stackrel{{\scriptstyle(2)}}{{=}}\!\!\sum_{\bar{k}\in\mathcal{K}\setminus[\mathcal{G}\cup\{k\}]}\!\!M_{\rm rx}^{2}{|\beta_{0,\bar{k}}|}^{2}E^{(\rm r)}E^{(\rm d)}\mathbb{E}\bigg{\{}{\bigg{|}\sum_{\dot{k}\in\hat{\mathcal{G}}}{\Omega[\dot{k}]}^{*}\Omega[\dot{k}+k-\bar{k}]\bigg{|}}^{2}\bigg{\}}

\displaystyle\quad+M_{\rm rx}^{2}{|\beta_{0,0}|}^{2}{[E^{(\rm r)}]}^{2}\mathbb{E}\bigg{\{}{\bigg{|}\sum_{\dot{k}\in\hat{\mathcal{G}}}{\Omega[\dot{k}]}^{*}\Omega[\dot{k}+k]\bigg{|}}^{2}\bigg{\}}

\displaystyle\stackrel{{\scriptstyle(3)}}{{\leq}}\sum_{\bar{k}\in\mathcal{K}\setminus\{k\}}M_{\rm rx}^{2}{|\beta_{\rm max}|}^{2}E^{(\rm r)}E^{(\rm d)}\mathbb{E}\bigg{\{}{\bigg{|}\sum_{\dot{k}\in\hat{\mathcal{G}}}{\Omega[\dot{k}]}^{*}\Omega[\dot{k}+k-\bar{k}]\bigg{|}}^{2}\bigg{\}}

\displaystyle\quad+M_{\rm rx}^{2}{|\beta_{\rm max}|}^{2}{[E^{(\rm r)}]}^{2}\mathbb{E}\bigg{\{}\Big{[}\sum_{\dot{k}\in\hat{\mathcal{G}}}{\big{|}\Omega[\dot{k}]\big{|}}^{2}\Big{]}\Big{[}\sum_{\dot{k}\in\hat{\mathcal{G}}}{\big{|}\Omega[\dot{k}+k]\big{|}}^{2}\Big{]}\bigg{\}}

\displaystyle\stackrel{{\scriptstyle(4)}}{{\leq}}M_{\rm rx}^{2}{|\beta_{\rm max}|}^{2}E^{(\rm r)}E^{(\rm d)}\mathbb{E}\bigg{\{}\sum_{\dot{k},\ddot{k}\in\hat{\mathcal{G}}}{\Omega[\dot{k}]}^{*}

\displaystyle\quad\times\bigg{[}\sum_{\bar{k}\in\mathcal{K}\setminus\{k\}}\Omega[\dot{k}+k-\bar{k}]{\Omega[\ddot{k}+k-\bar{k}]}^{*}\bigg{]}\Omega[\ddot{k}]\bigg{\}}

\displaystyle\quad+M_{\rm rx}^{2}\sum_{\dot{k}\in\hat{\mathcal{G}}}{|\beta_{\rm max}|}^{2}{[E^{(\rm r)}]}^{2}\mathbb{E}\big{\{}{\big{|}\Omega[\dot{k}+k]\big{|}}^{2}\big{\}}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Continuous Analog Channel Estimation Aided Beamforming for Massive MIMO Systems

Vishnu V. Ratnam, and Andreas F. Molisch V. V. Ratnam is with the Standards and Mobility Innovation Lab at Samsung Research America, Plano, TX, 75023 USA. A. F. Molisch is with the Ming Hsieh Department of Electrical and Computer Engineering, University of Southern California, Los Angeles, CA, 90089 USA (e-mail: {ratnam, molisch}@usc.edu). This work was supported by the National Science Foundation. Part of this work was presented at IEEE ICC 2018 [1].

Abstract

Analog beamforming greatly reduces the implementation cost of massive antenna transceivers by using only one up/down-conversion chain. However, it incurs a large pilot overhead when used with conventional channel estimation (CE) techniques. This is because these CE techniques involve digital processing, requiring the up/down-conversion chain to be time-multiplexed across the antenna dimensions. This paper introduces a novel CE technique, called continuous analog channel estimation (CACE), that avoids digital processing, enables analog beamforming at the receiver and additionally provides resilience against oscillator phase-noise. By avoiding time-multiplexing of up/down-conversion chains, the CE overhead is reduced significantly and furthermore becomes independent of the number of antenna elements. In CACE, a reference tone is transmitted continuously with the data signals, and the receiver uses the received reference signal as a matched filter for combining the data signals, albeit via analog processing. We propose a receiver architecture for CACE, analyze its performance in the presence of oscillator phase-noise, and derive near-optimal system parameters and power allocation. Transmit beamforming and initial access procedure with CACE are also discussed. Simulations confirm that, in comparison to conventional CE, CACE provides phase-noise resilience and a significant reduction in the CE overhead, while suffering only a small loss in signal-to-interference-plus-noise-ratio.

Index Terms:

Hybrid beamforming, analog beamforming, massive MIMO, channel estimation, analog channel estimation, initial access, carrier recovery, carrier arraying.

I Introduction

Massive multiple-input-multiple-output (MIMO) systems, where the transmitter (TX) and/or receiver (RX) are equipped with a large array of antenna elements, are considered a key enabler of 5G cellular technologies due to the massive beamforming and/or spatial multiplexing gains they offer [2, 3]. This technology is especially attractive at millimeter (mm) wave and terahertz (THz) frequencies, where the massive antenna arrays can be built with small form factors, and where the resulting beamforming gain can help compensate for the large channel attenuation. Despite the numerous benefits, full complexity massive MIMO transceivers, where each antenna has a dedicated up/down-conversion chain, are hard to implement in practice. This is due to the cost and power requirements of the up/down-conversion chains – which include expensive and power hungry circuit components such as the analog-to-digital converters (ADCs) and digital-to-analog converters [4]. A key solution to reduce the implementation costs of massive MIMO while retaining many of its benefits is Hybrid Beamforming [5, 6, 7, 8, 9, 10], wherein a massive antenna array is connected to a smaller number of up/down-conversion chains via the use of analog hardware, such as phase-shifters and switches. While being comparatively cost and power efficient, the analog hardware can focus the transmit/receive power into the dominant channel directions, thus minimizing the performance loss in comparison to full complexity transceivers. In this paper, we focus on a special case of hybrid beamforming with only one up/down-conversion chain (for the in-phase and quadrature signal components each), referred to as analog beamforming.

A major challenge for analog beamforming (and also hybrid beamforming in general) is the acquisition of channel state information required for beamforming, henceforth referred to as rCSI. The rCSI may include, for example, average/statistical channel parameters in some beamformer designs [11, 12, 10] and instantaneous parameters in some other designs [7, 8, 13]. The rCSI is commonly obtained by transmitting known signals (pilots) at the TX and performing channel estimation (CE) at the RX, at least once per rCSI coherence time $T_{\rm rcoh}$ – which is the duration for which the rCSI remains approximately constant. Since one down-conversion chain has to be time-multiplexed across the RX antennas for CE in analog beamforming, several pilot re-transmissions are required for rCSI acquisition [14, 15, 16, 12]. As an example, exhaustive CE approaches [15] require ${\rm O}(M_{\rm tx}M_{\rm rx})$ pilots per $T_{\rm rcoh}$ , where $M_{\rm tx},M_{\rm rx}$ are the number of TX and RX antennas, respectively and ${\rm O}(\cdot)$ represents the scaling behavior in the big-‘o’ notation. Such a large pilot overhead may consume a significant portion of the time-frequency resources when $T_{\rm rcoh}$ is short, such as in vehicle-to-vehicle channels [17], in systems using narrow TX/RX beams, e.g., massive MIMO systems, or in channels with large carrier frequencies (high Doppler) and high blocking probabilities, e.g., at mm-wave, THz frequencies [18]. The overhead also increases system latency and makes the initial access111Initial access refers to the process wherein, a user equipment and base-station discover each other, synchronize, and coordinate to initiate communication. (IA) procedure cumbersome [19, 20, 21]. As a solution, several fast CE approaches have been proposed in literature, which are discussed below assuming $M_{\rm tx}=1$ for convenience.222For $M_{\rm tx}>1$ , the pilot overhead increases further, either multiplicatively or additively, by a function of $M_{\rm tx}$ , determined by the CE algorithm used at the TX. Side information aided CE approaches utilize spatial/temporal statistics of rCSI to reduce the pilot overhead [22, 23, 9, 16]. Compressed sensing based CE approaches [24, 14, 25, 26] exploit the sparse nature of the channel to reduce the number of pilots per $T_{\rm rcoh}$ up to ${\rm O}\big{(}L\log(M_{\rm rx}/L)\big{)}$ , where $L$ is the number of non-zero components of the channel in a certain basis. Iterative angular domain CE uses progressively narrower search beams at the RX to reduce the required pilots per $T_{\rm rcoh}$ to ${\rm O}(\log M_{\rm rx})$ [27, 28, 21]. Approaches that utilize side information to enhance iterative angular domain CE [29, 30] or perform angle domain tracking [31, 32, 33] have also been considered. Sparse ruler based approaches exploit the possible Toeplitz structure of the spatial correlation matrix to reduce pilots per $T_{\rm rcoh}$ to ${\rm O}(\sqrt{M_{\rm rx}})$ [34, 35, 36, 37, 12]. For $M_{\rm tx}>1$ , matrix completion based techniques [38] use the low channel rank to reduce pilots per $T_{\rm rcoh}$ up to $O\big{(}L(M_{\rm rx}+M_{\rm tx})\big{)}$ , where $L$ is the channel rank. In all these approaches, since the required pilots per $T_{\rm rcoh}$ still scale with $M_{\rm rx}$ , they are only partially successful in reducing the CE overhead. Some of these approaches also require side information and/or prior timing/frequency synchronization [39, 40], making them less suitable for the IA procedure. Some approaches also require a long $T_{\rm rcoh}$ that spans the pilot re-transmissions and/or are only applicable for certain antenna array configurations and channel models. Compressed sensing, sparse ruler, and matrix completion approaches may further incurr large computation and/or memory overheads, making them unsuitable for use at user equipments (UEs). Finally, since these CE techniques require time-multiplexing of the up/down-conversion chains, they are prone to the transient effects of the analog hardware [41].

The main reason for the overhead is that conventional CE approaches require processing in the digital domain, while the RX has only one down-conversion chain. Prior to the growth of digital hardware and digital processing capabilities, some legacy systems [42, 43, 44, 45] used an alternate RX beamforming approach in single path channels (for space communication), that did not require digital processing. In this approach, an analog phase locked loop (PLL) is used to recover the carrier tone from the received signal at each RX antenna, and the recovered carrier is then used for down-converting the received signal at that antenna to base-band. Since the carrier and data experience the same inter-antenna phase shift (in single path channels), the down-conversion leads to compensation of this phase shift, enabling coherent combining of the signals from each antenna (i.e., beamforming). Note that carrier recovery at each RX antenna can be interpreted as an implicit estimation of the channel phase using analog hardware.333The difference between ‘isolation/recovery’ and ‘estimation’ is somewhat vague in the context of analog signal processing. We shall refer to such schemes that use only analog hardware to acquire the rCSI as analog channel estimation (ACE) techniques. As ACE does not involve digital processing, it avoids time-multiplexing of the down-conversion chain and shows potential in reducing the CE overhead for analog beamforming. The delay domain counterpart of ACE was also explored for single antenna ultra-wideband systems, referred to as transmit reference schemes [46, 47]. However, these legacy ACE systems only exploit the carrier phase but not its amplitude, and thus are not directly applicable to multi-path channels. Additionally, they involve recovery of the carrier at the RX via a PLL, which is difficult at the low signal-to-noise ratios (SNRs) and high frequencies encountered in mm-wave/THz systems [48, 49]. The PLL aided recovery may also lead to a high RX phase-noise [48, 49], viz., random fluctuation in the instantaneous frequency of the recovered carrier, that degrades the system performance. Finally, prior works do not perform a detailed performance analysis of ACE or explore optimal system parameters. Therefore in this paper we explore a more generalized ACE approach for RX beamforming, called continuous ACE (CACE). Instead of using PLLs for carrier recovery, the CACE RX uses a local oscillator and low-pass filter combination to isolate/filter-out a received reference/carrier signal. This (i) enables exploiting both the amplitude and phase information of the channel response, which is essential for multi-path channels, (ii) avoids the poor performance of PLL based recovery at low SNRs and (iii) helps compensate for TX/RX oscillator phase noise.

In CACE, a reference tone, i.e. a sinusoidal tone at a known frequency, is continuously transmitted along with the data by the TX, as illustrated in Fig. 2a.444Since this reference need not be at the center frequency of the TX signal, we don’t refer to it as the carrier here. At the RX, the received signal at each antenna is converted to base-band by a bank of mixers and a local oscillator that is tuned (approximately) to the reference frequency, as illustrated in Fig. 1. The in-phase (I) and quadrature (Q) components of the resulting base-band signal at each antenna are then low-pass filtered to extract the received signals corresponding to the reference, as illustrated in Fig. 2b. These filtered outputs, which are implicit $\text{estimates}^{3}$ of the channel response (including amplitude and phase) at the reference frequency, are then used as control signals to a variable gain, analog phase-shifter array to generate the RX analog beam. The un-filtered base-band received signals at each antenna are processed by these phase-shifters, added and fed to a single ADC for demodulation. As shall be shown, this process emulates using the received signal for the reference as a matched filter for the received data signals, and it achieves a large RX beamforming gain in sparse, wide-band massive MIMO channels. This is because, while the reference and data signals have different frequencies and thus may experience different channel responses, the channel response exhibits a similar spatial signature across frequency, especially for large antenna arrays (see Remark IV.1). Furthermore, since any TX/RX oscillator phase-noise affects both the reference and data similarly, the match filtering helps partially mitigate the phase-noise from the demodulation outputs. As digital processing and time multiplexing of the down-conversion chain are not required, the computational complexity is low and the CE overhead does not scale with $M_{\rm rx}$ . Unlike conventional beamformer designs [11, 50, 7, 32], CACE aided beamforming also improves diversity against multi-path component (MPC) blocking by combining the received signal power from many channel MPCs. The phase shifts during the receive mode can also be utilized for transmit beamforming on the reverse link. By providing an option for digitally controlling these phase-shifter inputs, the proposed architecture can also support conventional RX beamforming approaches when required.

On the flip side, CACE may require additional analog hardware in comparison to conventional digital CE, including $2M_{\rm rx}$ mixers and low-pass filters. Additionally, the accumulation of power from multiple MPCs, while improving diversity, may cause performance degradation in rich scattering channels, as shall be shown. An improperly designed reference tone may also cause strong inter-carrier interference (ICI) to the data signals due to phase noise. Finally, the proposed approach in its suggested form does not support reception of multiple spatial data streams and can only be used for beamforming at one end of a communication link. This architecture is therefore more suitable for use at the UEs. The possible extensions to multiple spatial stream reception shall be explored in future work.

Some preliminary results of this work were published in the conference paper [1], albeit under a simplified RX model and without a detailed phase-noise analysis. As shall be shown here, the phase-noise analysis plays a vital role in CACE performance and system design. A different ACE technique that does not require continuous transmission of the reference, called periodic ACE (PACE), was proposed by us in [51]. While PACE prevents wastage of transit resources on a continuous reference, it involves PLL aided reference recovery and suffers severe performance degradation at very low SNRs. A third ACE technique, called multi-antenna frequency shift reference (MA-FSR), that uses square law components instead of filters and phase-shifters at the RX, was explored by us in [52]. While being resilient to phase-noise like CACE and having a low hardware cost, MA-FSR is a non-coherent technique that suffers from a poor bandwidth efficiency of $50\%$ . It should be emphasized that CACE, PACE and MA-FSR are three different ACE schemes to help reduce CE overhead in massive MIMO systems, each having their unique advantages and RX architectures, and each requiring separate performance analysis techniques. The detailed analysis presented in this paper for CACE, in combination with the analysis of PACE and MA-FSR in [51, 52], shall aid in a detailed comparison of these schemes for a specific application. The contributions of this paper are as follows:

We propose a novel transmission technique called CACE, that enables RX beamforming with low CE overhead, and propose a corresponding RX architecture for CACE. 2. 2.

We analytically characterize the achievable system throughput with CACE aided beamforming in a wide-band channel, and derive near-optimal system parameters. 3. 3.

In the process, the impact of oscillator phase-noise on performance and the ability of CACE to partially suppress phase-noise are explored. 4. 4.

Simulations under practically relevant channel models are presented to support the analytical results.

The organization of the paper is as follows: the system model is presented Section II; the signal, noise and interference components at the demodulation outputs are analyzed in Section III; the system performance is characterized in Section IV; IA and transmit beamforming are discussed in Section V; simulations results are presented in Section VI and finally conclusions are drawn in Section VII.

Notation: scalars are represented by light-case letters; vectors by bold-case letters; and sets by calligraphic letters. Additionally, ${\rm j}=\sqrt{-1}$ , $a^{*}$ is the complex conjugate of a complex scalar $a$ , $|\mathbf{a}|$ represents the $\ell_{2}$ -norm of a vector $\mathbf{a}$ , $\mathbf{A}^{\rm T}$ is the transpose of a matrix $\mathbf{A}$ and ${\mathbf{A}}^{{\dagger}}$ is the conjugate transpose of a complex matrix $\mathbf{A}$ . Finally, $\ThisStyle{\ooalign{$ \SavedStyle\mathbb{I} $\cr\kern-0.18pt$ \SavedStyle\mathbb{I} $\cr\kern 0.18pt$ \SavedStyle\mathbb{I} $}}_{a}$ is an $a\times a$ identity matrix, $\ThisStyle{\ooalign{$ \SavedStyle\mathbb{O} $\cr\kern-0.18pt$ \SavedStyle\mathbb{O} $\cr\kern 0.18pt$ \SavedStyle\mathbb{O} $}}_{a,b}$ is the $a\times b$ all zeros matrix, $\mathbb{E}\{\}$ represents the expectation operator, $\stackrel{{\scriptstyle\rm d}}{{=}}$ represents equality in distribution, $\mathrm{Re}\{\cdot\}$ / $\mathrm{Im}\{\cdot\}$ refer to the real/imaginary component, respectively, and $\mathcal{CN}(\mathbf{a},\mathbf{B})$ represents a circularly symmetric complex Gaussian vector with mean $\mathbf{a}$ and covariance matrix $\mathbf{B}$ .

II General Assumptions and System model

We consider a single cell system in downlink, where a $M_{\rm tx}$ antenna base-station (BS) transmits data to multiple UEs simultaneously via spatial multiplexing. Since we mainly focus on the downlink, we shall use the terms BS/TX and UE/RX interchangeably. Each UE is assumed to have a hybrid architecture, with $M_{\rm rx}$ antennas and one down-conversion chain, and it performs CACE aided RX beamforming. On the other hand, the BS may have an arbitrary architecture and it transmits a single spatial data-stream to each scheduled UE. For convenience, we consider the use of noise-less and perfectly linear antennas, filters, amplifiers and mixers at the BS and UEs. We assume the downlink BS-UE communication to be divided into three stages: (i) Initial Access (IA) (see footnote 1 on page 1) (ii) TX beamformer design - where the TX acquires rCSI for all the UEs and uses it to perform UE scheduling, TX beamforming and power allocation, and (iii) Data transmission - wherein the BS transmits data signals and the scheduled UEs use CACE to adapt the RX beams and receive the data. Through a major portion of this paper, we assume that the IA and TX beamformer design have been performed apriori and shall focus on the data transmission stage. However in Section V, we shall also discuss how CACE beamforming can help in stages (i) and (ii).

In stage (iii), we assume the BS to transmit spatially orthogonal signals to the scheduled UEs to mitigate inter-user interference. This can be achieved, for example, by careful UE scheduling and/or via avoiding transmission to common channel scatterers [23]. For this system model and for a given TX beamformer and power allocation, we shall restrict the analysis to a single representative UE, without loss of generality. The BS is assumed to transmit orthogonal frequency division multiplexing (OFDM) symbols to the representative UE, with $K=K_{1}+K_{2}+1$ sub-carriers indexed as $\mathcal{K}=\{-K_{1},...,K_{2}\}$ . The [math]-th sub-carrier is used as the reference tone, while data is transmitted on the $K_{1}-g$ lower and $K_{2}-g$ higher sub-carriers indexed as $\mathcal{K}\setminus\mathcal{G}$ , where $\mathcal{G}=\{-g,..,0,..,g\}$ defines the non-data sub-carriers and $g$ is a design parameter. The remaining $2g$ sub-carriers, with indices in $\mathcal{G}\setminus\{0\}$ , are blanked to act as a guard band between the reference and data sub-carriers, as illustrated in Fig.2a.555While not considered here explicitly, the results can also be extended to the case of a single-carrier system where the reference tone manifests as an un-suppressed carrier component. Since the BS can afford an accurate oscillator, by ignoring its phase-noise, the complex equivalent transmit signal for the [math]-th OFDM symbol of stage (iii) can then be expressed as:

[TABLE]

for $-T_{\rm cp}\leq t\leq T_{\rm s}$ , where $\mathbf{t}$ is the $M_{\rm tx}\times 1$ unit-norm TX beamforming vector for this UE (designed apriori in stage (ii)), $E^{(\rm r)}$ is the energy-per-symbol allocated to the reference tone, $x_{k}$ is the data signal on the $k$ -th OFDM sub-carrier, $f_{\rm c}$ is the reference frequency, $f_{k}\triangleq k/T_{\rm s}$ represents the frequency offset of the $k$ -th sub-carrier and $T_{\rm s},T_{\rm cp}$ are the symbol duration and the cyclic prefix duration, respectively. Here we define the complex equivalent signal such that the actual (real) transmit signal is given by $\mathbf{s}_{\rm tx}(t)=\mathrm{Re}\{\tilde{\mathbf{s}}_{\rm tx}(t)\}$ . For the data sub-carriers ( $k\in\mathcal{K}\setminus\mathcal{G}$ ), we assume the use of independent data streams with equal power allocation, and circularly symmetric Gaussian signaling, i.e., $x_{k}\sim\mathcal{CN}(0,E^{(\rm d)})$ . The transmit power constraint is then given by $E^{(\rm r)}+(K-|\mathcal{G}|)E^{(\rm d)}\leq E_{\rm s}$ , where $E_{\rm s}$ is the total OFDM symbol energy (excluding the cyclic prefix).

The channel to the representative UE is assumed to have $L$ MPCs with the $M_{\rm rx}\times M_{\rm tx}$ channel impulse response matrix and its Fourier transform, respectively, given as [18]:

[TABLE]

where $\alpha_{\ell}$ is the complex amplitude and $\tau_{\ell}$ is the delay and $\mathbf{a}_{\rm tx}(\ell),\mathbf{a}_{\rm rx}(\ell)$ are the $M_{\rm tx}\times 1$ TX and $M_{\rm rx}\times 1$ RX array response vectors, respectively, of the $\ell$ -th MPC. As an illustration, the $\ell$ -th RX array response vector for a uniform planar array with $M_{\rm H}$ horizontal and $M_{\rm V}$ vertical elements ( $M_{\rm rx}=M_{\rm H}M_{\rm V}$ ) is given by $\mathbf{a}_{\rm rx}(\ell)=\bar{\mathbf{a}}_{\rm rx}\big{(}\psi^{\rm rx}_{\rm azi}(\ell),\psi^{\rm rx}_{\rm ele}(\ell)\big{)}$ , where

[TABLE]

for $h\in\{0,..,M_{\rm H}-1\}$ and $v\in\{1,..,M_{\rm V}\}$ , $\psi^{\rm rx}_{\rm azi}(\ell)$ , $\psi^{\rm rx}_{\rm ele}(\ell)$ are the azimuth and elevation angles of arrival for the $\ell$ -th MPC, $\Delta_{\rm H},\Delta_{\rm V}$ are the horizontal and vertical antenna spacings and $\lambda$ is the carrier wavelength. Expressions for $\mathbf{a}_{\rm tx}(\ell)$ can be obtained similarly. Note that in (2) we implicitly ignore the frequency variation of individual MPC amplitudes $\{\alpha_{0},..,\alpha_{L-1}\}$ and the beam squinting effects [53], which are reasonable assumptions for moderate system bandwidths. It is emphasized that the complete channel response (including all MPCs) however still experiences frequency selective fading. To prevent inter-symbol interference, we let the cyclic prefix be longer than the maximum channel delay: $T_{\rm cp}>\tau_{L-1}$ . We also consider a generic temporal variation model, where the time scale at which the MPC parameters $\{\alpha_{\ell},\mathbf{a}_{\rm tx}(\ell),\mathbf{a}_{\rm rx}(\ell),\tau_{\ell}\}$ change is much larger than the symbol duration $T_{\rm s}$ . Finally, we do not assume any distribution prior or side information on $\{\alpha_{\ell},\mathbf{a}_{\rm tx}(\ell),\mathbf{a}_{\rm rx}(\ell),\tau_{\ell}\}$ .

Each RX antenna front-end is assumed to have a low noise amplifier (LNA) followed by a band-pass filter (BPF) that leaves the desired signal un-distorted but suppresses the out-of-band noise. The filtered signal is then converted to base-band by multiplication with the I and Q components of an RX local oscillator, as depicted in the base-band conversion block of Fig. 1. This oscillator is assumed to be independently generated at the RX (i.e., without locking to the received reference). While we model the RX oscillator to suffer from phase-noise, for ease of theoretical analysis we assume the mean RX oscillator frequency to be equal to the reference frequency $f_{\rm c}$ . This assumption shall be relaxed later in the simulation results in Section VI. Then, from (1)-(2), the received base-band signal for the [math]-th OFDM symbol is given by:

[TABLE]

for $0\leq t\leq T_{\rm s}$ , where the $\mathrm{Re}/\mathrm{Im}$ parts of $\tilde{\mathbf{s}}_{\rm rx,BB}(t)$ are the outputs corresponding to the I and Q components of the RX oscillator, ${\rm LPF}$ represents the low-pass filtering in the base-band conversion procedure, $\theta(t)$ is the phase-noise process of the RX oscillator and $\tilde{\mathbf{w}}(t)$ is the $M_{\rm rx}\times 1$ complex equivalent, base-band, stationary, additive, vector Gaussian noise process, with individual entries being circularly symmetric, independent and identically distributed (i.i.d.), and having a power spectral density: $\mathcal{S}_{\rm w}(f)=\mathrm{N_{0}}$ for $-f_{K_{1}}\leq f\leq f_{K_{2}}$ . Note that while (4) is obtained by assuming no TX phase-noise, the results can be generalized under some mild constraints by treating the TX phase-noise as a part of $\theta(t)$ [54]. We model the RX phase-noise $\theta(t)$ as a Wiener process, which is representative of a free running oscillator [55, 56, 57]. In Appendix C, we also discuss how the results can be extended to phase-noise modeled as an Ornstein Uhlenbeck (OU) process, which is representative of an oscillator either locked to the received reference, or synthesized from a stable low frequency source [58, 57]. For the Wiener model, $\theta(t)$ is a non-stationary Gaussian process which satisfies:

[TABLE]

where $w_{\theta}(t)$ is a real white Gaussian process with variance $\sigma^{2}_{\theta}$ . We assume the RX to have an apriori knowledge of $\sigma_{\theta}$ . As illustrated in Fig. 1, the base-band signal ${[\tilde{\mathbf{s}}_{{\rm rx,BB}}(t)]}_{m}$ at each RX antenna $m$ is further passed through a narrow band low-pass filter: ${\rm LPF}_{\hat{g}}$ to extract the received reference signal. For convenience, ${\rm LPF}_{\hat{g}}$ is assumed to be an ideal rectangular filter with a cut-off frequency of $f_{\hat{g}}=\hat{g}/T_{\rm s}$ , where $\hat{g}\leq g/2$ . Neglecting the contribution of the data sub-carriers to the filtered outputs (which is accurate for low phase-noise i.e., $\sigma^{2}_{\theta}\ll g/T_{\rm s}$ ), these outputs can be expressed as:

[TABLE]

where $0\leq t\leq T_{\rm s}$ , we define $A(t)\triangleq\mathrm{LPF}_{\hat{g}}\{e^{-{\rm j}\theta(t)}\}$ and $\hat{\mathbf{w}}(t)$ is the $M_{\rm rx}\times 1$ filtered Gaussian noise with power spectral density: $\mathcal{S}_{\rm w}(f)=\mathrm{N_{0}}$ for $-f_{\hat{g}}\leq f\leq f_{\hat{g}}$ . An illustration of this filtering operation is provided in Fig. 2b. The aim is to use $\hat{\mathbf{s}}_{\rm rx,BB}(t)$ as the combining weights for the received data signal. This is accomplished by using $\hat{\mathbf{s}}_{\rm rx,BB}(t)$ as the control signals to a variable gain phase-shifter array, through which the base-band received signal vector $\tilde{\mathbf{s}}_{\rm rx,BB}(t)$ is processed, as shown in Fig. 1. We assume the filter cut-off frequency $f_{\hat{g}}$ to be small enough to allow the phase-shifters to respond to the slowly time varying control signals $\hat{\mathbf{s}}_{\rm rx,BB}(t)$ .666A more detailed discussion about $\hat{g}$ is considered in Sections IV and VI. The phase-shifter outputs are then summed up to obtain $y(t)=\hat{\mathbf{s}}_{\rm rx,BB}(t)^{{\dagger}}\tilde{\mathbf{s}}_{\rm rx,BB}(t)$ , which is further fed to an ADC that samples at $K/T_{\rm s}$ samples/sec to obtain:

[TABLE]

for $0\leq n<K$ , where we define $A[n]\triangleq A\big{(}\frac{nT_{\rm s}}{K}\big{)}$ , $\theta[n]\triangleq\theta\big{(}\frac{nT_{\rm s}}{K}\big{)}$ , $\tilde{\mathbf{w}}[n]\triangleq\tilde{\mathbf{w}}\big{(}\frac{nT_{\rm s}}{K}\big{)}$ and $\hat{\mathbf{w}}[n]\triangleq\hat{\mathbf{w}}\big{(}\frac{nT_{\rm s}}{K}\big{)}$ . Conventional OFDM demodulation is performed on $y[n]$ to demodulate the transmitted data signals, as analyzed in Section III.

III Analysis of the demodulation outputs

In this section, we study the demodulation of the sampled signal $y[n]$ for a representative [math]-th OFDM symbol. To this end, we first characterize the statistics of $e^{{\rm j}\theta[n]}$ , $\tilde{\mathbf{w}}[n]$ and $A[n]$ . The OFDM demodulation outputs are subsequently analyzed in Sections III-A–III-C. Note that the sampled, band-limited additive noise $\tilde{\mathbf{w}}[n]$ and the sampled RX phase-noise $e^{-{\rm j}\theta[n]}$ for $0\leq n<K$ can be expressed using their normalized Discrete Fourier Transform (nDFT) expansions as:

[TABLE]

where $\mathbf{W}[k]=\frac{1}{K}\sum_{n=0}^{K-1}\tilde{\mathbf{w}}[n]e^{-{\rm j}2\pi kn/K}$ and $\Omega[k]=\frac{1}{K}\sum_{n=0}^{K-1}e^{-{\rm j}\theta[n]}e^{-{\rm j}2\pi kn/K}$ are the corresponding nDFT coefficients. Here nDFT is an unorthodox definition for Discrete Fourier Transform, where the normalization by $K$ is performed while finding $\mathbf{W}[k],\Omega[k]$ instead of in (8). These nDFT coefficients are periodic with period $K$ and satisfy the following lemma:

Lemma III.1.

The nDFT coefficients of $e^{-{\rm j}\theta[n]}$ for $0\leq n<K$ satisfy:

[TABLE]

Proof.

See Appendix A. ∎

To test the accuracy of the approximation in Lemma III.1, the Monte-Carlo simulations of $\Delta_{k,k},\Delta_{k,k+1}$ and $\Delta_{k,k+100}$ for a typical phase-noise process ( $-93$ dBc/Hz at $10$ MHz offset) are compared to (9b) in Fig. 3.

As is evident from the results, (9b) is accurate for $k_{1}=k_{2}$ . Similarly, the simulated values of $\Delta_{k,k+1},\Delta_{k,k+100}$ are $\geq 20$ dB lower than $\Delta_{k,k}$ $\forall k$ , and can be well approximated as [math] as in (9b). The analogous version of Lemma III.1 for phase-noise modeled as an OU process is presented in Appendix C. In a similar way, for the channel noise we have:

Lemma III.2.

The nDFT coefficients of $\tilde{\mathbf{w}}[n]$ , i.e., $\big{\{}\mathbf{W}[k]\ \big{|}\ \forall k\big{\}}$ , are jointly Gaussian with:

[TABLE]

for arbitrary integers $k_{1},k_{2}$ , where $\delta^{K}_{a,b}=1$ if $a=b\ ({\rm mod}\ K)$ or $\delta^{K}_{a,b}=0$ otherwise.

Proof.

See Appendix B. ∎

Note that using these nDFT coefficients, the low-pass filtered versions of $\tilde{\mathbf{w}}[n]$ and $e^{-{\rm j}\theta[n]}$ in (6) can be approximated as:

[TABLE]

where $\hat{\mathcal{G}}\triangleq\{-\hat{g},...,\hat{g}\}$ and the approximations are obtained by replacing the linear convolution of $\tilde{\mathbf{s}}_{\rm rx,BB}(t)$ and the filter response $\mathrm{LPF}_{\hat{g}}\{\}$ with a circular convolution. This is accurate when the filter response has a narrow support, i.e., for $\hat{g}\gg 1$ . The remaining results in this paper are based on the approximations in (9)–(11) and on an additional approximation discussed later in Remark III.1. While we still use the $\leq,=,\geq$ operators in the following results for convenience of notation, it is emphasized that these equations are true in the strict sense only if the aforementioned approximations are met with equality. However simulation results are also used in Section VI to test the validity of these approximations. Substituting (8) and (11) into (7), the $k$ -th OFDM demodulation output can be expressed as:

[TABLE]

We shall split $Y_{k}$ as $Y_{k}=S_{k}+I_{k}+Z_{k}$ where $S_{k}$ , referred to as the signal component, involves the terms in (12) containing $x_{k}$ and not containing the channel noise, $I_{k}$ , referred to as the interference component, involves the terms containing $E^{(\rm r)},\{x_{\bar{k}}\mid\bar{k}\in\mathcal{K}\setminus\{k\}\}$ and not containing the channel noise, and $Z_{k}$ , referred to as the noise component, containing the remaining terms. These signal, interference and noise components are analyzed in the following subsections.

III-A Signal Component Analysis

From (12), the signal component for $k\in\mathcal{K}\setminus\mathcal{G}$ can be expressed as:

[TABLE]

where we define $\beta_{k_{1},k_{2}}\triangleq\mathbf{t}^{{\dagger}}{\boldsymbol{\mathcal{H}}(k_{1})}^{{\dagger}}\boldsymbol{\mathcal{H}}(k_{2})\mathbf{t}\big{/}M_{\rm rx}$ . Since $\hat{\mathcal{G}}\subset\mathcal{K}$ , note that $\sum_{\dot{k}\in\hat{\mathcal{G}}}{|\Omega[\dot{k}]|}^{2}<1$ from Lemma III.1, i.e., the phase-noise causes some loss in power of the signal component. However this loss is much smaller than in PACE [51] or digital CE based beamforming, where only ${|\Omega[0]|}$ contributes to the signal component unless additional phase-noise compensation is used. As is evident from (13), CACE based beamforming utilizes the (filtered) received signal vector corresponding to the reference tone as weights to combine the received signal vector corresponding to the data sub-carriers, i.e., it emulates imperfect maximal ratio combining. The imperfection is because the reference tone and the $k$ -th data stream pass through slightly different channels owing to the difference in their modulating frequencies. However the resulting loss in beamforming gain is small for sparse channels, i.e., for $L\ll M_{\rm tx},M_{\rm rx}$ , a criterion usually satisfied for mm-wave massive MIMO channels. This is due to a common channel spatial signature across frequency, as shall be shown in Sections IV and VI. The second moment of the signal component, averaged over the phase-noise and data symbols, can be computed as:

[TABLE]

where we define $\mu(a,\hat{g})\triangleq\sum_{\dot{k}\in\hat{\mathcal{G}}}\Delta_{a+\dot{k},a+\dot{k}}$ , and (14) follows from Jensen’s inequality and (9b).

III-B Interference Component Analysis

From (12), the interference component for $k\in\mathcal{K}\setminus\mathcal{G}$ can be expressed as:

[TABLE]

As is clear from above, the demodulation output $Y_{k}$ suffers ICI from other sub-carrier data streams and reference tone due to the RX phase-noise. The first and second moments of $I_{k}$ , averaged over the other sub-carrier data $\{\bar{k}\in\mathcal{K}\setminus[\mathcal{G}\cup\{k\}]\}$ and the phase-noise, can be expressed as:

[TABLE]

where $\stackrel{{\scriptstyle(1)}}{{=}},\stackrel{{\scriptstyle(2)}}{{=}}$ are obtained using the fact that $\{x_{k}|k\in\mathcal{K}\}$ have a zero-mean and are independently distributed; $\stackrel{{\scriptstyle(3)}}{{\leq}}$ is obtained by defining $\beta_{\rm max}\triangleq\max_{k\in\mathcal{K}}|\beta_{k,k}|$ , observing $|\beta_{0,k}|\leq\beta_{\rm max}$ , and using $\mathcal{K}\setminus[\mathcal{G}\cup\{k\}]\subseteq\mathcal{K}\setminus\{k\}$ in first term and using the Cauchy-Schwarz inequality for the second term; $\stackrel{{\scriptstyle(4)}}{{\leq}}$ follows by changing the summation order in the first term and by using (9a) for the second term; $\stackrel{{\scriptstyle(5)}}{{=}}$ follows by using $\Omega[k]=\Omega[k+K]$ and (9a) for the first term and $\stackrel{{\scriptstyle(6)}}{{\leq}}$ follows by using (9b) and the Jensen’s inequality. As shall be shown in Section VI, (16b) may be a loose bound on ICI for lower subcarriers, i.e., $|k|\ll K$ .

Remark III.1.

A tighter approximation for $\mathbb{E}\{{|I_{k}|}^{2}\}$ can be obtained by replacing $\mu(k,\hat{g})$ in (16b) with $\tilde{\mu}(k,\hat{g})\triangleq\sum_{\dot{k}\in\hat{\mathcal{G}}}\Delta_{\dot{k},\dot{k}}\Delta_{\dot{k}+k,\dot{k}+k}$ .

This heurtistic is obtained by assuming $\Omega[\dot{k}]$ and $\Omega[\ddot{k}+k]$ to be independently distributed for $\dot{k},\ddot{k}\in\hat{\mathcal{G}}$ and $k\in\mathcal{K}\setminus\mathcal{G}$ in step $\stackrel{{\scriptstyle(2)}}{{=}}$ of (16b), but we skip the proof for brevity. As shall be verified in Section VI, Remark III.1 offers a much better ICI approximation $\forall k$ and hence we shall use $\tilde{\mu}(k,\hat{g})$ instead of $\mu(k,\hat{g})$ in the forthcoming derivations in Section VI.

III-C Noise Component Analysis

From (12), the noise component of $Y_{k}$ for $k\in\mathcal{K}\setminus\mathcal{G}$ can be expressed as:

[TABLE]

Note that the noise consists of both signal-noise and noise-noise cross product terms. From Lemma III.2, it can readily be verified that $\mathbb{E}\{Z_{k}\}=0$ and $\mathbb{E}\{Z^{(i)}_{k}{[Z^{(j)}_{k}]}^{*}\}=0$ for $i\neq j$ , where the expectation is taken over the noise realizations. Thus the second moment of $Z_{k}$ , averaged over the TX data, phase-noise and channel noise, can be expressed as $\mathbb{E}\{{|Z_{k}|}^{2}\}=\mathbb{E}\{{|Z_{k}^{(1)}|}^{2}\}+\mathbb{E}\{{|Z_{k}^{(2)}|}^{2}\}+\mathbb{E}\{{|Z_{k}^{(3)}|}^{2}\}$ , where:

[TABLE]

where $|\hat{\mathcal{G}}|=2\hat{g}+1$ , $\stackrel{{\scriptstyle(1)}}{{=}},\stackrel{{\scriptstyle(3)}}{{=}}$ follow from Lemma III.2; $\stackrel{{\scriptstyle(2)}}{{=}},\stackrel{{\scriptstyle(4)}}{{=}}$ follow from (9b), and $\stackrel{{\scriptstyle(5)}}{{=}}$ follows from Lemma III.2, (9b) and the result on the expectation of the product of four Gaussian random variables [59]. From (18), we can then upper-bound the noise power as:

[TABLE]

where we use the fact that $|\beta_{\dot{k},\dot{k}}|\leq\beta_{\rm max}$ , $\sum_{\dot{k}\in\hat{\mathcal{G}}}\big{[}\Delta_{\dot{k}+k,\dot{k}+k}+\Delta_{\dot{k},\dot{k}}\big{]}\leq 1$ for $k\in\mathcal{K}\setminus\mathcal{G}$ (as $\hat{g}\leq g/2$ ) and $\sum_{\bar{k}\in\mathcal{K}\setminus\mathcal{G}}\Delta_{k+\dot{k}-\bar{k},k+\dot{k}-\bar{k}}\leq 1$ , from (9a).

IV Performance Analysis

From (12)–(17), the effective single-input-single-output (SISO) channel between the $k$ -th sub-carrier input and corresponding output can be expressed for $k\in\mathcal{K}\setminus\mathcal{G}$ as:

[TABLE]

where $I_{k}$ and $Z_{k}$ are analyzed in Sections III-B and III-C, respectively. As is evident from (20), the signal component suffers from two kinds of fading: (i) a frequency-selective and channel dependent slow fading component represented by $\beta_{0,k}$ and (ii) a frequency-flat and phase-noise dependent fast fading component, represented by $\sum_{\dot{k}\in\hat{\mathcal{G}}}{|\Omega[\dot{k}]|}^{2}$ . The estimation of these fading coefficients is discussed later in this section. In this paper, we consider the simple demodulation approach where $x_{k}$ is estimated only from $Y_{k}$ , and the $I_{k},Z_{k}$ are treated as noise.777The estimation of $x_{k}$ from multiple OFDM sub-carriers outputs shall be explored in future work. For this demodulation approach, a lower bound to the signal-to-interference-plus-noise ratio (SINR) can be obtained from (14), (16b), Remark III.1 and (19), as shown in (21) at the top of next page,

where $\boldsymbol{\beta}\triangleq\{\beta_{0,k}|\mathcal{K}\}$ and we use the fact that $\mathbb{E}\{{|I_{k}+Z_{k}|}^{2}\}=\mathbb{E}\{{|I_{k}|}^{2}\}+\mathbb{E}\{{|Z_{k}|}^{2}\}$ .888Since $Z_{k}$ is the noise experienced while estimating $x_{k}$ , it is inaccurate to take an expectation of ${|Z_{k}|}^{2}$ with respect to $x_{k}$ , as in (18). However the impact of this error is negligible when $K\gg 1$ .

Remark IV.1.

If the RX array response vectors for the MPCs are mutually orthogonal i.e. ${\mathbf{a}_{\rm rx}(\ell_{1})}^{{\dagger}}\mathbf{a}_{\rm rx}(\ell_{2})=M_{\rm rx}\delta_{\ell_{1},\ell_{2}}^{\infty}$ , then $\beta_{\dot{k},\ddot{k}}=\sum_{\ell=0}^{L-1}{|\alpha_{\ell}|}^{2}{|{\mathbf{a}_{\rm tx}(\ell)}^{{\dagger}}\mathbf{t}|}^{2}e^{{\rm j}2\pi(f_{\dot{k}}-f_{\ddot{k}})\tau_{\ell}}$ and $\beta_{\rm max}=\bar{\beta}$ , where we define $\bar{\beta}\triangleq\sum_{\ell=0}^{L-1}{|\alpha_{\ell}|}^{2}\allowbreak{|{\mathbf{a}_{\rm tx}(\ell)}^{{\dagger}}\mathbf{t}|}^{2}$ .

The orthogonality of array response vectors is approximately satisfied if the MPCs are well separated and $M_{\rm rx}\gg L$ [60]. Additionally, while terms in $\beta_{\dot{k},\ddot{k}}$ combine incoherently, the resulting loss in $\gamma^{\rm LB}_{k}(\boldsymbol{\beta})$ is small for sparse wide-band channels with small $L$ . Thus the CACE technique is very well suited for mm-wave massive MIMO channels where these conditions are typically satisfied. From Remark IV.1, note that even without explicit CE at the RX $\gamma^{\rm LB}_{k}(\boldsymbol{\beta})$ scales with $M_{\rm rx}$ in the low SNR regime, which is a desired characteristic. Though the ICI term also scales with $M_{\rm rx}$ , its impact can be kept small in the desired SNR range by picking $\hat{g}$ such that $\mu(0,\hat{g})\approx 1$ . In a similar way, with perfect knowledge of the fading coefficients at the RX, an approximate lower bound to the ergodic capacity can be obtained as:999Here the ergodic capacity is computed assuming $\{\beta_{0,k}|k\in\mathcal{K}\}$ remain constant for infinite time but $\sum_{\dot{k}\in\hat{\mathcal{G}}}{|\Omega[\dot{k}]|}^{2}$ experiences many independent realizations. This capacity is representative of the throughput of practical codes that have a length spanning multiple OFDM symbols but smaller than coherence time of $\beta_{0,k}$ [61].

[TABLE]

where $\stackrel{{\scriptstyle(1)}}{{\geq}}$ is obtained by assuming $I_{k},Z_{k}$ to be Gaussian distributed and using the expression for ergodic capacity [62], $\stackrel{{\scriptstyle(2)}}{{\approx}}$ follows by sending the outer expectation into the $\log(\cdot)$ functions and $\stackrel{{\scriptstyle(3)}}{{\geq}}$ follows from (14), (16b) and (19). While $\stackrel{{\scriptstyle(2)}}{{\approx}}$ is an approximation, it typically yields a lower bound since ${\rm Variance}\{\sum_{\dot{k}\in\hat{\mathcal{G}}}{|\Omega[\dot{k}]|}^{2}\}\leq\mu(0,\hat{g})[1-\mu(0,\hat{g})]\ll\mu(0,\hat{g})^{2}$ (from (9a) and [63]).

Note that for demodulating $x_{k}$ ’s and achieving the above SINR and capacity, the RX requires estimates of $\mathrm{N}_{0}$ and the SISO channel fading coefficients $\boldsymbol{\beta}$ and $\sum_{\dot{k}\in\hat{\mathcal{G}}}{|\Omega[\dot{k}]|}^{2}$ . Since the RX has a good beamforming gain (21), the channel parameters $\boldsymbol{\beta},\mathrm{N}_{0}$ can be tracked accurately at the RX with a low estimation overhead using pilot symbols and blanked symbols. These values, along with phase-noise parameter $\sigma_{\theta}$ , can further be fed back to the TX for rate and power allocation. Note that since these pilots are only used to estimate the SISO channel parameters and not the actual MIMO channel, the advantages of simplified CE are still applicable for a CACE based RX. On the other hand, the low variance albeit fast varying component $\sum_{\dot{k}\in\hat{\mathcal{G}}}{|\Omega[\dot{k}]|}^{2}$ can be estimated for every symbol using the [math]-th sub-carrier output $Y_{0}$ . It can be shown from (12) that $Y_{0}=M_{\rm rx}\beta_{0,0}E^{(\rm r)}\Big{[}\sum_{\dot{k}\in\hat{\mathcal{G}}}{|\Omega[\dot{k}]|}^{2}\Big{]}+I_{0}+M_{\rm rx}|\hat{\mathcal{G}}|\mathrm{N}_{0}+Z_{0}$ , where we have $\mathbb{E}\{{|I_{0}|}^{2}\}\leq\mathbb{E}\{{|I_{k}|}^{2}\}$ and $\mathbb{E}\{{|Z_{0}|}^{2}\}\leq 2\mathbb{E}\{{|Z_{k}|}^{2}\}$ for any $k\in\mathcal{K}\setminus\mathcal{G}$ .101010While the derivations follow similar steps to those in Section III, the explicit proof is skipped for brevity. Thus $\sum_{\dot{k}\in\hat{\mathcal{G}}}{|\Omega[\dot{k}]|}^{2}$ can be estimated from $Y_{0}$ with an SINR $\geq\frac{E^{\rm r}\gamma^{\rm LB}_{k}(\boldsymbol{\beta})}{2E_{(\rm d)}}$ , which is usually a large value.

IV-A Optimizing the system parameters

In this section we find capacity maximizing values of the system parameters $g,E^{(\rm r)}$ and $\hat{g}$ . From (22), note that the approximate ergodic capacity $C_{\rm approx}(\boldsymbol{\beta})$ is a decreasing function of $g$ for $g\geq 2\hat{g}$ . Thus a $C_{\rm approx}(\boldsymbol{\beta})$ maximizing choice of $g$ should satisfy $g=2\hat{g}$ . To find a near-optimal values of $E^{(\rm r)}$ and $\hat{g}$ , we further lower bound $C_{\rm approx}(\boldsymbol{\beta})$ using (22) and (21), as:

[TABLE]

where $\Xi(\boldsymbol{\beta})$ is as given in (23b) at the top of this page, $\stackrel{{\scriptstyle(1)}}{{\geq}}$ follows from the fact that $\log(1+\gamma^{\rm LB}_{k}(\boldsymbol{\beta}))\geq\log(\gamma^{\rm LB}_{k}(\boldsymbol{\beta}))$ and by taking the summation over $k$ in (22) into the denominator of the logarithm; and $\stackrel{{\scriptstyle(2)}}{{=}}$ in (23b) follows from the fact that $\sum_{k\in\mathcal{K}\setminus\mathcal{G}}\tilde{\mu}(k,\hat{g})\leq\sum_{k\in\mathcal{K}\setminus\hat{\mathcal{G}}}\mu(0,\hat{g})\Delta_{k,k}$ and $E^{(\rm d)}({K-|\mathcal{G}|})+E^{(\rm r)}=E_{\rm s}$ . It can be verified that the numerator of $\Xi(\boldsymbol{\beta})$ is a differentiable, strictly concave function of $E^{(\rm r)}$ , while the denominator is a positive, affine function of $E^{(\rm r)}$ . Thus $\Xi(\boldsymbol{\beta})$ is a strictly pseudo-concave function of $E^{(\rm r)}$ [64], and the $C_{\rm approx}(\boldsymbol{\beta})$ maximizing power allocation can be obtained by setting $\frac{{\rm d}\Xi(\boldsymbol{\beta})}{{\rm d}E^{(\rm r)}}=0$ as:

[TABLE]

where $Q=M_{\rm rx}{|\beta_{\rm max}|}^{2}[1-\mu(0,\hat{g})]\mu(0,\hat{g})E_{\rm s}+\beta_{\rm max}\mathrm{N}_{0}(K-|\hat{\mathcal{G}}|-|\mathcal{G}|)$ and $R=\mathrm{N}_{0}|\hat{\mathcal{G}}|\big{[}\beta_{\rm max}+\mathrm{N}_{0}(K-|\mathcal{G}|)/E_{\rm s}\big{]}$ . As evident from (23b), $\hat{g}$ offers a trade-off between the phase-noise induced ICI and the channel noise accumulation. While finding a closed form expression for (23a) maximizing $\hat{g}$ is intractable, it can be computed numerically by performing a simple line search over $1\leq\hat{g}\leq\min\{K_{1},K_{2}\}/2$ , with $g=2\hat{g}$ and $E^{(\rm r)}$ as given by (24).

V Initial Access, TX beamforming and uplink beamforming

In this section we briefly discuss stages (i) and (ii) of downlink transmission (see Section II), and uplink TX beamforming for CACE aided UEs. In the suggested IA protocol for stage (i), the BS performs beam sweeping along different angular directions, possibly with different beam widths, similar to the approach of 3GPP New Radio (NR). For each TX beam, the BS transmits primary (PSS) and secondary synchronization sequences (SSS) with the reference signal, in a form similar to (1). The UEs use CACE aided RX beamforming, and initiate uplink random access to the BS upon successfully detecting a PSS/SSS. As shall be shown in Section VI, the SINR expression (21) is resilient to frequency mismatches between TX and RX oscillators, and thus is also applicable for the PSS/SSSs where frequency synchronization may not exist. Since angular beam-sweeping is only performed at the BS, the IA latency does not scale with $M_{\rm rx}$ and yet the PSS/SSS symbols can exploit the RX beamforming gain, thus improving cell discovery radius and/or reducing IA overhead. This is in contrast to digital CE at the UE, which would require sweeping through many RX beam directions for each TX direction, necessitating several repetitions of the PSS/SSS for each TX beam. During downlink stage (ii), note that scheduling of UEs, designing TX beamformer $\mathbf{t}$ and allocation of power requires knowledge of $\{|\alpha_{\ell}|,\mathbf{a}_{\rm tx}(\ell)\}$ for all the UEs. Such rCSI can be acquired at the BS either by downlink CE with rCSI feedback from the UEs or by uplink CE. The protocol for downlink CE with feedback is similar to the IA protocol, with the BS transmitting pilot symbols instead of PSS/SSS along different candidate $\mathbf{t}$ ’s. Uplink CE can be performed by transmitting orthogonal pilots from the UEs omni-directionally, and using any of the digital CE algorithms from Section I at the BS. Note that CACE cannot be used at the BS since the pilots from multiple UEs need to be separated via digital processing.

Note that the phase shifts used for RX beamforming at a CACE aided UE in downlink, can also be used for transmit beamforming in the uplink. However since the reference signal is not available at the UE during uplink transmission in time division duplexing systems, a mechanism for locking these phase shift values from a previous downlink transmission stage is required (similar to [51]). In contrast, frequency division duplexing can avoid such a mechanism due to continuous availability of the downlink reference, and consequently $\hat{\mathbf{s}}_{\rm rx,BB}(t)$ .

VI Simulation Results

For the simulations, we consider a single cell scenario with a $\lambda/2$ -spaced $32\times 8$ ( $M_{\rm tx}=256$ ) antenna BS and one representative UE: with a $\lambda/2$ -spaced $16\times 4$ ( $M_{\rm rx}=64$ ) antenna array, one down-conversion chain, using CACE aided beamforming and having perfect timing synchronization to the BS. The BS has apriori rCSI and transmits one spatial OFDM data stream with $T_{\rm s}=1\mu$ s, $K_{1}=K_{2}+1=512$ and $f_{\rm c}=30$ GHz along the strongest MPC, i.e., $\mathbf{t}=\mathbf{a}_{\rm tx}(\bar{l})$ for $\bar{l}=\rm{argmax}_{\ell}\{|\alpha_{\ell}|\}$ . The UE oscillator experiences phase-noise with variance $\sigma_{\theta}^{2}=1/\sqrt{T_{\rm s}}$ known both to the BS and UE. The UE also has perfect knowledge of $\boldsymbol{\beta},\mathrm{N}_{0}$ and $\sum_{\dot{k}\in\hat{\mathcal{G}}}{|\Omega[\dot{k}]|}^{2}$ . For convenience, we shall use $\bar{\beta}E_{\rm s}/K\mathrm{N}_{0}$ to quantify the simulation SNR, which reflects the mean SNR at any RX antenna without RX beamforming gain (see Remark IV.1).

For testing the validity of the analytical results, we first consider a sparse channel matrix $\mathbf{H}(t)$ with $L=3$ , $\hat{\tau}_{\ell}=\{0,20,40\}$ ns, angles of arrival $\psi^{\rm rx}_{\rm azi}=\{0,\pi/6,-\pi/6\}$ , $\psi^{\rm rx}_{\rm ele}=\{0.45\pi,\pi/2,\pi/2\}$ and normalized amplitudes $\frac{\alpha_{\ell}\mathbf{a}_{\rm tx}(\ell)^{{\dagger}}\mathbf{t}}{\sqrt{\bar{\beta}}}=\{\sqrt{0.6},-\sqrt{0.3},\sqrt{0.1}\}$ . The UE uses $\hat{g}=g/2=10$ and $E^{(\rm r)},E^{(\rm d)}$ from (24). For this model, the symbol error rates (SERs) for the sub-carriers, obtained by Monte-Carlo simulations, are compared to the analytical SERs for a Gaussian channel with SINR given by (21) (with/without Remark III.1) in Fig. 4. For the Monte-Carlo results, we use truncated sinc filters: ${\rm LPF}_{\hat{g}}(t)=\sin(2\pi\hat{g}t/T_{\rm s})\big{/}(\pi t)$ for $|t|\leq 2T_{\rm s}/\hat{g}$ . As observed from the results and mentioned in Section III-B, the use of Remark III.1 in (21) provides a tight SINR bound even for small $|k|$ . We also observe that the SER for $k=22\ (\approx\hat{g})$ is high due to the ICI caused from the high power reference signal. However this ICI diminishes very quickly with $k$ due to phase noise suppression, as evident from the SER for $k=-40$ . While the mean RX oscillator frequency was assumed to be perfectly matched to the TX oscillator for the analytical results (see Section II), we also plot in Fig. 4 the case with a $5$ MHz frequency mismatch. Results show a negligible degradation in performance, suggesting that the CACE design is resilient to oscillator frequency mismatches that are smaller than the cut-off frequency of ${\rm LPF}_{\hat{g}}$ . For computational tractability, and due to the accuracy of the bounds in Fig. 4, we shall henceforth use (21) and (22) to quantify the performance of CACE for the remaining results.

We next plot $C_{\rm approx}(\boldsymbol{\beta})$ from (22) as a function of $\hat{g}$ in Fig. 5, with (a) $C_{\rm approx}(\boldsymbol{\beta})$ maximizing $E^{(\rm r)}$ (obtained by exhaustive search over $0\leq E^{(\rm r)}\leq E_{\rm s}$ ) and (b) $E^{(\rm r)}$ chosen from (24), respectively. As observed from the results, the curves are very close, suggesting the accuracy of the power allocation in (24). Fig. 5 also demonstrates the trade-off characterized by $\hat{g}$ : where ICI degrades the performance for small $\hat{g}$ and the noise accumulation, spectral efficiency reduction degrade performance for large $\hat{g}$ . We also note that the optimal $\hat{g}$ increases with SNR.

Fig. 6a compares the achievable throughput (excluding CE overhead) for beamforming with digital CE and different ACE schemes: CACE, PACE [51], MA-FSR [52], respectively for the sparse channel defined above. For digital CE, the RX beamformer is aligned with the largest eigenvector of the effective RX correlation matrix $\mathbf{R}_{\rm rx}(\mathbf{t})=\frac{1}{K}\sum_{k\in\mathcal{K}}\boldsymbol{\mathcal{H}}(f_{k})\mathbf{t}\mathbf{t}^{{\dagger}}{\boldsymbol{\mathcal{H}}(f_{k})}^{{\dagger}}$ [11], which in turn is either (a) known apriori at the BS or (b) is estimated by nested array based sampling [34]. To decouple the loss in beamforming gain due to CE errors from loss due to phase-noise, we assume $\sigma_{\theta}\approx 0$ . As is evident from Fig. 6a, PACE and CACE suffer only a $\leq 2$ dB beamforming loss in compared to digital CE in sparse channels and above a threshold SNR. While CACE performs marginally worse than PACE at high SNR due to power wastage on a continuous reference, unlike PACE it does not suffer from PLL based carrier recovery losses at low SNR. While MA-FSR performs poorly due to low bandwidth efficiency, it requires much simpler hardware then all other schemes. To demonstrate the phase-noise suppressing capability of CACE (and MA-FSR), we also plot the throughput of CACE (with optimal $\hat{g}$ ) and digital CE, with $\sigma_{\theta}^{2}=1/T_{\rm s}$ and without any additional phase-noise mitigation. As is evident from the results, both CACE and MA-FSR aid in mitigating oscillator phase-noise in addition to enabling RX beamforming. To study the impact of more realistic channels and number of MPCs, we also consider a rich scattering stochastic channel in Fig. 6b, having $L/10$ resolvable MPCs and $10$ sub-paths per resolvable MPC. All channel parameters are generated according to the 3GPP TR38.900 Rel 14 channel model (UMi NLoS scenario) [65], with the resolvable MPCs and sub-paths modeled as clusters and rays, respectively. However to model the sub-paths of each MPC as unresolvable, we use an intra-cluster delay spread of $1ns$ and an intra-cluster angle spread of $\pi/50$ (for all elevation, azimuth, arrival and departure). As observed, the performance of ACE schemes degrades slightly faster with $L$ than of digital CE due to the incoherent combining of the MPC contributions in $\beta_{0,k}$ (see Remark IV.1).

Note that the results in Fig. 6 do not include the CE overhead for PACE and digital CE. While nested array digital CE requires $21$ dedicated pilot symbols ( $\approx 2\sqrt{M_{\rm rx}}$ ) for updating RX beamformer, PACE requires $6$ symbols ( ${\rm O}(1)$ ) and CACE, MA-FSR only require a continuous reference tone. The corresponding overhead reduction can be significant when downlink CE with rCSI feedback is used for rCSI acquisition at the BS (see Section V). For example, with exhaustive beam-scanning [15] at the BS and an rCSI coherence time of $10$ ms, the BS rCSI acquisition overhead reduces from $40$ % for nested array digital CE to $11$ % for PACE and $\approx|\mathcal{G}|/K<5\%$ for CACE.

VII Conclusions

This paper proposes a novel CE technique called CACE for designing the RX beamformer in massive MIMO systems. CACE enables both RX beamforming and phase-noise cancellation at very low CE overhead. The performance analysis suggests that in sparse channels and for low-pass filter bandwidth parameter $\hat{g}\gg 1$ , the SINR with CACE scales linearly with the number of receive antennas $M_{\rm rx}$ . The analysis and simulations also show that $\hat{g}$ yields a trade-off between phase-noise induced ICI and noise accumulation. Simulations suggest that CACE suffers only a small degradation in beamforming gain in comparison to digital CE based beamforming in sparse channels, and is resilient to TX-RX oscillator frequency mismatch. In comparison to other ACE schemes, CACE performs marginally worse than PACE at high SNR but performs much better at lower SNR. It also performs much better than MA-FSR, albeit at a higher RX hardware complexity. Finally, CACE also provides phase-noise suppression unlike most other CE schemes. The CE overhead reduction with CACE is significant, especially when downlink CE with feedback is required. The IA latency reduction with CACE aided beamforming is also discussed. While base-band phase shifters are sufficient for a CACE based RX unlike in conventional analog beamforming, $2M_{\rm rx}$ mixers may be required for the base-band conversion at the RX, thus adding to the hardware cost.

Appendix A

Proof of Lemma III.1.

Note that from the definition of $\Omega[k]$ , we have $e^{-{\rm j}\theta[n]}\stackrel{{\scriptstyle\mathcal{F}}}{{\longrightarrow}}\Omega[k]$ and $e^{{\rm j}\theta[n]}\stackrel{{\scriptstyle\mathcal{F}}}{{\longrightarrow}}\Omega^{*}[-k]$ , where $\mathcal{F}$ represents the nDFT Operation. Then using convolution property of the nDFT, we have:

[TABLE]

which proves property (9a). Property (9b) can be obtained as follows:

[TABLE]

where $\stackrel{{\scriptstyle(1)}}{{=}}$ follows by using the expression for the characteristic function of the Gaussian random variable $\theta[\dot{n}]-\theta[\ddot{n}]$ ; $\stackrel{{\scriptstyle(2)}}{{=}}$ follows by defining $u=\dot{n}-\ddot{n}$ and $\stackrel{{\scriptstyle(3)}}{{\approx}}$ follows by changing the inner summation limits which is accurate for $\sigma_{\theta}^{2}T_{\rm s}\gg 1$ and $\stackrel{{\scriptstyle(4)}}{{=}}$ follows from the expression for the sum of a geometric series. ∎

Appendix B

Proof of Lemma III.2.

Note that each component of $\tilde{\mathbf{w}}(t)$ is independent and identically distributed as a circularly symmetric Gaussian random process. Hence its nDFT coefficients, obtained as $\mathbf{W}[k]=\frac{1}{K}\sum_{n=0}^{K-1}\tilde{\mathbf{w}}(nT_{\rm s}/K)e^{-{\rm j}2\pi\frac{kn}{K}}$ are also jointly Gaussian and circularly symmetric. For these coefficients at RX antennas $a,b$ we obtain:

[TABLE]

where we use the auto-correlation function of the channel noise at any RX antenna as: $R_{\tilde{w}}(t)=\mathrm{N}_{0}\sin(\pi Kt/T_{\rm s})\exp{\{-{\rm j}\pi(K_{1}-K_{2})t/T_{\rm s}\}}\Big{/}{\pi t}$ . ∎

Appendix C

Here we model the RX phase-noise $\theta(t)$ as a zero mean Ornstein-Ulhenbeck (OU) process [66], which is representative of the output of a type-1 phase-locked loop with a linear phase detector [49, 58, 57]. For such a model, $\theta(t)$ satisfies:

[TABLE]

where, $w_{\theta}(t)$ is a standard real white Gaussian process, and $\eta_{\theta},\sigma_{\theta}$ are system parameters. From (5) it can be shown that $\theta(t)$ is a stationary Gaussian process (in steady state), with an auto-correlation function given by: $R_{\theta}(\tau)=\mathbb{E}\{\theta(t)\theta(t+\tau)\}=\frac{\sigma_{\theta}^{2}}{2\eta_{\theta}}e^{-\eta_{\theta}|\tau|}$ [57].

Lemma C.1.

For phase-noise modeled as an OU process we have:

[TABLE]

Proof of Lemma C.1.

Note that from the definition of $\Omega[k]$ , we have $e^{-{\rm j}\theta[n]}\stackrel{{\scriptstyle\mathcal{F}}}{{\longrightarrow}}\Omega[k]$ and $e^{{\rm j}\theta[n]}\stackrel{{\scriptstyle\mathcal{F}}}{{\longrightarrow}}\Omega^{*}[-k]$ , where $\mathcal{F}$ represents the nDFT Operation. Then using convolution property of the nDFT, we have:

[TABLE]

which proves property (9a). Property (9b) can be obtained as follows:

[TABLE]

where $\stackrel{{\scriptstyle(2)}}{{=}}$ follows from similar steps to (25) and $\stackrel{{\scriptstyle(3)}}{{\approx}}$ follows by noting that $R_{\theta}[u]$ has a limited support around $u=0$ and hence $R_{\theta}[u]\approx R_{\theta}[u-K]\approx 0$ for $u>(K-1)/2$ . Note that since $e^{-R_{\theta}[0]+R_{\theta}[u]}$ is an auto-correlation function, its nDFT is non-negative, thus ensuring that $\Delta_{k_{1},k_{1}}\geq 0$ in (29). ∎

Bibliography66

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] V. V. Ratnam and A. Molisch, “Reference tone aided transmission for massive MIMO: analog beamforming without CSI,” in Proc. IEEE Int. Conf. Commun. (ICC) , May 2018.
2[2] T. Marzetta, “Noncooperative Cellular Wireless with Unlimited Numbers of Base Station Antennas,” IEEE Trans. Wireless Commun. , vol. 9, pp. 3590–3600, Nov. 2010.
3[3] F. Boccardi, R. Heath, A. Lozano, T. Marzetta, and P. Popovski, “Five disruptive technology directions for 5G,” IEEE Commun. Mag. , vol. 52, pp. 74–80, Feb. 2014.
4[4] B. Murmann, “ADC performance survey 1997-2018 (ISSCC & VLSI Symposium).” available at: https://web.stanford.edu/~murmann/adcsurvey.html .
5[5] A. F. Molisch, V. V. Ratnam, S. Han, Z. Li, S. L. H. Nguyen, L. Li, and K. Haneda, “Hybrid beamforming for massive MIMO: A survey,” IEEE Commun. Mag. , vol. 55, pp. 134–141, Sept. 2017.
6[6] R. W. Heath, N. González-Prelcic, S. Rangan, W. Roh, and A. M. Sayeed, “An overview of signal processing techniques for millimeter wave MIMO systems,” IEEE J. Sel. Topics Signal Process. , vol. 10, pp. 436–453, Apr. 2016.
7[7] A. Alkhateeb, G. Leus, and R. W. Heath, “Limited feedback hybrid precoding for multi-user millimeter wave systems,” IEEE Trans. Wireless Commun. , vol. 14, pp. 6481–6494, Nov. 2015.
8[8] F. Sohrabi and W. Yu, “Hybrid digital and analog beamforming design for large-scale antenna arrays,” IEEE J. Sel. Topics Signal Process. , vol. 10, pp. 501–513, Apr. 2016.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Continuous Analog Channel Estimation Aided Beamforming for Massive MIMO Systems

Abstract

Index Terms:

I Introduction

II General Assumptions and System model

III Analysis of the demodulation outputs

Lemma III.1**.**

Proof.

Lemma III.2**.**

Proof.

III-A Signal Component Analysis

III-B Interference Component Analysis

Remark III.1**.**

III-C Noise Component Analysis

IV Performance Analysis

Remark IV.1**.**

IV-A Optimizing the system parameters

V Initial Access, TX beamforming and uplink beamforming

VI Simulation Results

VII Conclusions

Appendix A

Proof of Lemma III.1.

Appendix B

Proof of Lemma III.2.

Appendix C

Lemma C.1**.**

Proof of Lemma C.1.

Lemma III.1.

Lemma III.2.

Remark III.1.

Remark IV.1.

Lemma C.1.