FALP: Fast beam alignment in mmWave systems with low-resolution phase   shifters

Nitin Jonathan Myers; Amine Mezghani; Robert W. Heath Jr

arXiv:1902.05714·eess.SP·September 13, 2019·IEEE Trans. Commun.

FALP: Fast beam alignment in mmWave systems with low-resolution phase shifters

Nitin Jonathan Myers, Amine Mezghani, Robert W. Heath Jr

PDF

TL;DR

FALP introduces a rapid beam alignment method for mmWave systems using low-resolution phase shifters, significantly reducing training overhead and leveraging compressed sensing and Fourier transforms for efficient channel acquisition.

Contribution

The paper presents a novel framework called FALP that enables fast beam alignment in mmWave systems with ultra-low power, low-resolution phase shifters, and establishes a new link to magnetic resonance imaging.

Findings

01

FALP achieves faster beam alignment than exhaustive search methods.

02

The CS matrix in FALP satisfies the restricted isometry property.

03

FALP exploits FFT for efficient channel measurement processing.

Abstract

Millimeter wave (mmWave) systems can enable high data rates if the link between the transmitting and receiving radios is configured properly. Fast configuration of mmWave links, however, is challenging due to the use of large antenna arrays and hardware constraints. For example, a large amount of training overhead is incurred by exhaustive search-based beam alignment in typical mmWave phased arrays. In this paper, we present a framework called FALP for Fast beam Alignment with Low-resolution Phase shifters. FALP uses an efficient set of antenna weight vectors to acquire channel measurements, and allows faster beam alignment when compared to exhaustive scan. The antenna weight vectors in FALP can be realized in ultra-low power phase shifters whose resolution can be as low as one-bit. From a compressed sensing (CS) perspective, the CS matrix designed in FALP satisfies the restricted…

Figures18

Click any figure to enlarge with its caption.

Equations100

y [m] = ⟨ H, P [m]⟩ + v [m],

y [m] = ⟨ H, P [m]⟩ + v [m],

a (ω) = [1, e^{j ω}, e^{j 2 ω}, \dots, e^{j (N - 1) ω}]^{T} .

a (ω) = [1, e^{j ω}, e^{j 2 ω}, \dots, e^{j (N - 1) ω}]^{T} .

H = k = 1 \sum K γ_{k} a (ω_{e, k}) a^{T} (ω_{a, k}) .

H = k = 1 \sum K γ_{k} a (ω_{e, k}) a^{T} (ω_{a, k}) .

H = U_{N} X U_{N} .

H = U_{N} X U_{N} .

P [m] = J_{r [m]}^{T} P J_{c [m]} .

P [m] = J_{r [m]}^{T} P J_{c [m]} .

y [m] = ⟨ H, J_{r [m]}^{T} P J_{c [m]} ⟩ + v [m] .

y [m] = ⟨ H, J_{r [m]}^{T} P J_{c [m]} ⟩ + v [m] .

y [m] = k = 0 \sum N - 1 ℓ = 0 \sum N - 1 H (k, ℓ) P^{c} ((k - r [m])_{N}, (ℓ - c [m])_{N}) + v [m] .

y [m] = k = 0 \sum N - 1 ℓ = 0 \sum N - 1 H (k, ℓ) P^{c} ((k - r [m])_{N}, (ℓ - c [m])_{N}) + v [m] .

P_{FC} (k, ℓ) = P^{c} (- k_{N}, - ℓ_{N}) \forall k, ℓ \in I_{N} .

P_{FC} (k, ℓ) = P^{c} (- k_{N}, - ℓ_{N}) \forall k, ℓ \in I_{N} .

y [m] = (H ⊛ P_{FC})_{r [m], c [m]} + v [m] .

y [m] = (H ⊛ P_{FC})_{r [m], c [m]} + v [m] .

y = P_{Ω} (H ⊛ P_{FC}) + v .

y = P_{Ω} (H ⊛ P_{FC}) + v .

G = H ⊛ P_{FC} .

G = H ⊛ P_{FC} .

Z = N U_{N}^{*} P_{FC} U_{N}^{*} .

Z = N U_{N}^{*} P_{FC} U_{N}^{*} .

S

S

S

S

= X ⊙ Z .

y = P_{Ω} (U_{N} S U_{N}) + v .

y = P_{Ω} (U_{N} S U_{N}) + v .

\hat{\mathbf{S}}=\!\!\begin{array}[]{c}\mathrm{arg\,min\,\,}\|\mathbf{W}\|_{1},\,\mathrm{s.t\,}\|\mathbf{y}-\mathcal{P}_{\Omega}(\mathbf{U}_{N}\mathbf{W}\mathbf{U}_{N})\|_{2}\leq\sqrt{M}\sigma\end{array}.

\hat{\mathbf{S}}=\!\!\begin{array}[]{c}\mathrm{arg\,min\,\,}\|\mathbf{W}\|_{1},\,\mathrm{s.t\,}\|\mathbf{y}-\mathcal{P}_{\Omega}(\mathbf{U}_{N}\mathbf{W}\mathbf{U}_{N})\|_{2}\leq\sqrt{M}\sigma\end{array}.

\bigl{\|}\mathbf{X}-\hat{\mathbf{X}}\bigl{\|}_{F}\leq C_{1}\frac{Z_{\mathrm{max}}\left\|\mathbf{X}-\left(\mathbf{X}\right)_{k}\right\|_{1}}{\sqrt{k}Z_{\mathrm{min}}}+C_{2}\frac{N\sigma}{Z_{\mathrm{min}}},

\bigl{\|}\mathbf{X}-\hat{\mathbf{X}}\bigl{\|}_{F}\leq C_{1}\frac{Z_{\mathrm{max}}\left\|\mathbf{X}-\left(\mathbf{X}\right)_{k}\right\|_{1}}{\sqrt{k}Z_{\mathrm{min}}}+C_{2}\frac{N\sigma}{Z_{\mathrm{min}}},

⟨ P, J_{x} P J_{y}^{T} ⟩ = 0 \forall (x, y) \in I_{N} \times I_{N} ∖ (0, 0) .

⟨ P, J_{x} P J_{y}^{T} ⟩ = 0 \forall (x, y) \in I_{N} \times I_{N} ∖ (0, 0) .

P = \frac{1}{2} [11 1 - 1] .

P = \frac{1}{2} [11 1 - 1] .

\hat{X} = \hat{S} ⊙ Z^{c} .

\hat{X} = \hat{S} ⊙ Z^{c} .

β_{est} = β \in (0, 2 π / 2^{q}) argmin ∥ Q_{q} (F^{opt} (β)) - F^{opt} (β) ∥_{F} .

β_{est} = β \in (0, 2 π / 2^{q}) argmin ∥ Q_{q} (F^{opt} (β)) - F^{opt} (β) ∥_{F} .

G_{Ω} (r [m], c [m])

G_{Ω} (r [m], c [m])

G_{Ω} (k, ℓ)

S_{bl}

S_{bl}

X_{bl}

\mathbf{N}_{\Omega}(r,c)=\begin{cases}\begin{array}[]{c}1,\,\,\,\,\mathrm{if}\,\,(r,c)\in\Omega\\ 0,\,\,\,\,\mathrm{if\,\,(r,c)\notin\Omega}\end{array}\end{cases}.

\mathbf{N}_{\Omega}(r,c)=\begin{cases}\begin{array}[]{c}1,\,\,\,\,\mathrm{if}\,\,(r,c)\in\Omega\\ 0,\,\,\,\,\mathrm{if\,\,(r,c)\notin\Omega}\end{array}\end{cases}.

K_{bl} = \frac{N}{M} U_{N}^{*} N_{Ω} U_{N}^{*} .

K_{bl} = \frac{N}{M} U_{N}^{*} N_{Ω} U_{N}^{*} .

S_{bl}

S_{bl}

= \frac{M}{N ^{2}} S ⊛ K_{bl} .

F_{_{ZFB}} = Q_{q} (U_{N} (:, r_{_{ZFB}}) U_{N} (c_{_{ZFB}}, :)) .

F_{_{ZFB}} = Q_{q} (U_{N} (:, r_{_{ZFB}}) U_{N} (c_{_{ZFB}}, :)) .

ξ^{2} = \frac{1 - ρ}{ρ N ^{2}} .

ξ^{2} = \frac{1 - ρ}{ρ N ^{2}} .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

FALP: Fast beam alignment in mmWave systems with low-resolution phase shifters

Nitin Jonathan Myers, Student Member, IEEE,

Amine Mezghani, Member, IEEE, and Robert W. Heath Jr., Fellow, IEEE N. J. Myers ([email protected]) and R. W. Heath Jr. ([email protected]) are with the Wireless Networking and Communications Group, The University of Texas at Austin, Austin, TX 78712 USA. A. Mezghani ([email protected]) is with the Department of Electrical and Computer Engineering, University of Manitoba, Winnipeg, MB, R3T 5V6, Canada. This material is based upon work supported by the National Science Foundation under Grant numbers NSF-CNS-1702800, NSF-CNS-1731658, and NSF ECCS-1711702.

Abstract

Millimeter wave (mmWave) systems can enable high data rates if the link between the transmitting and receiving radios is configured properly. Fast configuration of mmWave links, however, is challenging due to the use of large antenna arrays and hardware constraints. For example, a large amount of training overhead is incurred by exhaustive search-based beam alignment in typical mmWave phased arrays. In this paper, we present a framework called FALP for Fast beam Alignment with Low-resolution Phase shifters. FALP uses an efficient set of antenna weight vectors to acquire channel measurements, and allows faster beam alignment when compared to exhaustive scan. The antenna weight vectors in FALP can be realized in ultra-low power phase shifters whose resolution can be as low as one-bit. From a compressed sensing (CS) perspective, the CS matrix designed in FALP satisfies the restricted isometry property and allows CS algorithms to exploit the fast Fourier transform. The proposed framework also establishes a new connection between channel acquisition in phased arrays and magnetic resonance imaging.

Index Terms:

Perfect arrays, 2D sparse recovery, one-bit phase shifters, magnetic resonance imaging, mm-Wave

I Introduction

Millimeter wave (mmWave) communication, currently used in the IEEE 802.11ad standard and 5G, can support Gbps data rates by exploiting the large amount of bandwidth available at mmWave frequencies [1]. MmWave systems typically use large antenna arrays and directional beams to achieve such high data rates. The process of beam alignment, i.e., finding the best directional beam, can be challenging in mmWave hardware architectures like the phased array [2]. As phased arrays have fewer radio frequency (RF) chains than the antennas, standard techniques like exhaustive search-based beam alignment can result in a substantial training overhead when applied to mmWave systems[2].

Compressed sensing (CS) is a technique that allows reconstructing a sparse signal with fewer measurements when compared to the dimension of the signal [3]. Due to the sparse nature of mmWave channels in an appropriate dictionary, CS is a promising solution for mmWave channel estimation or beam alignment with sub-Nyquist channel measurements [2]. The channel measurements in CS are obtained by projecting the channel onto a lower dimensional subspace using a CS matrix [3]. The channel is then recovered from the lower dimensional projections using optimization techniques that exploit sparsity of the channel [4, 5]. The guarantees on the recovery of sparse signals and the complexity of the reconstruction algorithms, depend on the choice of the CS matrix used to obtain these projections. The restricted isometry property (RIP) [6] is one metric that characterizes the efficiency of a CS matrix in recovering sparse signals. Unfortunately, several random CS matrices that are known to satisfy the RIP cannot be realized in phased arrays due to hardware constraints [7]. To this end, prior work has used random IID phase shift-based CS matrices for sub-Nyquist mmWave channel estimation and beam alignment [4, 5]. CS techniques that use the random phase shift design, however, cannot exploit fast transforms and may result in a high complexity when applied to large antenna systems. Structured CS algorithms are promising for large antenna systems as they can perform sparse recovery with a reduced computational complexity [8].

Convolutional compressed sensing (CCS) is one form of structured CS in which the signal of interest is projected onto fewer circulantly shifted versions of a base sequence [9]. The convolutional structure of CS matrices in CCS allows sparse recovery algorithms to exploit the fast Fourier transform. In CCS of vectors, the choice of the base sequence is critical for the successful recovery of the sparse signal [9]. Prior work has shown that optimal base sequences of a certain length exist when the size of the alphabet is sufficiently large [9, 10]. For example, ideal base sequences of length $16$ exist in $\{1,\mathsf{j},-1,-\mathsf{j}\}^{16}$ , where $\mathsf{j}=\sqrt{-1}$ . Base sequences of the same length, however, do not exist in $\{1,-1\}^{16}$ [10]. For CCS in phased arrays, the size of the alphabet is determined by the resolution of the phase shifters. As typical mmWave phased arrays use low resolution phase shifters, applying CCS in such systems can be challenging. The difficulty lies in finding optimal base sequences that are compatible with the hardware. In this paper, we construct efficient structured CS matrices that satisfy the RIP, and can be realized in arbitrarily large phased arrays whose resolution can be as low as one-bit.

We propose a novel 2D-CCS framework called FALP for convolutional compressed sensing of mmWave channels with planar phased arrays. The channel measurements in our 2D-CCS framework are obtained by projecting the channel matrix onto 2D-circulant shifts of a base matrix. Similar to standard vector CCS [9], the performance of 2D-CCS depends on the choice of the base matrix. As the number of hardware compatible matrices in phased arrays is exponential in the array dimensions, a brute-force approach to find the best base matrix is not practical for large arrays. In our prior work called Swift-Link [7], we used Zadoff-Chu (ZC) sequences for efficient CCS-based channel estimation in planar arrays. For linear phased arrays, the efficiency of CCS-based channel estimation with ZC sequences was studied in [11]. The 2D-CCS equivalent of the sequences in [7] and [11], is a base matrix that is an outer product of two ZC sequences. Realizing base matrices using ZC-sequences, however, requires a phase shift resolution that is logarithmic in the number of antennas [11]. In large antenna arrays, it can be difficult to meet such a requirement as the use of high resolution phase shifters can result in a higher hardware cost and a higher power consumption.

One-bit phased arrays are promising in terms of the hardware complexity, cost and power consumption [12, 13]. The hardware associated with a one-bit phase shifter can be as simple as a combination of a switch and an inverter [14]. Both these components consume less power than a typical high resolution phase shifter. For instance, a one-bit phase shifter may require $10\,\mathrm{mW}$ while a four-bit phase shifter may need $45\,\mathrm{mW}$ [14]. The binary phase control capability in one-bit phased arrays, however, complicates the design of efficient base matrices for 2D-CCS. In this paper, we arrive at a surprising result that ideal base matrices for 2D-CCS exist for infinite array dimensions even over a binary alphabet. This result allows applying FALP to large phased arrays with one-bit phase shifters, and makes it a candidate solution for next generation wireless systems. We summarize our main contributions as follows.

•

We propose a compressive channel acquisition technique that acquires channel measurements with fewer 2D-circulant shifts of a base matrix. We determine the properties of base matrices that result in efficient 2D-CCS and are compatible with phased arrays.

•

We show that perfect arrays [15, 16] can be used as efficient base matrices in FALP. For a given resolution of phase shifters, such arrays exist for a family of array dimensions. For other cases, we derive the sub-optimality gap of CS algorithms when non-ideal base matrices are used in our framework.

•

We establish an equivalence between CS-based beam alignment with FALP and CS in magnetic resonance imaging (MRI) [17]. The equivalence allows direct application of k-space trajectories in MRI to the beam alignment problem. For a random trajectory, we derive the probability of successful beam alignment using zero filling-based reconstruction in MRI. We use this equivalence to show how low complexity beam alignment can be performed using a single 2D-fast Fourier transform.

•

Using simulations, we show that the use of perfect arrays in 2D-CCS results in better beam alignment when compared to 2D-CCS with a randomly chosen base matrix. We also show that the proposed CS technique performs slightly better than the common random phase shift-based approach, for a significantly reduced computational complexity.

We would like to highlight that Swift-Link [7] and FALP solve two independent problems. On the one hand, FALP develops efficient base matrices for 2D-CCS in low resolution phased arrays. On the other hand, Swift-Link designs trajectories to perform CS-based beam alignment that is robust to carrier frequency offset (CFO). For simplicity of exposition, we assume perfect frame timing and carrier synchronization. Nevertheless, Swift-Link’s trajectory can be used in FALP for CFO robust beam alignment in low-resolution phased arrays.

The rest of the paper is organized as follows. In Section II, we describe the system and channel model in a planar phased array-based system. Section III is the main technical section of the paper, where we explain how channel measurements are acquired in 2D-CCS and introduce the notion of base matrix. We mathematically show that perfect arrays [15, 16] are good candidates for ideal base matrices, and describe FALP in Section III. In Section IV, we explain how compressive beam alignment in FALP is analogous to CS in MRI. We use the analogy to develop a beam alignment technique that does not require any iterative optimization. Simulation results are presented in Section V, before the conclusions and future work in Section VI.

Notation $:$ $\mathbf{A}$ is a matrix, $\mathbf{a}$ is a column vector and $a,A$ denote scalars. Using this notation $\mathbf{A}^{T},\mathbf{A}^{\text{c}}$ and $\mathbf{A}^{\ast}$ represent the transpose, conjugate and conjugate transpose of $\mathbf{A}$ . We use $\mathrm{diag}\left(\mathbf{a}\right)$ to denote a diagonal matrix with entries of $\mathbf{a}$ on its diagonal. The scalar $a\left[m\right]$ denotes the $m^{\mathrm{th}}$ element of $\mathbf{a}$ . The $\ell_{2}$ norm of $\mathbf{a}$ is denoted by $\|\mathbf{a}\|_{2}$ . The $k^{\mathrm{th}}$ row and the $\ell^{\mathrm{th}}$ column of $\mathbf{A}$ are denoted by $\mathbf{A}(k,:)$ and $\mathbf{A}(:,\ell)$ . The scalar $\mathbf{A}\left(k,\ell\right)$ or $\mathbf{A}_{k,\ell}$ denotes the entry of $\mathbf{A}$ in the $k^{\mathrm{th}}$ row and the ${\ell}^{\mathrm{th}}$ column. The matrix $|\mathbf{A}|$ contains the element-wise magnitude of $\mathbf{A}$ , i.e., $|\mathbf{A}|_{k,\ell}=|\mathbf{A}_{k,\ell}|$ . The $\ell_{1}$ norm and the Frobenius norm of $\mathbf{A}$ are denoted by $\|\mathbf{A}\|_{1}$ and $\|\mathbf{A}\|_{\mathrm{F}}$ . The inner product of two matrices $\mathbf{A}$ and $\mathbf{B}$ is defined as $\langle\mathbf{A},\mathbf{B}\rangle=\sum_{k,\ell}\mathbf{A}\left(k,\ell\right){\mathbf{B}^{\text{c}}}\left(k,\ell\right)$ . We use $\mathbf{1}$ to denote an all-ones matrix and $\mathbf{I}$ to denote the identity matrix. The symbols $\odot$ and $\circledast$ are used for the Hadamard product and 2D circular convolution [18].

II System and channel model

In this section, we describe a planar phased antenna array system considered in FALP. To explain our framework, we assume a narrowband mmWave system and focus on the transmit beam alignment problem. We extend our algorithm to the wideband setting in Section V.

II-A System model

We consider an analog beamforming system in which the transmitter (TX) is equipped with a uniform planar array (UPA) of antennas as shown in Fig. 1. For ease of notation, we consider an equal number of antennas, i.e., $N$ , along each of the azimuth and elevation dimensions of the UPA. Our framework can also be extended to other rectangular arrays by using array response vectors of appropriate dimensions in the formulation. The beamforming architecture at the TX uses a single radio frequency (RF) chain as shown in Fig. 1. Each antenna element in the UPA is connected to the RF chain through a digitally controlled phase shifter. By appropriately configuring the phase shifters, the TX can perform directional transmission [19]. We define the set $\mathcal{I}_{J}=\{0,1,2,\cdots,J-1\}$ . The resolution of each phase shifter is assumed to be $q$ -bits; the set of possible phase shifts is defined as $\mathbb{Q}_{q}=\left\{e^{\mathsf{j}2\pi k/2^{q}}/N:k\in\mathcal{I}_{2^{q}}\right\}$ . As each antenna in the UPA is connected to a unique phase shifter, it is possible to configure $N^{2}$ phase shifters. Therefore, the phase shift matrix applied to the phased array at the TX is constrained to be an element in $\mathbb{Q}^{N\times N}_{q}$ . The transmit beam alignment problem is to determine a phase shift matrix at the TX that maximizes the SNR at the receiver (RX).

A possible approach to perform beam alignment is to estimate a reasonable approximation of the channel and use it to configure the phased array. For simplicity of exposition, we assume a single antenna at the RX and focus on the transmit beam alignment problem. Our framework can be extended to settings with UPAs at both the TX and the RX, by using fourth order tensors to model the channel. We index the antenna element in the $i^{\mathrm{th}}$ row and the $j^{\mathrm{th}}$ column of the transmit array as $(i,j)$ . For an $N\times N$ UPA, $i\in\mathcal{I}_{N}$ and $j\in\mathcal{I}_{N}$ . Let $\mathbf{H}\in\mathbb{C}^{N\times N}$ be the channel matrix between the UPA at the TX and the receive antenna. Specifically, $\mathbf{H}(i,j)$ represents the channel coefficient between the $(i,j)^{\mathrm{th}}$ antenna in the UPA and the antenna at the RX. The TX uses different phase shift configurations across multiple training slots for the RX to obtain channel measurements.

In the $m^{\mathrm{th}}$ training slot, the TX applies the phase shift matrix $\mathbf{P}[m]\in\mathbb{Q}_{q}^{N\times N}$ to its phased array, and the RX acquires the channel measurement $y[m]$ . We use $M$ to denote the total number of channel measurements acquired by the RX. In this paper, we assume perfect frame timing and carrier synchronization. Our assumption is valid in cellular scenarios where synchronization is performed using separate control channels. With the perfect synchronization assumption, the $m^{\mathrm{th}}$ channel measurement is

[TABLE]

where $v[m]\sim\mathcal{N}_{\mathrm{c}}\left(0,\sigma^{2}\right)$ is additive white Gaussian noise. As the measurement in (1) is a scalar projection of $\mathbf{H}$ , estimating a generic $N\times N$ channel matrix requires $M=N^{2}$ channel measurements. Exhaustive beam search is one such approach that obtains the projections of $\mathbf{H}$ on all the $N^{2}$ elements of the 2D-discrete Fourier transform (2D-DFT) dictionary [2]. Such a solution, however, does not scale well with the array dimensions. In this paper, we propose a novel set of phase shift matrices, $\{\mathbf{P}[m]\}^{M-1}_{m=0}$ , for compressive channel acquisition. We prove that a good approximation of mmWave channels can be obtained from $M=\mathcal{O}(\mathrm{log}N)$ channel measurements that are acquired using the proposed set. We also show that CS algorithms that use the proposed design have a lower computational complexity than those that use the common random phase shift-based design [4, 5].

II-B Channel model

We consider a geometric-ray-based model for the channel matrix $\mathbf{H}$ [19]. Let $\gamma_{k}$ , $\theta_{e,k}$ and $\theta_{a,k}$ denote the complex ray gain, elevation angle-of-departure and azimuth angle-of-departure of the $k^{\mathrm{th}}$ ray. We define the beamspace angles $\omega_{a,k}=\pi\,\mathrm{sin}\,\theta_{e,k}\mathrm{sin}\,\theta_{a,k}$ and $\omega_{e,k}=\pi\,\mathrm{sin}\,\theta_{e,k}\mathrm{cos}\,\theta_{a,k}$ . We define the Vandermonde vector $\mathbf{a}\left(\omega\right)\in\mathbb{C}^{N\times 1}$ as

[TABLE]

The wireless channel for a half wavelength spaced UPA in the baseband is given by

[TABLE]

As large antenna arrays are used in typical mmWave settings, the dimension of the channel, i.e., $N^{2}$ , can be large in mmWave systems when compared to conventional lower frequency systems.

Channel matrices at mmWave are sparse in a well chosen dictionary, because of the propagation characteristics of the environment [2]. For UPAs, the 2D-DFT basis is often chosen for a sparse representation of $\mathbf{H}$ [20]. We use $\mathbf{U}_{N}$ to denote the standard unitary DFT matrix of size $N\times N$ . Let $\mathbf{X}\in\mathbb{C}^{N\times N}$ denote the inverse 2D-DFT of $\mathbf{H}$ , such that

[TABLE]

The unitary nature of the DFT implies that $\mathbf{X}=\mathbf{U}^{\ast}_{N}\mathbf{H}\mathbf{U}^{\ast}_{N}$ . The matrix $\mathbf{X}$ is called the beamspace channel as it contains the received measurements when different directional 2D-DFT beams are used at the TX [20]. The sparsity of the mmWave channel in the angle domain translates to the sparsity of the beamspace channel matrix $\mathbf{X}$ . As the beamspace angles-of-departure (AoD) in the channel may not align exactly with those corresponding to the DFT dictionary, there can be leakage effects in the 2D-DFT representation [2]. Therefore, the matrix $\mathbf{X}$ is approximately sparse. In such a case, dictionaries that use a finer AoD domain representation can be used for a sparser representation of $\mathbf{H}$ [4]. Using such a dictionary, however, increases the dimensionality of the CS problem. For our analysis, we consider $\mathbf{X}$ to be perfectly sparse, while our simulation results are for the realistic case where $\mathbf{X}$ is approximately sparse.

III Convolutional CS in planar arrays

In this section, we explain the main motivation for 2D-CCS, and describe the notions of the base matrix and the sub-sampling set in 2D-CCS. Then, we identify the conditions on the base matrix that minimize the channel reconstruction error with 2D-CCS. Finally, we show that perfect arrays over small alphabets [15, 16] can be used as base matrices, for efficient 2D-CCS in low resolution phased arrays.

III-A Motivation for 2D-CCS

A possible approach to acquire measurements in CS is to obtain fewer projections of the sparse signal in an appropriate basis [3]. For example, CS can efficiently recover a sparse matrix from its subsampled 2D-DFT [3]; the reconstruction problem in this case is known as a partial 2D-DFT CS problem [21]. Partial 2D-DFT CS has a lower complexity and may achieve better signal reconstruction, when compared to other CS techniques [22, 23]. In the context of mmWave channels, partial 2D-DFT CS can estimate the sparse matrix $\mathbf{X}$ from fewer samples of its 2D-DFT, i.e., $\mathbf{H}$ . An illustration of the reconstruction is shown in Fig. 2. The direct application of partial 2D-DFT CS in mmWave phased arrays, however, is challenging. The difficulty arises because $\mathbf{H}$ cannot be directly subsampled using phased arrays, as required by partial 2D-DFT CS. For instance, acquiring $\mathbf{H}(0,0)=\langle\mathbf{H},\mathbf{e}_{0}\mathbf{e}^{T}_{0}\rangle$ in a single training slot requires the application of $\mathbf{e}_{0}\mathbf{e}^{T}_{0}$ to the antenna array. The matrix $\mathbf{e}_{0}\mathbf{e}^{T}_{0}$ , however, does not belong to the feasible set, i.e., $\mathbb{Q}^{N\times N}_{q}$ . Although introducing switches after each phase shifter can help realize $\mathbf{e}_{0}\mathbf{e}^{T}_{0}$ , the SNR in the resulting channel measurement can be poor. This is because $\mathbf{e}_{0}\mathbf{e}^{T}_{0}$ uses a single transmit antenna and per-antenna power constraints limit the power that can be transmitted from an antenna. In this paper, we develop a novel 2D-CCS technique that overcomes these practical challenges, and has all the advantages of partial 2D-DFT CS.

The motivation for 2D-CCS comes from the observation that the matrices used to obtain channel projections in partial 2D-DFT CS are 2D-circulant shifts of a particular matrix. It can be noticed from Fig. 2 that the matrices used to acquire channel projections, i.e., $\mathbf{e}_{0}\mathbf{e}^{T}_{0}$ , $\mathbf{e}_{1}\mathbf{e}^{T}_{2}$ , and $\mathbf{e}_{2}\mathbf{e}^{T}_{1}$ , are all 2D-circulant shifts of $\mathbf{e}_{0}\mathbf{e}^{T}_{0}$ . The channel projections in our framework are acquired by applying 2D-circulant shifts of a matrix $\mathbf{P}\in\mathbb{Q}^{N\times N}_{q}$ , instead of $\mathbf{e}_{0}\mathbf{e}^{T}_{0}$ , to the phased array. We define $\mathbf{P}$ as the base matrix in 2D-CCS. Due to the constant modulus nature of the matrices in $\mathbb{Q}^{N\times N}_{q}$ , our 2D-CCS framework uses all the antennas in the phased array for compressive channel acquisition.

Now, we explain how compressive channel measurements are obtained in 2D-CCS. In the $m^{\mathrm{th}}$ training slot, the TX applies a $(r[m],c[m])$ 2D-circulant shift of $\mathbf{P}$ to its phased array. The matrix $\mathbf{P}[m]$ is generated by circulantly shifting $\mathbf{P}$ by $r[m]$ units along the rows and $c[m]$ units along the columns. We define $\Omega$ as a set that contains the 2D-circulant shifts used to acquire the $M$ channel measurements, i.e., $\Omega=\{(r[m],c[m])\}_{m=0}^{M-1}$ . In this paper, the set of circulant shifts, i.e., $\Omega$ , is constructed by sampling $M$ distinct coordinates at random from $\mathcal{I}_{N}\times\mathcal{I}_{N}$ . We define $\mathbf{J}\in\mathbb{R}^{N\times N}$ as a circulant delay matrix with its first row as $(0,1,0,0,..,0)$ . The subsequent rows of $\mathbf{J}$ are generated by right circulantly shifting the previous row by $1$ unit. Using this notation, we define the $d$ circulant delay matrix as $\mathbf{J}_{d}=\mathbf{J}\cdot\mathbf{J}\cdots\mathbf{J}$ ( $d$ times). In 2D-CCS, the matrix applied to the phased array in the $m^{\mathrm{th}}$ slot is

[TABLE]

An illustration of the compressive channel acquisition procedure using 2D-CCS, for a base matrix $\mathbf{P}$ and a subsampling set $\Omega=\{(0,0),(1,2),(2,1)\}$ , is shown in Fig. 3. The base matrix determines the success of 2D-CCS-based recovery. As an example, consider a 2D-CCS technique that uses $\mathbf{P}=\mathbf{1}/N$ . Channel acquisition with such a matrix results in the same measurement, i.e., mean of the entries in $\mathbf{H}$ , for any 2D-circulant shift. As $\mathbf{H}$ cannot be estimated just from its mean, 2D-CCS with $\mathbf{P}=\mathbf{1}/N$ fails. In Sections III-B and III-C, we use ideas from partial 2D-DFT CS to study how the choice of $\mathbf{P}$ impacts the performance of 2D-CCS.

III-B Transforming convolutional CS to partial 2D-DFT CS

We derive a compact representation of the channel measurements in 2D-CCS. In the $m^{\mathrm{th}}$ training slot, the TX applies $\mathbf{P}[m]$ to its phased array and the RX receives

[TABLE]

We use $k_{N}$ to denote the modulo $-N$ remainder of $k$ . The $m^{\mathrm{th}}$ channel measurement is then

[TABLE]

We define $\mathbf{P}_{\mathrm{FC}}$ as a flipped and conjugated version of $\mathbf{P}$ , i.e.,

[TABLE]

By the definition of 2D-circular convolution [18], it can be observed from (7) that

[TABLE]

In 2D-CCS, the RX acquires the $(r[m],c[m])$ entry of $\mathbf{H}\circledast\mathbf{P}_{\mathrm{FC}}$ , when the TX applies an $(r[m],c[m])$ 2D-circulant shift of $\mathbf{P}$ to its phased array. In $M$ training slots, the TX applies $M$ distinct 2D-circulant shifts of $\mathbf{P}$ according to the coordinates in $\Omega$ . The vector of $M$ channel measurements received at the RX is then

[TABLE]

The measurement vector in 2D-CCS is a subsampled convolution of $\mathbf{H}$ and $\mathbf{P}_{\mathrm{FC}}$ .

Now, we show how channel measurements in 2D-CCS can be interpreted as partial 2D-DFT measurements of a transformed beamspace. We define the convolved channel $\mathbf{G}$ as

[TABLE]

The spectral mask corresponding to the base matrix $\mathbf{P}$ is defined as

[TABLE]

Similar to the definition of the beamspace $\mathbf{X}$ , we define the masked beamspace as

[TABLE]

An interesting property of the Fourier transform is that the 2D-DFT of $\mathbf{H}\circledast\mathbf{P}_{\mathrm{FC}}$ is a scaled element-wise product of the 2D-DFTs of $\mathbf{H}$ and $\mathbf{P}_{\mathrm{FC}}$ [18]. We use this property to rewrite (13) as

[TABLE]

The transformations that relate the matrices $\mathbf{H}$ , $\mathbf{X}$ , $\mathbf{G}$ , and $\mathbf{S}$ are shown in Fig. 4. The matrix $\mathbf{S}$ is called the masked beamspace because it is an element-wise multiplication of the beamspace $\mathbf{X}$ and the spectral mask $\mathbf{Z}$ . As the element-wise multiplication of a sparse matrix with any other matrix is a sparse matrix, $\mathbf{S}$ is sparse under the assumption that $\mathbf{X}$ is sparse. The vector $\mathbf{y}$ can be expressed using (10), (11) and (13) as

[TABLE]

The channel measurements in (16) can be interpreted as the subsampled 2D-DFT of the masked beamspace $\mathbf{S}$ . In a subsampling setting, i.e., $M<N^{2}$ , the masked beamspace can be recovered from $\mathbf{y}$ , using partial 2D-DFT CS techniques that exploit the sparsity of $\mathbf{S}$ .

III-C Conditions on the base matrix for efficient CS

In this section, we derive guarantees on channel recovery for partial 2D-DFT CS over the masked beamspace. Using these guarantees, we identify the conditions on the base matrix for efficient 2D-CCS-based recovery of the beamspace channel $\mathbf{X}$ .

The CS matrix that results from acquiring $M=\mathcal{O}(\mathrm{log}N)$ 2D-DFT samples of the sparse matrix $\mathbf{S}$ in (16), is known to satisfy the restricted isometry property with high probability [8]. The masked beamspace $\mathbf{S}$ can be estimated from the channel measurements in (16) using an $\ell_{1}$ optimization program [3]

[TABLE]

The optimization program in (17) encourages sparse masked beamspace solutions that are consistent with the received channel measurements [3]. It is important to note that successful recovery of $\mathbf{S}$ does not guarantee the reconstruction of the beamspace channel, i.e., $\mathbf{X}$ . The recovery of the beamspace depends on the spectral mask $\mathbf{Z}$ . For example, if $\mathbf{Z}(k,\ell)=0$ for some $k$ and $\ell$ , the masked beamspace component $\mathbf{S}(k,\ell)=0$ . In such case, $\mathbf{X}(k,\ell)$ cannot be recovered from the spectral mask equation, i.e., $\mathbf{S}=\mathbf{Z}\odot\mathbf{X}$ . To avoid such blanking effects in the masked beamspace, well conditioned spectral masks must be designed to estimate $\mathbf{X}$ from $\mathbf{S}$ .

We derive guarantees for compressive beamspace reconstruction using masked beamspace recovery with (17). Let $\hat{\mathbf{X}}$ be a solution to the beamspace channel. As $\mathbf{S}=\mathbf{Z}\odot\mathbf{X}$ , the estimate $\hat{\mathbf{X}}$ must satisfy $\hat{\mathbf{S}}=\mathbf{Z}\odot\hat{\mathbf{X}}$ . For the spectral mask $\mathbf{Z}$ , we define $Z_{\mathrm{max}}=\underset{k,\ell}{\mathrm{max}}\left|\mathbf{Z}(k,\ell)\right|$ and $Z_{\mathrm{min}}=\underset{k,\ell}{\mathrm{min}}\left|\mathbf{Z}(k,\ell)\right|$ . We use $\left(\mathbf{A}\right)_{k}$ to denote the $k$ sparse representation of $\mathbf{A}$ . The matrix $\left(\mathbf{A}\right)_{k}$ is obtained from $\mathbf{A}$ by retaining the $k$ largest entries in magnitude and setting the rest to [math].

Theorem 1.

For a fixed constant $\gamma\in\left(0,1\right)$ , a solution $\hat{\mathbf{X}}$ such that $\hat{\mathbf{S}}=\hat{\mathbf{X}}\odot\mathbf{Z}$ satisfies

[TABLE]

with a probability of at least $1-\gamma$ if $M\geq Ck\,\mathrm{max}\left\{2\mathrm{log}^{3}(2k)\,\mathrm{log}(N),\,\mathrm{log}(\gamma^{-1})\right\}$ . The constants $C,C_{1}$ and $C_{2}$ are independent of all the other parameters.

Proof.

See Section VII-A. ∎

For a given $\mathbf{X}$ , $M$ , and $\sigma$ , the result in (18) indicates that upper bound on the channel reconstruction error, i.e., $\bigl{\|}\mathbf{X}-\hat{\mathbf{X}}\bigl{\|}_{F}$ , is lower when $Z_{\mathrm{min}}$ is bounded away from [math]. The smallest entry in $|\mathbf{Z}|$ depends on the base matrix $\mathbf{P}$ . Note that $\mathbf{Z}$ is the inverse 2D-DFT of $\mathbf{P}_{\mathrm{FC}}$ , the flipped and conjugated version of $\mathbf{P}$ . As $Z_{\mathrm{min}}\leq Z_{\mathrm{max}}$ , a base matrix that achieves the smallest upper bound in (18) is one that has $Z_{\mathrm{min}}=Z_{\mathrm{max}}$ .

Now, we show that ideal base matrices in $\mathbb{Q}_{q}^{N\times N}$ must have a unimodular spectral mask, i.e., $Z_{\mathrm{max}}=1$ and $Z_{\mathrm{min}}=1$ , for efficient 2D-CCS in phased arrays. It can be observed from (12) that the norm of the spectral mask is $\|\mathbf{Z}\|_{F}=N$ for any $\mathbf{P}\in\mathbb{Q}_{q}^{N\times N}$ . The condition $Z_{\mathrm{min}}=Z_{\mathrm{max}}$ is achieved under the norm constraint only when all the entries of $|\mathbf{Z}|$ are equal to $1$ . Designing a base matrix $\mathbf{P}\in\mathbb{Q}^{N\times N}_{q}$ such that the spectral mask in (12) is unimodular, however, is a difficult problem. The main challenge in the design of $\mathbf{P}$ is due to the phase shift constraint, i.e., $\mathbf{P}\in\mathbb{Q}_{q}^{N\times N}$ . The canonical basis element $\mathbf{e}_{0}\mathbf{e}^{T}_{0}$ is a good example that has a unimodular spectral mask, but lies outside $\mathbb{Q}_{q}^{N\times N}$ . A brute force approach to find a matrix in $\mathbb{Q}_{q}^{N\times N}$ with a unimodular spectral mask is not practical as $\mathbb{Q}_{q}^{N\times N}$ contains a large number of matrices, i.e., $2^{qN^{2}}$ . Prior work has considered subsampled convolution using random sequences [8]; the 2D extension of such a technique is 2D-CCS using a $\mathbf{P}$ that is chosen at random from $\mathbb{Q}^{N\times N}_{q}$ . A random choice for $\mathbf{P}$ , however, may not result in a unimodular spectral mask. In Sec. III-D, we show that ideal base matrices exist for several combinations of $q$ and $N$ .

III-D Perfect arrays as ideal base matrices

In this section, we establish the equivalence between unimodularity of the spectral mask and perfect periodic spatial autocorrelation of the base matrix. Using this equivalence, we show that perfect arrays [15, 16], a class of matrices that have perfect periodic spatial autocorrelation, satisfy the properties of an ideal base matrix for 2D-CCS-based channel recovery.

The duality between perfect periodic spatial autocorrelation and unimodular 2D-DFT properties is explained in Theorem 19.

Theorem 2.

A matrix $\mathbf{P}\in\mathbb{Q}^{N\times N}_{q}$ has a unimodular spectral mask, i.e., $Z_{\mathrm{max}}=1$ and $Z_{\mathrm{min}}=1$ , if and only if $\mathbf{P}$ has perfect periodic spatial autocorrelation, i.e.,

[TABLE]

Proof.

See Section VII-B. ∎

The problem of finding a perfect array in $\mathbb{Q}^{N\times N}_{q}$ over small alphabets, i.e., a small $q$ , has been well investigated in [15] and [16]. Although there are several hardware constrained 2D-CS applications, perfect arrays over finite alphabets have not been used in the context of CS, to the best of our knowledge. The construction of perfect arrays over large alphabets, i.e., a large $q$ , can be trivial. For example, a matrix that is an outer product of two Zadoff-Chu sequences satisfies the conditions in Theorem 19, and is a perfect array. The ZC-based matrix, however, may not be realizable in low resolution phased arrays [7, 11]. An important step towards efficient 2D-CCS in $q$ -bit phased arrays, is to find matrices that are perfect arrays, i.e., matrices that satisfy the perfect periodic autocorrelation property in $\mathbb{Q}^{N\times N}_{q}$ .

Perfect arrays over $\mathbb{Q}^{N_{1}\times N_{2}}_{q}$ were constructed for binary and quaternary alphabets, i.e., for $q=\mathrm{log}_{2}{2}$ and $q=\mathrm{log}_{2}{4}$ in [15] and [16]. In this paper, we consider square arrays, i.e., arrays of size $N\times N$ for simplicity. We also consider the extreme case of perfect binary arrays, i.e., $q=1$ , as such arrays can be implemented in phase shifters of any resolution. An example of a perfect binary array for $N=2$ is

[TABLE]

The matrix in (20) satisfies the perfect periodic autocorrelation property in (19). The binary nature of $\mathbf{P}$ in (20) allows its application to $2\times 2$ phased arrays with a resolution of one-bit. As typical mmWave systems have large antenna arrays, it is useful to construct perfect arrays of large dimensions for efficient 2D-CCS. An interesting result from [15] is that perfect binary arrays in $\mathbb{Q}^{N\times N}_{1}$ exist when $N=2^{k}$ or $N=3\cdot 2^{k}$ , where $k$ is any natural number. A recursive method to generate perfect binary arrays was provided in [15] for square arrays and other rectangular configurations. An implementation of the construction in [15] is available on our GitHub page [24]. For $q=2$ , several perfect arrays, for which perfect binary arrays of the same dimension do not exist, were proposed in [16]. The arrays in [16] can be used for efficient 2D-CCS in $2$ -bit phased arrays. FALP uses perfect binary arrays in 2D-CCS, and allows the CS algorithm in (17) to achieve the smallest upper bound on the channel reconstruction error in (18).

III-E Perfect array-based compressive beam alignment in FALP

We show how channel matrices are estimated with FALP, using Fig. 5. FALP uses a partial 2D-DFT CS algorithm to estimate the masked beamspace matrix $\mathbf{S}$ , from 2D-CCS-based channel measurements that are acquired with a perfect array. It can be observed from Theorem 19 that the spectral mask $|\mathbf{Z}|$ is unimodular for any perfect array, i.e., $|\mathbf{Z}(k,\ell)|=1\,\forall k,\ell$ . In such a case, the transformation between the beamspace $\mathbf{X}$ and the masked beamspace $\mathbf{S}$ in (15), can be inverted using $\mathbf{X}(k,\ell)=\mathbf{S}(k,\ell){\mathbf{Z}^{\text{c}}}(k,\ell)\,\forall k,\ell$ . For a masked beamspace $\hat{\mathbf{S}}$ obtained from partial 2D-DFT CS, the beamspace matrix can be estimated as

[TABLE]

The channel estimate $\hat{\mathbf{H}}=\mathbf{U}_{N}\hat{\mathbf{X}}\mathbf{U}_{N}$ is then used for beam alignment.

Now, we explain a two step procedure for beam alignment with the channel estimate $\hat{\mathbf{H}}$ . The first step relaxes the $q$ -bit constraint to find an $\mathbf{F}\in\mathbb{Q}^{N\times N}_{\infty}$ that maximizes $|\langle\hat{\mathbf{H}},\mathbf{F}\rangle|$ . Note that the phased array implementation requires $|\mathbf{F}_{k,\ell}|=1/N$ for every $k$ and $\ell$ . By the dual norm inequality [25], we have $|\langle\hat{\mathbf{H}},\mathbf{F}\rangle|\leq\mathrm{max}(|\mathbf{F}|)\|\hat{\mathbf{H}}\|_{1}$ . Therefore, $|\langle\hat{\mathbf{H}},\mathbf{F}\rangle|\leq\|\hat{\mathbf{H}}\|_{1}/N$ . The upper bound in the dual norm inequality is achieved by an $\mathbf{F}^{\mathrm{opt}}(\beta)$ such that $|\mathbf{F}^{\mathrm{opt}}_{k,\ell}(\beta)|=1/N$ and $\mathrm{phase}(\mathbf{F}^{\mathrm{opt}}_{k,\ell}(\beta))=\beta+\mathrm{phase}(\hat{\mathbf{H}}_{k,\ell})$ for any $\beta\in(-\pi,\pi]$ . The scalar $\beta$ corresponds to the global phase in $\mathbf{F}^{\mathrm{opt}}(\beta)$ . As the angles in $\mathbf{F}^{\mathrm{opt}}(\beta)$ may not be integer multiples of $2\pi/2^{q}$ , $\mathbf{F}^{\mathrm{opt}}(\beta)$ may not be directly realized in $q$ -bit phased arrays. In such a case, a $q$ -bit phase quantized version of $\mathbf{F}^{\mathrm{opt}}(\beta)$ can be used in the phased array, for an appropriate choice of global phase $\beta$ .

The second step of our beam alignment procedure finds the best $\beta$ that minimizes phase errors due to $q$ -bit phase quantization of $\mathbf{F}^{\mathrm{opt}}(\beta)$ . This step is important in low resolution phased arrays [26, 12]. Let $\mathcal{Q}_{q}(\mathbf{F}^{\mathrm{opt}}(\beta))$ denote the $q$ -bit phase quantized version of $\mathbf{F}^{\mathrm{opt}}(\beta)$ . Note that $\mathcal{Q}_{q}(\cdot)$ performs element-wise phase quantization. The global phase term $\beta_{\mathrm{est}}$ that minimizes the phase quantization error can be expressed as

[TABLE]

To solve for the scalar $\beta_{\mathrm{est}}$ in (22), we define a phase set $\mathcal{B}$ that contains $K_{\mathcal{B}}$ uniformly spaced values in $(0,2\pi/2^{q})$ . The optimization in (22) is performed using line search over the elements in $\mathcal{B}$ for a sufficiently large $K_{\mathcal{B}}$ . The phase shift matrix used at the TX with CS-based beamforming is then $\mathbf{F}_{\mathrm{CS}}=\mathcal{Q}_{q}(\mathbf{F}^{\mathrm{opt}}(\beta_{\mathrm{est}}))$ .

The partial 2D-DFT CS algorithm in FALP requires a lower computational complexity when compared to other standard CS techniques. Let $\mathbf{A}_{\mathrm{CS}}\in\mathbb{C}^{M\times N^{2}}$ be the CS matrix corresponding to the partial 2D-DFT CS problem in (16). CS algorithms that solve (16) typically perform iterative optimization over an $N^{2}$ dimensional variable. For example, each iteration in the orthogonal matching pursuit (OMP) algorithm [27] requires computing matrix-vector products of the form $\mathbf{A}_{\mathrm{CS}}\mathbf{w}$ and $\mathbf{A}^{\ast}_{\mathrm{CS}}\mathbf{d}$ . As $\mathbf{A}_{\mathrm{CS}}$ is a partial 2D-DFT CS matrix for the model in (16), the matrix-vector products in CS can be implemented using the 2D-FFT [22]. In the subsampling regime where $M=\alpha N^{2}$ for some constant $\alpha<1$ , the 2D-FFT-based implementation has a complexity of $\mathcal{O}(N^{2}\,\mathrm{log}N)$ while the complexity of standard matrix-vector product is $\mathcal{O}(N^{4})$ [22].

IV An MRI-inspired approach for ultra-low complexity beam alignment

In this section, we use insights from zero filling reconstruction in MRI [28], to develop a different sub-Nyquist beam alignment technique based on FALP. The proposed technique does not require any iterative optimization, unlike standard CS or partial 2D-DFT CS. Specifically, the zero filling-based approach uses the perfect array-based training, and estimates a reasonable beamformer with just a single 2D-FFT computation. We prove that our method can achieve a beam alignment performance that is comparable to exhaustive scan with the 2D-DFT dictionary.

IV-A Connection between CS in MRI and CS using FALP

The measurements in MRI are defined by a trajectory that acquires samples from the Fourier transform of an MR image, also known as the k-space [28]. Prior work on CS-MRI has shown that MR images can be reconstructed using subsampling k-space trajectories that acquire fewer samples from the k-space [17]. For example, CS can reconstruct sparse angiogram images from fewer samples of their 2D-DFT [3]. In this section, we explain the equivalent of k-space trajectory in FALP.

The channel measurements in FALP, i.e., $\mathcal{P}_{\Omega}(\mathbf{G})$ , are samples from the 2D-DFT of the sparse masked beamspace $\mathbf{S}$ . The subsampling pattern over $\mathbf{G}$ is determined by the set $\Omega$ , as shown in Fig. 3. The masked beamspace $\mathbf{S}$ is analogous to sparse angiogram image in MRI. The k-space, which represents the Fourier transform of the MR image, is equivalent to the matrix $\mathbf{G}$ as $\mathbf{G}=\mathbf{U}_{N}\mathbf{S}\mathbf{U}_{N}$ . The analogue of a k-space trajectory in FALP is a 2D-curve in $\mathcal{I}_{N}\times\mathcal{I}_{N}$ that sequentially traverses through the coordinates in $\Omega$ . An example of a trajectory for $M=9$ and $N=5$ is shown in Fig. 6a. The trajectory in Fig. 6a sequentially acquires the channel measurements $\{\mathbf{G}_{01},\mathbf{G}_{22},\mathbf{G}_{04},\cdots,\mathbf{G}_{32}\}$ for the subsampling set $\Omega=\{(0,1),(2,2),(0,4),\cdots,(3,2)\}$ .

We would like to mention that k-space trajectories in MRI are typically constrained to be continuous, i.e., random k-space trajectories may not be realized. Random trajectories over the matrix $\mathbf{G}$ , however, can be realized in FALP as any circulant shift of a base matrix can be applied to the phased array. In this paper, we use ideas from CS-MRI to investigate how beam alignment can be performed without any iterative optimization.

IV-B Zero filling-based reconstruction: From MRI to beam alignment

A traditional approach for MR imaging is to use a trajectory that fully samples the k-space. The MR image is then recovered by applying a 2D-Fourier transform over the acquired samples. Zero filling-based technique is an approach to estimate MR images using trajectories that subsample the k-space. The idea in zero filling-based reconstruction is to fill the unsampled entries in the k-space with zeros, and invert the Fourier transform to estimate the MR image. In this section, we show that zero filling-based recovery can also be used to recover a reasonable one-sparse approximation of the beamspace. To provide a better illustration and for a tractable analysis, we ignore the measurement noise in the system, i.e., $\sigma=0$ . The simulation results in Section V include the impact of measurement noise.

Now, we explain zero filling-based beamspace reconstruction using FALP. We define the subsampling ratio in FALP as $\rho=M/N^{2}$ . The matrix $\mathbf{G}_{\Omega}\in\mathbb{C}^{N\times N}$ is defined to contain the entries of $\mathbf{G}$ at the locations in $\Omega$ , and zeros in the other locations. Specifically, the entries of $\mathbf{G}_{\Omega}$ are given by

[TABLE]

The matrix $\mathbf{G}_{\Omega}$ can be constructed by populating the $M$ channel measurements in FALP at the locations in $\Omega$ . In a subsampling setting, i.e., $M<N^{2}$ , the construction of $\mathbf{G}_{\Omega}$ is equivalent to zero filling in MRI. The equivalence follows from the observation that $\mathbf{G}_{\Omega}$ is [math] at the unsampled $\mathbf{G}$ -space locations. In a full sampling setting, the beamspace $\mathbf{X}$ can be estimated from the transformations $\mathbf{S}=\mathbf{U}^{\ast}_{N}\mathbf{G}\mathbf{U}^{\ast}_{N}$ and $\mathbf{X}=\mathbf{S}\odot{\mathbf{Z}^{\text{c}}}$ . The zero filling-based reconstruction procedure applies the same transformations over $\mathbf{G}_{\Omega}$ , when $M<N^{2}$ . We define the zero filling-based estimates corresponding to $\mathbf{S}$ and $\mathbf{X}$ as

[TABLE]

Note that $\mathbf{X}_{{\mathrm{bl}}}=\mathbf{X}$ , when $M=N^{2}$ . A natural question that arises is how does subsampling impact the zero filling-based estimate $\mathbf{X}_{{\mathrm{bl}}}$ when compared to $\mathbf{X}$ .

Now, we show that subsampling the $\mathbf{G}$ -space results in a blurred $\mathbf{X}_{{\mathrm{bl}}}$ . For ease of notation, we investigate the impact of blur on $\mathbf{S}_{{\mathrm{bl}}}$ ; our study is justified by the fact that $\mathbf{X}_{{\mathrm{bl}}}$ in (26) is just a phase modulated version of $\mathbf{S}_{{\mathrm{bl}}}$ for a unimodular $\mathbf{Z}$ . To characterize the impact of subsampling on $\mathbf{S}_{{\mathrm{bl}}}$ , we define a binary matrix $\mathbf{N}_{\Omega}\in\mathbb{C}^{N\times N}$ such that

[TABLE]

We define a kernel matrix $\mathbf{K}_{\mathrm{bl}}$ as the scaled inverse 2D-DFT of $\mathbf{N}_{\Omega}$ , i.e.,

[TABLE]

It can be observed from (27) that $\mathbf{G}_{\Omega}=\mathbf{G}\odot\mathbf{N}_{\Omega}$ . As element-wise multiplication of two matrices results in 2D-circular convolution of their inverse 2D-DFTs [18], (25) can be simplified to

[TABLE]

It can be observed from (30) that the matrices $\mathbf{S}_{{\mathrm{bl}}}$ and $\mathbf{S}$ differ by a convolutional distortion due to $\mathbf{K}_{\mathrm{bl}}$ . The matrix $\mathbf{K}_{\mathrm{bl}}$ is similar to the point spread function (PSF) in MRI. For the special case of $M=N^{2}$ , it can be observed that $\mathbf{K}_{\mathrm{bl}}=\mathbf{e}_{0}\mathbf{e}^{T}_{0}$ and $\mathbf{S}_{{\mathrm{bl}}}=\mathbf{S}$ . In the subsampling regime, however, the PSF $\mathbf{K}_{\mathrm{bl}}$ is not a perfect dirac matrix. Therefore, $\mathbf{K}_{\mathrm{bl}}$ induces distortion in $\mathbf{S}_{{\mathrm{bl}}}$ for $M<N^{2}$ . For the masked beamspace in Fig. 7a, an example of the distortion induced due to subsampling is shown in Fig. 7b. The amount of distortion in $\mathbf{S}_{{\mathrm{bl}}}$ is a function of the subsampling ratio, i.e., $\rho$ .

IV-C Beam alignment with the zero filling-based estimate

We define the zero filling-based beam alignment technique (ZFB) as one that chooses the beamformer based on the coordinate that maximizes $|\mathbf{S}_{{\mathrm{bl}}}|$ , i.e., the zero filling-based masked beamspace estimate. If $|\mathbf{S}_{{\mathrm{bl}}}|$ achieves its maximum at $(r_{{}_{\mathrm{ZFB}}},c_{{}_{\mathrm{ZFB}}})$ , the transmit beamformer in ZFB is defined as

[TABLE]

It is important to note that exhaustive scan with the 2D-DFT dictionary selects the coordinate that maximizes $|\mathbf{X}|$ , which is same as the one that maximizes $|\mathbf{S}|$ , for a unimodular $\mathbf{Z}$ . In an ideal setting, i.e., $M=N^{2}$ , $\mathbf{S}_{{\mathrm{bl}}}=\mathbf{S}$ and ZFB results in the same beamformer as exhaustive scan with the 2D-DFT dictionary. In this section, we identify the subsampling regime for which ZFB results in the same beamformer as exhaustive scan.

Zero filling-based beam alignment is successful when the coordinate that maximizes $\mathbf{S}_{{\mathrm{bl}}}$ also maximizes $\mathbf{S}$ . As $\mathbf{S}_{{\mathrm{bl}}}=\mathbf{S}\circledast\mathbf{K}_{{\mathrm{bl}}}$ , it is important to characterize the entries of $\mathbf{K}_{{\mathrm{bl}}}$ to determine the success of ZFB. The matrix $\mathbf{K}_{{\mathrm{bl}}}$ in (28) is a function of the subsampling set $\Omega$ , that is chosen at random. It can be observed from (27) that $\mathbf{N}_{\Omega}$ has $M$ ones and $N^{2}-M$ zeros. Therefore, $\mathbf{K}_{\mathrm{bl}}(0,0)$ , the scaled DC-component of $\mathbf{N}_{\Omega}$ , is $1$ . The other entries of $\mathbf{K}_{\mathrm{bl}}$ explicitly depend on the elements in the sampled set $\Omega$ unlike $\mathbf{K}_{\mathrm{bl}}(0,0)$ . As $\Omega$ is sampled at random, $\mathbf{K}_{\mathrm{bl}}(r,c)$ can be modelled as a random variable for any $(r,c)\neq(0,0)$ with a variance [17]

[TABLE]

Note that the variance of the PSF at the $N^{2}-1$ locations other than $(0,0)$ is exactly the same when $\Omega$ is chosen uniformly at random. The magnitude of $|\mathbf{K}_{\mathrm{bl}}|$ , is shown in Fig. 6b for a particular realization of $\Omega$ .

We explain the setting used to investigate beam alignment with the zero-filling based approach. For simplicity of analysis, we consider a $2$ -path channel such that the beamspace angles of departure of each path are aligned with those defined by the 2D-DFT dictionary. Without loss of generality, we consider $\mathbf{S}(0,0)=1$ and $\mathbf{S}(r_{o},c_{o})=a$ , such that $|a|<1$ and $(r_{o},c_{o})$ is some coordinate other than $(0,0)$ . The remaining $N^{2}-2$ entries of $\mathbf{S}$ are equal to [math] as seen in Fig. 7a. For such a setting, beam alignment via ZFB is successful when $|(\mathbf{S}\circledast\mathbf{K}_{\mathrm{bl}})_{0,0}|$ is the largest entry in $|\mathbf{S}\circledast\mathbf{K}_{\mathrm{bl}}|$ . The probability that ZFB is successful can be expressed as

[TABLE]

The statistics of the PSF, i.e., $\mathbf{K}_{\mathrm{bl}}$ , can be used to determine the entries in $\mathbf{S}\circledast\mathbf{K}_{\mathrm{bl}}$ . Prior work in MRI [17] and partial 2D-DFT CS [29] has modelled $\mathbf{K}_{\mathrm{bl}}(r,c)$ as $\mathcal{N}_{\mathrm{c}}(0,\xi^{2})$ for any $(r,c)\neq(0,0)$ . It is important to note that the random variables $\mathbf{K}_{\mathrm{bl}}(r_{1},c_{1})$ and $\mathbf{K}_{\mathrm{bl}}(r_{2},c_{2})$ can be coupled for $(r_{1},c_{1})\neq(0,0)$ and $(r_{2},c_{2})\neq(0,0)$ . For instance, as $\mathbf{N}_{\Omega}$ is a real matrix, its inverse 2D-DFT must be conjugate symmetric [18], i.e., $\mathbf{K}_{\mathrm{bl}}(N-r,N-c)={\mathbf{K}^{\text{c}}}_{\mathrm{bl}}(r,c)$ for any $(r,c)\in\mathcal{I}_{N}\times\mathcal{I}_{N}$ . In this paper, we account for such dependencies to derive a lower bound on the probability of success with ZFB.

Now, we describe the statistics of the entries in $\mathbf{S}\circledast\mathbf{K}_{\mathrm{bl}}$ . For distinct coordinates $(r_{o},c_{o})$ and $(r_{1},c_{1})$ , we model $\mathbf{K}_{\mathrm{bl}}(r_{o},c_{o})$ , $\mathbf{K}_{\mathrm{bl}}(r_{1},c_{1})$ and $\mathbf{K}_{\mathrm{bl}}(r_{1}-r_{o},c_{1}-c_{o})$ as IID random variables $\mathsf{x}$ , $\mathsf{b}$ and $\mathsf{w}$ . To validate the independence assumption, we conducted empirical studies on the total variation distance between the joint distribution of $\mathsf{x}$ , $\mathsf{b}$ and $\mathsf{w}$ , and the product of their marginals. Our studies indicate that the variables can be considered “independent” except for the case when $(r_{1},c_{1})=(2r_{o},2c_{o})$ or $((r_{o}+c_{o})/2,(r_{o}+c_{o})/2)$ or $(-r_{o},-c_{o})$ . We ignore these three scenarios to assume that $\mathsf{x}$ , $\mathsf{b}$ and $\mathsf{w}$ are independent; such an assumption was also made in [29]. Each of the random variables $\mathsf{x}$ , $\mathsf{b}$ and $\mathsf{w}$ is distributed as $\mathcal{N}_{\mathrm{c}}(0,\xi^{2})$ . From the conjugate symmetry property, it follows that $\mathbf{K}_{\mathrm{bl}}(N-r_{o},N-c_{o})=\mathsf{x}^{\ast}$ . The $(k,\ell)^{\mathrm{th}}$ entry in $\mathbf{S}\circledast\mathbf{K}_{\mathrm{bl}}$ can be expressed as

[TABLE]

For a $2$ -sparse $\mathbf{S}$ defined in this section, it can be shown from (34) that $(\mathbf{S}\circledast\mathbf{K}_{\mathrm{bl}})_{0,0}$ , $(\mathbf{S}\circledast\mathbf{K}_{\mathrm{bl}})_{r_{o},c_{o}}$ and $(\mathbf{S}\circledast\mathbf{K}_{\mathrm{bl}})_{r_{1},c_{1}}$ are $1+a\mathsf{x}^{\ast}$ , $a+\mathsf{x}$ and $\mathsf{b}+a\mathsf{w}$ .

We now derive a lower bound on the probability of successful beam alignment for the $2-$ sparse channel. We define $\mathcal{Q}_{1}(\alpha,\beta)$ as the first order Marcum-Q function with parameters $\alpha$ and $\beta$ [30]. A lower bound on the beam alignment probability in (33) is derived in Theorem 3.

Theorem 3.

For a $2-$ sparse beamspace channel, the probability that zero filling-based beam alignment is successful can be lower bounded as

[TABLE]

Proof.

See Section VII-C. ∎

We show that the phase transition region that follows from our bound in (35) matches well with that observed from simulations. We consider a setting with $N=32$ , and a $2-$ sparse $\mathbf{S}$ with $\mathbf{S}(0,0)=1$ and $\mathbf{S}(r_{o},c_{o})=a$ for some $|a|<1$ . In our simulations, $(r_{o},c_{o})$ was chosen uniformly at random, and $M$ entries of $\mathbf{G}$ were sampled at random by applying $M$ random circulant shifts of a $32\times 32$ perfect binary array at the TX. Beam alignment is declared successful if the zero filling-based estimate, i.e., $|\mathbf{S}\circledast\mathbf{K}_{\mathrm{bl}}|$ , achieves its maximum at $(0,0)$ . The phase transition regions corresponding to the bound in (35) and the observed beam alignment probability are shown in Fig. 8a and Fig. 8b. The plots indicate that ZFB performs well with sub-Nyquist channel measurements. The proposed zero filling-based technique estimates a good one-sparse approximation of $\mathbf{S}$ that is consistent with the channel measurements. CS algorithms can estimate better sparse approximations of $\mathbf{S}$ , but require iterative optimization. ZFB can provide a reasonable beamformer with a single 2D-FFT, and is advantageous over CS-based beam alignment in terms of computational complexity.

V Simulations

In this section, we explain how beam alignment can be performed in a wideband system. Then, we describe the system and channel parameters used in our simulations. Finally, we present numerical results that show the performance of CS-based beam alignment and zero filling-based beam alignment using FALP.

V-A Beam alignment in a wideband system

We use a Golay sequence-based frame structure [31] and extend the beam alignment techniques in Sec. III-E and Sec. IV-C to a wideband system. For an elaborate description of the wideband extension, we refer the reader to [23]. We consider an $L$ tap wideband channel $\left\{\mathbf{H}[\ell]\right\}_{\ell=0}^{L-1}$ , where $\mathbf{H}[\ell]\in\mathbb{C}^{N\times N}$ . For each phase shift configuration in $\{\mathbf{P}[m]\}^{M}_{n=1}$ , the TX transmits a Golay complementary sequence of length $2N_{\mathrm{s}}$ followed by a guard interval of $L-1$ zeros. The use of guard interval prevents inter-frame interference and allows sufficient time to configure the phase shifters [5]. The RX uses the perfect autocorrelation property of complementary Golay sequences to obtain the channel impulse response (CIR) for each phase shift configuration. The CIR obtained in the $m^{\mathrm{th}}$ training slot is a noisy version of $\{\langle\mathbf{H}[\ell],\mathbf{P}[m]\rangle\}_{\ell=0}^{L-1}$ . Using several spatial channel projections, it is possible to reconstruct the wideband channel. We define $\mathbf{Y}_{\mathrm{blk}}\in\mathbb{C}^{M\times L}$ as a matrix that contains noisy wideband channel projections. The noise in $\mathbf{Y}_{\mathrm{blk}}$ is modelled using $\mathbf{V}_{s}\in\mathbb{C}^{M\times L}$ ; $\mathbf{Y}_{\mathrm{blk}}(m,\ell)$ is then

[TABLE]

As a spreading gain of $2N_{s}$ is achieved at the output of the Golay correlator, it can be observed that the entries in $\mathbf{V}_{s}$ are independent and identically distributed as $\mathcal{N}_{\mathrm{c}}(0,\sigma^{2}/(2N_{s}))$ .

In this paper, a single tap of the wideband channel is used to determine the transmit beamformer. Nevertheless, CS-based wideband channel estimation can also be performed at the expense of a higher complexity [4, 5]. The tap used to perform beam alignment is given by $\ell_{o}={\mathrm{argmax}}_{\ell}\|\mathbf{Y}_{\mathrm{blk}}(:,\ell)\|_{2}$ . The channel measurements considered in FALP are compressive spatial projections of $\mathbf{H}[\ell_{o}]$ , i.e., $\mathbf{y}=\mathbf{Y}_{\mathrm{blk}}(:,\ell_{o})$ . We use $\mathbf{H}[\ell_{\mathrm{opt}}]$ to denote the channel tap that has the maximum energy of the $L$ taps. In practice, $\mathbf{H}[\ell_{o}]$ can be different from $\mathbf{H}[\ell_{\mathrm{opt}}]$ as $\ell_{o}$ is determined from lower-dimensional spatial projections of the $L$ channel taps. The matrix $\hat{\mathbf{H}}[\ell_{o}]$ obtained using CS over $\mathbf{y}$ , can be considered as an equivalent narrowband channel estimate that is used for beam alignment. Note that our approach ignores the correlation between channel taps as it performs beam alignment based on a single tap. Developing better beam alignment strategies that account for such correlations is an interesting direction.

V-B System and channel description

We consider an analog beamforming system in Fig. 1, where the TX is equipped with a half-wavelength spaced UPA of size $32\times 32$ , i.e., $N=32$ . The resolution of phase shifters is set to $q=1$ bit. We consider a carrier frequency of $28\,\mathrm{GHz}$ and an operating bandwidth of $100\,\mathrm{MHz}$ , which corresponds to a symbol duration of $10\,\mathrm{ns}$ . The height of the TX and the RX were $5\,\mathrm{m}$ and $2\,\mathrm{m}$ in our simulation setup. The separation between the TX and the RX is set to $60\,\mathrm{m}$ . The transmit power at the TX is assumed to be $20\,\mathrm{dBm}$ . The RX is equipped with a single antenna element.

The mmWave channels in our simulations were derived from the QuaDRiga channel simulator for the 3GPP 38.901 UMi-NLoS scenario [32]. For a TX-RX separation of $60\,\mathrm{m}$ , the omnidirectional RMS delay spread was found to be less than $176\,\mathrm{ns}$ in more than $90\%$ of the channel realizations. Considering the leakage effects due to pulse shaping, the wideband channel is modelled using $L=64$ taps corresponding to a duration of $640\,\mathrm{ns}$ . The channel measurements for CS-based beam alignment are acquired using Golay complementary sequences along the time dimension, where each sequence is of length $N_{\mathrm{s}}=64$ . For our simulation settings, it can be observed that the duration of the guard interval that follows a Golay pair is $630\,\mathrm{ns}$ . The guard interval is sufficient enough to change the phase shift configuration at the TX, as phase shifters with a settling time of about $30\,\mathrm{ns}$ at $28\,\mathrm{GHz}$ have been reported in [33]. The standard 2D-DFT is used as a sparsifying dictionary for the spatial channel, unless otherwise stated. The simulation results we report are the averages over $100$ channel realizations.

V-C Performance evaluation

The 2D-CCS-based technique in Sec. III-E estimates the beamspace $\mathbf{X}$ , while ZFB in Sec. IV-C estimates a one-sparse approximation of $\mathbf{X}$ . Therefore, we define different metrics and benchmarks to evaluate the two techniques.

We describe the metrics and benchmarks used to evaluate 2D-CCS with FALP. In this paper, the error in the channel estimate is defined for a single tap, as CS is performed over the channel measurements corresponding to one tap. We use the OMP algorithm for CS-based channel estimation [27]. The stopping threshold and the maximum number of iterations used in OMP were $\sigma\sqrt{M/2N_{\mathrm{s}}}$ and $50$ [27]. The normalized squared error in the channel estimate is defined as

[TABLE]

Using the channel estimate $\hat{\mathbf{H}}[\ell_{o}]$ , the TX constructs the beamformer $\mathbf{F}_{\mathrm{CS}}$ according to the method in Sec. III-E. The number of elements in $\mathcal{B}$ was chosen as $K_{\mathcal{B}}=6$ . The effective wideband single-input single-output (SISO) channel after CS-based beam alignment is then $\{\langle\mathbf{H}[\ell],\mathbf{F}_{\mathrm{CS}}\rangle\}^{L-1}_{\ell=0}$ . The achievable rate is computed for the effective channel using the water filling algorithm.

We compare the proposed CS-based approach with the perfect channel state information (CSI) scenario in which the beamformer is computed using $\mathbf{H}[\ell_{\mathrm{opt}}]$ . We also evaluate random 2D-CCS, and standard CS with random IID phase shift matrices[5]. In random 2D-CCS, the base matrix $\mathbf{P}$ is chosen at random from the feasible set, i.e., $\mathbb{Q}^{N\times N}_{q}$ [8]. It is important to note the transformation $\mathbf{S}=\mathbf{X}\odot\mathbf{Z}$ may not be invertible for a random base matrix. As a result, the approach in Fig. 5, that solves for the beamspace through a partial 2D-DFT problem cannot be used for random 2D-CCS. The complexity of random 2D-CCS, however, is lower than standard CS. We use a low complexity version of OMP that exploits the 2D-convolutional structure of the training in random 2D-CCS.

Now, we define benchmarks for ZFB using FALP. As a one-sparse approximation of $\mathbf{X}$ is estimated in ZFB, its CS counterparts are those that estimate a single beamspace component. We consider the single step matching pursuit (MP) algorithm in [34] for two different training designs. The first design uses fewer 2D-circulant shifts of a random base matrix, while the second one is the common IID phase shift-based design [5]. We would like to highlight the fact that both ZFB and matching pursuit with 2D-circulant shifts of a random base matrix, exploit the 2D-FFT for fast estimation. The complexity of both these algorithms is lower than the one that uses the IID phase shift-based design, as shown in Fig. 9.

We compare the achievable rate obtained using the proposed techniques and the benchmarks. It can be observed from Fig. 10 that CS using the perfect array-based training in FALP achieves about $90\%$ of the perfect CSI rate, with just $120$ channel measurements. In contrast, exhaustive scan-based beam alignment with the 2D-DFT dictionary requires $1024$ channel measurements. While both FALP and random 2D-CCS use 2D-convolutional channel acquisition, it can be observed that the rate achieved with FALP is significantly larger than that with random 2D-CCS. Similarly, ZFB performs better than single step MP that uses 2D-circulant shifts of a random base matrix. The difference in performance between the two techniques is due to the choice of the base matrix, i.e., $\mathbf{P}$ . FALP uses a carefully designed base matrix, i.e., a perfect array, that satisfies the optimality conditions for efficient 2D-CCS. The loss in achievable rate when compared to the perfect CSI case is due to noise in the channel measurements and leakage effects in the beamspace representation.

The plot in Fig. 10 indicates that partial 2D-DFT CS-based beam alignment with FALP performs slightly better than standard CS with the IID random phase shift-based design. It is important to note, however, that the computational complexity of the CS algorithm in FALP is significantly lower than the standard approach that uses IID random phase shifts. As CS algorithms typically involve iterative optimization, it is useful to reduce the complexity of computations in each iteration without compromising the performance of the algorithm. The random 2D-CCS-based approach has a lower complexity, but results in a poor achievable rate. CS-based algorithms in FALP achieve the best of both worlds, i.e., better beam alignment at a reduced computational complexity.

In Fig. 11, we plot the channel estimation accuracy with CS-based techniques. The metric used in Fig. 11 is the mean of the normalized squared error in (37), i.e., $\mathbb{E}[\mathrm{NSE}]$ . We would like to point out that this metric is different from the usual normalized mean squared error (NMSE). We use mean of NSE, as the normalized mean squared error (NMSE) is dominated by poor channel realizations that result in a low received power. As the primary objective of our simulations is to compare different training solutions, we propose to use $\mathbb{E}[\mathrm{NSE}]$ , where the mean is taken over the NSE in dB. Such a metric is robust to fluctuations in the norm of the channel. The proposed CS-based beam alignment technique results in a slightly lower mean NSE than the random phase shift-based approach. Although the mean NSE approaches just $-7\,\mathrm{dB}$ , it can be observed from Fig. 10 that the achievable rate with the proposed approach is reasonable for $M=200$ . It is because beam alignment depends on how well the CS algorithm reconstructs the phase of the channel instead of the full channel. From Fig. 11, it can be observed that the use of an oversampled 2D-DFT dictionary, by a factor of $2$ along both the azimuth and elevation dimensions, results in a lower mean NSE. In such a case, CS algorithms have a higher complexity as the dimensionality of the underlying optimization problem is quadrupled.

The low complexity nature of CS algorithms in FALP makes it a promising solution for beam alignment in massive and low resolution phased arrays. In Fig. 12, we show that CS-based beam alignment using FALP works well for a wide range of antenna dimensions. We would like to mention that FALP can be applied in one-bit phased arrays only when a perfect binary array exists, i.e., when $N=2^{k}$ or $N=3\cdot 2^{k}$ , for a natural number $k$ . Designing perfect arrays for other combinations of phase shift resolution and antenna configurations, is an interesting research direction.

VI Conclusions and future work

In this paper, we have proposed FALP, a framework for compressive beam alignment or channel estimation using a perfect array-based codebook. The existence of perfect arrays over small alphabets allows fast and efficient compressed sensing in low resolution phased arrays through FALP. We have derived guarantees on channel reconstruction from sub-Nyquist sampling using FALP. In addition, we have shown how zero filling-based reconstruction in MRI can be used for rapid and low complexity beam alignment using a single 2D-FFT.

FALP establishes a new platform to translate CS ideas from MRI to channel estimation or beam alignment in mmWave systems. As CS matrices in FALP can be parameterized by trajectories, k-space trajectories in MRI can be used for beam alignment or channel estimation. Furthermore, the trajectories can be optimized so that the CS matrix is robust to hardware non-idealities like CFO, phase noise, and frame synchronization errors. Investigating the performance of existing MRI trajectories in the beam alignment problem, and designing new trajectories are interesting research directions.

VII Proof of Theorems

VII-A Proof of Theorem 1

For the conditions in Theorem 1, the reconstruction error in the masked beamspace obtained using the $\ell_{1}$ -norm optimization program in (17) can be bounded as

[TABLE]

The upper bound on the reconstruction error in (38) follows from Theorems 1 and 3 of [8]. Using the spectral mask equation, i.e., $\mathbf{S}=\mathbf{X}\odot\mathbf{Z}$ , and (38), we translate the guarantee on $\hat{\mathbf{S}}$ to the true beamspace estimate, i.e., $\hat{\mathbf{X}}$ .

We first obtain an upper bound on the $\ell_{1}$ approximation error of the masked beamspace in (38). We define $\Gamma\subseteq\mathcal{I}_{N}\times\mathcal{I}_{N}$ and its complement as $\Gamma^{\mathrm{c}}$ . The cardinality of $\Gamma$ is denoted by $|\Gamma|$ . With the definition of $\mathbf{N}_{\Omega}$ in (27), we express the sparse approximation error $\|\mathbf{S}-(\mathbf{S})_{k}\|_{1}$ as

[TABLE]

From the spectral mask equation, we have $\mathbf{S}(\ell,m)=\mathbf{X}(\ell,m)\mathbf{Z}(\ell,m)$ . As a result, $|\mathbf{S}(\ell,m)|\leq Z_{\mathrm{max}}|\mathbf{X}(\ell,m)|$ . The $\ell_{1}$ approximation error in (40) is upper bounded as

[TABLE]

Using the definition of the $\ell_{1}$ error in a $k$ sparse approximation of $\mathbf{X}$ in (41), we have $\|\mathbf{S}-(\mathbf{S})_{k}\|_{1}\leq Z_{\mathrm{max}}\|\mathbf{X}-(\mathbf{X})_{k}\|_{1}$ .

Now, we derive a lower bound on the error in the masked beamspace estimate in terms of the error in the true beamspace estimate. The squared error in $\hat{\mathbf{S}}$ is $\|\mathbf{S}-\hat{\mathbf{S}}\|_{F}^{2}=\sum_{\ell,m}|\mathbf{Z}(\ell,m)(\mathbf{X}(\ell,m)-\mathbf{\hat{X}}(\ell,m))|^{2}$ . By definition, $|\mathbf{Z}(\ell,m)|\geq Z_{\mathrm{min}}$ for every $\ell$ and $m$ . Therefore, the error in the masked beamspace estimate is lower bounded as $\|\mathbf{S}-\hat{\mathbf{S}}\|_{F}\geq Z_{\mathrm{min}}\|\mathbf{X}-\hat{\mathbf{X}}\|_{F}$ . The result in Theorem 1 follows by using $\|\mathbf{S}-(\mathbf{S})_{k}\|_{1}\leq Z_{\mathrm{max}}\|\mathbf{X}-(\mathbf{X})_{k}\|_{1}$ and $\|\mathbf{S}-\hat{\mathbf{S}}\|_{F}\geq Z_{\mathrm{min}}\|\mathbf{X}-\hat{\mathbf{X}}\|_{F}$ in (38).

VII-B Proof of Theorem 19

The spectral mask $\mathbf{Z}$ is a scaled inverse 2D-DFT of $\mathbf{P}_{\mathrm{FC}}$ , i.e., $\mathbf{Z}=N\mathbf{U}^{\ast}_{N}\mathbf{P}_{\mathrm{FC}}\mathbf{U}^{\ast}_{N}$ . It can be observed that the unimodularity of $\mathbf{Z}$ is equivalent to the unimodularity of $\mathbf{Z}_{\mathrm{FC}}$ , a flipped and conjugated version of $\mathbf{Z}$ . Using the properties of the Fourier transform, it can be shown that $\mathbf{Z}_{\mathrm{FC}}=N\mathbf{U}_{N}\mathbf{P}\mathbf{U}_{N}$ [18]. Now, we use the result that $N\mathbf{U}_{N}\mathbf{P}\mathbf{U}_{N}$ is unimodular if and only if $\mathbf{P}$ has perfect periodic spatial autocorrelation [35, Sec. III-A]. The result in Theorem 19 follows by putting these observations together.

VII-C Proof of Theorem 3

From (33), the probability that $|(\mathbf{S}\circledast\mathbf{K}_{\mathrm{bl}})|$ achieves maximum at $(0,0)$ can be expressed as

[TABLE]

Note that $\mathbf{S}$ is non-zero only at the locations $(0,0)$ and $(r_{o},c_{o})$ . In this section, we derive closed form expressions for $\mathrm{Pr}(|(\mathbf{S}\circledast\mathbf{K}_{\mathrm{bl}})_{0,0}|\leq|(\mathbf{S}\circledast\mathbf{K}_{\mathrm{bl}})_{r_{o},c_{o}}|)$ and $\mathrm{Pr}(|(\mathbf{S}\circledast\mathbf{K}_{\mathrm{bl}})_{0,0}|\leq|(\mathbf{S}\circledast\mathbf{K}_{\mathrm{bl}})_{r_{1},c_{1}}|)$ for some $(r_{1},c_{1})\notin\{(0,0),(r_{o},c_{o})\}$ . We then derive a lower bound on $p$ in (42).

The matrix $|\mathbf{S}|$ is $1$ at $(0,0)$ , $|a|$ at $(r_{o},c_{o})$ , and [math] at the remaining $N^{2}-2$ locations. Although $|\mathbf{S}|$ is maximum at $(0,0)$ for $|a|<1$ , it is possible that $|(\mathbf{S}\circledast\mathbf{K}_{\mathrm{bl}})|$ achieves maximum at $(r_{o},c_{o})$ . We define $p_{1}=\mathrm{Pr}(|(\mathbf{S}\circledast\mathbf{K}_{\mathrm{bl}})_{0,0}|\leq|(\mathbf{S}\circledast\mathbf{K}_{\mathrm{bl}})_{r_{o},c_{o}}|)$ . Using (34), $p_{1}$ can be expressed as

[TABLE]

The inequality $|1+a\mathsf{x}^{\ast}|\leq|a+\mathsf{x}|$ is equivalent to $1+|a|^{2}|\mathsf{x}|^{2}+2\mathrm{Re}\{a\mathsf{x}^{\ast}\}\leq|a|^{2}+|\mathsf{x}|^{2}+2\mathrm{Re}\{a\mathsf{x}^{\ast}\}$ and can be simplified to $(1-|a|^{2})|\mathsf{x}|^{2}\geq 1-|a|^{2}$ . For any $|a|<1$ , $p_{1}$ is given by

[TABLE]

As $\mathsf{x}\sim\mathcal{N}_{\mathrm{c}}(0,\xi^{2})$ for a large $N$ , $2|\mathsf{x}|^{2}/\xi^{2}$ follows the central $\chi^{2}$ -distribution of order $2$ . Therefore, $p_{1}$ in (45) can be expressed in terms of the Marcum Q-function [30] as

[TABLE]

An interesting observation from (46) is that $p_{1}$ is independent of the strength of the second best path, i.e., $|a|$ .

Now, we derive the probability that $|(\mathbf{S}\circledast\mathbf{K}_{\mathrm{bl}})_{0,0}|\leq|(\mathbf{S}\circledast\mathbf{K}_{\mathrm{bl}})_{r_{1},c_{1}}|$ for some $(r_{1},c_{1})$ such that $\mathbf{S}(r_{1},c_{1})=0$ . We define $p_{2}=\mathrm{Pr}(|(\mathbf{S}\circledast\mathbf{K}_{\mathrm{bl}})_{0,0}|\leq|(\mathbf{S}\circledast\mathbf{K}_{\mathrm{bl}})_{r_{1},c_{1}}|)$ . Using (34), $p_{2}$ can be expressed as

[TABLE]

We assume that the variables $\mathsf{x}$ , $\mathsf{b}$ and $\mathsf{w}$ are independent. With this assumption, it can be observed that $\mathsf{b}+a\mathsf{w}\sim\mathcal{N}_{\mathrm{c}}(0,(1+|a|^{2})\xi^{2})$ and $1+a\mathsf{x}^{\ast}\sim\mathcal{N}_{\mathrm{c}}(1,|a|^{2}\xi^{2})$ . We use $\chi^{2}_{{}_{\mathrm{C}}}$ and $\chi^{2}_{{}_{\mathrm{NC}}}$ to denote the central and non-central chi-squared random variables of degree $2$ [30]. The non-centrality parameter of $\chi^{2}_{{}_{\mathrm{NC}}}$ is set as $\lambda_{{}_{\mathrm{NC}}}=2/{|a|^{2}\xi^{2}}$ . Using these definitions, it can be shown that $|\mathsf{b}+a\mathsf{w}|^{2}\sim\xi^{2}(1+|a|^{2})\chi^{2}_{{}_{\mathrm{C}}}/2$ and $|1+a\mathsf{x}^{\ast}|^{2}\sim|a|^{2}\xi^{2}\chi^{2}_{{}_{\mathrm{NC}}}/2$ . We use $f(t)$ to denote the probability density of $\chi^{2}_{{}_{\mathrm{NC}}}$ at $t$ . The probability in (47) is then

[TABLE]

We use the complementary cumulative distribution function of $\chi^{2}_{{}_{\mathrm{C}}}$ to express (49) as

[TABLE]

It can be observed from (50) that $p_{2}$ is the moment generating function of $\chi^{2}_{{}_{\mathrm{NC}}}$ [30] evaluated at $-|a|^{2}/(2+2|a|^{2})$ . The probability in (50) is then

[TABLE]

As a sanity check, it can be observed that $p_{2}=p_{1}$ for $a=0$ .

We use $p_{1}$ and $p_{2}$ in (46) and (51), to derive a lower bound on (42). The probability that $|\mathbf{S}\circledast\mathbf{K}_{\mathrm{bl}}|$ does not achieve its maximum at $(0,0)$ can be upper bounded using a union bound as

[TABLE]

The right hand side in (52) comprises of two distinct terms that are $p_{1}=\mathrm{Pr}(|(\mathbf{S}\circledast\mathbf{K}_{\mathrm{bl}})_{0,0}|\leq|(\mathbf{S}\circledast\mathbf{K}_{\mathrm{bl}})_{r_{o},c_{o}}|)$ , and $p_{2}=\mathrm{Pr}(|(\mathbf{S}\circledast\mathbf{K}_{\mathrm{bl}})_{0,0}|\leq|(\mathbf{S}\circledast\mathbf{K}_{\mathrm{bl}})_{r_{1},c_{1}}|)$ for any $(r_{1},c_{1})\notin\{(0,0),(r_{o},c_{o})\}$ . As there are $N^{2}-2$ terms of the second kind, we can write

[TABLE]

The result in Theorem 3 follows by substituting (46) and (51) in (53).

Bibliography35

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] S. Rangan, T. S. Rappaport, and E. Erkip, “Millimeter-wave cellular wireless networks: Potentials and challenges,” in Proc. of the IEEE , vol. 102, no. 3, pp. 366–385, 2014.
2[2] R. W. Heath, N. Gonzalez-Prelcic, S. Rangan, W. Roh, and A. M. Sayeed, “An overview of signal processing techniques for millimeter wave MIMO systems,” IEEE J. Sel. Topics Signal Process. , vol. 10, no. 3, pp. 436–453, 2016.
3[3] D. L. Donoho, “Compressed sensing,” IEEE Trans. on Inform. Theory , vol. 52, no. 4, pp. 1289–1306, 2006.
4[4] Z. Marzi, D. Ramasamy, and U. Madhow, “Compressive channel estimation and tracking for large arrays in mm-wave picocells,” IEEE J. Sel. Topics Signal Process. , vol. 10, no. 3, pp. 514–527, 2016.
5[5] J. Rodríguez-Fernández, N. González-Prelcic, K. Venugopal, and R. W. Heath, “Frequency-domain compressive channel estimation for frequency-selective hybrid millimeter wave MIMO systems,” IEEE Trans. on Wireless Commun. , vol. 17, no. 5, pp. 2946–2960, 2018.
6[6] E. J. Candes, “The restricted isometry property and its implications for compressed sensing,” Comptes rendus mathematique , vol. 346, no. 9-10, pp. 589–592, 2008.
7[7] N. J. Myers, A. Mezghani, and R. W. Heath, “Swift-Link: A compressive beam alignment algorithm for practical mm Wave radios,” IEEE Trans. on Signal Process. , vol. 67, no. 4, pp. 1104–1119, 2019.
8[8] F. Krahmer and H. Rauhut, “Structured random measurements in signal processing,” GAMM-Mitteilungen , vol. 37, no. 2, pp. 217–238, 2014.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

FALP: Fast beam alignment in mmWave systems with low-resolution phase shifters

Abstract

Index Terms:

I Introduction

II System and channel model

II-A System model

II-B Channel model

III Convolutional CS in planar arrays

III-A Motivation for 2D-CCS

III-B Transforming convolutional CS to partial 2D-DFT CS

III-C Conditions on the base matrix for efficient CS

Theorem 1**.**

Proof.

III-D Perfect arrays as ideal base matrices

Theorem 2**.**

Proof.

III-E Perfect array-based compressive beam alignment in FALP

IV An MRI-inspired approach for ultra-low complexity beam alignment

IV-A Connection between CS in MRI and CS using FALP

IV-B Zero filling-based reconstruction: From MRI to beam alignment

IV-C Beam alignment with the zero filling-based estimate

Theorem 3**.**

Proof.

V Simulations

V-A Beam alignment in a wideband system

V-B System and channel description

V-C Performance evaluation

VI Conclusions and future work

VII Proof of Theorems

VII-A Proof of Theorem 1

VII-B Proof of Theorem 19

VII-C Proof of Theorem 3

Theorem 1.

Theorem 2.

Theorem 3.