Rate-Splitting Unifying SDMA, OMA, NOMA, and Multicasting in MISO   Broadcast Channel: A Simple Two-User Rate Analysis

Bruno Clerckx; Yijie Mao; Robert Schober; and H. Vincent Poor

arXiv:1906.04474·cs.IT·June 12, 2019

Rate-Splitting Unifying SDMA, OMA, NOMA, and Multicasting in MISO Broadcast Channel: A Simple Two-User Rate Analysis

Bruno Clerckx, Yijie Mao, Robert Schober, and H. Vincent Poor

PDF

TL;DR

This paper demonstrates that rate-splitting with SIC in a two-user MISO broadcast channel unifies and generalizes SDMA, OMA, NOMA, and multicasting, providing a flexible framework that adapts to channel conditions.

Contribution

It introduces a simple rate-splitting approach that encompasses multiple transmission strategies, showing how it outperforms and unifies existing methods based on channel disparity and directions.

Findings

01

RS unifies SDMA, OMA, NOMA, and multicasting.

02

RS outperforms traditional strategies under certain channel conditions.

03

Analytical characterization of sum-rate performance.

Abstract

Considering a two-user multi-antenna Broadcast Channel, this paper shows that linearly precoded Rate-Splitting (RS) with Successive Interference Cancellation (SIC) receivers is a flexible framework for non-orthogonal transmission that generalizes, and subsumes as special cases, four seemingly different strategies, namely Space Division Multiple Access (SDMA) based on linear precoding, Orthogonal Multiple Access (OMA), Non- Orthogonal Multiple Access (NOMA) based on linearly precoded superposition coding with SIC, and physical-layer multicasting. The paper studies the sum-rate and shows analytically how RS unifies, outperforms, and specializes to SDMA, OMA, NOMA, and multicasting as a function of the disparity of the channel strengths and the angle between the user channel directions.

Tables1

t^{⋆} = \min (- \frac{a}{2 ​ b} - \frac{c}{2 ​ d}, 1) = \min (\frac{{| {\bar{𝐡}}_{2}^{H} ​ 𝐟_{c} |}^{2}}{2 ​ {| {\bar{𝐡}}_{2}^{H} ​ 𝐟_{c} |}^{2} - ρ} + \frac{1}{2 ​ ρ} ​ (\frac{1}{{‖ 𝐡_{1} ‖}^{2}} + \frac{1}{{‖ 𝐡_{2} ‖}^{2}}) ​ (\frac{2 ​ ρ - 2 ​ {| {\bar{𝐡}}_{2}^{H} ​ 𝐟_{c} |}^{2}}{2 ​ {| {\bar{𝐡}}_{2}^{H} ​ 𝐟_{c} |}^{2} - ρ}) ​ \frac{1}{P}, 1) .

(14)

Equations37

x = p_{c} s_{c} + p_{1} s_{1} + p_{2} s_{2} .

x = p_{c} s_{c} + p_{1} s_{1} + p_{2} s_{2} .

y_{k} = h_{k}^{H} x + n_{k}, k = 1, 2,

y_{k} = h_{k}^{H} x + n_{k}, k = 1, 2,

R_{c} = min (lo g_{2} (1 + \frac{h _{1}^{H} p _{c} ^{2}}{1 + h _{1}^{H} p _{1} ^{2} + h _{1}^{H} p _{2} ^{2}}), lo g_{2} (1 + \frac{h _{2}^{H} p _{c} ^{2}}{1 + h _{2}^{H} p _{1} ^{2} + h _{2}^{H} p _{2} ^{2}})),

R_{c} = min (lo g_{2} (1 + \frac{h _{1}^{H} p _{c} ^{2}}{1 + h _{1}^{H} p _{1} ^{2} + h _{1}^{H} p _{2} ^{2}}), lo g_{2} (1 + \frac{h _{2}^{H} p _{c} ^{2}}{1 + h _{2}^{H} p _{1} ^{2} + h _{2}^{H} p _{2} ^{2}})),

R_{k} = lo g_{2} (1 + \frac{h _{k}^{H} p _{k} ^{2}}{1 + h _{k}^{H} p _{j} ^{2}}), k \neq = j .

R_{k} = lo g_{2} (1 + \frac{h _{k}^{H} p _{k} ^{2}}{1 + h _{k}^{H} p _{j} ^{2}}), k \neq = j .

p_{c} max min (\frac{h _{1}^{H} p _{c} ^{2}}{1 + h _{1}^{H} p _{1} ^{2}}, \frac{h _{2}^{H} p _{c} ^{2}}{1 + h _{2}^{H} p _{2} ^{2}}) .

p_{c} max min (\frac{h _{1}^{H} p _{c} ^{2}}{1 + h _{1}^{H} p _{1} ^{2}}, \frac{h _{2}^{H} p _{c} ^{2}}{1 + h _{2}^{H} p _{2} ^{2}}) .

\max_{\mathbf{p}_{\mathrm{c}}}\min\left(\big{|}\tilde{\mathbf{h}}_{1}^{H}\mathbf{p}_{\mathrm{c}}\big{|}^{2},\big{|}\tilde{\mathbf{h}}_{2}^{H}\mathbf{p}_{\mathrm{c}}\big{|}^{2}\right).

\max_{\mathbf{p}_{\mathrm{c}}}\min\left(\big{|}\tilde{\mathbf{h}}_{1}^{H}\mathbf{p}_{\mathrm{c}}\big{|}^{2},\big{|}\tilde{\mathbf{h}}_{2}^{H}\mathbf{p}_{\mathrm{c}}\big{|}^{2}\right).

f_{c} = \frac{1}{λ} (μ_{1} \tilde{h}_{1} + μ_{2} \tilde{h}_{2} e^{- j ∠ α_{12}}),

f_{c} = \frac{1}{λ} (μ_{1} \tilde{h}_{1} + μ_{2} \tilde{h}_{2} e^{- j ∠ α_{12}}),

λ = \frac{α _{11} α _{22} - ∣ α _{12} ∣ ^{2}}{α _{11} + α _{22} - 2 ∣ α _{12} ∣},

λ = \frac{α _{11} α _{22} - ∣ α _{12} ∣ ^{2}}{α _{11} + α _{22} - 2 ∣ α _{12} ∣},

\left[\begin{array}[]{c}\mu_{1}\\ \mu_{2}\end{array}\right]=\frac{1}{\alpha_{11}+\alpha_{22}-2\left|\alpha_{12}\right|}\left[\begin{array}[]{c}\alpha_{22}-\left|\alpha_{12}\right|\\ \alpha_{11}-\left|\alpha_{12}\right|\end{array}\right],

\left[\begin{array}[]{c}\mu_{1}\\ \mu_{2}\end{array}\right]=\frac{1}{\alpha_{11}+\alpha_{22}-2\left|\alpha_{12}\right|}\left[\begin{array}[]{c}\alpha_{22}-\left|\alpha_{12}\right|\\ \alpha_{11}-\left|\alpha_{12}\right|\end{array}\right],

\left[\begin{array}[]{cc}\alpha_{11}&\alpha_{12}\\ \alpha_{12}^{*}&\alpha_{22}\end{array}\right]=\left[\begin{array}[]{c}\tilde{\mathbf{h}}_{1}^{H}\\ \tilde{\mathbf{h}}_{2}^{H}\end{array}\right]\left[\begin{array}[]{cc}\tilde{\mathbf{h}}_{1}&\tilde{\mathbf{h}}_{2}\end{array}\right].

\left[\begin{array}[]{cc}\alpha_{11}&\alpha_{12}\\ \alpha_{12}^{*}&\alpha_{22}\end{array}\right]=\left[\begin{array}[]{c}\tilde{\mathbf{h}}_{1}^{H}\\ \tilde{\mathbf{h}}_{2}^{H}\end{array}\right]\left[\begin{array}[]{cc}\tilde{\mathbf{h}}_{1}&\tilde{\mathbf{h}}_{2}\end{array}\right].

R_{\mathrm{s}}=\log_{2}\left(\gamma_{1}^{2}\right)+\log_{2}\left(\gamma_{2}^{2}+\big{|}\mathbf{h}_{2}^{H}\mathbf{p}_{\mathrm{c}}\big{|}^{2}\right).

R_{\mathrm{s}}=\log_{2}\left(\gamma_{1}^{2}\right)+\log_{2}\left(\gamma_{2}^{2}+\big{|}\mathbf{h}_{2}^{H}\mathbf{p}_{\mathrm{c}}\big{|}^{2}\right).

P_{k} = max (μ - \frac{1}{∥ h _{k} ∥ ^{2} ρ}, 0), k = 1, 2,

P_{k} = max (μ - \frac{1}{∥ h _{k} ∥ ^{2} ρ}, 0), k = 1, 2,

R_{s} = lo g_{2} (a c + (a d + b c) t + b d t^{2}),

R_{s} = lo g_{2} (a c + (a d + b c) t + b d t^{2}),

\displaystyle\max_{\mathbf{f}_{\mathrm{c}}}\min\left(\big{|}\tilde{\mathbf{h}}_{1}^{H}\mathbf{f}_{\mathrm{c}}\big{|}^{2},\big{|}\tilde{\mathbf{h}}_{2}^{H}\mathbf{f}_{\mathrm{c}}\big{|}^{2}\right)

\displaystyle\max_{\mathbf{f}_{\mathrm{c}}}\min\left(\big{|}\tilde{\mathbf{h}}_{1}^{H}\mathbf{f}_{\mathrm{c}}\big{|}^{2},\big{|}\tilde{\mathbf{h}}_{2}^{H}\mathbf{f}_{\mathrm{c}}\big{|}^{2}\right)

\Leftrightarrow

\Leftrightarrow

\Leftrightarrow

R_{\mathrm{s}}\stackrel{{\scriptstyle P\nearrow}}{{=}}\log_{2}\big{(}\left\|\mathbf{h}_{1}\right\|^{2}\!\rho\big{)}+2\log_{2}\left(P\right)+\log_{2}\left(et^{2}+ft\right)

R_{\mathrm{s}}\stackrel{{\scriptstyle P\nearrow}}{{=}}\log_{2}\big{(}\left\|\mathbf{h}_{1}\right\|^{2}\!\rho\big{)}+2\log_{2}\left(P\right)+\log_{2}\left(et^{2}+ft\right)

t^{\star}=\min\left(\frac{-f}{2e},1\right)=\min\left(\frac{\big{|}\bar{\mathbf{h}}_{2}^{H}\mathbf{f}_{\mathrm{c}}\big{|}^{2}}{2\big{|}\bar{\mathbf{h}}_{2}^{H}\mathbf{f}_{\mathrm{c}}\big{|}^{2}-\rho},1\right),

t^{\star}=\min\left(\frac{-f}{2e},1\right)=\min\left(\frac{\big{|}\bar{\mathbf{h}}_{2}^{H}\mathbf{f}_{\mathrm{c}}\big{|}^{2}}{2\big{|}\bar{\mathbf{h}}_{2}^{H}\mathbf{f}_{\mathrm{c}}\big{|}^{2}-\rho},1\right),

\Delta R_{\mathrm{s}}=\left.R_{\mathrm{s}}\right|_{t^{\star}}-\left.R_{\mathrm{s}}\right|_{t=1}=\log_{2}\left(\frac{\big{|}\bar{\mathbf{h}}_{2}^{H}\mathbf{f}_{\mathrm{c}}\big{|}^{4}}{\rho\left(2\big{|}\bar{\mathbf{h}}_{2}^{H}\mathbf{f}_{\mathrm{c}}\big{|}^{2}-\rho\right)}\right).

\Delta R_{\mathrm{s}}=\left.R_{\mathrm{s}}\right|_{t^{\star}}-\left.R_{\mathrm{s}}\right|_{t=1}=\log_{2}\left(\frac{\big{|}\bar{\mathbf{h}}_{2}^{H}\mathbf{f}_{\mathrm{c}}\big{|}^{4}}{\rho\left(2\big{|}\bar{\mathbf{h}}_{2}^{H}\mathbf{f}_{\mathrm{c}}\big{|}^{2}-\rho\right)}\right).

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Rate-Splitting Unifying SDMA, OMA, NOMA,

and Multicasting in MISO Broadcast Channel:

A Simple Two-User Rate Analysis

Bruno Clerckx, Yijie Mao, Robert Schober, and H. Vincent Poor B. Clerckx is with Imperial College London, London SW7 2AZ, UK (email: [email protected]). Y. Mao is with The University of Hong Kong, Hong Kong, China (email: [email protected]). R. Schober is with University of Erlangen-Nuremberg, 91058 Erlangen, Germany (email: [email protected]). H. V. Poor is with Princeton University, Princeton, NJ 08544 USA (e-mail: [email protected]). This work has been partially supported by the EPSRC of the UK under grant EP/N015312/1.

Abstract

Considering a two-user multi-antenna Broadcast Channel, this paper shows that linearly precoded Rate-Splitting (RS) with Successive Interference Cancellation (SIC) receivers is a flexible framework for non-orthogonal transmission that generalizes, and subsumes as special cases, four seemingly different strategies, namely Space Division Multiple Access (SDMA) based on linear precoding, Orthogonal Multiple Access (OMA), Non-Orthogonal Multiple Access (NOMA) based on linearly precoded superposition coding with SIC, and physical-layer multicasting. The paper studies the sum-rate and shows analytically how RS unifies, outperforms, and specializes to SDMA, OMA, NOMA, and multicasting as a function of the disparity of the channel strengths and the angle between the user channel directions.

Index Terms:

Rate-splitting, multi-antenna broadcast channel, rate analysis, SDMA, OMA, NOMA, multicasting

I Introduction

Linearly precoded Rate-Splitting (RS) with Successive Interference Cancellation (SIC) receivers has recently appeared as a powerful non-orthogonal transmission and robust interference management strategy for multi-antenna wireless networks [1]. Though originally introduced for the two-user Single-Input Single-Output Interference Channel (IC) in [2], RS has become an underpinning communication-theoretic strategy to tackle modern interference-related problems and has recently been successfully investigated in several Multiple-Input Single-Output (MISO) Broadcast Channel (BC) settings, namely, unicast-only transmission with perfect Channel State Information at the Transmitter (CSIT) [3, 4] and imperfect CSIT [5, 6, 7, 8, 9, 10, 11, 12, 13], (multigroup) multicast-only transmission [14], as well as superimposed unicast and multicast transmission [15]. Results highlight that RS provides significant benefits in terms of spectral efficiency [3, 6, 7, 9, 13, 14, 15], energy efficiency [4], robustness [8], and CSI feedback overhead reduction [6, 12] over conventional strategies used in LTE-A/5G that rely on fully treating interference as noise (e.g. conventional multi-user linear precoding and Space Division Multiple Access - SDMA) or fully decoding interference (e.g. power-domain Non-Orthogonal Multiple Access - NOMA [16]). The key behind realizing those benefits is the ability of RS, through splitting messages into common and private parts, to partially decode interference and partially treat interference as noise. Additionally, RS is an enabler for powerful multiple access designs that subsumes SDMA and NOMA as special cases and outperforms them both for a wide range of network loads (underloaded/overloaded regimes) and user deployments (for diverse channel directions/strengths and CSIT qualities) [3]. In this work, we build upon this last observation and show considering a simple two-user MISO BC with perfect CSIT that RS is a flexible framework for non-orthogonal transmission that generalizes, and subsumes as special cases, four seemingly completely different strategies, namely SDMA based on linear precoding, Orthogonal Multiple Access (OMA) where a resource is fully taken up by a single user, power-domain NOMA based on linearly precoded superposition coding with SIC, and physical-layer multicasting. This is the first paper to show analytically how RS unifies, outperforms, and specializes to SDMA, OMA, NOMA, and multicasting as a function of the disparity of the user channel strengths and the angle between the user channel directions. To that end, the paper differs from, and nicely complements, past works that analytically studied the rate performance of RS with imperfect CSIT [6, 9, 12] or looked at RS from an optimization perspective [3, 7, 8].

Notation: $|.|$ and $\left\|.\right\|$ refer to the absolute value of a scalar and the $l_{2}$ -norm of a vector. $\mathbf{I}$ is the identity matrix. $\mathbf{a}^{H}$ denotes the Hermitian transpose of vector $\mathbf{a}$ . I.i.d. stands for independent and identically distributed. $\mathcal{CN}(0,\sigma^{2})$ denotes the Circularly Symmetric Complex Gaussian distribution with zero mean and variance $\sigma^{2}$ . $\sim$ stands for “distributed as”.

II System Model: Rate-Splitting Architecture

We consider a MISO BC consisting of one transmitter with $n_{t}$ antennas and two single-antenna users. As per Fig. 1, the architecture relies on rate-splitting of two messages $W_{1}$ and $W_{2}$ intended for user-1 and user-2, respectively. To that end, the message $W_{k}$ of user- $k$ is split into a common part $W_{\mathrm{c},k}$ and a private part $W_{\mathrm{p},k}$ . The common parts $W_{\mathrm{c},1},W_{\mathrm{c},2}$ of both users are combined into the common message $W_{\mathrm{c}}$ , which is encoded into the common stream $s_{\mathrm{c}}$ using a codebook shared by both users. Hence, $s_{\mathrm{c}}$ is a common stream required to be decoded by both users, and contains parts of the messages $W_{1}$ and $W_{2}$ intended for user-1 and user-2, respectively. The private parts $W_{\mathrm{p},1}$ and $W_{\mathrm{p},2}$ , respectively containing the remaining parts of the messages $W_{1}$ and $W_{2}$ , are independently encoded into the private stream $s_{1}$ for user-1 and $s_{2}$ for user-2. Out of the two messages $W_{1}$ and $W_{2}$ , three streams $s_{\mathrm{c}}$ , $s_{1}$ , and $s_{2}$ are therefore created. The streams are linearly precoded such that the transmit signal is given by

[TABLE]

Defining $\mathbf{s}=[s_{\mathrm{c}},s_{1},s_{2}]^{T}$ and assuming that $\mathbb{E}[\mathbf{s}\mathbf{s}^{H}]=\mathbf{I}$ , the average transmit power constraint is written as $P_{\mathrm{c}}+P_{1}+P_{2}\leq P$ where $P_{\mathrm{c}}=\left\|\mathbf{p}_{\mathrm{c}}\right\|^{2}$ and $P_{k}=\left\|\mathbf{p}_{k}\right\|^{2}$ with $k=1,2$ . We refer to $\mathbf{h}_{k}$ as the channel vector of user- $k$ , such that the signal received at user- $k$ can be written as

[TABLE]

where $n_{k}\sim\mathcal{CN}(0,1)$ is Additive White Gaussian Noise (AWGN). We further write the channel vectors as the product of their norm and direction as $\mathbf{h}_{k}=\left\|\mathbf{h}_{k}\right\|\bar{\mathbf{h}}_{k}$ , and assume without loss of generality $\left\|\mathbf{h}_{1}\right\|\geq\left\|\mathbf{h}_{2}\right\|$ . We also assume perfect CSI at the transmitter and the receivers.

At each user- $k$ , the common stream $s_{\mathrm{c}}$ is first decoded into $\widehat{W}_{\mathrm{c}}$ by treating the interference from the private streams as noise. Using SIC, $\widehat{W}_{\mathrm{c}}$ is re-encoded, precoded, and subtracted from the received signal, such that user- $k$ can decode its private stream $s_{k}$ into $\widehat{W}_{\mathrm{p},k}$ by treating the remaining interference from the other private stream as noise. User- $k$ reconstructs the original message by extracting $\widehat{W}_{\mathrm{c},k}$ from $\widehat{W}_{\mathrm{c}}$ , and combining $\widehat{W}_{\mathrm{c},k}$ with $\widehat{W}_{\mathrm{p},k}$ into $\widehat{W}_{k}$ . Assuming Gaussian signalling and ideal SIC, the rate of the common stream is given by

[TABLE]

and the rates of the two private streams are obtained as

[TABLE]

The rate of user- $k$ is given by $R_{k}+R_{\mathrm{c},k}$ where $R_{\mathrm{c},k}$ is the rate of the common part of the $k$ th user’s message, i.e., $W_{\mathrm{c},k}$ , and it satisfies $R_{\mathrm{c},1}+R_{\mathrm{c},2}=R_{\mathrm{c}}$ . The sum-rate is therefore simply written as $R_{\mathrm{s}}=\sum_{k=1,2}R_{k}+R_{\mathrm{c},k}=R_{\mathrm{c}}+R_{1}+R_{2}$ .

By adjusting the message split and the power allocation to the common stream and the private streams, RS enables the decoding of part of the interference (thanks to the presence of the common stream) and treating the remaining part (the private stream of the other user) as noise. Therefore, the introduced RS architecture allows the exploration of a wide range of strategies. Among those strategies, there are four extreme cases, namely, SDMA, NOMA, OMA, and physical-layer multicasting. Indeed, SDMA is obtained by allocating no power to the common stream ( $P_{\mathrm{c}}=0$ ) such that $W_{k}$ is encoded directly into $s_{k}$ . No interference is decoded at the receiver using the common message, and the interference between $s_{1}$ and $s_{2}$ is fully treated as noise. NOMA is obtained by encoding $W_{2}$ entirely into $s_{\mathrm{c}}$ (i.e., $W_{\mathrm{c}}=W_{2}$ ) and $W_{1}$ into $s_{1}$ , and turning off $s_{2}$ ( $P_{2}=0$ ). In this way, user-1 fully decodes the interference created by the message of user-2. OMA is a sub-strategy of SDMA and NOMA and is obtained when only user-1 (with the stronger channel gain) is scheduled ( $P_{\mathrm{c}}=0,P_{2}=0$ ). Multicasting is obtained by combining and encoding both $W_{1}$ and $W_{2}$ into $s_{\mathrm{c}}$ , and turning off $s_{1}$ and $s_{2}$ ( $P_{1}=0,P_{2}=0$ ). The mapping of the messages to the streams is further illustrated in Fig. 2.

Remark 1

Recall that the maximum number of interference-free streams (also called Degrees-of-Freedom DoF) in a two-user MISO BC is equal to 2. From the above system model, both SDMA and RS can achieve such a DoF by precoding $s_{1}$ and $s_{2}$ using zero-forcing (ZF). On the other hand, OMA, NOMA, and multicasting can achieve at most a DoF of 1 (irrespectively of how the precoders and power allocation are optimized), which leads to a rate loss at high Signal-to-Noise Ratio (SNR) in general multi-antenna settings, as already highlighted in [14, 3].

III Sum-Rate Analysis

Our objective is to derive tractable and insightful sum-rate expressions to illustrate the flexibility of RS in unifying SDMA, OMA, NOMA, and multicasting. To that end, we do not optimize the precoding directions jointly with the power allocation as in [7, 3] but rather fix the precoding directions using ZF for the private streams, and adjust the power allocation among all the streams111Simulations in Section IV show that the conclusions drawn with the simple precoders also hold with the numerically optimized precoders of [7, 3].. This leads to $\left|\mathbf{h}_{2}^{H}\mathbf{p}_{1}\right|=0$ , $\left|\mathbf{h}_{1}^{H}\mathbf{p}_{2}\right|=0$ , and $\left|\mathbf{h}_{k}^{H}\mathbf{p}_{k}\right|^{2}=\left\|\mathbf{h}_{k}\right\|^{2}\rho P_{k}$ , $k=1,2$ , where $\rho=1-\left|\bar{\mathbf{h}}_{1}^{H}\bar{\mathbf{h}}_{2}\right|^{2}$ ( $\rho=0$ corresponds to aligned channels and $\rho=1$ to orthogonal channels). The precoder of the common stream is then to be designed such that

[TABLE]

Defining $\gamma_{k}^{2}=1+\left|\mathbf{h}_{k}^{H}\mathbf{p}_{k}\right|^{2}=1+\left\|\mathbf{h}_{k}\right\|^{2}\rho P_{k}$ , $k=1,2$ , and $\tilde{\mathbf{h}}_{k}=\mathbf{h}_{k}/\gamma_{k}$ , the problem is re-written as

[TABLE]

Following [17], the solution of (6) is $\mathbf{p}_{\mathrm{c}}=\sqrt{P_{\mathrm{c}}}\mathbf{f}_{\mathrm{c}}$ with the precoder direction $\mathbf{f}_{\mathrm{c}}$ ( $\left\|\mathbf{f}_{\mathrm{c}}\right\|^{2}=1$ ) given by

[TABLE]

where

[TABLE]

III-A Sum-Rate at Finite SNR

The sum-rate with the above precoder designs can be written as $R_{\mathrm{s}}=R_{\mathrm{c}}+\log_{2}\left(\gamma_{1}^{2}\right)+\log_{2}\left(\gamma_{2}^{2}\right)$ , where $R_{\mathrm{c}}\!=\!\min\!\big{(}\log_{2}\big{(}1\!+\!\big{|}\tilde{\mathbf{h}}_{1}^{H}\mathbf{p}_{\mathrm{c}}\big{|}^{2}\big{)},\log_{2}\big{(}1\!+\!\big{|}\tilde{\mathbf{h}}_{2}^{H}\mathbf{p}_{\mathrm{c}}\big{|}^{2}\big{)}\big{)}$ . With $\mathbf{p}_{\mathrm{c}}$ as per (7), following [17], $\big{|}\tilde{\mathbf{h}}_{1}^{H}\mathbf{p}_{\mathrm{c}}\big{|}\!=\!\big{|}\tilde{\mathbf{h}}_{2}^{H}\mathbf{p}_{\mathrm{c}}\big{|}$ , and we can write $R_{\mathrm{c}}=\log_{2}\big{(}1+\big{|}\tilde{\mathbf{h}}_{2}^{H}\mathbf{p}_{\mathrm{c}}\big{|}^{2}\big{)}$ , and the sum-rate simply as

[TABLE]

Consider a fraction $t$ of the total transmit power $P$ is allocated to the private streams such that $P_{1}+P_{2}=tP$ and the remaining power $P_{\mathrm{c}}=\left(1-t\right)P$ is allocated to the common stream. For a given $t$ , the optimal values of $P_{1}$ and $P_{2}$ , maximizing the sum-rate of the private streams, are given by the Water-Filling (WF) solution

[TABLE]

with the water level $\mu$ chosen such that $P_{1}\!+\!P_{2}\!=\!tP$ , and set as $\mu\!=\!\frac{tP}{2}\!+\!\frac{1}{2\rho}\left[\frac{1}{\left\|\mathbf{h}_{1}\right\|^{2}}\!+\!\frac{1}{\left\|\mathbf{h}_{2}\right\|^{2}}\right]$ in the sequel. Let us also introduce $\Gamma\!=\!\frac{1}{\rho}\left[\frac{1}{\left\|\mathbf{h}_{2}\right\|^{2}}\!-\!\frac{1}{\left\|\mathbf{h}_{1}\right\|^{2}}\right]$ , which is a function of two main parameters: $\rho$ reflecting the angle between the user channel directions, and $\frac{1}{\left\|\mathbf{h}_{2}\right\|^{2}}-\frac{1}{\left\|\mathbf{h}_{1}\right\|^{2}}$ reflecting the disparity of the channel strengths. We can then identify two main regimes.

III-A1 OMA/NOMA/Multicasting Regime

If $\mu\leq\frac{1}{\left\|\mathbf{h}_{2}\right\|^{2}\rho}$ , i.e., $tP\leq\Gamma$ , we set $P_{2}=0$ and $P_{1}=tP$ according to (12), and RS specializes to multicasting for $t=0$ , NOMA for $0<t<1$ , and OMA for $t=1$ . In this regime, $t$ needs to be adjusted so as to identify the best strategy among OMA, NOMA, and multicasting, and therefore efficiently allocate power across the common stream $s_{\mathrm{c}}$ and the private stream $s_{1}$ .

III-A2 RS/SDMA Regime

If $\mu>\frac{1}{\left\|\mathbf{h}_{2}\right\|^{2}\rho}$ , i.e. $tP>\Gamma$ , the WF solution (12) leads to $P_{1}=\mu-\frac{1}{\left\|\mathbf{h}_{1}\right\|^{2}\rho}=\frac{tP}{2}+\frac{\Gamma}{2}>0$ and $P_{2}=\mu-\frac{1}{\left\|\mathbf{h}_{2}\right\|^{2}\rho}=\frac{tP}{2}-\frac{\Gamma}{2}>0$ . RS specializes to SDMA whenever $t$ is set to 1, but does not specializes to any other known scheme for $0<t<1$ . In this regime, $t$ needs to be adjusted, as explained in the sequel, so as to allocate the power efficiently across the common stream and the two private streams. Substituting the expressions of $P_{k}$ and $\gamma_{k}^{2}$ , $k=1,2$ , into (11), we can write

[TABLE]

where $b=\frac{\left\|\mathbf{h}_{1}\right\|^{2}\rho P}{2}$ , $a=1+\frac{\Gamma}{P}b$ , $d=\frac{\left\|\mathbf{h}_{2}\right\|^{2}\rho P}{2}-|\mathbf{h}_{2}^{H}\mathbf{f}_{c}|^{2}P$ , and $c=1-\frac{\Gamma}{P}d+|\mathbf{h}_{2}^{H}\mathbf{f}_{c}|^{2}(P-\Gamma)$ . The value of $t$ that maximizes $R_{\mathrm{s}}$ is the solution of $\frac{\partial R_{s}}{\partial t}=0$ , which is written as $t=-\frac{a}{2b}-\frac{c}{2d}$ . Since $t\leq 1$ , the optimal value $t^{\star}$ is given in closed form by (14) at the top of the next page. For $t^{\star}<1$ , RS yields a non-zero sum-rate enhancement over SDMA.

Remark 2

It is important to note that the solution $t=-\frac{a}{2b}-\frac{c}{2d}$ holds because the coefficients $a$ , $b$ , $c$ , $d$ are not functions of $t$ . This could appear surprising since $c$ and $d$ are functions of $\mathbf{f}_{c}$ , which, according to (6), is a function of $P_{1}$ and $P_{2}$ and therefore of $t$ . However, interestingly, in the regime where $P_{1}>0$ and $P_{2}>0$ , we can show that $\mathbf{f}_{c}$ is not a function of $t$ . Making use of $P_{1}=\frac{tP}{2}+\frac{\Gamma}{2}$ and $P_{2}=\frac{tP}{2}-\frac{\Gamma}{2}$ , we can write $\gamma_{k}^{2}=1+\left\|\mathbf{h}_{k}\right\|^{2}\rho P_{k}=\frac{f(t)}{\left\|\mathbf{h}_{j}\right\|^{2}}$ , $k,j=1,2$ and $k\neq j$ , with $f(t)=\frac{\left\|\mathbf{h}_{1}\right\|^{2}+\left\|\mathbf{h}_{2}\right\|^{2}+{\left\|\mathbf{h}_{1}\right\|^{2}\left\|\mathbf{h}_{2}\right\|^{2}\rho P}t}{2}$ . We then obtain

[TABLE]

which reveals that $\mathbf{f}_{\mathrm{c}}$ is not a function of $t$ and the channel strength disparity, but only of the channel directions.

III-B Sum-Rate at High SNR

At high SNR, considering $0<t\leq 1$ and $\rho>0$ , the solution in (12) allocates power uniformly across the two private streams as $P_{1}=P_{2}=\frac{tP}{2}>0$ . Hence, only RS and SDMA are suitable strategies at high SNR. The sum-rate in (11) can then be written as

[TABLE]

with $e=\frac{\left\|\mathbf{h}_{2}\right\|^{2}\rho}{4}\!-\!\frac{\left|\mathbf{h}_{2}^{H}\mathbf{f}_{\mathrm{c}}\right|^{2}}{2}$ , $f=\frac{\left|\mathbf{h}_{2}^{H}\mathbf{f}_{\mathrm{c}}\right|^{2}}{2}$ . Not surprisingly, a DoF of 2 is achieved in (16). More interesting is the fact that RS brings a constant sum-rate enhancement over SDMA. Indeed, the value of $t$ that maximizes (16) is given by

[TABLE]

which coincides with (14) when $P\rightarrow\infty$ , and leads to a high SNR non-zero (whenever $0<t^{\star}<1$ ) sum-rate gap between RS and SDMA ( $t=1$ ) given by

[TABLE]

$t^{\star}$ increases and $\Delta R_{\mathrm{s}}$ decreases as $\rho$ increases, and both are not a function of the channel strengths. The sum-rate gap between RS and NOMA/OMA/multicasting grows unbounded as $P\!\rightarrow\!\infty$ due to the difference in DoF (Remark 1).

III-C Discussions

We can draw several insights from the above analysis. First, for given $t$ , $\rho$ , $\left\|\mathbf{h}_{1}\right\|^{2}$ , and $\left\|\mathbf{h}_{2}\right\|^{2}$ , as $P$ increases, the SNRs of the private streams increase, while the Signal-to-Interference-plus-Noise Ratio (SINR) of the common stream ultimately saturates (interference limited regime). This suggests that the common message can only provide a constant rate improvement at high SNR, while the two private streams provide the DoF of 2. Second, the quantity $\rho$ is present in the SNRs of both private streams and has the effect of increasing/decreasing the SNRs of those two streams. A lower $\rho$ indicates that both private streams effectively operate at a lower SNR. According to (12), for a given $t$ , a low $\rho$ favors power allocation to a single private stream (NOMA/OMA/Multicasting regime) over a wider range of $P$ , and also leads to a smaller interference power (and therefore a higher rate) for the common stream. A higher $\rho$ leads to a higher effective SNR and therefore a better capability to support two private streams (RS/SDMA regime). Third, as the disparity of channel strengths increases, the WF solution allocates a larger amount of power to the stronger user (user-1) over a wider range of $P$ (for a given $t$ ). Beyond a certain disparity, for given $t$ , $P$ , and $\rho$ , $P_{2}$ is turned off and RS specializes to NOMA/OMA.

IV Evaluations

In this section, we first illustrate the above analysis and the preferred regions for the operation of NOMA, OMA, SDMA, and RS. We assume $n_{t}=2$ , and channel vectors given by $\mathbf{h}_{1}=1/\sqrt{2}\>[1,1]^{H}$ and $\mathbf{h}_{2}=\gamma/\sqrt{2}\>[1,e^{j\theta}]^{H}$ .

Assuming the precoding strategies in Section III and the WF power allocation (12), the colors in Fig. 3(a) and (b) illustrate the optimum value (obtained from exhaustive search whenever not available in closed form) of $t$ that maximizes the sum-rate and the corresponding preferred communication strategy (RS, SDMA, NOMA, OMA) as a function of $\rho=1-\left|\bar{\mathbf{h}}_{1}^{H}\bar{\mathbf{h}}_{2}\right|^{2}$ (ranging from 0 to 1) and $\gamma_{\mathrm{dB}}=20\log_{10}(\gamma)$ (ranging from 0 to -20dB), i.e., user-1 and user-2 have a long-term SNR of 20dB and $0\mathrm{dB}\leq 20\mathrm{dB}+\gamma_{\mathrm{dB}}\leq 20\mathrm{dB}$ , respectively. Recall that SDMA is characterized by $t=1,P_{1}>0,P_{2}>0$ , NOMA by $0<t<1,P_{1}>0,P_{2}\!=\!0$ , OMA by $t\!=\!1,P_{1}\!=\!P,P_{2}\!=\!0$ , and multicast by $t\!=\!0,P_{1}\!=\!0,P_{2}\!=\!0$ . For all other regimes, RS does not specialize to any other well-established scheme and is simply referred to as RS. We observe that NOMA is preferred for deployments with small $\rho$ , i.e., closely aligned users, and small $\gamma$ , SDMA is preferred whenever $\rho$ is sufficiently large, i.e., semi-orthogonal users, and RS bridges those two extremes. OMA is preferred whenever $\gamma$ is very small.

Recall that Fig. 3 is obtained for $P=100$ W. In Fig. 4, we assess the evolution of the regions as a function of $P$ for $P=10$ W and $P=1000$ W (where the long term SNR is 10 dB and 30 dB, respectively). As $P$ increases, RS becomes the dominant strategy for most deployment conditions.

Fig. 5 shows the relative sum-rate gain [%] of RS over dynamic switching between SDMA and NOMA, defined as $\frac{R_{\mathrm{s}}^{\mathrm{RS}}-\max(R_{\mathrm{s}}^{\mathrm{SDMA}},R_{\mathrm{s}}^{\mathrm{NOMA}})}{\max(R_{\mathrm{s}}^{\mathrm{SDMA}},R_{\mathrm{s}}^{\mathrm{NOMA}})}\!\times\!100$ , for $P\!=\!10,100,1000$ W and the precoders from Section III. RS provides explicit gains over dynamic switching for medium values of $\rho$ . The values in brackets indicate the relative sum-rate gains over SDMA and NOMA, respectively, i.e., $\big{(}\frac{R_{\mathrm{s}}^{\mathrm{RS}}-R_{\mathrm{s}}^{\mathrm{SDMA}}}{R_{\mathrm{s}}^{\mathrm{SDMA}}}\!\times\!100,\frac{R_{\mathrm{s}}^{\mathrm{RS}}-R_{\mathrm{s}}^{\mathrm{NOMA}}}{R_{\mathrm{s}}^{\mathrm{NOMA}}}\!\times\!100\big{)}$ . Large gains over SDMA are observed for low to medium values of $\rho$ , and over NOMA for medium to large values of $\rho$ at low SNR and for all values of $\rho$ and $\gamma_{\mathrm{dB}}$ at higher SNR. Values $(0,0)$ indicate that OMA is the preferred strategy, and that RS, SDMA, and NOMA all specialize to OMA.

Fig. 6 is similar to Fig. 5 but now the Weighted Minimum Mean Square Error (WMMSE) precoding optimization framework for RS developed in [7, 3] is adopted. Such framework optimizes all precoders ( $\mathbf{p}_{\mathrm{c}},\mathbf{p}_{1},\mathbf{p}_{2}$ ) jointly with the power allocations so as to maximize the weighted sum-rate $\sum_{k=1,2}u_{k}\left(R_{k}+R_{\mathrm{c},k}\right)$ . In those evaluations, the convergence tolerance of the WMMSE algorithm is set to $\epsilon\!=\!10^{-3}$ [3]. When allocating equal weights or higher weights to the user with the stronger channel (namely user-1), NOMA has no benefit over SDMA. When a higher weight is given to the weaker user (user-2), NOMA is able to outperform SDMA. RS on the other hand always provides the same or better performance than both SDMA and NOMA for all weights, $\rho$ , and $\gamma_{\mathrm{dB}}$ . Though the precoders of Section III are simple and not optimal, the insights obtained from the analysis and Fig. 5 are inline with those obtained from Fig. 6. Hence, irrespectively of the precoding strategies, i.e., simple or optimized, RS unifies and outperforms SDMA, OMA, NOMA, and multicasting.

We now change the channel model and assume i.i.d. Rayleigh fading, i.e., the entries of $\mathbf{h}_{1}$ and $\mathbf{h}_{2}$ are $\mathcal{CN}(0,1/n_{t})$ and $\mathcal{CN}(0,\gamma^{2}/n_{t})$ . We generate 10000 channel realizations. Making use of the precoders in Section III, we identify the preferred (i.e., sum-rate maximizing) strategy for each channel realization. Fig. 7 displays the percentage a given strategy is the preferred option as a function of $P$ and $\gamma_{\mathrm{dB}}$ for $n_{t}=2$ . OMA is preferred for low $P$ and low $\gamma_{\mathrm{dB}}$ , and RS becomes the preferred option as $P$ and/or $\gamma_{\mathrm{dB}}$ increase. At high SNR, RS is the preferred option for about 75% of the channel realizations and SDMA for the remaining 25%. Results with $n_{t}=4$ (not reproduced here due to the space constraint) show that NOMA almost disappears from the set of preferred strategies, and SDMA becomes more dominant (for about 60% of the channel realizations and RS for the remaining 40%). This is natural since, as $n_{t}$ increases, the likelihood to experience large $\rho$ increases, and $t^{\star}$ has a higher chance of being equal to 1.

V Conclusions

RS unifies SDMA, OMA, NOMA, and multicasting under a single approach and provides a powerful framework for the design and optimization of non-orthogonal transmission, multiple access, and interference management strategies. Thanks to its versatility, RS has the potential to tackle challenges of modern communication systems and is a gold mine of research problems for academia and industry, spanning fundamental limits, optimization, PHY and MAC layers, and standardization.

Bibliography17

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] B. Clerckx, H. Joudeh, C. Hao, M. Dai and B. Rassouli, “Rate Splitting for MIMO Wireless Networks: A Promising PHY-Layer Strategy for LTE Evolution,” IEEE Commun. Mag , pp. 98-105, May 2016.
2[2] T. Han et al., “A New Achievable Rate Region for the Interference Channel,” IEEE Trans. Inf. Theory , vol. 27, no. 1, pp. 49–60, Jan. 1981.
3[3] Y. Mao, B. Clerckx, and V.O.K. Li, “Rate-Splitting Multiple Access for Downlink Communication Systems: Bridging, Generalizing and Outperforming SDMA and NOMA,” EURASIP JWCN , May 2018.
4[4] Y. Mao, et al., “Energy Efficiency of Rate-Splitting Multiple Access, and Performance Benefits over SDMA and NOMA,” in Proc. IEEE ISWCS 2018.
5[5] S. Yang, M. Kobayashi, D. Gesbert, and X. Yi, “Degrees of Freedom of Time Correlated MISO Broadcast Channel with Delayed CSIT,” IEEE Trans. Inf. Theory , vol. 59, no. 1, pp. 315–328, Jan. 2013.
6[6] C. Hao, Y. Wu, and B. Clerckx, “Rate Analysis of Two-Receiver MISO Broadcast Channel with Finite Rate Feedback: A Rate-Splitting Approach,” IEEE Trans. Commun. , vol. 63, no. 9, pp. 3232-3246, Sept. 2015.
7[7] H. Joudeh et al., “Sum-Rate Maximization for Linearly Precoded Downlink Multiuser MISO Systems with Partial CSIT: A Rate-Splitting Approach,” IEEE Trans. Commun. , vol. 64, no. 11, pp. 4847-4861, Nov. 2016.
8[8] H. Joudeh and B. Clerckx, “Robust Transmission in Downlink Multiuser MISO Systems: A Rate-Splitting Approach,” IEEE Trans. Signal Process. , Vol. 64, No. 23, pp. 6227-6242, Dec. 2016.