Improper Signaling for SISO Two-user Interference Channels with Additive   Asymmetric Hardware Distortion

Mohammad Soleymani; Christian Lameiro; Ignacio Santamaria; and Peter; J. Schreier

arXiv:1901.05821·eess.SP·September 12, 2019·IEEE Trans. Commun.

Improper Signaling for SISO Two-user Interference Channels with Additive Asymmetric Hardware Distortion

Mohammad Soleymani, Christian Lameiro, Ignacio Santamaria, and Peter, J. Schreier

PDF

TL;DR

This paper demonstrates that improper Gaussian signaling (IGS) can enhance the rate performance of two-user interference channels affected by asymmetric hardware distortion, through novel iterative optimization algorithms.

Contribution

It introduces two new algorithms for optimizing IGS parameters in interference channels with asymmetric hardware impairments, outperforming existing methods.

Findings

01

IGS improves rate performance in asymmetric HWD scenarios

02

Proposed algorithms outperform proper Gaussian signaling

03

Simplified algorithm reduces computational complexity

Abstract

Hardware non-idealities are among the main performance restrictions for upcoming wireless communication systems. Asymmetric hardware distortions (HWD) happen when the impairments of the I/Q branches are correlated or imbalanced, which in turn generate improper additive interference at the receiver side. When the interference is improper, as well as in other interference-limited scenarios, improper Gaussian signaling (IGS) has been shown to provide rate and/or power efficiency benefits. In this paper, we investigate the rate benefits of IGS in a two-user interference channel (IC) with additive asymmetric HWD when interference is treated as noise. We propose two iterative algorithms to optimize the parameters of the improper transmit signals. We first rewrite the rate region as an pseudosignal-to-interference-plus-noise-ratio (PSINR) region and employ majorization minimization and…

Figures25

Click any figure to enlarge with its caption.

Tables2

Algorithm I Proposed sequential optimization algorithm.

Initialization

Set

ϵ

,

M

,

𝐩^{(0)} = 𝟎

,

𝐪^{(0)} = 𝟎

,

m = 1

, convergence=0

While convergence=0 and

m \leq M

do

Construct

{\tilde{E}}_{k}^{(m)} ​ (𝐩, 𝐪) = {\tilde{u}}_{k}^{(m)} ​ (𝐩, 𝐪) / {\tilde{v}}_{k}^{(m)} ​ (𝐩, 𝐪)

for

k = 1, 2

using Lemma 3

Obtain

𝐩^{(m + 1)}

and

𝐪^{(m + 1)}

by solving (22), i.e.,

run algorithm II

If

‖ 𝐩^{(m)} - 𝐩^{(m + 1)} ‖ / ‖ 𝐩^{(m)} ‖ < ϵ

and

‖ 𝐪^{(m)} - 𝐪^{(m + 1)} ‖ / ‖ 𝐪^{(m)} ‖ < ϵ

convergence=1

𝐩^{⋆} = 𝐩^{(m + 1)}

and

𝐪^{⋆} = 𝐪^{(m + 1)}

End (If)

m = m + 1

End (While)

Return

𝐩^{⋆}

and

𝐪^{⋆}

.

Algorithm II Generalized Dinkelbach algorithm.

Initialization

Set

ϵ

,

L

,

l = 0

,

μ^{(l)} = \min_{k = 1, 2} (\frac{{\tilde{E}}_{k} ​ (𝐩^{(m)}, 𝐪^{(m)}) - 1}{α_{k}})

Compute

{\hat{E}}_{k} ​ (𝐩, 𝐪, μ^{(l)})

for

k = 1, 2

by (23)

While

\min_{k = 1, 2} ​ {{\hat{E}}_{k} ​ (𝐩, 𝐪, μ^{(l)})} \geq ϵ

and

l \leq L

do

l = l + 1

Obtain

𝐩^{(l)}

and

𝐪^{(l)}

by solving (25)

If

\min_{k = 1, 2} ​ {{\tilde{E}}_{k} ​ (𝐩, 𝐪, μ^{(l)})} < ϵ

𝐩^{⋆} = 𝐩^{(l)}

and

𝐪^{⋆} = 𝐪^{(l)}

Else

Update

μ^{(l)}

by (24)

End (If)

End (While)

Return

𝐩^{⋆}

and

𝐪^{⋆}

.

Equations131

y = P h (x + η) + n,

y = P h (x + η) + n,

y_{k} = p_{1} h_{1 k} (x_{1} + η_{1 k}) + p_{2} h_{2 k} (x_{2} + η_{2 k}) + n_{k},

y_{k} = p_{1} h_{1 k} (x_{1} + η_{1 k}) + p_{2} h_{2 k} (x_{2} + η_{2 k}) + n_{k},

R_{k} = \frac{1}{2} lo g_{2} \frac{( σ ^{2} + \sum _{j = 1}^{2} p _{j} ∣ h _{j k} ∣ ^{2} ( 1 + σ _{η_{j k}}^{2} ) ) ^{2} - \sum _{j = 1}^{2} ( q _{j} + p _{j} σ ~ _{η_{j k}}^{2} ) h _{j k}^{2} ^{2}}{( σ ^{2} + \sum _{j = 1}^{2} p _{j} ∣ h _{j k} ∣ ^{2} ( 1 + σ _{η_{j k}}^{2} ) - p _{k} ∣ h _{k k} ∣ ^{2} ) ^{2} - \sum _{j = 1}^{2} ( q _{j} + p _{j} σ ~ _{η_{j k}}^{2} ) h _{j k}^{2} - q _{k} h _{k k}^{2} ^{2}},

R_{k} = \frac{1}{2} lo g_{2} \frac{( σ ^{2} + \sum _{j = 1}^{2} p _{j} ∣ h _{j k} ∣ ^{2} ( 1 + σ _{η_{j k}}^{2} ) ) ^{2} - \sum _{j = 1}^{2} ( q _{j} + p _{j} σ ~ _{η_{j k}}^{2} ) h _{j k}^{2} ^{2}}{( σ ^{2} + \sum _{j = 1}^{2} p _{j} ∣ h _{j k} ∣ ^{2} ( 1 + σ _{η_{j k}}^{2} ) - p _{k} ∣ h _{k k} ∣ ^{2} ) ^{2} - \sum _{j = 1}^{2} ( q _{j} + p _{j} σ ~ _{η_{j k}}^{2} ) h _{j k}^{2} - q _{k} h _{k k}^{2} ^{2}},

a_{k}

a_{k}

\tilde{f}_{k}

b_{1}

b_{2}

q

R_{k} = \frac{1}{2} lo g_{2} (\frac{( σ ^{2} + a _{k}^{T} p ) ^{2} - ∣ f _{k}^{H} q + f ~ _{k}^{H} p ∣ ^{2}}{( σ ^{2} + b _{k}^{T} p ) ^{2} - ∣ g _{k}^{H} q + f ~ _{k}^{H} p ∣ ^{2}}),

R_{k} = \frac{1}{2} lo g_{2} (\frac{( σ ^{2} + a _{k}^{T} p ) ^{2} - ∣ f _{k}^{H} q + f ~ _{k}^{H} p ∣ ^{2}}{( σ ^{2} + b _{k}^{T} p ) ^{2} - ∣ g _{k}^{H} q + f ~ _{k}^{H} p ∣ ^{2}}),

R, p, q maximize

R, p, q maximize

R_{k} \geq λ_{k} R,

0 \leq p_{k} \leq P_{k},

∣ q_{k} ∣ \leq p_{k},

E, p, q maximize

E, p, q maximize

E_{k} (p, q) \geq 1 + α_{k} E,

0 \leq p_{k} \leq P_{k},

∣ q_{k} ∣ \leq p_{k},

E_{k} (p, q) ≜ \frac{( σ ^{2} + a _{k}^{T} p ) ^{2} - ∣ f _{k}^{H} q + f ~ _{k}^{H} p ∣ ^{2}}{( σ ^{2} + b _{k}^{T} p ) ^{2} - ∣ g _{k}^{H} q + f ~ _{k}^{H} p ∣ ^{2}} = \frac{u _{k} ( p , q )}{v _{k} ( p , q )} .

E_{k} (p, q) ≜ \frac{( σ ^{2} + a _{k}^{T} p ) ^{2} - ∣ f _{k}^{H} q + f ~ _{k}^{H} p ∣ ^{2}}{( σ ^{2} + b _{k}^{T} p ) ^{2} - ∣ g _{k}^{H} q + f ~ _{k}^{H} p ∣ ^{2}} = \frac{u _{k} ( p , q )}{v _{k} ( p , q )} .

0 \leq p_{k} \leq P_{k}, ∣ q_{k} ∣ \leq p_{k} maximize k = 1, 2 min {\frac{E _{k} ( p , q ) - 1}{α _{k}}} .

0 \leq p_{k} \leq P_{k}, ∣ q_{k} ∣ \leq p_{k} maximize k = 1, 2 min {\frac{E _{k} ( p , q ) - 1}{α _{k}}} .

V (μ)

V (μ)

\overset{μ}{ˉ}

μ^{(l)} = min (\frac{u _{1} ( x ^{(l - 1)} )}{v _{1} ( x ^{(l - 1)} )}, \frac{u _{2} ( x ^{(l - 1)} )}{v _{2} ( x ^{(l - 1)} )}) > 0,

μ^{(l)} = min (\frac{u _{1} ( x ^{(l - 1)} )}{v _{1} ( x ^{(l - 1)} )}, \frac{u _{2} ( x ^{(l - 1)} )}{v _{2} ( x ^{(l - 1)} )}) > 0,

x^{(l - 1)} = x ar g max i min (u_{i} (x) - μ^{(l - 1)} v_{i} (x)) .

x^{(l - 1)} = x ar g max i min (u_{i} (x) - μ^{(l - 1)} v_{i} (x)) .

u_{k} (p, q)

u_{k} (p, q)

v_{k} (p, q)

\tilde{u}_{k}^{(m)} (p, q) =

\tilde{u}_{k}^{(m)} (p, q) =

+ 2 (σ^{2} + a_{k}^{T} p^{(m)}) a_{k}^{T} (p - p^{(m)}),

\tilde{v}_{k}^{(m)} (p, q) =

\tilde{v}_{k}^{(m)} (p, q) =

- 2 R [\tilde{f}_{k}^{H} (g_{k}^{H} q^{(m)} + \tilde{f}_{k}^{H} p^{(m)})^{*}] (p - p^{(m)})

- 2 R [(g_{k}^{H} q^{(m)} + \tilde{f}_{k}^{H} p^{(m)})^{*} g_{k}^{H} (q - q^{(m)})],

E^{'}, p, q maximize

E^{'}, p, q maximize

\tilde{E}_{k}^{(m)} (p, q) \geq 1 + α_{k} E^{'},

0 \leq p_{k} \leq P_{k},

∣ q_{k} ∣ \leq p_{k},

\hat{E}_{k} (p, q, μ^{(l)}) ≜ u_{k}^{(m)} (p, q) - (μ^{(l)} α_{k} + 1) v_{k}^{(m)} (p, q),

\hat{E}_{k} (p, q, μ^{(l)}) ≜ u_{k}^{(m)} (p, q) - (μ^{(l)} α_{k} + 1) v_{k}^{(m)} (p, q),

μ^{(l)} = k = 1, 2 min (\frac{E ~ _{k} ( p ^{(l - 1)} , q ^{(l - 1)} ) - 1}{α _{k}}) .

μ^{(l)} = k = 1, 2 min (\frac{E ~ _{k} ( p ^{(l - 1)} , q ^{(l - 1)} ) - 1}{α _{k}}) .

E^{'}, p, q maximize

E^{'}, p, q maximize

\hat{E}_{k} (p, q, μ^{(l)}) \geq E^{'},

\eqref P - C o n s t - 29, \eqref k - C o n s t - 29 .

E, p maximize

E, p maximize

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Improper Signaling for SISO Two-user Interference Channels with Additive Asymmetric Hardware Distortion

Mohammad Soleymani∗, Student Member, IEEE, Christian Lameiro∗, Member, IEEE, Ignacio Santamaria*†* Senior Member, IEEE and Peter J. Schreier∗, Senior Member, IEEE ∗Mohammad Soleymani, Christian Lameiro and Peter J. Schreier are with the Signal and System Theory Group, Universität Paderborn, Germany, http://sst.upb.de (emails: {mohammad.soleymani,christian.lameiro,peter.schreier}@sst.upb.de). *†*Ignacio Santamaria is with the Department of Communications Engineering, University of Cantabria (email: [email protected]). The work of M. Soleymani, C. Lameiro and P. J. Schreier was supported by the German Research Foundation (DFG) under grants LA 4107/1-1, SCHR 1384/7-1 and SCHR 1384/8-1. The work of I. Santamaria was supported by MINECO of Spain and AEI/FEDER funds of the E.U., under grant TEC2016-75067-C4-4-R (CARMEN).

Abstract

Hardware non-idealities are among the main performance restrictions for upcoming wireless communication systems. Asymmetric hardware distortions (HWD) happen when the impairments of the I/Q branches are correlated or imbalanced, which in turn generate improper additive interference at the receiver side. When the interference is improper, as well as in other interference-limited scenarios, improper Gaussian signaling (IGS) has been shown to provide rate and/or power efficiency benefits. In this paper, we investigate the rate benefits of IGS in a two-user interference channel (IC) with additive asymmetric HWD when interference is treated as noise. We propose two iterative algorithms to optimize the parameters of the improper transmit signals. We first rewrite the rate region as an pseudo-signal-to-interference-plus-noise-ratio (PSINR) region and employ majorization minimization and fractional programming to find a suboptimal solution for the achievable user rates. Then, we propose a simplified algorithm based on a separate optimization of the powers and complementary variances of the users, which exhibits lower computational complexity. We show that IGS can improve the performance of the two-user IC with additive HWD. Our proposed algorithms outperform proper Gaussian signaling and competing IGS algorithms in the literature that do not consider asymmetric HWD.

Index Terms:

Achievable rate region, asymmetric hardware distortions, difference of convex programming, generalized Dinkelbach algorithm, improper Gaussian signaling, interference channel.

I Introduction

One of the targets of 5G is reaching a data rate more than 1000 times greater than the data rate of current cellular systems [1]. However, reaching this goal entails many challenges. Among them is to overcome the non-idealities, i.e., hardware distortions (HWD), of devices which can result in a substantial performance degradation [2, 3, 4]. HWD are due to various imperfections in transceivers, including I/Q imbalance, non-linear power amplifiers, imperfect and/or low resolution analog-to-digital and digital-to-analog converters, frequency/phase offset and so on [3, 4, 5, 6, 7, 8, 9, 10]. Another main challenge for data-rate enhancement is to handle interference from other users, and hence interference management techniques play a key role in 5G [1]. Recently, it has been shown that improper Gaussian signaling (IGS) can improve the performance of various interference-limited systems [11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26]. In IGS schemes, the real and imaginary parts of the signal are correlated and/or have unequal powers [27, 28]. While proper Gaussian signaling (PGS) achieves channel capacity for point-to-point communications in the presence of proper Gaussian noise [29], this is not the case under improper Gaussian noise that arises as a result of asymmetric HWD [4, 11, 30, 31].

I-A Related work

The effect of HWD is studied in [5, 6, 7, 8, 9, 10] for various scenarios. In [5], the secrecy performance of downlink massive multiple-input multiple-output (MIMO) systems was considered with HWD and a passive multiple-antenna eavesdropper. The paper [6] analyzed the achievable rate of massive MIMO systems with Rician channels and HWD. In [7], the authors considered a full-duplex massive MIMO relay with HWD and proposed a scheme to mitigate the distortion by exploiting statistical knowledge of the channels. In [8], the authors studied a massive MIMO system with a new system model for HWD at the transceivers. The paper [10] studied the performance of dual-hop relaying with different protocols in the presence of HWD.

In the aforementioned papers, symmetric HWD are considered. Nevertheless, HWD can, in general, provoke asymmetric or improper distortion in both the transmitted and received signal [30, 32, 4, 11, 31]. The paper [4] considered IGS in a single-input, multiple-output (SIMO) system with additive asymmetric HWD and showed that IGS improved the performance of the system. In [11], the authors investigated the effect of IGS in a relay network with additive asymmetric HWD. They maximized the achievable rate of the relay network by optimizing the complementary variance of the transmitted signal in the source and relay nodes.

Improper signaling schemes have also been proposed to improve different performance metrics in interference-limited networks with ideal devices [12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 26]. In [12], IGS was considered as an interference management tool for the first time in the literature, where the authors considered a three-user interference channel (IC) and showed that IGS can improve the degrees-of-freedom (DoF) in this scenario. The paper [13] showed that IGS can increase the DoF of MIMO X channels. In [14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 25], the authors studied the performance of IGS when Treating Interference as Noise (TIN) was the strategy used for decoding. The paper [14] showed that IGS can improve the performance of the two-user interference channel, while in [15] IGS was used to optimize the rate of the $K$ -user MIMO interference channel. Moreover, the authors in [15] derived the rate region of the two-user single-input, single-output (SISO) IC with TIN by solving a semidefinite programming (SDP) problem, showing that IGS can enlarge the rate region and improve the performance of the system. The paper [16] showed that IGS can reduce the symbol error rate of the $K$ -user IC. In [17, 18, 19], benefits of IGS were studied in different Z-IC scenarios. In [20, 21], the authors showed that IGS improves the performance of underlay and overlay cognitive radio systems, respectively. Finally, [26] showed that IGS can improve the performance of full-duplex relaying systems with fading channels.

I-B Contribution

In this paper, we study the performance of IGS in a two-user IC with additive asymmetric HWD with TIN. To the best of our knowledge, it is the first work addressing the SISO IC with asymmetric HWD. We assume that the transceivers of both users produce additive asymmetric HWD noise, and model the HWD as an additive improper Gaussian noise, similar to [30, 4, 11, 31]. We devise two iterative algorithms to derive suboptimal solutions for the achievable rate region of the two-user IC. To this end, we rewrite the rate region as a pseudo-signal-to-interference-plus-noise-ratio (PSINR) region and employ sequential optimization approaches to solve the resulting problems.

In our first proposed algorithm, we employ majorization minimization (MM) as well as fractional programming (FP) and the well-known generalized Dinkelbach algorithm. MM is an iterative algorithm and consists of two steps in every iteration: i) majorization, and ii) minimization [33]. In the majorization step, the objective function is approximated by a surrogate function. Then, the approximated problem is solved in the minimization step. In other words, MM solves a non-convex optimization problem by solving a sequence of surrogate optimization problems, which can be solved easier than the original problem [33]. In our algorithm, to solve each surrogate problem, we apply the generalized Dinkelbach algorithm, which is a powerful tool to solve multiple ratio maximin problems [34, 35]. In Dinkelbach-based algorithms, an iterative optimization is performed, in which the fractional functions are replaced by surrogate functions at each iteration. The generalized Dinkelbach algorithm permits solving fractional programming efficiently and results in the global optimal solution of the original optimization problem if the optimization problem at each iteration is perfectly solved, i.e., its global optimum is obtained [34, 35, 36, 37].

In our second proposed algorithm, we employ a separate optimization of powers and complementary variances. We first optimize the powers transmitted by the users by employing the well-known bisection method, which transforms the original problem into a sequence of feasibility problems, and derive a closed-form solution for the feasibility problem. In order to obtain the complementary variances, we employ difference of convex programming (DCP), which is a special case of sequential convex programming (SCP) and falls into MM [33, 38]. In DCP, the objective function and/or constraints are difference of two convex/concave functions. DCP solves a non-convex problem by solving a sequence of convex optimization problems and converges to a stationary point111A stationary point of a constrained optimization problem satisfies the corresponding Karush-Kuhn-Tucker (KKT) conditions [38]. of the original problem [38].

The main contributions of this paper are as in the following:

•

We first propose an iterative algorithm based on a sequential optimization method, in which we solve a sequence of fractional optimization problems [33, 39]. We derive the global optimal solution of each surrogate problem by FP and the generalized Dinkelbach algorithm. Our first proposed algorithm obtains a stationary point of the PSINR region.

•

We also propose a simplified algorithm that is computationally less expensive than our proposed algorithm with FP. This simplified algorithm is based on a separate optimization of powers and complementary variances of users. We employ a bisection method to obtain the powers and derive a closed-form solution for powers in each iteration. Then, we employ DCP to find the complementary variances.

•

Our results show that IGS enlarges the achievable rate of the two-user IC in the presence of additive asymmetric HWD, and that there is a significant performance improvement by IGS for highly asymmetric HWD noise. Moreover, both of our proposed algorithms outperform existing PGS and other existing IGS algorithms.

I-C Paper outline

The rest of this paper is organized as follows. Section II describes the scenario and formulates the achievable-rate-region problem. In Section III, we propose our algorithm based on MM and FP, and in Section IV, we develop a simplified version of this algorithm. Finally, Section V presents some numerical results.

II System Model

II-A Preliminaries of IGS

Let us consider a zero-mean complex Gaussian random variable $x$ with variance $p_{x}=\mathbb{E}\{|x|^{2}\}$ and complementary variance $q_{x}=\mathbb{E}\{x^{2}\}$ [27, 28]. Note that the complementary variance is complex and $|q_{x}|\leq p_{x}$ . We denote the probability distribution of $x$ by $x\sim\mathcal{CN}(m_{x},p_{x},q_{x})$ , where $m_{x}=0$ is the mean of $x$ . We define the complex correlation coefficient of $x$ as $\tilde{\kappa}_{x}=\frac{q_{x}}{p_{x}}$ , where $|\tilde{\kappa}_{x}|\in[0,1]$ is the so-called circularity coefficient. If $\tilde{\kappa}_{x}=0$ , $x$ is proper; otherwise, it is improper [27, 28]. We call $x$ maximally improper if $|\tilde{\kappa}_{x}|=1$ .

II-B Hardware distortion model

In this paper, we employ the distortion model in [30, 11, 4, 31] and model the aggregated effect of HWD on the transceiver of a communication link with an improper Gaussian additive noise as

[TABLE]

where $y$ , $x$ , $P$ , $h$ , $\eta$ , and $n$ are the received signal, transmitted symbol, transmission power, channel coefficient, aggregated HWD noise and additive complex proper Gaussian noise, respectively. The aggregated HWD noise is modeled as an improper complex Gaussian random variable with probability distribution $\eta\sim\mathcal{CN}(0,\sigma_{\eta}^{2},\tilde{\sigma}_{\eta}^{2})$ , where $\sigma_{\eta}^{2}=\sigma_{\eta_{\text{TX}}}^{2}+\sigma_{\eta_{\text{RX}}}^{2}$ and $\tilde{\sigma}_{\eta}^{2}=\tilde{\sigma}_{\eta_{\text{TX}}}^{2}+\tilde{\sigma}_{\eta_{\text{RX}}}^{2}$ are the variance and complementary variance of $\eta$ , respectively, both of which are composed of contributions at the transmitter side (denoted TX) and the receiver side (denoted RX). Please refer to [30, Lemma 1] for more details about this model.

It is worth mentioning that this model is an extension of the model in [5, 6, 7, 8, 9, 10], where the HWD is modeled as additive proper Gaussian noise. However, as indicated in e.g., [30, 11, 40, 4, 31, 41, 42, 32], the aggregated HWD is, in general, improper due to I/Q imbalance. Note that the variances and complementary variances of HWD noise are not only a function of device parameters, but also a linear function of the transmission power and channel gain, meaning that higher transmission power results in higher HWD noise [4, 30]. Moreover, even if the channel noise is proper, the aggregated distortion is improper due to the asymmetric HWD.

II-C Network scenario and signal model

We consider a two-user IC with additive asymmetric HWD at the transmitters and receivers of both users, as depicted in Fig. 1.222It is worth mentioning that our algorithms can easily be extended to the $K$ -user IC. However, we consider only the 2-user IC for the ease of illustration. We assume that users are allowed to employ IGS and treat the interference as noise. Using the proposed HWD model, the received signals at receiver $k$ is

[TABLE]

respectively, where $x_{k}$ , $h_{jk}$ , $n_{k}$ , and $\eta_{jk}$ for $j,k\in\{1,2\}$ are the transmit signal of user $k$ , channel between transmitter $j$ and receiver $k$ , independent zero-mean proper complex Gaussian noise with variance $\sigma^{2}$ , and the aggregated HWD noise of the link between transmitter $j$ and receiver $k$ , respectively. Since the transmitted signals $x_{1}$ and $x_{2}$ are improper complex Gaussian, the achievable rate of user $k\in\{1,2\}$ is [11, 19, 15] given by (3), shown at the top of the next page,

where $p_{k}$ , $q_{k}$ , $\sigma_{\eta_{jk}}^{2}$ , and $\tilde{\sigma}_{\eta_{jk}}^{2}$ for $i,j\in\{1,2\}$ are, respectively, the transmission power of user $k$ ,the complementary variance of the transmitted signal of user $k$ , the aggregated variance and the complementary variance of the HWD noise in the link between user $j$ and user $k$ . The rate of user $k\in\{1,2\}$ can be written using vector notation as

[TABLE]

where the corresponding parameters are defined in (5)-(9), shown at the top of the next page. We also define $\Omega\!=\!\{p_{k},q_{k}\!:0\leq p_{k}\leq P_{k},|q_{k}|\leq p_{k},k=1,2\}$ as the feasible set of the design parameters, where $P_{k}$ is the power budget of user $k$ . Note that since $q_{k}$ for $k=1,2$ is the complementary variance of user $k$ , its absolute value has to be not greater than the transmission power of user $k$ , i.e., $|q_{k}|\leq p_{k}$ .

It is to be noted that, in practice, discrete rather than Gaussian signaling is employed (see, e.g., [43, 42, 16]), which will lead to performance degradation with respect to IGS. The significance of studying improper Gaussian signals is that it shows us whether improper signaling may in principle achieve performance improvements over proper signaling. In this paper, we focus on IGS and leave the analysis and design of improper discrete constellations for future work.

II-D Problem Statement

In this paper, we aim at obtaining the boundary of the achievable rate region for the described two-user IC. To this end, we employ the following definition of the Pareto boundary for the achievable rate region.

Definition 1 ([19, 44]).

The rate pair ( $R_{1},R_{2}$ ) is called Pareto-optimal if ( $R_{1}^{\prime},R_{2}$ ) and ( $R_{1},R_{2}^{\prime}$ ), with $R_{1}^{\prime}>R_{1}$ and $R_{2}^{\prime}>R_{2}$ , are not achievable.

The rate region is the union of all these achievable rate tuples, i.e., $\mathcal{R}=\underset{\{\mathbf{p},\mathbf{q}\}\in\Omega}{\bigcup}(R_{1},R_{2})$ , and its boundary can be derived by the rate profile technique as in the following optimization problem [15]

[TABLE]

where $\lambda_{1},\lambda_{2}\geq 0$ are fixed and $\lambda_{1}+\lambda_{2}=1$ . We can obtain the boundary of the rate region by solving (10) for different rate-profile parameters, i.e., $\lambda_{1}$ and $\lambda_{2}$ . Note there are efficient algorithms to derive the global optimal solution of convex optimization problems [45, 39, 46]. However, we are unable to apply these algorithms to (10) due to the fact that the rates are not concave functions of the optimization variables, which makes (10) non-convex [45, 39, 46]. Hence, in this paper we propose numerical algorithms to derive suboptimal solutions to (10).

The paper [30] proposed an algorithm based on DCP to maximize the achievable rate of a multihop relay system with additive asymmetric HWD, in which all nodes transmit with maximum power, by optimizing over the complementary variances. Such algorithms cannot be applied for a joint optimization of powers and complementary variances since, in this case, the rates are not a difference of two jointly concave/convex functions in $\mathbf{p}$ and $\mathbf{q}$ . Hence, we solve (10) by MM and FP. In MM, the objective function and constraints of an optimization problem are not required to follow a very specific structure such as being a difference of two convex/concave functions, which makes it more powerful than DCP.

To solve (10), we rewrite it such that it is more suitable to be solved with MM and FP. To this end, we employ the PSINR profile technique in [47, 48] to write an optimization problem that results in the solution of (10). We define the PSINR profile as

[TABLE]

where $\alpha_{1}\geq 0$ and $\alpha_{2}\geq 0$ are constants, $\alpha_{1}+\alpha_{2}=1$ , and

[TABLE]

We can derive the boundary of the PSINR region by varying $\alpha_{1}\in[0,1]$ . Note that $E_{k}(\mathbf{p},\mathbf{q})\geq 1$ for $k=1,2$ since the rates are non-negative. Moreover, the numerator and denominator of $E_{k}(\mathbf{p},\mathbf{q})$ are strictly positive because the rates are bounded and non-negative. In the following lemma, we show that this technique results in the boundary of the rate region in (10).

Lemma 1 ([47, 48]).

Every point in the boundary of the rate region corresponds to a point in the boundary of the PSINR region, and vice versa.

Proof.

Assume there exists a pair $(R_{1},R_{2})$ on the boundary of the achievable rate region that is not on the boundary of the PSINR region. In other words, the pair $(E_{1}=2^{R_{1}},E_{2}=2^{R_{2}})$ , which is a feasible PSINR pair, is not on the boundary of the PSINR region, and hence there exist $E_{1}^{\prime}$ and/or $E_{2}^{\prime}$ such that the pairs $(E_{1}^{\prime}>E_{1},E_{2})$ and/or $(E_{1},E_{2}^{\prime}>E_{2})$ are feasible. Since the logarithm functions are monotonically increasing, the rate pairs $(0.5\log_{2}(E_{1}^{\prime})>R_{1},R_{2})$ or $(R_{1},0.5\log_{2}(E_{2}^{\prime})>R_{2})$ are achievable, which implies that $(R_{1},R_{2})$ is not on the boundary of the rate region. Similarly, it can be shown that every point in the boundary of the PSINR region associates with a point in the boundary of the rate region. ∎

Note that we can rewrite (11) as the following maximin optimization problem by removing the variable $E$

[TABLE]

III Boundary of the rate region by Fractional Programming

In this section, we solve the PSINR profile problem in (11) by MM, which results in solving a sequence of fractional optimization problems. We solve each fractional optimization problem by FP and the generalized Dinkelbach algorithm [36, 35, 37]. Our proposed algorithm converges to a stationary point of (11). We first provide preliminaries on generalized Dinkelbach’s algorithm in Section III-A and then propose our algorithm in III-B.

III-A Preliminaries of generalized Dinkelbach’s algorithm

Dinkelbach’s algorithm is a powerful tool that solves FP problems, which was proposed to handle single-ratio functions. The generalized Dinkelbach algorithm is a modified Dinkelbach algorithm to solve maximin multiple-ratio problems [34]. The generalized Dinkelbach algorithm is an iterative approach, in which the fractional functions are approximated by surrogate functions at each iteration. In the following lemma, we present some conditions that are used in the generalized Dinkelbach algorithm.

Lemma 2 ([34, 35]).

Consider the fractional functions $\frac{u_{i}(\mathbf{x})}{v_{i}(\mathbf{x})}$ , where $u_{i}(\mathbf{x})$ and $v_{i}(\mathbf{x})$ are continuous in $\mathbf{x}$ , $v_{i}(\mathbf{x})$ is strictly positive in $\mathbf{x}$ , and $\mathbf{x}$ is a vector with dimension $n$ that belongs to a compact set $\mathcal{X}$ . Let us define

[TABLE]

where $V(\mu)$ , $\bar{\mu}$ , and $\mu$ are real and scalar, and have the following properties.

$V(\mu)$ * is continuous and strictly decreasing in $\mu$ .* 2. 2.

The optimization problems (14) and (15) always have optimal solutions. 3. 3.

$\bar{\mu}$ * is finite and $V(\bar{\mu})=0$ .* 4. 4.

$V(\mu)$ * has a unique root, and $V(\mu)=0$ implies $\mu=\bar{\mu}$ .*

The generalized Dinkelbach algorithm employs the surrogate function $V(\mu)$ in (14) and tries to iteratively find the unique root of $V(\mu)$ , i.e., $\bar{\mu}$ . The algorithm starts with an initial point, e.g., $\mu^{(0)}=0$ , then it updates $\mu$ to obtain $\bar{\mu}$ . Assuming $u_{i}(\mathbf{x})\geq 0$ , which is the case we consider in this paper, $V(0)=\underset{\mathbf{x}}{\max}\,\,\,\underset{i}{\min}\left(u_{i}(\mathbf{x})\right)>0$ . Since $V(\mu)$ is continuous and strictly decreasing in $\mu$ , $\mu$ is chosen monotonically increasing at each iteration ( $\mu^{(l)}>\mu^{(l-1)}$ ) until $V(\mu)$ approaches 0. At the $l$ th iteration, $\mu^{(l)}$ is

[TABLE]

where $\mathbf{x}^{(l-1)}$ is

[TABLE]

The generalized Dinkelbach algorithm updates $\mu^{(l)}$ and $\mathbf{x}^{(l-1)}$ based on (16) and (17), respectively, until a convergence metric is met, e.g., $V(\mu^{(l)})<\epsilon$ , where $\epsilon>0$ . This algorithm converges linearly to the optimal solution [34].

Note that in order to apply the generalized Dinkelbach algorithm, it is not required that $u_{i}(\mathbf{x})$ and $v_{i}(\mathbf{x})$ fulfill any other condition (except those in the lemma), which makes this algorithm a powerful tool to solve different types of fractional problems. If $u_{i}(\mathbf{x})$ and $v_{i}(\mathbf{x})$ are concave and convex functions, respectively, the optimization problem at each iteration is convex and can easily be solved. However, in the general case, it might be difficult to efficiently solve the optimization problem at each iteration.

III-B Proposed algorithm

We can apply the generalized Dinkelbach algorithm to derive the boundary of the PSINR region since the optimization problem can be written as a maximin weighted problem as indicated in (13). However, since $u_{k}(\mathbf{p},\mathbf{q})$ and $v_{k}(\mathbf{p},\mathbf{q})$ are not, respectively, concave and convex in optimization variables, the corresponding optimization problem in each iteration of the generalized Dinkelbach algorithm is not convex. Indeed, $u_{k}(\mathbf{p},\mathbf{q})$ and $v_{k}(\mathbf{p},\mathbf{q})$ are a difference of two convex/concave functions:

[TABLE]

Hence, to solve (13), we employ a sequential optimization approach by approximating $E_{k}(\mathbf{p},\mathbf{q})$ with a lower bound $\tilde{E}_{k}(\mathbf{p},\mathbf{q},\mu)$ in each iteration [33, 39]. Then, we obtain the global optimal solution of each surrogate optimization problem by the generalized Dinkelbach algorithm. To this end, in each iteration, we first approximate $u_{k}(\mathbf{p},\mathbf{q})$ by a lower bound concave function $\tilde{u}_{k}(\mathbf{p},\mathbf{q})$ and $v_{k}(\mathbf{p},\mathbf{q})$ by an upper bound convex function $\tilde{v}_{k}(\mathbf{p},\mathbf{q})$ as in the following lemma.

Lemma 3.

A concave lower bound for $u_{k}(\mathbf{p},\mathbf{q})$ in the $m$ th iteration is

[TABLE]

Moreover, a convex upper bound for $v_{k}(\mathbf{p},\mathbf{q})$ in the $m$ th iteration is

[TABLE]

where $\mathbf{p}^{(m)}$ and $\mathbf{q}^{(m)}$ are the power and complementary variances at the $m$ th iteration, which are the solution of the previous iteration. Furthermore, $\mathfrak{R}\left[x\right]$ takes the real part of $x$ .

Proof.

Please refer to Appendix A. ∎

Now, we are able to write the surrogate optimization problem in $m$ th iteration as

[TABLE]

where $\tilde{E}_{k}^{(m)}(\mathbf{p},\mathbf{q})=\frac{\tilde{u}_{k}^{(m)}(\mathbf{p},\mathbf{q})}{\tilde{v}_{k}^{(m)}(\mathbf{p},\mathbf{q})}$ and $E_{k}(\mathbf{p},\mathbf{q})$ fulfill the following conditions:

$\tilde{E}_{k}^{(m)}(\mathbf{p},\mathbf{q})\leq E_{k}(\mathbf{p},\mathbf{q})$ for all feasible $\mathbf{p},\mathbf{q}$ and $k=1,2$ . 2. 2.

$\tilde{E}_{k}^{(m)}(\mathbf{p}^{(m)},\mathbf{q}^{(m)})=E_{k}(\mathbf{p}^{(m)},\mathbf{q}^{(m)})$ for $k=1,2$ . 3. 3.

$\frac{\partial\tilde{E}_{k}^{(m)}(\mathbf{p}^{(m)},\mathbf{q}^{(m)})}{\partial\mathbf{p}}=\frac{\partial E_{k}(\mathbf{p}^{(m)},\mathbf{q}^{(m)})}{\partial\mathbf{p}}$ and $\frac{\partial\tilde{E}^{(m)}_{k}(\mathbf{p}^{(m)},\mathbf{q}^{(m)})}{\partial\mathbf{q}}=\frac{\partial E_{k}(\mathbf{p}^{(m)},\mathbf{q}^{(m)})}{\partial\mathbf{q}}$ for $k=1,2$ .

These properties guarantee that the algorithm converges to a stationary point of (11) [39, Section II.B]. To solve (22) and obtain $\mathbf{p}^{(m+1)}$ and $\mathbf{q}^{(m+1)}$ , we employ the generalized Dinkelbach algorithm, which gives the global optimal solution of (22), as in the following. We summarize this procedure in Algorithm I.

Now we solve (22) and obtain its global optimal solution by FP, which is also an iterative algorithm as explained in Section III-A. To this end, we introduce the following functions, which are the corresponding surrogate functions of $\frac{\tilde{E}^{(m)}_{k}-1}{\alpha_{k}}$ for $k=1,2$ :

[TABLE]

where $\mu^{(l)}\in\mathbb{R}$ is fixed and given by

[TABLE]

It is worth mentioning that the generalized Dinkelbach algorithm requires an initial point $\mu^{(0)}$ , which can be obtained by substituting $\mathbf{p}^{(m)}$ and $\mathbf{q}^{(m)}$ in (24). By substituting (23) in (22), the optimization problem at each iteration of the generalized Dinkelbach algorithm is

[TABLE]

We solve (25) for the given $\mu^{(l)}$ , which results in $\mathbf{p}^{(l)}$ and $\mathbf{q}^{(l)}$ . Then, we update $\mu^{(l)}$ by (24) and repeat the procedure until a convergence metric is met. As indicated in Section III-A, the convergence rate of the generalized Dinkelbach algorithm is linear. The optimization problem (25) is convex, and its global optimal solution can be efficiently obtained [45]. We summarize this procedure in Algorithm II.

To sum up, the proposed algorithm works as follows. We solve the PSINR profile in (11) by solving a sequence of fractional optimization problems. Indeed, we employ a sequential optimization approach and approximate the PSINR term of each user by a lower bound. In order to derive the global optimal solution of each fractional optimization problem, we perform another iterative algorithm, i.e., the generalized Dinkelbach algorithm. It is worth mentioning that this algorithm does not converge to the Pareto-optimal solution; however, it obtains a stationary point of (11).

IV Simplified algorithm

In this section, we propose a simplified version of the algorithm from Section III, which exhibits a lower computational complexity. In the simplified algorithm, we first optimize the transmission power $\mathbf{p}$ for PGS, i.e., for $\mathbf{q}=\mathbf{0}$ . This problem is addressed in Section IV-A. Then, in Section IV.B, we optimize the complementary variances for the resulting transmit power $\mathbf{p}$ such that the rates of all users is simultaneously increased.

IV-A Power optimization

In this subsection, we optimize the transmission power vector $\mathbf{p}$ for PGS, i.e., when $\mathbf{q}=\mathbf{0}$ . In this case, deriving the boundary of the PSINR region can be cast as the optimization problem

[TABLE]

for $\alpha_{1},\alpha_{2}\geq 0$ and $\alpha_{1}+\alpha_{2}=1$ . Unfortunately, the optimization problem in (26) is not convex due to (26b). In the following lemma, we derive a lower bound for (26b), which allows us to simplify (26) and derive a low-complexity algorithm.

Lemma 4.

A lower bound for the left-hand side of (26b) is

[TABLE]

where the equality in (27) holds if and only if the HWD noise is proper, i.e., $\tilde{\mathbf{f}}_{i}=\mathbf{0}$ .

Proof.

It is easy to verify that $0\leq|\tilde{\mathbf{f}}_{i}^{H}\mathbf{p}|^{2}<(\sigma^{2}+\mathbf{b}_{i}^{T}\mathbf{p})^{2}<(\sigma^{2}+\mathbf{a}_{i}^{T}\mathbf{p})^{2}$ . Let us define

[TABLE]

where $0\leq t<\beta_{2}<\beta_{1}$ . The lower bound in (27) is then satisfied if $f(t)$ is increasing in $t$ . This function is strictly increasing in $t\in[0,\beta_{2})$ since

[TABLE]

Thus, we have

[TABLE]

with equality if and only if $t=0$ . ∎

For each point characterized by $\alpha_{1}$ and $\alpha_{2}$ , we solve (26) for the lower bound in (27) as the optimization problem

[TABLE]

It is worth mentioning that the lower bound in Lemma 4 is employed to simplify (26) and obtain the powers, and the actual rates are derived by substituting the obtained powers in (3). Note that the region achieved by solving (26) includes the region achieved by solving (31). If the additive HWD noise is proper, (31) is equivalent to (26)333 This is in line with [49], where it was shown that proper Gaussian noise is the worst case in a $K$ -user MIMO IC with ideal devices.. The global optimum solution of (31) can be derived by employing a bisection method and solving a sequence of feasibility problems [39]. That is, we fix $E$ as $E^{\prime}$ and consider the feasibility problem (32), shown at the top of the next page.

If (32) is feasible for a given $E^{\prime}$ , the optimal solution of (31) is greater than or equal to $E^{\prime}$ , i.e., $E^{\star}\geq E^{\prime}$ . Otherwise, $E^{\star}<E^{\prime}$ . In order to find $E^{\star}$ , we employ the well-known bisection method over $E^{\prime}$ solving (32) at each iteration, which yields, upon convergence, the global optimal solution of (31) [45]. Constraints (32b) and (32c) are linear in $\mathbf{p}$ , which permits deriving a closed-form expression for a feasible point, as presented in the following theorem. It is worth mentioning that this algorithm does not attain the global optimal solution of (26). There might be optimization approaches to obtain its global optimal solution such as the monotonic optimization framework [50, 51, 52], although the computational complexity of these approaches is high.

Theorem 1.

The optimization problem in (32) is feasible for a given $E^{\prime}$ if and only if $0\leq p_{k}^{\prime}\leq P_{k}$ , for $k=1,2$ , where

[TABLE]

Moreover, $\mathbf{A}$ is given by (38), shown at the top of the next page.

Proof.

Please refer to Appendix B. ∎

We note that this algorithm leads to the optimal PGS only when HWD noise is proper. Note that PGS is suboptimal, in point-to-point communications, in the presence of asymmetric HWD [4, 31]. Thus, the users may improve the performance by employing IGS in additive asymmetric HWD. It is worth noting that, in this paper, we aim at proposing PGS and IGS schemes for the two-user IC with additive asymmetric HWD, but we do not derive sufficient and necessary conditions for the optimality of IGS or PGS in the two-user IC with additive asymmetric HWD, which remains an open problem.

IV-B Complementary variance design

In this subsection, we optimize the complementary variances $\mathbf{q}$ for a given $\mathbf{p}^{\star}$ , which has been obtained by solving (31). We obtain $\mathbf{q}$ such that the rates of both users exceed the rates achieved by PGS, which are the rates achievable with $\mathbf{q}=\mathbf{0}$ and the power vector $\mathbf{p}^{\star}$ obtained by solving (31). In other words, we want to solve the optimization problem (35), shown at the top of this page, where $p_{k}^{\star}$ is the $k$ th element of $\mathbf{p}^{\star}$ . Moreover, $E_{p,k}$ is fixed and given by

[TABLE]

Unfortunately, (35) is not convex due to (35b). Hence, in order to efficiently solve (35), we first rewrite (35b) as

[TABLE]

where $t_{k}=t\left[(\sigma^{2}+\mathbf{b}_{k}^{T}\mathbf{p}^{\star})^{2}-|\mathbf{g}_{k}^{H}\mathbf{q}+\tilde{\mathbf{f}}^{H}_{k}\mathbf{p}^{\star}|^{2}\right]$ . We then relax the relation between $t_{1}$ , $t_{2}$ , and $\mathbf{q}$ and treat $t_{1}$ and $t_{2}$ as new optimization variables. In other words, we approximate (35) as (38), shown at the top of this page. If $\min(t_{1},t_{2})>0$ , the rates of both users are simultaneously increased by employing IGS. Otherwise, we set $\mathbf{q}=\mathbf{0}$ and employ PGS. Note that the constraint (38b) can be rewritten as

[TABLE]

which is a difference of two convex functions. Thus, (38) is not a convex optimization problem, but it can be efficiently solved by difference of convex programming and a convex-concave procedure similar to (25) [53, 54, 55, 38, 33]. Hence, we employ difference of convex programming (DCP) and solve (38) iteratively. At each iteration, we approximate the left-hand side of (39) by a concave function. To this end, we employ the first-order Taylor expansion and approximate the convex part of (39) around the point $\mathbf{q}^{(l)}$ by an affine function as

[TABLE]

where $\mathbf{q}^{(l)}$ contains the complementary variances of the users in the $l$ th iteration. It is worth mentioning that $|\mathbf{g}_{i}^{H}\mathbf{q}+\tilde{\mathbf{f}}^{H}_{i}\mathbf{p}^{\star}|^{2}$ is always greater than or equal to the right-hand side of (40), and consequently, no trust region is required in DCP [53, 54, 55]. Finally, in the $l$ th each iteration, (39) can be approximated by

[TABLE]

Finally, the convex optimization problem in the $l$ th iteration is

[TABLE]

This problem can be easily solved by standard numerical tools [45]. Moreover, the proposed DCP algorithm converges to a stationary point of (38) [53, 54, 55, 33, 38]. It is worth mentioning that a stationary point of (38) is not necessarily a stationary point of (35).

The proposed simplified algorithm can be summarized as follows. The joint optimization problem for $\mathbf{p}$ and $\mathbf{q}$ is decoupled into two separate optimization problems. We derive the transmission powers by employing the well-known bisection method, which results, in each iteration, in a feasibility problem that has a closed-form solution. Then, we employ the DCP algorithm to derive the complementary variances for the given transmission powers.

V Numerical Results

In this section, we present some numerical results to illustrate our findings. For all examples, we set $\sigma^{2}=1$ , $P_{1}=P_{2}=P$ , $\epsilon=10^{-4}$ , and $L=M=20$ , where $\epsilon$ , $L$ , and $M$ are, respectively, the threshold for convergence, and the maximum number of iterations for Algorithms I and II. Moreover, the maximum number of iterations for the algorithm in Section IV-B is 40. We also define the signal-to-noise ratio (SNR) as the ratio of the power budget to $\sigma^{2}$ , i.e., SNR $=\frac{P}{\sigma^{2}}$ . We compare our proposed algorithms with PGS and the joint variance and complementary variance optimization algorithm in [15] for IGS, which is designed for ideal devices. To the best of our knowledge, there exists no PGS algorithm for additive asymmetric HWD in the literature. Because of that, we optimize the PGS scheme by using the first step of our simplified algorithm (see Section IV-A). In the figures, we use the following labels:

•

S-IGS: our proposed simplified design in Section IV,

•

FP-IGS: our proposed design with FP in Section III,

•

PGS: the proposed PGS design in Section IV-A,

•

I-IGS: the joint variance and complementary variance IGS design in [15] for ideal devices,

•

S-TS: our proposed design in Section IV with time sharing,

•

F-TS: our proposed design in Section III with time sharing,

•

P-TS: the proposed PGS design in Section IV-A with time sharing.

V-A Ideal devices

In this subsection, we compare the performance of our proposed algorithms with the joint variance and covariance IGS algorithm in [15] when there is no HWD.

In Fig. 2, we show the average symmetric rate, i.e., the minimum rate allocated to the users, which is the fairness point of the rate region boundary and obtained by $\alpha_{1}=\alpha_{2}=0.5$ . We average the results over 100 channel realizations, where each channel realization is taken from a complex proper Gaussian distribution with variance 1, i.e., $\mathcal{CN}(0,1,0)$ . As can be observed, our proposed algorithm based on FP outperforms the proposed algorithm in [15], especially at high SNR. Our simplified algorithm performs similarly to the proposed algorithm in [15] for low SNR. However, the algorithm in [15] performs better than the simplified algorithm in the moderate SNR regime. The reason is that the benefit of employing IGS increases with SNR. Thus, the performance differences of the IGS algorithms are clearer at higher SNR.

In Fig. 3, we also provide rate region examples for ideal devices and the channel realization

[TABLE]

where $[\mathbf{H}_{1}]_{ij}=h_{ij}$ for $i,j\in\{1,2\}$ . As can be observed, IGS can enlarge the achievable rate region for this channel realization and $P=10$ . Since the benefits of IGS are minor for low SNR, IGS does not provide any gain for $P=1$ . This is also in line with the averaged results in Fig. 2, where IGS has minor benefits at low SNR, while it improves the performance of the system significantly at moderate SNR. For this channel realization, our proposed algorithms and the algorithm in [15] perform very closely to each other. In Fig. 3b, we also consider the effect of time sharing444 We derive the achievable rate region with TS by taking the convex hull operation over the corresponding achievable rate regions [15]. It is worth mentioning that time sharing results in the convex hull operation when power constraint is considered for each operational point. The achievable rate region with time sharing might be enlarged if an average power constraint over different operational point is considered [56]. However, this analysis is outside of the scope of this paper. on the achievable rate region. As can be observed, IGS with time sharing outperforms PGS with time sharing for this example. Since the IGS designs perform similarly, for this example, we provide only the time sharing for our proposed IGS design in Section III.

The joint variance and covariance IGS algorithm in [15] is an iterative algorithm, based on a bisection method over the minimum weighted rates of users, and is proposed for ideal devices. The algorithm employs semidefinite relaxation (SDR) programming in order to solve the corresponding feasibility problem at each iteration of the bisection method. Since the solution of the SDR in [15] is not ensured to be rank-one, it does not necessarily obtain a valid solution, and a Gaussian randomization procedure is employed to obtain a rank-one solution. The solution obtained by the randomization procedure is not ensured to fulfill any optimality condition, which is in contrast with our proposed algorithm, which converges to a stationary point of (11). That may be the reason why our algorithm provides a better average symmetric rate than SDR for high SNR in this scenario. It is also worth mentioning that our proposed algorithms are more general since they consider additive asymmetric HWD, while the algorithm in [15] can only be applied for ideal devices.

V-B Non-ideal devices

In this subsection, we consider the effect of HWD on the performance of the two-user IC. Throughout this subsection, we consider the same statistics for HWD in all devices. In Fig. 4, we show the rate region for $\mathbf{H}_{1}$ and $P=1$ under maximally improper HWD555Maximally improper HWD happens when the in-phase and quadrature-phase noises are completely correlated [30]. noise. As shown in Fig. 3a, IGS brings negligible gains when the transceivers are ideal, but, as observed in Fig. 4, IGS can significantly enlarge the rate region if there is additive asymmetric HWD. Note that even in point-to-point communications, PGS is in general suboptimal for asymmetric HWD, as it is shown in Fig. 4 for either $R_{1}=0$ or $R_{2}=0$ .

In Fig. 5, we show the achievable rate region for $\tilde{\sigma}_{\eta}^{2}=0$ , $P=1$ (SNR $=0\,$ dB), and channel realization

[TABLE]

We take $\tilde{\sigma}^{2}_{\eta}=0$ , i.e., symmetric (proper) HWD. We can observe that IGS enlarges the rate region even for proper HWD with high noise variance, i.e., $\sigma^{2}_{\eta}=0.5$ and $\sigma^{2}_{\eta}=1$ . It is worth mentioning that the PGS design is Pareto-optimal in the presence of additive symmetric HWD. As can be observed, our IGS design in Section III with time sharing outperforms the Pareto-optimal PGS with time sharing for these examples.

In the following, we provide some averaged results for different parameters to illustrate different aspects of employing IGS. Similar to Fig. 2, we average the results over 100 channel realizations, where each channel realization is taken from a complex proper Gaussian distribution with variance 1, i.e., $\mathcal{CN}(0,1,0)$ .

In Fig. 6, we consider the effect of the variance of the HWD noise on the average symmetric rate of users ( $\alpha_{1}=\alpha_{2}=0.5$ ) for $P=20$ . In this figure, we consider proper ( $\tilde{\sigma}^{2}_{\eta}=0$ ) and maximally improper ( $\tilde{\sigma}^{2}_{\eta}=\sigma^{2}_{\eta}$ ) HWD noise. We observe that our proposed algorithm with FP outperforms the other algorithms for maximally improper HWD noise. Moreover, in Fig. 6a, our proposed IGS algorithms perform better than PGS for proper HWD noise with different variances, which is Pareto-optimal PGS in this case. Furthermore, our simplified algorithm outperforms the IGS algorithm in [15] in the presence of HWD. However, the performance improvement by our algorithms is minor for proper HWD with high noise variance, where our algorithms only provide $5\%$ improvement over PGS when $\sigma^{2}_{\eta}=1$ for this example.

Figure 7 shows the effect of the circularity coefficient of the HWD noise on the symmetric rate for $P=20$ . As can be observed, the benefits of employing IGS increase with the circularity coefficient of the HWD noise, and there is a considerable performance improvement by IGS in maximally improper HWD noise. We emphasize that PGS is suboptimal, even in interference-free communications, under asymmetric HWD. Our proposed IGS design with FP outperforms the other algorithms, especially in highly asymmetric HWD noise. When the variance of the HWD noise is small, the gain of employing IGS is larger. The other interesting result in this figure is that our simplified algorithm performs very similarly to our proposed algorithm based on FP for proper HWD. Since the simplified algorithm has less computational cost, it can be employed for proper HWD noise when the variance of the HWD noise is high, i.e., $\sigma^{2}_{\eta}\geq 0.5$ . However, our proposed algorithm based on FP outperforms the other algorithm in low-power HWD noise and/or highly asymmetric HWD noise. Note that, since the IGS algorithm in [15] is proposed for ideal devices and does not consider HWD, it performs worse than the proposed PGS, which considers additive symmetric HWD, from the average symmetric rate point of view, even when the HWD noise is maximally improper.

In Fig. 8, we consider the effect of the power budget on the symmetric rate of users. There is an almost constant performance gap between our proposed algorithms and the other algorithms. Similar to the other figures, our proposed IGS with FP outperforms our simplified algorithm.

VI Conclusion

In this paper, we considered a two-user IC with additive asymmetric HWD at the transceivers. Treating interference as noise, we addressed the problem of obtaining the achievable rate region for IGS and proposed two suboptimal algorithms. The first algorithm, which is based on MM and the generalized Dinkelbach algorithm, obtains a stationary point of the PSINR region. In this algorithm, we jointly optimize the powers and complementary variances. We also proposed a simplified algorithm that has lower computational complexity. This simplified algorithm is based on the separate optimization of the powers and complementary variances. Through numerical examples, we showed that the proposed approaches enlarge the achievable rate region and outperform PGS and existing IGS algorithms, especially as the HWD becomes more asymmetric.

Appendix A Proof of Lemma 3

In order to approximate $u_{k}(\mathbf{p},\mathbf{q})$ and $v_{k}(\mathbf{p},\mathbf{q})$ , we employ convex-concave (or concave-convex) procedure (CCP), in which the convex (concave) part is approximated as an affine function by the first-order approximation of the Taylor expansion. Note that we take the first-order term and employ an affine approximation since an affine function is the nearest concave approximation to a convex function. The first-order approximation of a real function $u(\mathbf{x})$ around the point $\mathbf{x}_{0}$ is obtained through its Taylor expansion as [27, 57]

[TABLE]

where $\mathbf{x}$ is a complex vector. In order to apply the CCP to $u_{k}(\mathbf{p},\mathbf{q})$ , we have to differentiate the convex part in (18) with respect to $\mathbf{p}$ , which is straightforward since it is a real function on a real domain and consequently, analytic in $\mathbf{p}$ . The derivative of $(\sigma^{2}+\mathbf{a}_{k}^{T}\mathbf{p})^{2}$ with respect to $\mathbf{p}$ is

[TABLE]

and the resulting first-order approximation around the power vector in the $m$ th iteration, $\mathbf{p}^{(m)}$ , is given by

[TABLE]

By substituting (47) in (18), we can derive $\tilde{u}_{k}(\mathbf{p},\mathbf{q})$ .

In order to convexify $v_{k}(\mathbf{p},\mathbf{q})$ , we have to differentiate the concave part in (19) with respect to $\mathbf{p}$ and $\mathbf{q}$ . The derivative of $|\mathbf{g}_{k}^{H}\mathbf{q}+\tilde{\mathbf{f}}^{H}_{k}\mathbf{p}|^{2}$ with respect to $\mathbf{p}$ is also straightforward since it is analytic in $\mathbf{p}$ :

[TABLE]

The term $|\mathbf{g}_{k}^{H}\mathbf{q}+\tilde{\mathbf{f}}^{H}_{k}\mathbf{p}|^{2}$ , on the other hand, is not analytic in $\mathbf{q}$ since it is a real-valued function while $\mathbf{q}$ is a complex vector [27, 57]. Thus, we have to employ Wirtinger calculus to obtain the derivative of $|\mathbf{g}_{k}^{H}\mathbf{q}+\tilde{\mathbf{f}}^{H}_{k}\mathbf{p}|^{2}$ with respect to $\mathbf{q}$ . By Wirtinger calculus, we treat $\mathbf{q}$ and $\mathbf{q}^{*}$ as two independent complex variables [27, 57]. Thus, we take the derivative of $|\mathbf{g}_{k}^{H}\mathbf{q}+\tilde{\mathbf{f}}^{H}_{k}\mathbf{p}|^{2}$ with respect to $\mathbf{q}$ while treating $\mathbf{q}^{*}$ as a constant, which results in

[TABLE]

Now by (45), we can approximate $|\mathbf{g}_{k}^{H}\mathbf{q}+\tilde{\mathbf{f}}^{H}_{k}\mathbf{p}|^{2}$ as an affine function as

[TABLE]

By substituting (50) in (19), we can obtain $\tilde{v}_{k}(\mathbf{p},\mathbf{q})$ .

Appendix B Proof of Theorem 1

A given $E^{\prime}$ is feasible if and only if there exists at least a pair $(p_{1},p_{2})$ that satisfies all the constraints in (32). Let us first consider the two linear constraints in (32b), which can be written as (51) and (52), shown at the top of the next page.

We can construct $\mathbf{A}$ in (38) by the coefficients of $p_{1}$ and $p_{2}$ in (51) and (52). It is worth mentioning that the non-diagonal elements of $\mathbf{A}$ in (38) are non-positive since $\sqrt{1+\alpha_{1}E^{\prime}}\geq 1$ and $\sqrt{1+\alpha_{2}E^{\prime}}\geq 1$ . Thus, if the diagonal elements of $\mathbf{A}$ are not positive, there is no positive power pair that satisfies (51) and (52) simultaneously. Hence, in the following, we assume without loss of generality that $\mathbf{A}$ has strictly positive diagonal elements and strictly negative non-diagonal elements.

We can rewrite (51) and (52) as

[TABLE]

where $\mathbf{y}=\left[\begin{array}[]{cc}(\sqrt{1+\alpha_{1}E^{\prime}}-1)\sigma^{2}&(\sqrt{1+\alpha_{2}E^{\prime}}-1)\sigma^{2}\end{array}\right]^{T}$ . Moreover, $[\mathbf{A}]_{ij}$ , and $y_{i}$ for $i,j\in\{1,2\}$ are the $ij$ th element of $\mathbf{A}$ , and the $i$ th element of $\mathbf{y}$ , respectively. If we decouple the inequalities, we end up with

[TABLE]

The right-hand sides (RHS) in (55) and (56) are positive for a feasible $E^{\prime}$ as mentioned before. Note that if $\det(\mathbf{A})<0$ , there are no positive power pairs that satisfy (55) and (56) for the given structure of $\mathbf{A}$ in (38). Thus, we consider $\det(\mathbf{A})>0$ , which yields

[TABLE]

or equivalently $\mathbf{p}=[\begin{array}[]{cc}p_{1}&p_{2}\end{array}]^{T}\geq\mathbf{A}^{-1}\mathbf{y}$ , where $p_{1}^{\prime}$ and $p_{2}^{\prime}$ are the intersecting point given in (37). Hence, the intersecting point provides the minimum positive power pairs that satisfy (51) and (52). If $p_{1}^{\prime}$ and $p_{2}^{\prime}$ satisfy the power constraint, $E^{\prime}$ is feasible (Fig. 9.a). Otherwise, $E^{\prime}$ is infeasible (Fig. 9.b).

Bibliography57

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] J. G. Andrews, S. Buzzi, W. Choi, S. V. Hanly, A. Lozano, A. C. Soong, and J. C. Zhang, “What will 5G be?” IEEE J. Sel. Areas Commun. , vol. 32, no. 6, pp. 1065–1082, 2014.
2[2] S. Buzzi, I. Chih-Lin, T. E. Klein, H. V. Poor, C. Yang, and A. Zappone, “A survey of energy-efficient techniques for 5G networks and challenges ahead,” IEEE J. Sel. Areas Commun. , vol. 34, no. 4, pp. 697–709, 2016.
3[3] R. W. Heath, N. Gonzalez-Prelcic, S. Rangan, W. Roh, and A. M. Sayeed, “An overview of signal processing techniques for millimeter wave MIMO systems,” IEEE J. Sel. Topics Signal Process. , vol. 10, no. 3, pp. 436–453, 2016.
4[4] S. Javed, O. Amin, S. S. Ikki, and M.-S. Alouini, “Asymmetric hardware distortions in receive diversity systems: Outage performance analysis,” IEEE Access , vol. 5, pp. 4492–4504, 2017.
5[5] J. Zhu, D. W. K. Ng, N. Wang, R. Schober, and V. K. Bhargava, “Analysis and design of secure massive MIMO systems in the presence of hardware impairments,” IEEE Trans. Wireless Commun. , vol. 16, no. 3, pp. 2001–2016, 2017.
6[6] J. Zhang, L. Dai, X. Zhang, E. Björnson, and Z. Wang, “Achievable rate of Rician large-scale MIMO channels with transceiver hardware impairments,” IEEE Trans. Veh. Technol , vol. 65, no. 10, pp. 8800–8806, 2016.
7[7] X. Xia, D. Zhang, K. Xu, W. Ma, and Y. Xu, “Hardware impairments aware transceiver for full-duplex massive MIMO relaying,” IEEE Trans. Signal Process. , vol. 63, no. 24, pp. 6565–6580, 2015.
8[8] E. Björnson, J. Hoydis, M. Kountouris, and M. Debbah, “Massive MIMO systems with non-ideal hardware: Energy efficiency, estimation, and capacity limits,” IEEE Trans. Inf. Theory , vol. 60, no. 11, pp. 7112–7139, 2014.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Improper Signaling for SISO Two-user Interference Channels with Additive Asymmetric Hardware Distortion

Abstract

Index Terms:

I Introduction

I-A Related work

I-B Contribution

I-C Paper outline

II System Model

II-A Preliminaries of IGS

II-B Hardware distortion model

II-C Network scenario and signal model

II-D Problem Statement

Definition 1** (​[19, 44]).**

Lemma 1** (​[47, 48]).**

Proof.

III Boundary of the rate region by Fractional Programming

III-A Preliminaries of generalized Dinkelbach’s algorithm

Lemma 2** (​​[34, 35]).**

III-B Proposed algorithm

Lemma 3**.**

Proof.

IV Simplified algorithm

IV-A Power optimization

Lemma 4**.**

Proof.

Theorem 1**.**

Proof.

IV-B Complementary variance design

V Numerical Results

V-A Ideal devices

V-B Non-ideal devices

VI Conclusion

Appendix A Proof of Lemma 3

Appendix B Proof of Theorem 1

Definition 1 ([19, 44]).

Lemma 1 ([47, 48]).

Lemma 2 ([34, 35]).

Lemma 3.

Lemma 4.

Theorem 1.