Truth-Telling Mechanism for Secure Two-Way Relay Communications with   Energy-Harvesting Revenue

Muhammad R. A. Khandaker; Kai-Kit Wong; Gan Zheng

arXiv:1703.06547·cs.IT·March 22, 2017

Truth-Telling Mechanism for Secure Two-Way Relay Communications with Energy-Harvesting Revenue

Muhammad R. A. Khandaker, Kai-Kit Wong, Gan Zheng

PDF

Open Access

TL;DR

This paper introduces a truthful relay selection mechanism in secure two-way energy-harvesting communications, ensuring relays reveal true information and optimizing beamforming and power with reduced computational complexity.

Contribution

It proposes a novel incentive mechanism for truthful relay selection and a low-complexity SOCP-based joint beamforming and power optimization scheme.

Findings

01

The incentive mechanism effectively enforces truth-telling among relays.

02

The SOCP approach reduces computational complexity significantly.

03

Numerical results demonstrate improved secrecy sum rate and energy harvesting performance.

Abstract

This paper brings the novel idea of paying the utility to the winning agents in terms of some physical entity in cooperative communications. Our setting is a secret two-way communication channel where two transmitters exchange information in the presence of an eavesdropper. The relays are selected from a set of interested parties such that the secrecy sum rate is maximized. In return, the selected relay nodes' energy harvesting requirements will be fulfilled up to a certain threshold through their own payoff so that they have the natural incentive to be selected and involved in the communication. However, relays may exaggerate their private information in order to improve their chance to be selected. Our objective is to develop a mechanism for relay selection that enforces them to reveal the truth since otherwise they may be penalized. We also propose a joint cooperative relay…

Tables1

Table 1. TABLE I: Proposed alternating algorithm for solving problem ( 35 )

Step	Action
1	Initialize $p_{s, 1} = p_{s, 2} = p_{r} = \frac{P_{\max}}{K + 2}$ .
2	Repeat
	a) Solve the SOCP problem (41) using existing solvers,
	e.g., CVX [33].
	b) Solve the linear programming problem (42).
3	Until convergence

Equations91

y_{r, i}

y_{r, i}

y_{e}^{(1)}

y_{r} = p_{s, 1} h_{1, r} s_{1} + p_{s, 2} h_{2, r} s_{2} + n_{r},

y_{r} = p_{s, 1} h_{1, r} s_{1} + p_{s, 2} h_{2, r} s_{2} + n_{r},

P_{h, i} = ξ_{i} (1 - ρ_{i}) (p_{s, 1} ∣ h_{1, i} ∣^{2} + p_{s, 2} ∣ h_{2, i} ∣^{2} + σ^{2}),

P_{h, i} = ξ_{i} (1 - ρ_{i}) (p_{s, 1} ∣ h_{1, i} ∣^{2} + p_{s, 2} ∣ h_{2, i} ∣^{2} + σ^{2}),

y_{s, 1} = h_{1, r}^{T} x_{r} + n_{s, 1} = p_{s, 1} h_{1, r}^{T} F h_{1, r} s_{1} + p_{s, 2} h_{1, r}^{T} F h_{2, r} s_{2} + h_{1, r}^{T} F n_{r} + n_{s, 1},

y_{s, 1} = h_{1, r}^{T} x_{r} + n_{s, 1} = p_{s, 1} h_{1, r}^{T} F h_{1, r} s_{1} + p_{s, 2} h_{1, r}^{T} F h_{2, r} s_{2} + h_{1, r}^{T} F n_{r} + n_{s, 1},

y_{s, 2} = h_{2, r}^{T} x_{r} + n_{s, 2} = p_{s, 1} h_{2, r}^{T} F h_{1, r} s_{1} + p_{s, 2} h_{2, r}^{T} F h_{2, r} s_{2} + h_{2, r}^{T} F n_{r} + n_{s, 2},

y_{s, 2} = h_{2, r}^{T} x_{r} + n_{s, 2} = p_{s, 1} h_{2, r}^{T} F h_{1, r} s_{1} + p_{s, 2} h_{2, r}^{T} F h_{2, r} s_{2} + h_{2, r}^{T} F n_{r} + n_{s, 2},

y_{e}^{(2)} = h_{r, e}^{T} x_{r} + n_{e}^{(2)} = p_{s, 1} h_{r, e}^{T} F h_{1, r} s_{1} + p_{s, 2} h_{r, e}^{T} F h_{2, r} s_{2} + h_{r, e}^{T} F n_{r} + n_{e}^{(2)},

y_{e}^{(2)} = h_{r, e}^{T} x_{r} + n_{e}^{(2)} = p_{s, 1} h_{r, e}^{T} F h_{1, r} s_{1} + p_{s, 2} h_{r, e}^{T} F h_{2, r} s_{2} + h_{r, e}^{T} F n_{r} + n_{e}^{(2)},

y_{s, 1}

y_{s, 1}

= p_{s, 2} f^{H} H_{1, r} h_{2, r} s_{2} + \overset{n}{ˉ}_{s, 1}

= p_{s, 2} f^{H} h_{2, 1} s_{2} + \overset{n}{ˉ}_{s, 1},

y_{s, 2}

y_{s, 2}

= p_{s, 1} f^{H} H_{2, r} h_{1, r} s_{1} + \overset{n}{ˉ}_{s, 2}

= p_{s, 1} f^{H} h_{1, 2} s_{1} + \overset{n}{ˉ}_{s, 2},

\begin{array}[]{c}\underbrace{\left[\begin{array}[]{c}y_{\rm e}^{(1)}\\ y_{\rm e}^{(2)}\end{array}\right]}=\\ {\bf y}_{\rm e}\end{array}\begin{array}[]{c}\underbrace{\left[\begin{array}[]{cc}\sqrt{p_{{\rm s},1}}h_{1,{\rm e}}&\sqrt{p_{{\rm s},2}}h_{2,{\rm e}}\\ \sqrt{p_{{\rm s},1}}{\bf f}^{H}{\bar{\bf h}}_{1,{\rm e}}&\sqrt{p_{{\rm s},2}}{\bf f}^{H}{\bar{\bf h}}_{2,{\rm e}}\end{array}\right]}\\ {\bf H}_{\rm e}\end{array}\begin{array}[]{c}\underbrace{\left[\begin{array}[]{c}s_{1}\\ s_{2}\end{array}\right]}\\ {\bf s}\end{array}\\ \begin{array}[]{c}\underbrace{+\left[\begin{array}[]{c}{n}_{\rm e}^{(1)}\\ {\bar{n}}_{\rm e}^{(2)}\end{array}\right]},\\ {\bf n}_{\rm e}\end{array}

\begin{array}[]{c}\underbrace{\left[\begin{array}[]{c}y_{\rm e}^{(1)}\\ y_{\rm e}^{(2)}\end{array}\right]}=\\ {\bf y}_{\rm e}\end{array}\begin{array}[]{c}\underbrace{\left[\begin{array}[]{cc}\sqrt{p_{{\rm s},1}}h_{1,{\rm e}}&\sqrt{p_{{\rm s},2}}h_{2,{\rm e}}\\ \sqrt{p_{{\rm s},1}}{\bf f}^{H}{\bar{\bf h}}_{1,{\rm e}}&\sqrt{p_{{\rm s},2}}{\bf f}^{H}{\bar{\bf h}}_{2,{\rm e}}\end{array}\right]}\\ {\bf H}_{\rm e}\end{array}\begin{array}[]{c}\underbrace{\left[\begin{array}[]{c}s_{1}\\ s_{2}\end{array}\right]}\\ {\bf s}\end{array}\\ \begin{array}[]{c}\underbrace{+\left[\begin{array}[]{c}{n}_{\rm e}^{(1)}\\ {\bar{n}}_{\rm e}^{(2)}\end{array}\right]},\\ {\bf n}_{\rm e}\end{array}

γ_{1} = \frac{p _{s, 2} f ^{H} h _{2, 1} h _{2, 1}^{H} f}{σ ^{2} ( f ^{H} C _{n, 1} f + 1 )},

γ_{1} = \frac{p _{s, 2} f ^{H} h _{2, 1} h _{2, 1}^{H} f}{σ ^{2} ( f ^{H} C _{n, 1} f + 1 )},

γ_{2} = \frac{p _{s, 1} f ^{H} h _{1, 2} h _{1, 2}^{H} f}{σ ^{2} ( f ^{H} C _{n, 2} f + 1 )}

γ_{2} = \frac{p _{s, 1} f ^{H} h _{1, 2} h _{1, 2}^{H} f}{σ ^{2} ( f ^{H} C _{n, 2} f + 1 )}

C_{1}

C_{1}

C_{2}

C_{e} = \frac{1}{2} lo g_{2} det (I_{2} + H_{e} H_{e}^{H} C_{n, e}^{- 1}),

C_{e} = \frac{1}{2} lo g_{2} det (I_{2} + H_{e} H_{e}^{H} C_{n, e}^{- 1}),

C_{s} = [C_{1} + C_{2} - C_{e}]^{+}

C_{s} = [C_{1} + C_{2} - C_{e}]^{+}

t_{i} (\hat{θ}_{i}, \hat{θ}_{- i})

t_{i} (\hat{θ}_{i}, \hat{θ}_{- i})

- j = 1 \sum K v_{j} (O_{- i} (\hat{θ}_{j}, \hat{θ}_{- j}), θ_{j})

= j = 1 \sum K v_{j} (O (\hat{θ}_{j}, \hat{θ}_{- j}), θ_{j})

- j = 1 \sum K v_{j} (O_{- i} (\hat{θ}_{j}, \hat{θ}_{- j}), θ_{j})

- v_{i} (O (\hat{θ}_{i}, \hat{θ}_{- i}), θ_{i}) .

t_{i}(\hat{\theta}_{i},\hat{{\theta}}_{-i})=\left\{\begin{array}[]{l}\!-v_{K+1}\left({\mathcal{O}}(\hat{\theta}_{j},\hat{{\theta}}_{-j}),\theta_{j}\right),~{}\text{for }k=1,\dots,K,\\ \!0,\qquad\qquad\text{for }k=K+1,\dots,N.\end{array}\right.

t_{i}(\hat{\theta}_{i},\hat{{\theta}}_{-i})=\left\{\begin{array}[]{l}\!-v_{K+1}\left({\mathcal{O}}(\hat{\theta}_{j},\hat{{\theta}}_{-j}),\theta_{j}\right),~{}\text{for }k=1,\dots,K,\\ \!0,\qquad\qquad\text{for }k=K+1,\dots,N.\end{array}\right.

u_{i} (\hat{θ}_{i}, \hat{θ}_{- i})

u_{i} (\hat{θ}_{i}, \hat{θ}_{- i})

i = 1 \sum K v_{i} (O (\hat{θ}_{i}, \hat{θ}_{- i}), θ_{i}) \geq j = 1 \sum K v_{j} (O_{- i} (\hat{θ}_{j}, \hat{θ}_{- j}), θ_{j}),

i = 1 \sum K v_{i} (O (\hat{θ}_{i}, \hat{θ}_{- i}), θ_{i}) \geq j = 1 \sum K v_{j} (O_{- i} (\hat{θ}_{j}, \hat{θ}_{- j}), θ_{j}),

t_{1} (\hat{θ}_{1}, \hat{θ}_{- 1})

t_{1} (\hat{θ}_{1}, \hat{θ}_{- 1})

j = 1 \sum 3

j = 1 \sum 3

\tilde{y}_{s, 1} = α_{i} p_{r} p_{s, 2} h_{1, i} h_{2, i} s_{2} + α_{i} p_{r} h_{1, i} n_{r, i} + n_{s, 1}

\tilde{y}_{s, 1} = α_{i} p_{r} p_{s, 2} h_{1, i} h_{2, i} s_{2} + α_{i} p_{r} h_{1, i} n_{r, i} + n_{s, 1}

\tilde{y}_{s, 2} = α_{i} p_{r} p_{s, 1} h_{2, i} h_{1, i} s_{1} + α_{i} p_{r} h_{2, i} n_{r, i} + n_{s, 2},

\tilde{y}_{s, 2} = α_{i} p_{r} p_{s, 1} h_{2, i} h_{1, i} s_{1} + α_{i} p_{r} h_{2, i} n_{r, i} + n_{s, 2},

v_{i} (g_{i}) ≜ C_{i, s} (g_{i}) = \frac{1}{2} [lo g_{2} (1 + \frac{α _{i}^{2} p _{r} p _{s, 1} ∣ h _{1, i} ∣ ^{2} ∣ h _{2, i} ∣ ^{2}}{σ ^{2} ( α _{i}^{2} p _{r} ∣ h _{1, i} ∣ ^{2} + 1 )}) + lo g_{2} (1 + \frac{α _{i}^{2} p _{r} p _{s, 2} ∣ h _{2, i} ∣ ^{2} ∣ h _{1, i} ∣ ^{2}}{σ ^{2} ( α _{i}^{2} p _{r} ∣ h _{2, i} ∣ ^{2} + 1 )})] .

v_{i} (g_{i}) ≜ C_{i, s} (g_{i}) = \frac{1}{2} [lo g_{2} (1 + \frac{α _{i}^{2} p _{r} p _{s, 1} ∣ h _{1, i} ∣ ^{2} ∣ h _{2, i} ∣ ^{2}}{σ ^{2} ( α _{i}^{2} p _{r} ∣ h _{1, i} ∣ ^{2} + 1 )}) + lo g_{2} (1 + \frac{α _{i}^{2} p _{r} p _{s, 2} ∣ h _{2, i} ∣ ^{2} ∣ h _{1, i} ∣ ^{2}}{σ ^{2} ( α _{i}^{2} p _{r} ∣ h _{2, i} ∣ ^{2} + 1 )})] .

O (\overset{g}{^}_{i}, \overset{g}{^}_{- i}) ≜ ar g {R_{k}} max k = 1 \sum K C_{i, s} (\overset{g}{^}_{i}) .

O (\overset{g}{^}_{i}, \overset{g}{^}_{- i}) ≜ ar g {R_{k}} max k = 1 \sum K C_{i, s} (\overset{g}{^}_{i}) .

u_{i}\left(\hat{g}_{i}\right)=\left\{\begin{array}[]{l}\pi_{i}C_{i,{\rm s}}\left(\hat{g}_{i}\right),\qquad\text{if }{\sf R}_{i}\text{ is selected},\\ 0,\qquad\qquad\qquad\text{otherwise.}\end{array}\right.

u_{i}\left(\hat{g}_{i}\right)=\left\{\begin{array}[]{l}\pi_{i}C_{i,{\rm s}}\left(\hat{g}_{i}\right),\qquad\text{if }{\sf R}_{i}\text{ is selected},\\ 0,\qquad\qquad\qquad\text{otherwise.}\end{array}\right.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsWireless Communication Security Techniques · Energy Harvesting in Wireless Networks · Cooperative Communication and Network Coding

Full text

Truth-Telling Mechanism for Secure Two-Way Relay Communications with Energy-Harvesting Revenue††thanks: This work is supported by EPSRC under grant EP/K015893/1.

Muhammad R. A. Khandaker, , Kai-Kit Wong, , and Gan Zheng, , M. R. A. Khandaker and K. K. Wong are with the Department of Electronic and Electrical Engineering, University College London, WC1E 7JE, United Kingdom (e-mail: $\rm [email protected]$ ; $\rm kai\text{-}[email protected]$ ).G. Zheng is with The Wolfson School of Mechanical, Electrical and Manufacturing Engineering, Loughborough University, United Kingdom (e-mail: $\rm [email protected]$ ).

Abstract

This paper brings the novel idea of paying the utility to the winning agents in terms of some physical entity in cooperative communications. Our setting is a secret two-way communication channel where two transmitters exchange information in the presence of an eavesdropper. The relays are selected from a set of interested parties such that the secrecy sum rate is maximized. In return, the selected relay nodes’ energy harvesting requirements will be fulfilled up to a certain threshold through their own payoff so that they have the natural incentive to be selected and involved in the communication. However, relays may exaggerate their private information in order to improve their chance to be selected. Our objective is to develop a mechanism for relay selection that enforces them to reveal the truth since otherwise they may be penalized. We also propose a joint cooperative relay beamforming and transmit power optimization scheme based on an alternating optimization approach. Note that the problem is highly non-convex since the objective function appears as a product of three correlated Rayleigh quotients. While a common practice in the existing literature is to optimize the relay beamforming vector for given transmit power via rank relaxation, we propose a second-order cone programming (SOCP)-based approach in this paper which requires a significantly lower computational task. The performance of the incentive control mechanism and the optimization algorithm has been evaluated through numerical simulations.

Index Terms

Cooperative beamforming; energy harvesting; mechanism design; secrecy; two-way relay.

I. Introduction

Relaying is a promising technique to extend wireless coverage and increase the achievable rate [1, 2, 3, 4], and in recent years it has also been recognized as a spectrally efficient way to exchange information over distance between two transceivers via two-way relaying [5, 6, 7, 8]. Relays, if used collaboratively, can also form focused signal or noise beams to provide physical-layer security [9, 3, 4]. Collaborative relays follow the same idea as multiple antennas to exploit the spatial degrees of freedom for enhancing the signals to the legitimate receiver and worsening the interception of the eavesdropper by transmitting artificial noises [10, 11, 12, 13].

There is a huge scope of research for selecting the best relay nodes in maximizing the system performance. A meaningful setting would be to let the selected relays earn some form of revenue for relaying others’ information. In this case, challenge arises because the candidates may behave selfishly to maximize their own revenues. To tackle this, game theory is a popular tool to analyze the conflict of interests among intelligent rational competitors [14, 15, 16]. Auction and pricing schemes were proposed for efficient selection of a social choice, but most of them were based on the assumption that the players are honest and ready to disclose their true private information [15, 16], which may not be the case in practice. Also, in the literature, the “revenues” are usually some abstract quantities that may not be meaningful [15, 16, 17, 18, 19].

Nevertheless, a recent development in wireless communications, which promotes energy transfer over wireless channels, may be the answer to help quantify the revenues one may gain from contributing to others’ communications. Through simultaneous wireless information and power transfer (SWIPT), mobile users are provided with access to both energy and data at the same time which brings enormous prospects of new applications [20, 21, 22, 23, 24, 25]. The concept of SWIPT was first introduced in [20] in a single noisy line, and later extended in [21] to frequency-selective channels. Practical SWIPT schemes, namely, time switching and power splitting, have also been proposed [22, 23]. Recent studies further considered the combination of SWIPT with physical-layer security [24, 25], one-way relaying [26], and two-way relaying [27].

The focus of this paper is fundamentally different from the literature. While we consider relay selection for a two-way communication system in which two nodes exchange information with the help of a set of relay nodes in the presence of an eavesdropper, rather than concentrating primarily on reaping the benefits of relaying for secrecy communications, our aim is to develop an efficient mechanism to ensure that the relays reveal their true private information for relay selection optimization. In this particular problem, the channel coefficients from a relay to the two sources and the eavesdropper are regarded as the private information of that relay. The participation of relays is incentivised by the possible energy earning from the sources. In particular, the source transmitters will ensure that the energy harvesting requirements of the selected relays are fulfilled up to a certain threshold (or the expected payoff level).

The problem is that under this setup, the relays may exaggerate their private information to improve their chance to be selected, hoping to maximize their energy earning. The objective for a self-enforcing truth-revealing mechanism is to ensure that the relays reveal their actual private information to avoid being punished to pay for any damage caused. Note that mechanism design approaches have already been considered for suppressing cheating in cognitive radio networks [17], wireless video caching [18], and one-way relaying [19]. However, in [17, 18, 19], the revenue was paid in terms of some virtual entity, which does not directly relate to the concerned participants, while in this paper, the revenue is physically defined as harvested energy. In the context of energy harvesting facility considered in this paper, it is assumed that only the selected relays can harvest their required energy, and the unselected relay nodes will harvest almost nothing. It is also assumed that the relays will participate in the mechanism, as is common in conventional relaying [3, 4, 5, 6], even in the absence of dedicated energy transmission. However, there is no guard mechanism to prevent any relay from announcing its undermined channel condition in an attempt not to be selected so it can harvest energy without paying any penalty. In that case, the relay may remain unselected even with a better channel condition. But the reality is that the channel state information (CSI) of each relay is its own private information and none of the relays actually knows the channel conditions of the other relays. Hence none of them can define any threshold downplaying by which may guarantee its non-selection. Although it may be generally assumed that any unselected relay will be able to harvest some extent of energy, there is no guarantee that the harvested energy would be above a useful level. Thus the key motivation for the relays to participate in the mechanism is that through the proposed mechanism they yield QoS guarantee (at least minimum incentive) in terms of energy earning. On the other hand, the unselected relays have no such guarantee.

With the mechanism, we then propose a joint collaborative relay beamforming and transmit power optimization scheme for maximizing the sum secrecy rate while guaranteeing the expected payoff of each selected relay node in the form of its harvested energy. The optimization problem appears to be highly non-convex as the objective function is a product of three correlated Rayleigh quotients. While a common practice tends to optimize the collaborative relay beamforming vector for a given transmit power using rank relaxation, our proposed approach requires no rank relaxation. Instead, we formulate the relay beamforming problem as a second-order cone program (SOCP), which has lower computational overhead.

To the best of our knowledge, the closest work in the existing literature to this paper can be found in [19]. However, our contribution is three-fold compared to the work in [19]. Firstly, we consider two-way amplify-and-forward relaying, whereas one-way decode-and-forward (DF) relaying was considered in [19]. The DF relaying vastly simplifies the utility characterization for mechanism design. Hence the system model is different. Secondly, we define the utility of the auctioneers (relays) in terms of some practically appealing quantity (harvested energy) as opposed to the virtual payment considered in [19] and many other existing works [18]. Note that the virtual payment system does not provide enough incentives to the players for participating in the auction. Thirdly, in addition to the incentive controlling mechanism design, we develop an optimal joint transmit power and relay beamforming design algorithm whereas [19] considered only truthful mechanism design for relay selection. We also note that collaborative relay beamforming problems for two-way relay systems were studied in [3, 4] but with a fixed number of relays, and without mechanism design and payments for the selected relays in terms of harvested energy.

The remainder of this paper is organized as follows. In Section II, the system model for a two-way relay network in the presence of an eavesdropper is described. Truth-telling mechanism design strategies are then briefly introduced in Section III. The joint-optimal collaborative relay beamforming and transmit power optimization algorithm is developed in Section IV. Section V presents the simulation results to illustrate the importance of the proposed mechanism design and we conclude the paper in Section VI.

Notations—Throughout the paper, boldface lowercase and uppercase letters are used to represent vectors and matrices, respectively. The symbol ${\bf I}_{n}$ denotes an $n\times n$ identity matrix, while $\bf 0$ is a zero vector or matrix. Also, ${\bf A}^{T}$ , ${\bf A}^{H}$ , ${\bf A}^{\dagger}$ , ${\rm tr}({\bf A})$ , ${\rm rank}({\bf A})$ , and ${\rm det}({\bf A})$ represent transpose, the Hermitian (conjugate) transpose, matrix projection, trace, rank and determinant of a matrix ${\bf A}$ , respectively; $\|\cdot\|$ represents the Euclidean norm; ${\bf A}\succeq{\bf 0}\,({\bf A}\succ{\bf 0})$ means that ${\bf A}$ is a Hermitian positive semidefinite (definite) matrix; $[{\bf A}]_{i,j}$ denotes the $(i,j)$ th element of ${\bf A}$ . The notation ${\bf x}\sim\mathcal{CN}(\boldsymbol{\mu},{\boldsymbol{\Sigma}})$ means that ${\bf x}$ is a random vector following a complex circularly symmetric Gaussian distribution with the mean vector $\boldsymbol{\mu}$ and the covariance matrix of ${\boldsymbol{\Sigma}}$ .

II. System Model

We consider a two-way relay network consisting of two sources, ${\sf S}_{1}$ and ${\sf S}_{2}$ , wishing to communicate with each other, $N$ relay nodes, $\{{\sf R}_{i}\}_{i=1}^{N}$ , and an eavesdropper, ${\sf E}$ , as illustrated in Fig. 1. There is no direct link between the two source nodes, so communication has to be done via the relays. Assuming the more practical half-duplex relays, the communication is accomplished in two time slots. In the first time slot, the source nodes broadcast their signals $s_{1}$ and $s_{2}$ to all the relay nodes. In the second time slot, the source nodes decide which of those $N$ relays will be selected to forward their messages to the corresponding destination nodes based on some predesigned mechanism which we will describe later. During the whole process, the eavesdropper node overhears the messages from the source nodes as well as the relay nodes. The source nodes aim at maximizing the secrecy sum-rate by properly selecting $K\leq N$ relay nodes. It is assumed that each relay node only knows its own CSI between itself and the transmitters as well as the eavesdropper. The relays then report their CSI to the mechanism designer (which may be one of the two sources or a centralized processor)111Note that the same node performs the transmit power and relay beamforming optimization and/or relay selection operations as well. as their bids to be selected.

The messages, $s_{1}$ and $s_{2}$ , transmitted from the sources need to be kept confidential to ${\sf E}$ . It is assumed that $s_{1}$ and $s_{2}\sim\mathcal{CN}(0,1)$ , and the transmit power from ${\sf S}_{1}$ and ${\sf S}_{2}$ is, respectively, $p_{{\rm s},1}$ and $p_{{\rm s},2}$ . In the first time slot, the received signals at ${\sf R}_{i}$ and ${\sf E}$ are, respectively, given by

[TABLE]

where $h_{i,j}$ for $i=1,2$ and $j=1,\dots,N,$ denote the complex channel gains between ${\sf S}_{i}$ and ${\sf R}_{j}$ and $h_{i,{\rm e}}$ for $i=1,2$ , are that between ${\sf S}_{i}$ and ${\sf E}$ , $n_{{\rm r},i}\sim\mathcal{CN}(0,\sigma^{2})$ and $n_{{\rm e}}^{(1)}\sim\mathcal{CN}(0,\sigma^{2})$ represent the complex additive white Gaussian noises (AWGNs) at ${\sf R}_{i}$ and ${\sf E}$ during the first time slot, respectively.

In vector form, the signals received at all the relays can be expressed as

[TABLE]

where ${\bf h}_{1,{\rm r}}\triangleq\left[h_{1,1},\dots,h_{1,N}\right]^{T}$ , ${\bf h}_{2,{\rm r}}\triangleq\left[h_{2,1},\dots,h_{2,N}\right]^{T}$ denote the channel vectors between the two sources and the relays, and ${\bf n}_{{\rm r}}\triangleq\left[n_{{\rm r},1},\dots,n_{{\rm r},N}\right]^{T}$ indicates the AWGN vector at the relay nodes. We assume that each relay node is equipped with a power splitting device to coordinate harvesting energy and forwarding the received signal. In particular, the received signal at the $i$ th relay, ${\sf R}_{i}$ , is split such that a $\rho_{i}\in[0,1]$ portion of the signal power is passed to the information forwarding block and the remaining $1-\rho_{i}$ portion of the power is sent to the energy harvesting block of the relay. Several power splitting schemes have been considered in the literature [22, 23] including fixed power splitting and dynamic power splitting. In order to keep our main focus on mechanism design, we consider fixed power splitting in this paper. Interested readers are referred to [22, 23] for more about the dynamic power splitting schemes.

From (1), the harvested power at the $i$ th relay node, ${\sf R}_{i}$ , is given by

[TABLE]

where $\xi_{i}\in(0,1]$ denotes the energy conversion efficiency of the energy transducers at the $i$ th relay that accounts for the loss in the energy transducers for converting the harvested energy to electrical energy to be stored. For convenience, we assume, without loss of generality, that $\xi_{k}=1,\forall k$ , in this paper. It is worth pointing out that the relays do not need to convert the received signal from the radio frequency (RF) band to the baseband in order to harvest the carried energy using modern energy transducers. Therefore, according to the law of energy conservation, it is assumed that the total harvested RF band power (energy normalized by the baseband symbol period) at each relay is proportional to the normalised energy of the received baseband signal.

In the second time slot, ${\sf R}_{i}$ amplifies the received signal $\sqrt{\rho_{i}}y_{{\rm r},i}$ by a complex weighting coefficient $f^{*}_{i}$ and then transmits $x_{{\rm r},i}=\sqrt{\rho_{i}}f^{*}_{i}y_{{\rm r},i}$ . Combining the transmit signals from all the relay nodes, we have ${\bf x}_{{\rm r}}={\bf F}{\bf y}_{{\rm r}}$ where ${\bf F}$ is the combined diagonal weight matrix in the form ${\bf F}={\rm diag}\left({\bf f}^{*}\right)$ , with ${\bf f}\triangleq\left[\sqrt{\rho_{1}}f_{1},\dots,\sqrt{\rho_{N}}f_{N}\right]^{T}$ . Note that for notational simplicity, the power splitting coefficients have been incorporated in the definition of the relay beamforming vector ${\bf f}$ . It is also assumed that the channel coefficients between the transmitters and the relays are block-fading reciprocal. The block-fading reciprocal channel assumption has been widely used in two-way relay literature, e.g., [3, 4, 5]. The assumption essentially means that channels for the two phases are reciprocal, which is based on the time division duplex (TDD) operation with synchronized time-slot. The TDD operation greatly reduces signalling overhead and leads to an SOCP-based problem formulation with reduced complexity, which we will elaborate in section IV. Thus, the received signal at ${\sf S}_{1}$ in the second time slot can be expressed as

[TABLE]

where $n_{{\rm s},1}\sim\mathcal{CN}(0,\sigma^{2})$ denotes the AWGN signal at source node ${\sf S}_{1}$ .

Similarly, the received signal at ${\sf S}_{2}$ can be expressed as

[TABLE]

and that at ${\sf E}$ can be written as

[TABLE]

where $n_{{\rm s},2}\sim\mathcal{CN}(0,\sigma^{2})$ and $n_{\rm e}^{(2)}\sim\mathcal{CN}(0,\sigma^{2})$ are the noises at ${\sf S}_{2}$ and ${\sf E}$ in the second time slot.

Since $s_{1}$ and $s_{2}$ are known, respectively, at ${\sf S}_{1}$ and ${\sf S}_{2}$ , the residual received signals after self-interference cancellation (typical for two-way channels) are, respectively, given by

[TABLE]

and

[TABLE]

where ${\bf H}_{i,{\rm r}}\triangleq{\rm diag}({\bf h}_{i,{\rm r}})$ , ${\bf h}_{j,i}\triangleq{\bf H}_{i,{\rm r}}{\bf h}_{j,{\rm r}}$ , for $i,j=1,2$ , and $j\neq i$ , ${\bar{n}}_{{\rm s},i}\triangleq{\bf h}_{i,{\rm r}}^{T}{\bf F}{\bf n}_{{\rm r}}+n_{{\rm s},i}$ , for $i=1,2$ , and we have used the identity ${\bf a}^{H}{\rm diag}({\bf b})={\bf b}^{H}{\rm diag}({\bf a})$ . Note that each transmission phase brings some opportunity for ${\sf E}$ to overhear the information. Hence, combining the received signals in (2) and (7) at ${\sf E}$ over two time slots, an equivalent multiple-input multiple-output (MIMO) channel is formed, i.e.,

[TABLE]

where ${\bar{\bf h}}_{i,{\rm e}}\triangleq{\bf H}_{\rm r,e}{\bf h}_{i,{\rm r}}$ , for $i=1,2$ , ${\bf H}_{\rm r,e}\triangleq{\rm diag}({\bf h}_{\rm r,e})$ , and ${\bar{n}}_{\rm e}^{(2)}\triangleq{\bf h}_{\rm r,e}^{T}{\bf F}{\bf n}_{{\rm r}}+n_{\rm e}^{(2)}$ .

As a result, the corresponding signal-to-noise ratio (SNR) for the equivalent transmission link from ${\sf S}_{2}$ to ${\sf S}_{1}$ can be expressed as

[TABLE]

where ${\bf C}_{{\rm n},1}\triangleq{\bf H}_{1,{\rm r}}{\bf H}_{1,{\rm r}}^{H}$ . Similarly, the SNR for the equivalent transmission link from ${\sf S}_{1}$ to ${\sf S}_{2}$ is

[TABLE]

with ${\bf C}_{{\rm n},2}\triangleq{\bf H}_{2,{\rm r}}{\bf H}_{2,{\rm r}}^{H}$ . Thus, the channel capacities at ${\sf S}_{1}$ , ${\sf S}_{2}$ , and ${\sf E}$ are given, respectively, by

[TABLE]

and

[TABLE]

where ${\bf C}_{\rm n,e}\triangleq{\rm diag}\left(\sigma^{2},\sigma^{2}\left(1+{\bf f}^{H}{\bf H}_{\rm r,e}{\bf f}\right)\right)$ is the equivalent noise covariance matrix at the eavesdropper ${\sf E}$ over the two time slots and the scalar factor $\frac{1}{2}$ is due to the fact that two time slots are required in order to accomplish one successful transmission. Then the achievable secrecy sum rate is given by [3, 4]

[TABLE]

where $[a]^{+}=\max(0,a)$ . Note that the secrecy sum-rate in (20) is the sum of secrecy rates provided by all the relay nodes. Since all the relay nodes may not have sufficiently strong fading channels in order to make a useful contribution to the secrecy sum-rate, selecting the appropriate relays as helpers can play a significant role in improving secrecy performance. In the next section, we will focus on the mechanism design approach in order to select the $K$ best relays that can make the most significant contribution.

However, since the relays selected will have greater opportunity222Note that in the proposed beamforming algorithm, the transmitters will transmit with sufficient power such that the energy harvesting requirements of all the selected relay nodes are satisfied at least to equality assuming that the relays report their true channel information. to harvest energy from the received signal, all the relays will be naturally interested in participating in the mechanism. The issue is that some of them may intentionally exaggerate their true information in order to be selected. We will focus on the incentive control mechanisms so that the participating relays are self-enforced to reveal the truth.

III. Truth-Telling Mechanism Design

This section provides a brief introduction of mechanism design. A mechanism $\mathcal{M}$ is defined by the tuple $\left({\mathcal{S}},t_{1},\dots,t_{N}\right)$ where $t_{i}$ for $i=1,\dots,N,$ represents the transfer payment of agent $i$ (or player $i$ )333In this paper, the terms “player” and “agent” will be used interchangeably. when the social choice is ${\mathcal{S}}$ . The transfer payment is the compensation paid by an agent in return to the social damage it causes to the others by being selected. Mechanism design (sometimes called reverse game theory) is a game theoretical tool that studies solutions for a class of private information games in order to achieve a specific system-wide outcome even though the agents are selfish [28]. In a mechanism, each agent reports its private information (referred to as ‘type’ in the native literature) to the designer that serves as the parameter of a valuation function quantifying its bid on a specific allocation outcome and the transfer payment. The most desirable criteria that the mechanism designers tend to achieve are incentive compatibility and social optimality. A mechanism is said to be incentive compatible if truth-telling becomes the dominant (best) strategy in the mechanism while the mechanism is social optimum if it can ensure the maximum aggregate utilities of all the agents in the system. The Vickrey-Clarke-Groves (VCG) mechanism [29, 30, 31] is well known to achieve these two goals. Hence, we consider the VCG mechanism in the relay selection problem in order to maximize the secrecy sum-rate.

A. VCG Mechanism

In the VCG mechanism, agents are the members of the society. All the agents announce their valuations for the auctioned items simultaneously. Hence, there is no way to know whether the agents are telling the truth. The design objective is to give the agents the right incentives to tell the truth. The social choice is a set of $K$ agents from a set of $N$ alternatives for $K$ identical auctioned items. In VCG mechanism, each winning agent must pay some compensation (i.e., transfer payment) for the social damage it causes. The more the damage, the higher is the transfer payoff. We will now present the framework to quantify how much each agent $i$ contributes to the rest of the society if selected.

Let $v_{i}\left({\mathcal{X}},\theta_{i}\right)$ denote the valuation by agent $i$ from alternative ${\mathcal{X}}$ given the true information $\theta_{i}$ . We also denote ${\mathcal{O}}(\hat{\theta}_{i},\hat{{\theta}}_{-i})$ as the utilitarian alternative (i.e., outcome of the mechanism) chosen from the available set of alternatives based on the reported information $\{\hat{\theta}_{i}\}_{i=1}^{N}$ , as opposed to the true information $\{\theta_{i}\}_{i=1}^{N}$ , where the variable $\hat{{\theta}}_{-i}\triangleq\{\hat{\theta}_{1},\dots,\hat{\theta}_{i-1},\hat{\theta}_{i+1},\dots,\hat{\theta}_{N}\}$ is defined as the set of reported information of all the agents except agent $i$ . Also, ${\mathcal{O}}_{-i}(\hat{\theta}_{j},\hat{{\theta}}_{-j})$ represents the utilitarian alternative when agent $i$ does not take part in the mechanism. Note that the type profile $\hat{{\theta}}\triangleq\{\hat{\theta}_{1},\dots,\hat{\theta}_{N}\}$ is an ordered list in the decreasing manner.

The total welfare of the society (excluding $i$ ) is thus given by $\sum_{j\neq i}^{K}v_{j}\left({\mathcal{O}}(\hat{\theta}_{j},\hat{{\theta}}_{-j}),\theta_{j}\right)$ . If agent $i$ were not a member of the society, then the social welfare would be changed to $\sum_{j=1}^{K}v_{j}\left({\mathcal{O}}_{-i}(\hat{\theta}_{j},\hat{{\theta}}_{-j}),\theta_{j}\right)$ . The difference in the social welfare with and without the presence of agent $i$ is a measure of how much agent $i$ contributes to the rest of the society. In the VCG mechanism, agent $i$ receives a monetary transfer payment equal to the amount it contributes to the rest of the society. As a result, the VCG mechanism is characterized by the following monetary transfer payment function

[TABLE]

Note that the two summation operations in (21) and (22) are conducted within two different sets of alternatives namely ${\mathcal{O}}(\hat{\theta}_{i},\hat{{\theta}}_{-i})$ and ${\mathcal{O}}_{{-i}}(\hat{\theta}_{j},\hat{{\theta}}_{-j})$ . The first sum $\sum_{j\neq i}^{K}v_{j}\left({\mathcal{O}}(\hat{\theta}_{j},\hat{{\theta}}_{-j}),\theta_{j}\right)$ in (17) includes ( $K-1$ ) terms while the second sum $\sum_{j=1}^{K}v_{j}\left({\mathcal{O}}_{-i}(\hat{\theta}_{j},\hat{{\theta}}_{-j}),\theta_{j}\right)$ includes $K$ different terms. Thus given a type profile $\hat{\theta}$ , the monetary transfer to agent $i$ is defined by the total value of all agents other than $i$ when agent $i$ is present in the system minus the total value of all agents when agent $i$ is absent in the system. The value is always negative since the sum of apparently (in absence of the $i$ th item) highest $K$ valuations is subtracted from the sum of the highest $(K-1)$ valuations. Note that the transfer payment of agent $i$ is independent of its own valuation $v_{i}$ . The difference of the first two terms in (22) represents the marginal contribution of agent $i$ to the system which is given as a discount to agent $i$ by the VCG payment mechanism. It is evident from (22) that all the $K$ winning bidders pay a social damage recovery payment equal to the highest non-winning (i.e., the $(K+1)$ -st) bid, whereas a losing bidder pays nothing, i.e.,

[TABLE]

In the VCG mechanism, the highest $K$ bidders win and the winning bidder $i$ attains a utility (payoff) of

[TABLE]

Note that the penalty method to prevent reporting false information by agent $i$ is imposed by the transfer payment $t_{i}\left(\hat{\theta}_{i},\hat{\theta}_{-i}\right)$ in (25) which distinguishes mechanism design from conventional game theory. In conventional game theory, the agents can exaggerate their private information arbitrarily in order to be selected such that their own payoff is maximized. But in the VCG mechanism, the transfer payment will penalize them if they do so. Thus the selected utilitarian alternative maximizes the sum of the announced valuations, i.e.,

[TABLE]

where the equality holds only when all the agents reveal their true private information. Let us now elaborate the VCG payment mechanism through a simple numerical example.

Example 1 VCG Transfer Payment

Consider five agents $\{1,2,3,4,5\}$ with valuations $v_{1}=22$ , $v_{2}=18$ , $v_{3}=15$ , $v_{4}=12$ and $v_{5}=8$ participating in a sealed bid auction for three identical items available for auction. Each bidder can bid for one item only. Applying the VCG mechanism, bidders $1,2,$ and $3$ should win since their bids confirm the maximum social welfare $(22+18+15=55)$ . The transfer payment by bidder $1$ is calculated as

[TABLE]

Thus bidder $1$ pays an amount $(12)$ equal to the highest non-winning bid $v_{4}=12$ for the social damage caused by its selection. Similarly, the transfer payments paid by bidders $2$ and $3$ both equal to $12$ . Note that the payments are consistent with their respective marginal contributions. The marginal contribution of agent $1$ is given by

[TABLE]

which is given as a discount to agent $1$ resulting in a transfer payment of $10-22=-12$ . Similarly, the marginal contribution of agents $2$ and $3$ can be computed as $(22+18+15)-(22+15+12)=6$ and $(22+18+15)-(22+18+12)=3$ .

Thus the utilities of the agents can be computed as $u_{1}=22-12=10,u_{2}=18-12=6,u_{3}=15-12=3,u_{4}=0,u_{5}=0$ .

Let us now assume that agent $4$ announces an exaggerated valuation of $v_{4}=22$ , as opposed to its true valuation $12$ , with a desire to win. Thus the agents $\{1,2,4\}$ win and their transfer payments can be obtained as $t_{1}=t_{2}=t_{4}=-15$ , which is equal to the highest non-winning bid. The corresponding payoffs of the winning bids are computed as $u_{1}=22-15=7,u_{2}=18-15=3,u_{4}=12-15=-3$ . Note that a negative utility of agent $4$ indicates that the agent must pay additional amount from its own pocket in order to comply with the auction rules. Now the total social welfare counts to $\sum_{i=1}^{5}u_{i}=7+3-3+0+0=7$ as opposed to $19$ if all the agents would have announced their true valuations. Thus the VCG mechanism gives the incentives that if any of the agents announces untrue valuation, that may damage the total social benefit as well as its own utility. $\blacksquare$

In the following, we apply the VCG mechanism for relay selection in a two-way communication system in presence of an eavesdropper.

B. VCG Mechanism for Relay Selection

We consider the channel coefficients of each relay node with the two source nodes and the eavesdropping node as the private information of that relay node. The relay nodes report their channel information $\hat{g}_{i}\triangleq\{\hat{h}_{1,i},\hat{h}_{2,i},\hat{h}_{{\rm e},i}\}$ to the source nodes (or the mechanism designer) simultaneously. Through reporting their CSI, the relay nodes actually commit to the mechanism designer the level of secrecy rates they can provide for the two source nodes. We assume that the selected relay nodes must keep their commitments during their transmission in the second phase. Although the reported information may not be the same as the true ones, the mechanism designer will select the relays treating them as true. Let $g_{i}\triangleq\{{h}_{1,i},{h}_{2,i},{h}_{{\rm e},i}\}$ denote ${\sf R}_{i}$ ’s true channel information and $C_{i,{\rm s}}(g_{i})$ denote the achievable secrecy sum rate through relay ${\sf R}_{i}$ . Note that the information leakage during the first time slot is not affected by the social choice of relays and we assume that the relays do cooperative null space beamforming towards the eavesdropper’s channel.444This will be elaborated in Section IV Hence, $C_{i,{\rm s}}(g_{i})$ can be defined as a function of the equivalent two-way single-input single-output (SISO) channel only. After removing the self-interference, the equivalent SISO channel from ${\sf S}_{2}$ to ${\sf S}_{1}$ via ${\sf R}_{i}$ can be modelled as

[TABLE]

and that from ${\sf S}_{1}$ to ${\sf S}_{2}$ is given by

[TABLE]

where $\alpha_{i}\triangleq\left(p_{{\rm s},1}|h_{1,i}|^{2}+p_{{\rm s},2}|h_{2,i}|^{2}+\sigma^{2}\right)^{-\frac{1}{2}}$ is the amplification factor satisfying the power constraint at relay $i$ and $p_{\rm r}$ is the available relay power budget. Thus ${\sf R}_{i}$ ’s independent valuation can be defined as

[TABLE]

Note that by dividing the numerator and the denominator of both logarithmic terms in (28) by $\alpha_{i}^{2}p_{\rm r}$ , $C_{i,{\rm s}}\left(g_{i}\right)$ can be shown as an increasing function of $p_{{\rm s},1},p_{{\rm s},2}$ and $p_{\rm r}$ . Hence during the mechanism design phase, we obtain $C_{i,{\rm s}}(g_{i})$ assuming $p_{{\rm s},1},p_{{\rm s},2}$ and $p_{\rm r}$ hold their maximum possible value. Thus the utilitarian alternative ${\mathcal{O}}\left(\hat{g}_{i},\hat{g}_{-i}\right)$ based on the reported channel information can be defined as

[TABLE]

Note that based on the definitions of the two sets ${\mathcal{O}}(\cdot)$ and ${\mathcal{O}}_{-i}(\cdot)$ , the output in (29) of the proposed mechanism design is a set $\{{\sf R}_{k}\}$ of $K$ relay nodes.

Let us define that $\pi_{i}$ is the average harvested power (price paid) against per unit of secrecy rate achieved by relay $i$ . It is worth mentioning that the unit price $\pi_{i}$ may vary amongst the relays depending on their channel fading conditions. Thus the utility of ${\sf R}_{i}$ can be defined independently as

[TABLE]

Note that in the existing game-theoretic approaches adopted in secrecy communication, the agents receive some virtual payment usually in terms of secrecy rate or transmit power [19], which has no operational meaning to them. However, we propose the utility to be paid through some physical entity (e.g., harvested energy) for the first time. In this paper, we assume that only the relay nodes selected by the mechanism designer can get payoff i.e., harvest required energy from the first time slot. Although this may not always be the case in practice, it is a valid (reasonable) assumption since the mechanism designer selects the relays with the best channel conditions. Essentially, the unselected relay nodes, which have worse channel conditions as guaranteed by the proposed mechanism design, will harvest almost nothing. Applying energy beamforming555We do not consider energy beamforming in this paper. Readers are referred to [27, 24] for energy beamforming strategies. [24] at both transmitting nodes, one can fully guarantee that the unselected relays will not be able to harvest any energy from the transmitters’ signals. However, designing such spatially selective energy beamforming is a complicated task [12, 27, 24] and requires additional resources (e.g., physical antennas) at the two transmitters, which is not compatible with the system settings (single-antenna transmitters) considered in this paper. Hence, in order to keep the main focus of this paper on mechanism design, we would like to leave transmit energy beamforming design as a potential future work. Since only the selected relay nodes can get payoff, some dishonest relays may exaggerate their channel information in order to create greater opportunity to be selected. This may result in an unfair selection and damage the expected payoff of the unselected relay nodes. Essentially, this will adversely affect the secrecy sum rate and no equilibrium can be achieved under this condition [30, 19]. Hence we aim at designing a useful mechanism that can assist in controlling the incentives of the relays through imposing some penalty functions for the dishonest relay nodes. The penalty function will ensure that if any relay node is selected based on its exaggerated channel information, it will pay more transfer payment for the social damage caused from its own source of power in order to guarantee the required level of secrecy rate at each source node.

In order to better clarify the motivation that drives the relays to exaggerate their true valuations (i.e., CSI in this case), we introduce the probability of being selected affecting their valuation decision. The higher the valuation, the higher the probability of being selected, and so is the expected payoff. In this context, we assume that the relay nodes do not know the channel information of the other relays before they actually enact their channel information but generally know that the secrecy rate of each relay obeys certain probability density function $\left(0\leq C_{i,{\rm s}}(g_{i})<\infty\right)$ . Thus we define the reported valuation of ${\sf R}_{i}$ as

[TABLE]

where ${\rm Pr}(A)$ indicates the probability that the event $A$ occurs. Accordingly, the expected payoff of ${\sf R}_{i}$ can be defined as

[TABLE]

Given the relay selection criterion (29), the natural incentive of a relay would thus be to exaggerate its achievable secrecy rate $C_{i,{\rm s}}\left(\hat{g}_{i}\right)$ to $\infty$ in order to get the maximum expected payoff, which eventually increases their probability of being selected. Hence we introduce the following VCG transfer payment function

[TABLE]

where ${\mathcal{O}}_{-i}(\cdot)$ is the relay selection outcome when ${\sf R}_{i}$ does not participate in the mechanism. It is obvious from (33) that if a relay node claims a higher secrecy rate by tempering $\hat{h}_{1,i}$ or $\hat{h}_{2,i}$ , it may have more chances to be selected, but runs the risk of paying extra transfer payoff through spending from its own source of power.666The exact mechanism to implement this will be discussed in Section IV. On the other hand, if a relay node reports a lower secrecy rate, it will receive a higher monetary compensation but at the cost of lower probability to be selected. Hence truth-telling is the dominant strategy in the proposed VCG mechanism. The idea will be elaborated in Section V through numerical examples. In the VCG mechanism based relay selection algorithm, the total payoff of ${\sf R}_{i}$ is given by

[TABLE]

The following theorem describes the strength of the VCG mechanism for relay selection.

Theorem 1

Announcing truthfully, i.e., $\hat{g}_{i}=g_{i}$ is a dominant strategy for each relay $i$ .

*Proof: * We need to prove that announcing $\hat{g}_{i}=g_{i}$ is the best strategy for relay $i$ no matter what other relays announce. If relay ${\sf R}_{i}$ announces $\hat{g}_{i}$ and others announce $\hat{g}_{-i}$ , then according to (34), ${\sf R}_{i}$ ’s utility is $u_{i}\left(\hat{g}_{i},\hat{g}_{-i}\right)=v_{i}\left({\mathcal{O}}(\hat{g}_{i},\hat{g}_{-i}),g_{i}\right)+\sum_{j\neq i}^{K}v_{j}\left({\mathcal{O}}(\hat{g}_{j},\hat{g}_{-j}),g_{j}\right)-\sum_{j=1}^{K}v_{j}\left({\mathcal{O}}_{-i}(\hat{g}_{j},\hat{g}_{-j}),g_{j}\right)$ . Relay $i$ has to decide which $\hat{g}_{i}$ to announce; however, it cannot determine ${\mathcal{O}}_{-i}(\hat{g}_{j},\hat{g}_{-j})$ since it is excluded from that society. Hence, we can ignore the last term in $u_{i}\left(\hat{g}_{i},\hat{g}_{-i}\right)$ as it is unaffected by ${\sf R}_{i}$ ’s announcement. Therefore, in order to maximize its own payoff, relay ${\sf R}_{i}$ aims to maximize the total utility of the society inclusive of itself. Since relay ${\sf R}_{i}$ cannot choose other relays’ announcements, it can only play its own part. That is, by truthfully announcing, $\hat{g}_{i}=g_{i}$ , it can ensure that ${\mathcal{O}}(g_{i},\hat{g}_{-i})$ will be chosen. Hence announcing truthfully is the best thing relay ${\sf R}_{i}$ can do. $\square$

Note that each relay node competing to be selected will have the same incentive to report its true CSI and the $K$ relays that can achieve the top $K$ secrecy rates will be selected which will eventually maximize the total payoff. Thus equilibrium is achieved under this condition.

Interestingly, the only additional task for implementing the proposed mechanism in relay selection, as opposed to conventional relay selection, is the calculation of the transfer payments, which involves simple mathematical operations. In return, the benefit is that the mechanism enforces the relays to reveal their true CSI. No additional signalling is needed since the node performing the optimization and/or relay selection can effectively implement the mechanism. A quantitative comparison of benefits has been provided in Example 1. As demonstrated in the example, if agent $4$ announces an exaggerated valuation, the total social welfare counts to $7$ as opposed to $19$ if all the agents would have announced their true valuations. Thus the VCG mechanism gives the incentives that if any of the agents announces untrue valuation, that may damage the total social benefit as well as its own utility.

Once the best relays are selected based on their reported channel information, the optimization of the transmit power and cooperative relay beamforming is conducted, which we discuss in the next section.

IV. Optimal Transmit Power and Relay Beamforming Design

In this section, we propose transmit power and cooperative relay beamforming optimization schemes assuming that full CSI of all the nodes is available. Although in some practical communication systems, obtaining the eavesdropper’s CSI can be very difficult (or even impossible), for the ease of exposition, we assume that the relays know their channels with the transmitters as well as the eavesdropper. This is a reasonable assumption for scenarios where the eavesdropper is an active user of the system, and the transmitter aims to provide different services to different types of users. For such active eavesdroppers, the CSI can be estimated from the eavesdropper’s transmission. Let us define $P_{{\rm b},i}\triangleq P_{{\rm h},i}-|f_{i}|^{2}\left(p_{{\rm s},1}|h_{1,i}|^{2}+p_{{\rm s},2}|h_{2,i}|^{2}+\sigma^{2}\right)$ as the net power to be stored in the battery of the $i$ th relay. The overall objective is to increase $C_{1}$ and $C_{2}$ as well as $P_{{\rm b},i}$ as much as possible while keeping $C_{\rm e}$ as small as possible under peak power constraints at the two transmitters as well as each relay node. Hence we formulate the following optimization problem

[TABLE]

Here $P_{\rm max}$ and $p_{\rm r}$ are the available power budgets at the two sources and each of the relay nodes, respectively. Note that the last term in (35a) indicates the saved power of the worst selected relay. In general, it may happen that $P_{{\rm b},i}$ is negative, which essentially means that the $i$ th selected relay may need to contribute additional power from its own storage in order to maintain its reported secrecy rate. However, the constraint (35d) ensures that each of the selected relays gets its appropriate payoff. To guarantee that the relay nodes do not need to use their own source of power, they may set $p_{\rm r}\leq u_{i}\left(\hat{g}_{i},\hat{g}_{-i}\right)$ . Then the constraints (35d) and (35d) jointly guarantee that the honest selected relays can harvest sufficient energy required for their transmission in the second phase. However, there is no guarantee that a dishonest relay will be able to harvest appropriate amount of energy since they likely have weaker fading channels than what they have reported. Since we assume that the selected relays transmit with sufficient power during the second phase such that their promised secrecy rates at two sources are maintained, only the honest relay nodes do not need to utilize their own source of power. Although $u_{i}\left(\hat{g}_{i},\hat{g}_{-i}\right)$ can assume any value in a general sense, we obtain $u_{i}\left(\hat{g}_{i},\hat{g}_{-i}\right)$ from (34) assuming $p_{{\rm s},1}=p_{{\rm s},2}=P_{\rm max}$ .

Note that the objective function in (35a) includes the product of three correlated Rayleigh quotients, which is neither convex, nor concave, and is in general very difficult to solve. However, a more tractable but suboptimal strategy for designing beamforming is to choose the beamforming vector lying in the null space of the eavesdropper’s channel in the second time slot. The corresponding beamforming optimization problem is to maximize the sum rate achieved at two sources instead of sum secrecy rate. Because we cannot cancel the information rate leakage to the eavesdropper during the first time slot, the impact of the eavesdropper’s achievable information rate on the secrecy sum rate should be considered when optimizing the beamforming vector as well as two source powers. As such, we can try to degrade the eavesdropper’s interception by constraining its maximum allowable information rate with a predetermined level $r_{\rm e}$ , which can help avoid dealing with the rate difference of concave functions in (35a). If the relay nodes choose the beamforming vector ${\bf f}$ lying in the null space of the eavesdropper’s equivalent channel vectors, then the information leackage in the second phase is completely eliminated, i.e., ${\bf f}^{H}{\bar{\bf h}}_{1,{\rm e}}={\bf f}^{H}{\bar{\bf h}}_{2,{\rm e}}=0$ so that the second row of ${\bf H}_{\rm e}$ in (14) can be eliminated. Thus $C_{\rm e}$ reduces to

[TABLE]

Introducing a real-valued slack variable $\nu$ , we reformulate problem (35) as

[TABLE]

where ${\bf f}={\bar{\bf H}}_{\rm e}^{\dagger}{\bar{\bf f}}$ , ${\bar{\bf f}}$ is any vector, ${\bar{\bf H}}_{\rm e}^{\dagger}$ is the projection matrix onto the null space of ${\bar{\bf H}}_{\rm e}\triangleq\left[{\bar{\bf h}}_{1,{\rm e}},{\bar{\bf h}}_{2,{\rm e}}\right]$ , the columns of which constitute the orthogonal basis for the null space of ${\bar{\bf H}}_{\rm e}$ . Note that from (37f), the transmit power of the $i$ th relay node can be expressed as $\left[{\bf f}{\bf f}^{H}\right]_{i,i}\left[{\bf R}_{\rm s}\right]_{i,i}$ with ${\bf R}_{\rm s}=p_{{\rm s},1}{\bf H}_{1,{\rm r}}{\bf H}_{1,{\rm r}}^{H}+p_{{\rm s},2}{\bf H}_{2,{\rm r}}{\bf H}_{2,{\rm r}}^{H}+\sigma^{2}{\bf I}_{K}$ . Also, for given $p_{{\rm s},1}$ and $p_{{\rm s},2}$ , we can see from (37) that (37f), (37f), and (37f) are irrelevant to ${\bf f}$ . However, the problem is still non-convex since the objective function is not concave. Hence we split the objective function and formulate the following relay beamforming optimization problem

[TABLE]

where $r_{0}$ is the objective value for the sum rate in (37a) and $\beta\in[0,1]$ is the rate splitting coefficient. The optimal solution of the problem can be found in two steps. First we solve problem (38) for a feasible $r_{0}$ to obtain ${\bf f}$ . Then we perform a one-dimensional search on $\beta$ to find the maximum $r_{0}$ for which problem (38) is feasible. The lower bound of the rate search is definitely [math]. However, to define the upper bound $r_{\rm max}$ , we first decouple the two-way relay channel into two one-way relay channels and obtain the rate $r_{i}$ of each one-way channel. Then the upper limit can be defined as $r_{\rm max}=2\times{\rm max}(r_{1},r_{2})$ . Let us now substitute ${\bf f}={\bar{\bf H}}_{\rm e}^{\dagger}{\bar{\bf f}}$ in (38) to obtain

[TABLE]

Problem (39) is a non-convex quadratically constrained quadratic programming (QCQP) problem which is $NP$ -hard in general. We reformulate problem (39) as follows:

[TABLE]

where $\eta_{1}\triangleq\sigma^{2}\left(2^{2\beta r_{0}}-1\right)/p_{{\rm s},2}$ , $\eta_{2}\triangleq\sigma^{2}\left(2^{2(1-\beta)r_{0}}-1\right)/p_{{\rm s},1}$ , $\sqrt{{\bf C}_{{\rm n},i}}$ is the element-wise square root of ${\bf C}_{{\rm n},i}$ , and ${\bar{\bf H}}_{\rm e}^{{\dagger}(i)}$ indicates the $i$ th row of ${\bar{\bf H}}_{\rm e}^{{\dagger}}$ . Since the constraints in (40) are expressed in terms of Euclidean vector norms, multiplying the optimal ${\bar{\bf f}}$ by an arbitrary phase shift $e^{j\phi}$ will not affect the constraints. Also, by definition, ${\bf h}_{2,1}$ and ${\bf h}_{1,2}$ yield identical numeric value. Therefore, ${\bar{\bf f}}^{H}{\bar{\bf H}}_{\rm e}^{{\dagger}H}{\bf h}_{i,j}$ can be considered as a real number, without loss of generality. Consequently, (40) can be rewritten as

[TABLE]

where $\tilde{\bf f}\triangleq\left[{\bar{\bf f}}^{T},1\right]^{T}$ , $\tilde{\bf h}_{i,j}^{H}=\left[\left({\bar{\bf H}}_{\rm e}^{{\dagger}H}{\bf h}_{i,j}\right)^{H},0\right]$ , ${\tilde{\bf h}}_{{\rm e},i}=\left[{\bar{\bf H}}_{\rm e}^{{\dagger}(i)},0\right]^{T}$ , and $\tilde{\bf C}_{{\rm n},i}=\left[\begin{array}[]{cc}\sqrt{{\bf C}_{{\rm n},i}}{\bar{\bf H}}_{\rm e}^{{\dagger}H}&{\bf 0}\\ {\bf 0}&1\end{array}\right]$ . Note that (41) is a standard SOCP problem which can be efficiently solved by interior point methods [32]. Once the optimal relay beamforming vector ${\bf f}$ is obtained, we formulate the following problem using the monotonic property of the $\log$ function to find the optimal $p_{{\rm s},1}$ and $p_{{\rm s},2}$ :

[TABLE]

where $\mu_{i}=\frac{{\bf f}^{H}{\bf h}_{i,j}{\bf h}_{i,j}^{H}{\bf f}}{\sigma^{2}\left(1+{\bf f}^{H}{\bf C}_{{\rm n},j}{\bf f}\right)},i,j=1,2,i\neq j$ . The problem (42) is convex for given $\rho_{i}$ and hence the globally optimal solution can be easily obtained using existing solvers [33]. Thus we update the relay beamforming vector ${\bf f}$ and the transmit powers $p_{{\rm s},1}$ and $p_{{\rm s},2}$ alternatingly. Since we solve a convex subproblem at each step of the alternating algorithm, the objective function can either increase or maintain, but cannot decrease at each step of the algorithm. A monotonic convergence follows directly from this observation. The algorithm is summarized in Table I.

-A Complexity of the Algorithm

We now focus on the computational complexity of the proposed optimization scheme. We analyze the complexity of the alternating algorithm step-by-step. Note that the relay beamforming optimization problem (41) involves only SOC constraints, and hence can be solved using standard interior-point methods (IPM) [34, Lecture 6]. Therefore, we can use the worst-case computation time of IPM to analyze the complexity of the proposed method. Now the overall complexity of the IPM for solving an SOCP problem containing $p$ constraints consists of two components:

a)

Iteration Complexity: The number of iterations required to reach an $\epsilon$ -accurate ( $\epsilon>0$ ) optimal solution is in the order of $\ln(1/\epsilon)\sqrt{\beta(\mathcal{K})}$ , where $\beta(\mathcal{K})=2p$ is known to be the barrier parameter.

b)

Per-Iteration Computation Cost: A system of $n$ linear equations is required to be solved in each iteration where $n$ is the number of decision variables. The computation tasks include the formation of the coefficient matrix $\bf H$ of the system of linear equations and the factorization of $\bf H$ . The cost of forming $\bf H$ sums on the order of $\kappa_{\rm for}=n\sum_{j=1}^{p}k_{j}^{2}$ , $k_{j}$ is the dimension of the $j$ th cone, while the cost of factorization is on the order of $\kappa_{\rm fac}=n^{3}$ [34].

Thus the overall computation cost for solving the problem using IPM is on the order of $\ln(1/\epsilon)\sqrt{\beta(\mathcal{K})}\\ \times(\kappa_{\rm for}+\kappa_{\rm fac})$ . Using these concepts, we can now analyze the computational complexity of problem (37). Note that the number of decision variables $n$ is on the order of $K$ (ignoring the slack variables). Now, the problem (37) has $p=(2K+2)$ SOC constraints. Thus the complexity of solving problem (37) is on the order of $4K\sqrt{(K+1)}\mathcal{O}(K)[(K+1)^{2}+K^{2}+1]\ln(1/\epsilon)$ .

In the next step of the algorithm, problem (42) is solve, which is a linear programming problem. Now the linear program (42) can be solved in polynomial time at a worst-case complexity of $\mathcal{O}\left(3^{3.5}(3K+3)^{2}\right)$ [35].

V. Simulation Results

In this section, we study the performance of the proposed mechanism design and joint source-relay optimization algorithm for a two-way relay system through numerical simulations. We simulate a flat Rayleigh fading environment where the channel coefficients are randomly generated as zero-mean and unit-variance complex Gaussian random variables. The noise variance $\sigma^{2}$ is assumed to be unity. For simplicity, the power splitting coefficient $\rho_{i},\forall i$ , is fixed at $0.5$ .

In the first few examples, we demonstrate the effectiveness of the VCG mechanism in self-enforcing truth-telling. Then we provide performance comparison of the proposed joint transmit power and cooperative relay beamforming optimization with some conventional schemes.

For the demonstration of the mechanism design examples, we assign randomly generated values $v_{i}(g_{i})$ instead of calculating $C_{i,{\rm s}},\forall i,$ which does not affect the relay selection mechanism. It is assumed that although relay $i$ does not know other relays’ reported valuation, it knows that every reported value $v_{-i}(g_{-i})$ obeys the probability density function $e^{-x_{i}}$ where the random variable $x_{i}\triangleq v_{-i}(g_{-i})$ , $x_{i}\in[0,\infty)$ and $\int_{0}^{+\infty}e^{-x_{i}}dx_{i}=1$ . For simplicity, it is assumed that the price paid per unit of secrecy rate is $\pi_{i}=1,~{}\forall i$ .

In Fig. 2, we illustrate how the VCG mechanism works using randomly generated true values of $x_{i}$ ’s as $\{1.1101,1.4321,0.4567,0.3690,0.8421\}$ where the mechanism is to select $K=3$ relays from $N=5$ alternatives. The payoff of each relay node is plotted versus reported $x_{i}$ values. Note that if all the relays report their true values, then ${\sf R}_{1}$ , ${\sf R}_{2}$ , and ${\sf R}_{5}$ will be selected and they get their maximum payoff at their true reported values of $1.1101$ , $1.4321$ , and $0.8421$ . It can be observed from Fig. 2 that both ${\sf R}_{1}$ , ${\sf R}_{2}$ , and ${\sf R}_{5}$ start receiving positive payoff only after their reported values exceed the highest of the unselected relays’ reported values since their selection is not guaranteed otherwise. Also, if either ${\sf R}_{3}$ or ${\sf R}_{4}$ reports a value higher than that of ${\sf R}_{5}$ ( $0.8421$ ), it will be selected instead of ${\sf R}_{5}$ . At that point, the selected relay gets a negative payment which indicates that it needs to use its own source of transmit power for relaying the signal, since it cannot harvest sufficient power due to a poorer actual channel. It is also evident from Fig. 2 that as long as a relay is not selected, it gets (or pays) nothing.

In the next example, we show the effect of exaggerated reported value by a particular relay ( ${\sf R}_{3}$ ) which is likely to be unselected based on its true channel information assuming that other relays report their true information. Results in Fig. 3 illustrate the fact that the exaggerated reported value of ${\sf R}_{3}$ damages not only its own payoff if selected, but also that of the other relay nodes, which essentially damages the overall system payoff. As discussed in Section III-A, this is due to the fact that the exaggerated reported value of ${\sf R}_{3}$ keeps a potential candidate ( ${\sf R}_{5}$ in this case) unselected, which results in a higher transfer payment of the selected relays. As soon as the reported value of ${\sf R}_{3}$ exceeds that of ${\sf R}_{5}$ , it is selected but receives a negative payoff. However, the payoff of ${\sf R}_{3}$ is always unaffected since there are always some higher reported values than that of ${\sf R}_{3}$ .

Note that the results in Figs. 2 and 3 represent the exact payoffs of the relay nodes without taking the probability of being selected into consideration. Hence the payoff of any relay was zero if unselected. However, relays may take the probability of being selected into consideration when deciding which value to report. That will essentially affect their expected payoff as well. In the next example, we intend to show that truth-telling is the best strategy for the relays through their expected payoff where we want to select $K=3$ relays from $N=5$ alternatives. Results in Fig. 4 show the expected payoff of the relays when their reported values follow negative exponential probability distribution assuming their true affordable secrecy rate of $\{1.1101,1.4321,0.4567,0.3690,0.8421\}$ . We consider a large number ( $10^{5}$ ) of sample values to calculate the average expected payoff of each relay node at any given reported value. It is now more clearly indicated in Fig. 4 that truth-telling is the dominant strategy in VCG mechanism. Any agent can expect its maximum payoff only when it reports its true channel information. We can also observe that the larger the true secrecy value of a relay node, the higher the expected payoff. Also, the maximum expected payoff of any relay node is actually less than $u_{i}\left(\hat{g}_{i}\right)$ which is because each selected relay node has to pay a mandatory transfer payment as a recovery for the social damage caused by its selection.

The above numerical examples reveal that the VCG mechanism gives the right incentive to the bidders in an auction to disclose their true valuation. Given the mechanism has been implemented perfectly, we now focus on the joint transmit power and cooperative relay beamforming optimization. In order to demonstrate the gain achieved by the proposed SOCP-based joint transmit power and relay beamforming algorithm, we compare the secrecy sum-rate performance of the proposed joint optimization algorithm with that of the relay-only optimization and the conventional randomization-guided semidefinite relaxation (SDR) schemes [36, 37] in the next example. In the relay-only optimization scheme, the two source nodes transmit at fixed power (not optimized). That is, we solve problem (41) with fixed $p_{{\rm s},1}=p_{{\rm s},2}=\frac{P_{\rm max}}{K+2}$ . Note that relay-only optimization is considered for the SDR scheme as well.

In Fig. 5, we compare the secrecy sum rate performance of the proposed algorithm (‘Joint opt.’ in the figure) with the relay-only optimization (‘Relay-only opt.’), and the SDR method of relay beamforming design followed by randomization technique (‘SDR approach’). In this example, we select $K=3$ and $4$ relays from a set of $N=8$ alternatives. Note that we initialize the algorithm in Section IV with $p_{{\rm s},1}=p_{{\rm s},2}=p_{\rm r}=\frac{P_{\rm max}}{K+2}$ and update the transmit powers and relay beamforming vector alternatingly. For updating the transmit powers, we set the tolerable information leakage threshold $r_{\rm e}=1$ (bps/Hz). Fig. 5 shows the performance improvement by the proposed joint optimization algorithm compared to the other two schemes. Since in the randomization approach, some of the constraints may be violated, the performance of the SDR algorithm is severely degraded. For example, at $P_{\rm max}=10$ dB, the proposed relay-only optimization algorithm achieves more than $1$ bps/Hz higher secrecy sum rate than the randomization approach.

Finally, we show the convergence of the proposed alternating algorithm by evaluating the number of iterations required to converge to an accuracy of $10^{-3}$ . We generated four random channel realizations (Channels- $1,2,3,4$ ) and solved problem (35). Fig. 6 shows the convergence of the secrecy sum rate maximization problem in different channel realizations with an initial $p_{{\rm s},1}=p_{{\rm s},2}=P_{\rm max}$ for $N=5$ and $K=2$ . It can be observed that the proposed algorithm achieves a fast convergence in various channel scenarios.

VI. Conclusions

In this paper, we considered two-way secret communications via energy harvesting relay nodes. In order to maximize the secrecy rate, the source nodes selected the most suitable relay nodes from the available alternatives. The selected relay nodes, in return, could harvest energy which is guaranteed at least to the minimum payoff level. A self-enforcing truth-telling mechanism design approach was adopted for the relay selection procedure that guarantees that the relays will not exaggerate their true information in order to be selected to gain illegal payoff. We then proposed a joint cooperative relay beamforming and transmission power optimization algorithm in order to maximize the achievable sum secrecy rate. Designing strategies for dedicated transmit energy beamforming can be an interesting future work.

Bibliography37

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] M. R. A. Khandaker and Y. Rong, “Joint transceiver optimization for multiuser MIMO relay communication systems,” IEEE Trans. Signal Process. , vol. 60, pp. 5977–5986, Nov. 2012.
2[2] A. Toding, M. R. A. Khandaker, and Y. Rong, “Joint source and relay optimization for parallel MIMO relay networks,” EURASIP J. Adv. Signal Process. , vol. 2012:174, Aug. 2012.
3[3] H.-M. Wang, Q. Yin, and X.-G. Xia, “Distributed beamforming for physical-layer security of two-way relay networks,” IEEE Trans. Signal Process. , vol. 60, pp. 3532–3545, July 2012.
4[4] H.-M. Wang, M. Luo, Q. Yin, and X.-G. Xia, “Hybrid cooperative beamforming and jamming for physical-layer security of two-way relay networks,” IEEE Trans. Inf. Forensics and Security , vol. 8, pp. 2007–2020, Dec. 2013.
5[5] B. Rankov and A. Wittneben, “Achievable rate regions for the two-way relay channel,” in Proc. IEEE ISIT , Seattle, USA, 9-14 July 2006, 1668-1672.
6[6] ——, “Spectral efficient protocols for half-duplex fading relay channels,” IEEE J. Sel. Areas Commun. , vol. 25, pp. 379–389, Feb. 2007.
7[7] M. R. A. Khandaker and K.-K. Wong, “Joint source and relay optimization for interference MIMO relay networks,” EURASIP J. Adv. Signal Process. , to appear, 2017.
8[8] E. Tekin and A. Yener, “The general Gaussian multiple-access and two-way wiretap channels: Achievable rates and cooperative jamming,” IEEE Trans. Inf. Theory , vol. 54, pp. 2735–2751, June 2008.