Capacity of Generalized Discrete-Memoryless Push-to-Talk Two-Way   Channels

Jian-Jia Weng; Fady Alajaji; and Tam\'as Linder

arXiv:1904.01473·cs.IT·February 1, 2021

Capacity of Generalized Discrete-Memoryless Push-to-Talk Two-Way Channels

Jian-Jia Weng, Fady Alajaji, and Tam\'as Linder

PDF

TL;DR

This paper extends Shannon's push-to-talk two-way channel model to include full-duplex and noisy reception, deriving the capacity region and demonstrating its properties under certain symmetry conditions.

Contribution

It introduces a generalized model of push-to-talk channels with full-duplex capability and characterizes its capacity region, which was not previously known.

Findings

01

Capacity region is the convex hull of at most 4 rate pairs.

02

Shannon's inner bound is tight under the symmetry property.

03

Examples illustrate different shapes of the capacity region.

Abstract

In this report, we generalize Shannon's push-to-talk two-way channel (PTT-TWC) by allowing reliable full-duplex transmission as well as noisy reception in the half-duplex (PTT) mode. Viewing a PTT-TWC as two state-dependent one-way channels, we introduce a channel symmetry property pertaining to the one-way channels. Shannon's TWC capacity inner bound is shown to be tight for the generalized model under this symmetry property. We also analytically derive the capacity region, which is shown to be the convex hull of (at most) 4 rate pairs. Examples that illustrate different shapes of the capacity region are given, and efficient transmission schemes are discussed via the examples.

Tables7

Table 1. TABLE I: The full and marginal transition matrices of Shannon’s PTT-TWC, where X j subscript 𝑋 𝑗 X_{j} and Y j subscript 𝑌 𝑗 Y_{j} denote user- j 𝑗 j ’s channel input and output, respectively, j = 1 , 2 𝑗 1 2 j=1,2 . The rows and columns are indexed by the channel inputs and outputs, respectively.

$(X_{1}, X_{2})$	$(0, 0)$	$(0, 1)$	$(1, 0)$	$(1, 1)$
$(0, 0)$	$\frac{1}{4}$	$\frac{1}{4}$	$\frac{1}{4}$	$\frac{1}{4}$
$(0, 1)$	$\frac{1}{2}$	$\frac{1}{2}$	$0$	$0$
$(0, 2)$	$0$	$0$	$\frac{1}{2}$	$\frac{1}{2}$
$(1, 0)$	$\frac{1}{2}$	$0$	$\frac{1}{2}$	$0$
$(1, 1)$	$\frac{1}{4}$	$\frac{1}{4}$	$\frac{1}{4}$	$\frac{1}{4}$
$(1, 2)$	$\frac{1}{4}$	$\frac{1}{4}$	$\frac{1}{4}$	$\frac{1}{4}$
$(2, 0)$	$0$	$\frac{1}{2}$	$0$	$\frac{1}{2}$
$(2, 1)$	$\frac{1}{4}$	$\frac{1}{4}$	$\frac{1}{4}$	$\frac{1}{4}$
$(2, 2)$	$\frac{1}{4}$	$\frac{1}{4}$	$\frac{1}{4}$	$\frac{1}{4}$

Table 2. (a) P Y 1 , Y 2 | X 1 , X 2 subscript 𝑃 subscript 𝑌 1 conditional subscript 𝑌 2 subscript 𝑋 1 subscript 𝑋 2 P_{Y_{1},Y_{2}|X_{1},X_{2}} [ 1 , Table I]

$(X_{1}, X_{2})$	$(0, 0)$	$(0, 1)$	$(1, 0)$	$(1, 1)$
$(0, 0)$	$\frac{1}{4}$	$\frac{1}{4}$	$\frac{1}{4}$	$\frac{1}{4}$
$(0, 1)$	$\frac{1}{2}$	$\frac{1}{2}$	$0$	$0$
$(0, 2)$	$0$	$0$	$\frac{1}{2}$	$\frac{1}{2}$
$(1, 0)$	$\frac{1}{2}$	$0$	$\frac{1}{2}$	$0$
$(1, 1)$	$\frac{1}{4}$	$\frac{1}{4}$	$\frac{1}{4}$	$\frac{1}{4}$
$(1, 2)$	$\frac{1}{4}$	$\frac{1}{4}$	$\frac{1}{4}$	$\frac{1}{4}$
$(2, 0)$	$0$	$\frac{1}{2}$	$0$	$\frac{1}{2}$
$(2, 1)$	$\frac{1}{4}$	$\frac{1}{4}$	$\frac{1}{4}$	$\frac{1}{4}$
$(2, 2)$	$\frac{1}{4}$	$\frac{1}{4}$	$\frac{1}{4}$	$\frac{1}{4}$

Table 3. (b) P Y 2 | X 1 , X 2 subscript 𝑃 conditional subscript 𝑌 2 subscript 𝑋 1 subscript 𝑋 2 P_{Y_{2}|X_{1},X_{2}}

$(X_{1}, X_{2})$	$0$	$1$
$(0, 0)$	$\frac{1}{2}$	$\frac{1}{2}$
$(1, 0)$	$1$	$0$
$(2, 0)$	$0$	$1$
$(0, 1)$	$\frac{1}{2}$	$\frac{1}{2}$
$(1, 1)$	$\frac{1}{2}$	$\frac{1}{2}$
$(2, 1)$	$\frac{1}{2}$	$\frac{1}{2}$
$(0, 2)$	$\frac{1}{2}$	$\frac{1}{2}$
$(1, 2)$	$\frac{1}{2}$	$\frac{1}{2}$
$(2, 2)$	$\frac{1}{2}$	$\frac{1}{2}$

Table 4. (c) P Y 1 | X 1 , X 2 subscript 𝑃 conditional subscript 𝑌 1 subscript 𝑋 1 subscript 𝑋 2 P_{Y_{1}|X_{1},X_{2}}

$(X_{1}, X_{2})$	$0$	$1$
$(0, 0)$	$\frac{1}{2}$	$\frac{1}{2}$
$(0, 1)$	$1$	$0$
$(0, 2)$	$0$	$1$
$(1, 0)$	$\frac{1}{2}$	$\frac{1}{2}$
$(1, 1)$	$\frac{1}{2}$	$\frac{1}{2}$
$(1, 2)$	$\frac{1}{2}$	$\frac{1}{2}$
$(2, 0)$	$\frac{1}{2}$	$\frac{1}{2}$
$(2, 1)$	$\frac{1}{2}$	$\frac{1}{2}$
$(2, 2)$	$\frac{1}{2}$	$\frac{1}{2}$

Table 5. TABLE II: Marginal transition matrices of a generalized PTT-TWC, where 0 ≤ a , b , c , d ≤ 2 3 formulae-sequence 0 𝑎 𝑏 𝑐 𝑑 2 3 0\leq a,b,c,d\leq\frac{2}{3} .

$(X_{1}, X_{2})$	$0$	$1$	$2$
$(0, 0)$	$\frac{1}{3}$	$\frac{1}{3}$	$\frac{1}{3}$
$(1, 0)$	$\frac{2}{3} - a$	$a$	$\frac{1}{3}$
$(2, 0)$	$a$	$\frac{2}{3} - a$	$\frac{1}{3}$
$(0, 1)$	$\frac{1}{3}$	$\frac{1}{3}$	$\frac{1}{3}$
$(1, 1)$	$\frac{2}{3} - b$	$b$	$\frac{1}{3}$
$(2, 1)$	$b$	$\frac{2}{3} - b$	$\frac{1}{3}$
$(0, 2)$	$\frac{1}{3}$	$\frac{1}{3}$	$\frac{1}{3}$
$(1, 2)$	$\frac{2}{3} - b$	$b$	$\frac{1}{3}$
$(2, 2)$	$b$	$\frac{2}{3} - b$	$\frac{1}{3}$

Table 6. (a) P Y 2 | X 1 , X 2 subscript 𝑃 conditional subscript 𝑌 2 subscript 𝑋 1 subscript 𝑋 2 P_{Y_{2}|X_{1},X_{2}}

$(X_{1}, X_{2})$	$0$	$1$	$2$
$(0, 0)$	$\frac{1}{3}$	$\frac{1}{3}$	$\frac{1}{3}$
$(1, 0)$	$\frac{2}{3} - a$	$a$	$\frac{1}{3}$
$(2, 0)$	$a$	$\frac{2}{3} - a$	$\frac{1}{3}$
$(0, 1)$	$\frac{1}{3}$	$\frac{1}{3}$	$\frac{1}{3}$
$(1, 1)$	$\frac{2}{3} - b$	$b$	$\frac{1}{3}$
$(2, 1)$	$b$	$\frac{2}{3} - b$	$\frac{1}{3}$
$(0, 2)$	$\frac{1}{3}$	$\frac{1}{3}$	$\frac{1}{3}$
$(1, 2)$	$\frac{2}{3} - b$	$b$	$\frac{1}{3}$
$(2, 2)$	$b$	$\frac{2}{3} - b$	$\frac{1}{3}$

Table 7. (b) P Y 1 | X 1 , X 2 subscript 𝑃 conditional subscript 𝑌 1 subscript 𝑋 1 subscript 𝑋 2 P_{Y_{1}|X_{1},X_{2}}

$(X_{1}, X_{2})$	$0$	$1$	$2$
$(0, 0)$	$\frac{1}{3}$	$\frac{1}{3}$	$\frac{1}{3}$
$(0, 1)$	$\frac{2}{3} - c$	$c$	$\frac{1}{3}$
$(0, 2)$	$c$	$\frac{2}{3} - c$	$\frac{1}{3}$
$(1, 0)$	$\frac{1}{3}$	$\frac{1}{3}$	$\frac{1}{3}$
$(1, 1)$	$\frac{2}{3} - d$	$d$	$\frac{1}{3}$
$(1, 2)$	$d$	$\frac{2}{3} - d$	$\frac{1}{3}$
$(2, 0)$	$\frac{1}{3}$	$\frac{1}{3}$	$\frac{1}{3}$
$(2, 1)$	$\frac{2}{3} - d$	$d$	$\frac{1}{3}$
$(2, 2)$	$d$	$\frac{2}{3} - d$	$\frac{1}{3}$

Equations36

\displaystyle\scalebox{1.0}{\mbox{$\displaystyle\mathcal{C}_{\text{I}}(P_{Y_{1},Y_{2}|X_{1},X_{2}})\triangleq\scalebox{1.0}{\mbox{$\displaystyle\overline{\text{co}}\left(\bigcup_{P_{X_{1}}P_{X_{2}}}\mathcal{R}(P_{X_{1}}P_{X_{2}},P_{Y_{1},Y_{2}|X_{1},X_{2}})\right)$}}$}},

\displaystyle\scalebox{1.0}{\mbox{$\displaystyle\mathcal{C}_{\text{I}}(P_{Y_{1},Y_{2}|X_{1},X_{2}})\triangleq\scalebox{1.0}{\mbox{$\displaystyle\overline{\text{co}}\left(\bigcup_{P_{X_{1}}P_{X_{2}}}\mathcal{R}(P_{X_{1}}P_{X_{2}},P_{Y_{1},Y_{2}|X_{1},X_{2}})\right)$}}$}},

\displaystyle\scalebox{1.0}{\mbox{$\displaystyle\mathcal{C}_{\text{O}}(P_{Y_{1},Y_{2}|X_{1},X_{2}})\triangleq\scalebox{1.0}{\mbox{$\displaystyle\bigcup_{P_{X_{1},X_{2}}}\mathcal{R}(P_{X_{1},X_{2}},P_{Y_{1},Y_{2}|X_{1},X_{2}})$}}$}},

\displaystyle\scalebox{1.0}{\mbox{$\displaystyle\mathcal{C}_{\text{O}}(P_{Y_{1},Y_{2}|X_{1},X_{2}})\triangleq\scalebox{1.0}{\mbox{$\displaystyle\bigcup_{P_{X_{1},X_{2}}}\mathcal{R}(P_{X_{1},X_{2}},P_{Y_{1},Y_{2}|X_{1},X_{2}})$}}$}},

\displaystyle[P_{Y_{2}|X_{1},X_{2}}(\cdot|\cdot,x_{2})]=\Bigl{(}\begin{smallmatrix}\bm{v}_{2}\\ \bm{Q}_{1,x_{2}}\\ \end{smallmatrix}\Bigr{)},

\displaystyle[P_{Y_{2}|X_{1},X_{2}}(\cdot|\cdot,x_{2})]=\Bigl{(}\begin{smallmatrix}\bm{v}_{2}\\ \bm{Q}_{1,x_{2}}\\ \end{smallmatrix}\Bigr{)},

\displaystyle[P_{Y_{1}|X_{1},X_{2}}(\cdot|x_{1},\cdot)]=\Bigl{(}\begin{smallmatrix}\bm{v}_{1}\\ \bm{Q}_{2,x_{1}}\\ \end{smallmatrix}\Bigr{)}.

\displaystyle[P_{Y_{1}|X_{1},X_{2}}(\cdot|x_{1},\cdot)]=\Bigl{(}\begin{smallmatrix}\bm{v}_{1}\\ \bm{Q}_{2,x_{1}}\\ \end{smallmatrix}\Bigr{)}.

I (X_{1}; Y_{2} ∣ X_{2})

I (X_{1}; Y_{2} ∣ X_{2})

\leq

I (X_{2}; Y_{1} ∣ X_{1})

I (X_{2}; Y_{1} ∣ X_{1})

\displaystyle P_{X_{1},X_{2}}(0){\cdot}\bm{R}^{*}_{1}+\Bigg{[}\sum_{x_{1}\neq 0}P_{X_{1}}(x_{1}){-}P_{X_{1},X_{2}}(x_{1},0)\Bigg{]}{\cdot}\bm{R}^{*}_{2}+

\displaystyle P_{X_{1},X_{2}}(0){\cdot}\bm{R}^{*}_{1}+\Bigg{[}\sum_{x_{1}\neq 0}P_{X_{1}}(x_{1}){-}P_{X_{1},X_{2}}(x_{1},0)\Bigg{]}{\cdot}\bm{R}^{*}_{2}+

[P_{X_{2}} (0) - P_{X_{1}, X_{2}} (0, 0)] \cdot R_{3}^{*} + [P_{X_{1}} (0) - P_{X_{1}, X_{2}} (0, 0)] \cdot R_{4}^{*} .

C_{j, 0} = 0.6667 > C_{j, x_{k}} = 0.1539

C_{j, 0} = 0.6667 > C_{j, x_{k}} = 0.1539

C_{1, 0} = 0.6667

C_{1, 0} = 0.6667

C_{2, 0} = 0.6667

C_{1, 0} = 0.2601

C_{1, 0} = 0.2601

C_{2, 0} = 0.6667

C_{1, 0} = 0.2601

C_{1, 0} = 0.2601

C_{2, 0} = 0.0791

I (X = x; Y) ≜ y \in Y \sum P_{Y ∣ X} (y ∣ x) \cdot lo g \frac{P _{Y ∣ X} ( y ∣ x )}{P _{Y} ( y )} .

I (X = x; Y) ≜ y \in Y \sum P_{Y ∣ X} (y ∣ x) \cdot lo g \frac{P _{Y ∣ X} ( y ∣ x )}{P _{Y} ( y )} .

P_{X}^{*} (x) = {0 \frac{1}{r - 1} if x = 0, otherwise .

P_{X}^{*} (x) = {0 \frac{1}{r - 1} if x = 0, otherwise .

I (X = 0; Y)

I (X = 0; Y)

\leq

P_{X}^{(1)} (x) = {α \frac{1 - α}{μ - 1} if x = 0, otherwise,

P_{X}^{(1)} (x) = {α \frac{1 - α}{μ - 1} if x = 0, otherwise,

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Capacity of Generalized Discrete-Memoryless Push-to-Talk Two-Way Channels

Jian-Jia Weng, Fady Alajaji, and Tamás Linder The authors are with the Department of Mathematics and Statistics, Queen’s University, Kingston, ON K7L 3N6, Canada (email: [email protected], [email protected], [email protected]).This work was supported in part by NSERC of Canada.

Abstract

In this report, we generalize Shannon’s push-to-talk two-way channel (PTT-TWC) by allowing reliable full-duplex transmission as well as noisy reception in the half-duplex (PTT) mode. Viewing a PTT-TWC as two state-dependent one-way channels, we introduce a channel symmetry property pertaining to the one-way channels. Shannon’s TWC capacity inner bound is shown to be tight for the generalized model under this symmetry property. We also analytically derive the capacity region, which is shown to be the convex hull of (at most) $4$ rate pairs. Examples that illustrate different shapes of the capacity region are given, and efficient transmission schemes are discussed via the examples.

Index Terms:

Network information theory, push-to-talk two-way channels, capacity region, time-sharing, channel symmetry.

I Introduction

Point-to-point two-way communication [1] as depicted in Fig. 1 allows two users to simultaneously exchange information over a shared channel. Ideally, this enables cooperation between users to jointly improve the reliability of transmission via interactive adaptive coding. However, how each user can effectively maximize its individual transmission rate over the shared channel and concurrently provide sufficient feedback to help the other user’s transmission is quite a challenging problem. Although in the past two decades increased attention has been given to two-way channels (TWCs) [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14], a single-letter characterization of the capacity region for general TWCs remains open. The aim of this report is to establish a capacity result for a generalized push-to-talk (PTT) TWC.

Let $X_{j}\in\{0,1,2\}$ and $Y_{j}\in\{0,1\}$ denote user- $j$ ’s channel input and output for $j=1,2$ , respectively. Shannon’s discrete-memoryless PTT-TWC (DM-PTT-TWC) [1] as shown in Table II(a) is a classic example where two-way simultaneous (i.e., full-duplex) transmission is completely unreliable and time-sharing between two one-way transmissions (i.e., half-duplex communication) is necessary to achieve capacity. As observed from the channel’s marginal transition matrices in Tables II(b) and II(c), user 1 can perfectly transmit a one-bit message to user $2$ only when the channel input of user $2$ is ‘0’, and vice versa. Let $R_{j}$ denote the transmission rate of user $j$ for $j=1,2$ . A simple time-sharing argument then gives the set of reliable transmission rate pairs $(R_{1},R_{2})=(\alpha,1-\alpha)$ , where $0\leq\alpha\leq 1$ . Since there is no other way to transmit information reliably, that set of rate pairs clearly constitutes the boundary of the capacity region and thus determines capacity.111A formal proof of this statement via the Lagrange multiplier method can be found in [2, Section 2.5.3].

Inspired by Shannon’s TWC setup, the PTT idea was extended to other multi-user channels such as PTT multiaccess channels [15, Problem 14.7], [16], switch-to-talk broadcast channels, and incompatible broadcast channels [17, Section V]. In [4], a capacity result was established for a DM-PTT network with more than two users.

In this report, we generalize Shannon’s PTT-TWC by making two-way simultaneous transmission useful. We also allow noisy reception in the half-duplex transmission and extend the channel input and output alphabets beyond ternary-input and binary-output. Viewing the PTT-TWC as two sets of one-way channels (one for each direction of transmission), we further introduce a channel symmetry property, which imposes on each transition matrix of the one-way channels a uniform structure, a weakly-symmetric structure[18], and a capacity constraint. Under this symmetry property, we analytically derive the capacity region for the generalized PTT-TWCs. We also illustrate the possible different shapes of the capacity region and discuss efficient transmission strategies via examples.

It is worth mentioning that a by-product of our derivation is a new way to show the tightness of Shannon’s capacity inner bound [1] which is complementary to prior methods in [7, 10, 12]. In fact, we find that none of these prior results imply that Shannon’s inner bound is tight for Shannon’s PTT-TWC (and for our general model under the symmetry property). We will discuss this issue later (in Section III).

The rest of this report is organized as follows. In Section II, a brief review on the general TWC and the proposed DM-PTT-TWC models is given. A capacity result for the proposed model is derived in Section III. Examples are presented and qualitatively assessed in Section IV, and conclusions are drawn in Section V.

II Preliminaries and Generalized

DM-PTT-TWC Model

II-A General DM-TWC Model

In point-to-point two-way communication, two users exchange messages $M_{1}$ and $M_{2}$ via $n$ channel uses. Messages $M_{1}$ and $M_{2}$ are assumed to be independent and uniformly distributed on the finite sets $\mathcal{M}_{1}\triangleq\{1,2,...,2^{\mathit{nR}_{1}}\}$ and $\mathcal{M}_{2}\triangleq\{1,2,...,2^{\mathit{nR}_{2}}\}$ , respectively, for some integers $\mathit{nR}_{1},\mathit{nR}_{2}\geq 0$ . Let $\mathcal{X}_{j}$ and $\mathcal{Y}_{j}$ be the channel input and output alphabets, respectively, for $j=1,2$ . For $i=1,2,\dots,n$ , let $X_{j,i}\in\mathcal{X}_{j}$ and $Y_{j,i}\in\mathcal{Y}_{j}$ denote the channel input and output of user $j$ at time $i$ , respectively. Given the channel transition probability $P_{Y_{1},Y_{2}|X_{1},X_{2}}$ , a TWC is said to be memoryless if $P_{Y_{1,i},Y_{2,i}|X_{1}^{i},X_{2}^{i},Y_{1}^{i-1},X_{2}^{i-1}}(y_{1,i},y_{2,i}|x_{1}^{i},x_{2}^{i},y_{1}^{i-1},y_{2}^{i-1})=P_{Y_{1},Y_{2}|X_{1},X_{2}}(y_{1,i},y_{2,i}|x_{1,i},x_{2,i})$ for all $i=1,2,\dots,n$ , where $x_{j}^{i}\triangleq(x_{j,1},x_{j,2},\dots,x_{j,i})$ and $y_{j}^{i-1}\triangleq(y_{j,1},y_{j,2},\dots,y_{j,i-1})$ . A channel code for a DM-TWC is defined as follows.

Definition 1.

An $(n,R_{1},R_{2})$ code for a DM-TWC consists of two message sets $\mathcal{M}_{1}=\{1,2,\dots,\allowbreak 2^{\mathit{nR}_{1}}\}$ and $\mathcal{M}_{2}=\{1,2,\dots,2^{\mathit{nR}_{2}}\}$ , two sequences of encoding functions $f_{1}^{n}\triangleq(f_{1,1},f_{1,2},\dots,f_{1,n})$ and $f_{2}^{n}\triangleq(f_{2,1},f_{2,2},\dots,f_{2,n})$ such that $X_{1,1}=f_{1,1}(M_{1})$ , $X_{2,1}=f_{2,1}(M_{2})$ , $X_{1,i}=f_{1,i}(M_{1},Y_{1}^{i-1})$ , and $X_{2,i}=f_{2,i}(M_{2},Y_{2}^{i-1})$ for $i=2,3,\dots,n$ , and two decoding functions $g_{1}$ and $g_{2}$ such that $\hat{M}_{2}=g_{1}(M_{1},Y_{1}^{n})$ and $\hat{M}_{1}=g_{2}(M_{2},Y_{2}^{n})$ .

When messages $M_{1}$ and $M_{2}$ are encoded via an $(n,R_{1},R_{2})$ channel code, the probability of decoding error is defined as $P^{(n)}_{\text{e}}(f_{1}^{n},f_{2}^{n},g_{1},g_{2})=\text{Pr}\{\hat{M}_{1}\neq M_{1}\ \text{or}\ \hat{M}_{2}\neq M_{2}\}.$

Definition 2.

A rate pair $(R_{1},R_{2})$ is said to be achievable if there exists a sequence of $(n,R_{1},R_{2})$ codes such that $\lim_{n\to\infty}P^{(n)}_{\text{e}}=0$ . The capacity region $\mathcal{C}$ is defined as the closure of all achievable rate pairs.

To date, a computable single-letter expression for the capacity region of general DM-TWCs has not been found. Capacity bounds such as [1, 19, 20, 21] still play crucial roles in studying transmission problems over DM-TWCs. Let $\mathcal{R}(P_{X_{1},X_{2}},P_{Y_{1},Y_{2}|X_{1},X_{2}})$ denote the set of rate pairs $(R_{1},R_{2})$ with $R_{1}\leq I(X_{1};Y_{2}|X_{2})$ and $R_{2}\leq I(X_{2};Y_{1}|X_{1})$ , where the joint distribution of all random variables is given by $P_{X_{1},X_{2},Y_{1},Y_{2}}=P_{X_{1},X_{2}}\cdot P_{Y_{1},Y_{2}|X_{1},X_{2}}$ . Shannon in [1] showed that the capacity region of a DM-TWC with transition probability $P_{Y_{1},Y_{2}|X_{1},X_{2}}$ is inner bounded by

[TABLE]

and outer bounded by

[TABLE]

where $\overline{\text{co}}$ denotes taking the closure of the convex hull. An alternative expression of $\mathcal{C}_{\text{I}}$ without the convex hull operation can be obtained by introducing an auxiliary random variable [22, Proposition 17.2].

In general, $\mathcal{C}_{\text{I}}$ and $\mathcal{C}_{\text{O}}$ do not coincide, but various sufficient conditions that imply the tightness of $\mathcal{C}_{\text{I}}$ have been proposed in [1, 10, 12]. However, these conditions only apply to DM-TWCs for which the convex hull operation is unnecessary in obtaining $\mathcal{C}_{\text{I}}$ , thus failing to determine the capacity region for channels requiring this operation, such as Shannon’s PTT-TWC. In this report, we address this issue for a generalized DM-PTT-TWC model.

II-B Generalized DM-PTT-TWC Model

For $j=1,2$ , let $\mathcal{X}_{j}\triangleq\{0,1,\dots,r_{j}-1\}$ and $\mathcal{Y}_{j}\triangleq\{0,1,\dots,s_{j}-1\}$ , where $r_{j}\geq 3$ and $s_{j}\geq 2$ (to avoid trivial cases). Without loss of generality, we set $X_{1}=0$ and $X_{2}=0$ as the signals for the “PTT mode”. For $j=1,2$ , let $\bm{v}_{j}$ denote the length- $s_{j}$ row vector with all entries equal to $1/s_{j}$ . Also, let $\bm{Q}_{j,x_{k}}$ denote a $(r_{j}-1)\times s_{k}$ channel transition matrix with capacity $C_{j,x_{k}}$ for $j,k=1,2$ with $j\neq k$ and $x_{k}\in\mathcal{X}_{k}$ . An $(r_{1},r_{2},s_{1},s_{2})$ generalized DM-PTT-TWC with transition probability $P_{Y_{1},Y_{2}|X_{1},X_{2}}$ is defined by imposing the following structure for the marginal channel transition matrices $[P_{Y_{j}|X_{1},X_{2}}(\cdot|\cdot,\cdot)]$ (where the rows and columns are indexed by the channel inputs and outputs, respectively): for all $x_{2}\in\mathcal{X}_{2}$ ,

[TABLE]

and for all $x_{1}\in\mathcal{X}_{1}$ ,

[TABLE]

We remark that the above structures do not imply the property $P_{Y_{1},Y_{2}|X_{1},X_{2}}=P_{Y_{1}|X_{1},X_{2}}\cdot P_{Y_{2}|X_{1},X_{2}}$ .

Unlike Shannon’s original PTT-TWC, our proposed model considers both perfect and noisy reception in the PTT mode and allows reliable full-duplex transmission. Shannon’s PTT-TWC can be recovered by setting $(r_{1},r_{2},s_{1},s_{2})=(3,3,2,2)$ , $\bm{Q}_{j,0}=\bm{I}_{2}$ , and $\bm{Q}_{j,1}=\bm{Q}_{j,2}=\frac{1}{2}\cdot\bm{1}_{2\times 2}$ for $j=1,2$ , where $\bm{I}_{2}$ and $\bm{1}_{2\times 2}$ denote the $2\times 2$ identity and all-one matrices, respectively, and the overall channel transition probability can be obtained as $P_{Y_{1},Y_{2}|X_{1},X_{2}}=P_{Y_{1}|X_{1},X_{2}}\cdot P_{Y_{2}|X_{1},X_{2}}$ .

III Capacity Region of Generalized DM-PTT-TWCs with a Symmetry Property

The capacity region of an $(r_{1},r_{2},s_{1},s_{2})$ DM-PTT-TWC is generally unknown. Below, we show that the capacity region can be analytically determined when the marginal channels exhibit the following symmetry property:

Channel Symmetry Property for Generalized PTT-TWCs: for $j,k=1,2$ with $j\neq k$ , $\bm{Q}_{j,x_{k}}$ ’s are weakly-symmetric222A channel is said to be weakly-symmetric if its transition matrix has identical column sums and its rows are permutations of each other [18, Section 7.2]; for such a channel, the mutual information is maximized by the uniform input distribution. We note that for more general symmetric transition matrices for which mutual information is maximized by the uniform input distribution (e.g. quasi-symmetric channels [23]), Theorem 1 does not necessarily hold. for all $x_{k}\in\mathcal{X}_{k}$ and $C_{j,x_{k}}=C_{j,1}$ for all $x_{k}\neq 0$ .

Letting $\mathbf{1}\{\cdot\}$ denote indicator function, and letting $P^{\text{U}_{0}}_{\mathcal{X}_{j}}$ denote the probability distribution that assigns zero probability mass to $X_{j}=0$ and is uniform over the set $\mathcal{X}_{j}\backslash\{0\}$ , $j=1,2$ , we define six rate pairs and their associated input distributions for the generalized PTT-TWC with the above symmetry property as follows:

•

$\bm{R}^{*}_{1}\triangleq(0,0)$ , $P_{X_{1},X_{2}}(x_{1},x_{2})=\mathbf{1}\{x_{1}=0\}\cdot\mathbf{1}\{x_{2}=0\}$ ;

•

$\bm{R}^{*}_{2}\triangleq(C_{1,1},C_{2,1})$ , $P_{X_{1},X_{2}}=P^{\text{U}_{0}}_{\mathcal{X}_{1}}\cdot P^{\text{U}_{0}}_{\mathcal{X}_{2}}$ ;

•

$\bm{R}^{*}_{3}\triangleq(C_{1,0},0)$ , $P_{X_{1},X_{2}}(x_{1},x_{2})=P^{\text{U}_{0}}_{\mathcal{X}_{1}}(x_{1})\cdot\mathbf{1}\{x_{2}=0\}$ ;

•

$\bm{R}^{*}_{4}\triangleq(0,C_{2,0})$ , $P_{X_{1},X_{2}}(x_{1},x_{2})=\mathbf{1}\{x_{1}=0\}\cdot P^{\text{U}_{0}}_{\mathcal{X}_{2}}(x_{2})$ ;

•

$\bm{R}^{*}_{5}\triangleq(C_{1,1},0)$ , $P_{X_{1},X_{2}}(x_{1},x_{2})=P^{\text{U}_{0}}_{\mathcal{X}_{1}}(x_{1})\cdot\mathbf{1}\{x_{2}=1\}$ ;

•

$\bm{R}^{*}_{6}\triangleq(0,C_{2,1})$ , $P_{X_{1},X_{2}}(x_{1},x_{2})=\mathbf{1}\{x_{1}=1\}\cdot P^{\text{U}_{0}}_{\mathcal{X}_{2}}(x_{2})$ .

Note that the $\bm{R}^{*}_{l}$ ’s are all attained via independent inputs.

Theorem 1.

For an $(r_{1},r_{2},s_{1},s_{2})$ DM-PTT-TWC that satisfies the above channel symmetry property, Shannon’s inner bound is tight and the capacity region can be determined by taking the convex hull of $\bm{R}^{*}_{1}$ , $\bm{R}^{*}_{2}$ , $\max(\bm{R}^{*}_{3},\bm{R}^{*}_{5})$ , and $\max(\bm{R}^{*}_{4},\bm{R}^{*}_{6})$ .333We set $\max(\bm{A},\bm{B})=\bm{B}$ iff $\bm{A}$ is upper-bounded component-wise by $\bm{B}$ .

The idea behind the proof of Theorem 1 is to show that any rate pair in Shannon’s outer bound region $\mathcal{C}_{\text{O}}$ can be upper-bounded component-wise by another rate pair that is a convex combination of the $\bm{R}^{*}_{l}$ ’s. More specifically, depending on the value of $C_{j,x_{k}}$ ’s, we can use the four rate pairs: $\bm{R}^{*}_{1}$ , $\bm{R}^{*}_{2}$ , $\max(\bm{R}^{*}_{3},\bm{R}^{*}_{5})$ , and $\max(\bm{R}^{*}_{4},\bm{R}^{*}_{6})$ , to upper-bound any rate pair in $\mathcal{C}_{\text{O}}$ and hence determine the capacity region. Here, we only prove the case where $\bm{R}^{*}_{3}=\max(\bm{R}^{*}_{3},\bm{R}^{*}_{5})$ and $\bm{R}^{*}_{4}=\max(\bm{R}^{*}_{4},\bm{R}^{*}_{6})$ . The same argument can be used to prove other cases, and hence the details are omitted.

Proof:

Given any $P_{X_{1},X_{2}}$ , we bound the associated rate pair $(I(X_{1};Y_{2}|X_{2}),I(X_{2};Y_{1}|X_{1}))$ as follows:

[TABLE]

where (3) follows from Lemma 3 in the Appendix and (3) holds since $C_{1,x_{2}}=C_{1,1}$ for all $x_{2}\neq 0$ . Similarly, we have

[TABLE]

Note that (3) and (5) and the fact that $\sum_{x_{2}\neq 0}(P_{X_{2}}(x_{2})-P_{X_{1},X_{2}}(0,x_{2}))=\sum_{x_{1}\neq 0}(P_{X_{1}}(x_{1})-P_{X_{1},X_{2}}(x_{1},0))$ imply that the pair $(I(X_{1};Y_{2}|X_{2}),I(X_{2};Y_{1}|X_{1}))$ is upper-bounded component-wise by

[TABLE]

Since the coefficients of the above four rate pairs sum to one, any rate pair in $\mathcal{C}_{\text{O}}$ is outer bounded by some convex combination of $\bm{R}^{*}_{1}$ , $\bm{R}^{*}_{2}$ , $\bm{R}^{*}_{3}$ , and $\bm{R}^{*}_{4}$ . Since the four rate pairs are achievable via independent inputs, we conclude that Shannon’s inner bound is tight. ∎

Clearly, the capacity region of Shannon’s PTT-TWC can be easily determined via Theorem 1 without using the time-sharing argument [1] or the Lagrange multiplier method [2].

Moreover, we note that (1) can be interpreted as the average amount of information sent over a set of state-dependent one-way channels $\{[P_{Y_{2}|X_{1},X_{2}}(\cdot|\cdot,x_{2})]{\mathrel{\mathop{\ordinarycolon}}}\ x_{2}\in\mathcal{X}_{2}\}$ , where the channel input, output, and state, correspond to $X_{1}$ , $Y_{2}$ , and $X_{2}$ , respectively. Thus, user-2’s input distribution $P_{X_{2}}$ not only carries its own message but also determines how often each one-way channel can be used for user 1. The same interpretation also applies to (5). Clearly, the best channel input distribution for one user may not create the most favorable one-way channel allocation for the other user, necessitating a rate trade-off between the two users’ transmissions.

Quantifying the trade-off is often the most involved part of determining the capacity region of general TWCs. The prior approach to tackle the problem is to exploit (when they exist) channel symmetry or invariance properties so that for any $P_{X_{1},X_{2}}=P_{X_{2}}\cdot P_{X_{1}|X_{2}}$ , one can always find a $\tilde{P}_{X_{1}}$ such that $\mathcal{R}(P_{X_{1},X_{2}},P_{Y_{1},Y_{2}|X_{1},X_{2}})\subseteq\mathcal{R}(\tilde{P}_{X_{1}}\cdot P_{X_{2}},P_{Y_{1},Y_{2}|X_{1},X_{2}})$ [1, 10, 12]. However, this approach fails here since such $\tilde{P}_{X_{1}}$ may not exist for each $P_{X_{1},X_{2}}$ . This observation can be illustrated via Shannon’s PTT-TWC as one can see that no single independent input distribution can achieve the rate pair $(R_{1},R_{2})=(\alpha,1-\alpha)$ , where $0<\alpha<1$ . It is thus of interest to exploit other symmetry property as the one presented at the beginning of the section that allows us to show $\mathcal{C}_{\text{O}}\subseteq\mathcal{C}_{\text{I}}$ directly.

IV Examples and Discussion

In the last section, we proved the tightness of Shannon’s inner bound for a class of generalized DM-PTT-TWCs. The capacity result in Theorem 1 suggests a way to use different state-dependent one-way channels to optimize bi-directional transmission rates. In what follows, we illustrate all possible shapes of the capacity region via examples and discuss the optimal transmission strategy behind each result.

Let $(r_{1},r_{2},s_{1},s_{2})=(3,3,3,3)$ . Consider the generalized PTT-TWC with the parameterized marginal transition matrices as shown in Table II and the following settings:

Setting 1: $(a,b,c,d)=(0,0.15,0,0.15)$ $\Rightarrow$

[TABLE]

for $j,k=1,2$ with $j\neq k$ and all $x_{k}\neq 0$ ;

Setting 2: $(a,b,c,d)=(0,0.05,0,0.01)$ $\Rightarrow$

[TABLE]

for all $x_{1}\neq 0$ and $x_{2}\neq 0$ ;

Setting 3: $(a,b,c,d)=(0.1,0,0,0.01)$ $\Rightarrow$

[TABLE]

for all $x_{1}\neq 0$ and $x_{2}\neq 0$ ;

Setting 4: $(a,b,c,d)=(0.1,0,0.2,0.05)$ $\Rightarrow$

[TABLE]

for all $x_{1}\neq 0$ and $x_{2}\neq 0$ .

Note that, unlike for Shannon’s original PTT-TWC, reliable full-duplex transmission is possible in the above settings since $C_{j,x_{k}}>0$ for all $j,k=1,2$ with $j\neq k$ and $x_{k}\in\mathcal{X}_{k}$ .

In Figures 2(a)-(d) (corresponding to Settings 1–4, respectively), the blue dots444In our computations, we discretized the standard 2-dimensional simplex to generate the input distributions for each user. The mutual information $I(X_{j};Y_{k}|X_{k})$ is then evaluated under the product of the discretized input distributions. A similar approach is used to obtain rate pairs in Shannon’s outer bound region. are the achievable rate pairs via independent inputs of the form: $P_{X_{1},X_{2}}=P_{X_{1}}\cdot P_{X_{2}}$ ; Shannon’s inner bound region $\mathcal{C}_{\text{I}}$ is then given by taking the convex hull of those rate pairs. Shannon’s outer bound $\mathcal{C}_{\text{O}}$ is obtained using a similar method, but the convex hull operation is not needed. We also depict the achievable rate region using the half-duplex transmission mode (via input symbol ‘[math]’). In all settings, we have that $\mathcal{C}_{\text{I}}=\mathcal{C}_{\text{O}}$ as expected.

In Figure 2(a), we first observe that the half-duplex transmission can attain the entire capacity region. Indeed, although full-duplex transmission is reliable, the large difference between $C_{j,0}$ and $C_{j,x_{k}}$ (for $x_{k}\neq 0$ ) limits the rates achievable via two-way simultaneous transmission and hence the half-duplex transmission is still optimal (in the sense of achieving capacity). Nevertheless, the benefit of full-duplex transmission can be made significant by increasing the value of $C_{j,x_{k}}$ for $x_{k}\neq 0$ . In Figure 2(b), we illustrate a situation where two-way simultaneous transmission achieves better rate pairs than using the half-duplex transmission.

Moreover, when the $C_{j,x_{k}}$ ’s ( $x_{k}\neq 0$ ) are much larger than $C_{j,0}$ , using $[P_{Y_{2}|X_{1},X_{2}}(\cdot|\cdot,0)]$ and $[P_{Y_{1}|X_{1},X_{2}}(\cdot|0,\cdot)]$ for information transmission becomes inefficient since they contribute very little to the overall transmission rates in (1) and (5). In this case, one should expect to abandon the (relatively) inefficient channels and use only the efficient ones. This is illustrated in Figs. 2(c) and 2(d). In an extreme case, such as Setting 4, the upper-right corner point of the capacity region is given by $\bm{R}^{*}_{2}=(0.6667,0.4105)=(C_{1,1},C_{2,1})$ , implying that both users shut down the state-dependent one-way channels $[P_{Y_{2}|X_{1},X_{2}}(\cdot|\cdot,0)]$ and $[P_{Y_{1}|X_{1},X_{2}}(\cdot|0,\cdot)]$ and only use the remaining channels for information exchange.

V Conclusions

We identified a channel symmetry property under which Shannon’s capacity inner bound is tight for a class of generalized DM-PTT-TWCs. This symmetry property differs from prior ones in that it necessitates the use of a time-sharing scheme to achieve capacity. Specifically, a time-sharing coding scheme that involves two independent transmissions is optimal. Viewing the generalized DM-PTT-TWC as two sets of one-way channels, we further observed that one-way channel components with (relatively) low capacity should be abandoned for efficient transmissions. Future research directions include finding a more general tightness condition for DM-TWCs, identifying the connections between different channel symmetry properties (in particular between the channel symmetry property introduced in this paper and the ones in [10] and [12]), and investigating the transmission of correlated sources over the generalized DM-PTT-TWCs.

The appendix establishes input-output mutual information results for one-way channels that are of the same type as the state-dependent one-way channels in the generalized PTT-TWC of Theorem 1. Let $\mathcal{X}=\{0,1,\dots,r-1\}$ and $\mathcal{Y}=\{0,1,\dots,s-1\}$ denote channel input and output alphabets, respectively, for some integers $r\geq 3$ and $s\geq 2$ . Suppose that the set of probability vectors $\{[P_{Y|X}(\cdot|x_{1})]\mathrel{\mathop{\ordinarycolon}}x_{1}\in\mathcal{X}\backslash\{0\}\}$ specifies a weakly-symmetric channel and $P_{Y|X}(y|0)=1/s$ for all $y\in\mathcal{Y}$ . The input-output mutual information for a specific channel input symbol $x\in\mathcal{X}$ is defined as

[TABLE]

The following results are needed in the proof of Theorem 1.

Lemma 2.

The capacity of the channel with the above properties is given by $C^{*}=\max_{P_{X}}I(X;Y)=\log s-H([P_{Y|X}(\cdot|1)])$ , where $H([P_{Y|X}(\cdot|1)])$ denotes the entropy of the probability vector $[P_{Y|X}(\cdot|1)]$ . The capacity-achieving input distribution is given by:

[TABLE]

Proof:

We apply the KKT condition for channel capacity [24, Theorem 4.5.1] to check the optimality of $P^{*}_{X}$ . Under $P^{*}_{X}$ , we first have that $I(X=x;Y)=\log s-H([P_{Y|X}(\cdot|1)])$ for $x\neq 0$ [18, Theorem 7.2.1] since $P_{X}^{*}$ is a uniform distribution when restricted to the input alphabet $\mathcal{X}\setminus\{0\}$ and the channel with the restricted inputs is weakly-symmetric. Moreover, for $x\neq 0$ , we have

[TABLE]

where $y^{\prime}\in\mathcal{Y}$ is arbitrary in (V) since $\sum_{x^{\prime}\neq 0}P_{Y|X}(y|x^{\prime})$ does not depend on $y$ and (V) holds since $H(Y|X=0)\leq\log s$ . Combining the above results then gives that $I(X=0;Y)\leq I(X=x;Y)$ for all $x\neq 0$ , thus implying the optimality of $P^{*}_{X}$ . Finally, we conclude that $C^{*}=\max_{P_{X}}I(X;Y)=I(X=x;Y)$ for any $x\neq 0$ by the KKT condition. ∎

Lemma 3.

For any $0\leq\alpha\leq 1$ , consider the following channel input distribution:

[TABLE]

Let $P^{(2)}_{X}$ denote any input distribution with $P^{(2)}_{X}(0)=\alpha$ . Then, we have that $I^{(2)}(X;Y)\leq I^{(1)}(X;Y)=(1-\alpha)\cdot C^{*}$ (here the superscript indicates which input distribution is used for evaluation).

Proof:

First, we have that $H^{(2)}(Y)\leq\log s=H^{(1)}(Y)$ . Also, since $H(Y|X=x)=H(Y|X=1)$ for all $x\neq 0$ due to the weakly-symmetric structure, one can easily conclude that $H^{(1)}(Y|X)=H^{(2)}(Y|X)$ . The above results then imply that $I^{(2)}(X;Y)\leq I^{(1)}(X;Y)$ . Moreover, a direct computation (with the result in Lemma 2) yields that $I^{(1)}(X;Y)=(1-\alpha)\cdot C^{*}$ , thereby completing the proof. ∎

Bibliography24

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] C. E. Shannon, “Two-way communication channels,” in Proc. 4th Berkeley Symp. Math. Stat. Probab. , 1961, pp. 611–644.
2[2] H. B. Meeuwissen, “Information theoretical aspects of two-way communication,” Ph.D. dissertation, TU Eindhoven, 1998.
3[3] G. Kramer, “Directed information for channels with feedback,” Ph.D. dissertation, Swiss Federal Institute of Technology Zurich, 1998.
4[4] ——, “Capacity results for the discrete memoryless network,” IEEE Trans. Inf. Theory , vol. 49, no. 1, pp. 4–21, Jan. 2003.
5[5] A. Maor and N. Merhav, “Two-way successively refined joint source-channel coding,” IEEE Trans. Inf. Theory , vol. 52, no. 4, pp. 1483–1494, Apr. 2006.
6[6] D. Gunduz, E. Erkip, A. Goldsmith, and H. V. Poor, “Source and channel coding for correlated sources over multiuser channels,” IEEE Trans. Inf. Theory , vol. 55, no. 9, pp. 3927–3944, Aug. 2009.
7[7] L. R. Varshney, “Two way communication over exponential family type channels,” in Proc. IEEE Int. Symp. Inf. Theory , 2013, pp. 2795–2799.
8[8] Z. Cheng and N. Devroye, “Two-way networks: when adaptation is useless,” IEEE Trans. Inf. Theory , vol. 60, no. 3, pp. 1793–1813, Mar. 2014.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Capacity of Generalized Discrete-Memoryless Push-to-Talk Two-Way Channels

Abstract

Index Terms:

I Introduction

II Preliminaries and Generalized

II-A General DM-TWC Model

Definition 1**.**

Definition 2**.**

II-B Generalized DM-PTT-TWC Model

III Capacity Region of Generalized DM-PTT-TWCs with a Symmetry Property

Theorem 1**.**

Proof:

IV Examples and Discussion

V Conclusions

Lemma 2**.**

Proof:

Lemma 3**.**

Proof:

Definition 1.

Definition 2.

Theorem 1.

Lemma 2.

Lemma 3.