Capacity of Generalized Discrete-Memoryless Push-to-Talk Two-Way Channels
Jian-Jia Weng, Fady Alajaji, and Tam\'as Linder

TL;DR
This paper extends Shannon's push-to-talk two-way channel model to include full-duplex and noisy reception, deriving the capacity region and demonstrating its properties under certain symmetry conditions.
Contribution
It introduces a generalized model of push-to-talk channels with full-duplex capability and characterizes its capacity region, which was not previously known.
Findings
Capacity region is the convex hull of at most 4 rate pairs.
Shannon's inner bound is tight under the symmetry property.
Examples illustrate different shapes of the capacity region.
Abstract
In this report, we generalize Shannon's push-to-talk two-way channel (PTT-TWC) by allowing reliable full-duplex transmission as well as noisy reception in the half-duplex (PTT) mode. Viewing a PTT-TWC as two state-dependent one-way channels, we introduce a channel symmetry property pertaining to the one-way channels. Shannon's TWC capacity inner bound is shown to be tight for the generalized model under this symmetry property. We also analytically derive the capacity region, which is shown to be the convex hull of (at most) 4 rate pairs. Examples that illustrate different shapes of the capacity region are given, and efficient transmission schemes are discussed via the examples.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Capacity of Generalized Discrete-Memoryless Push-to-Talk Two-Way Channels
Jian-Jia Weng, Fady Alajaji, and Tamás Linder The authors are with the Department of Mathematics and Statistics, Queen’s University, Kingston, ON K7L 3N6, Canada (email: [email protected], [email protected], [email protected]).This work was supported in part by NSERC of Canada.
Abstract
In this report, we generalize Shannon’s push-to-talk two-way channel (PTT-TWC) by allowing reliable full-duplex transmission as well as noisy reception in the half-duplex (PTT) mode. Viewing a PTT-TWC as two state-dependent one-way channels, we introduce a channel symmetry property pertaining to the one-way channels. Shannon’s TWC capacity inner bound is shown to be tight for the generalized model under this symmetry property. We also analytically derive the capacity region, which is shown to be the convex hull of (at most) rate pairs. Examples that illustrate different shapes of the capacity region are given, and efficient transmission schemes are discussed via the examples.
Index Terms:
Network information theory, push-to-talk two-way channels, capacity region, time-sharing, channel symmetry.
I Introduction
Point-to-point two-way communication [1] as depicted in Fig. 1 allows two users to simultaneously exchange information over a shared channel. Ideally, this enables cooperation between users to jointly improve the reliability of transmission via interactive adaptive coding. However, how each user can effectively maximize its individual transmission rate over the shared channel and concurrently provide sufficient feedback to help the other user’s transmission is quite a challenging problem. Although in the past two decades increased attention has been given to two-way channels (TWCs) [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14], a single-letter characterization of the capacity region for general TWCs remains open. The aim of this report is to establish a capacity result for a generalized push-to-talk (PTT) TWC.
Let and denote user-’s channel input and output for , respectively. Shannon’s discrete-memoryless PTT-TWC (DM-PTT-TWC) [1] as shown in Table II(a) is a classic example where two-way simultaneous (i.e., full-duplex) transmission is completely unreliable and time-sharing between two one-way transmissions (i.e., half-duplex communication) is necessary to achieve capacity. As observed from the channel’s marginal transition matrices in Tables II(b) and II(c), user 1 can perfectly transmit a one-bit message to user only when the channel input of user is ‘0’, and vice versa. Let denote the transmission rate of user for . A simple time-sharing argument then gives the set of reliable transmission rate pairs , where . Since there is no other way to transmit information reliably, that set of rate pairs clearly constitutes the boundary of the capacity region and thus determines capacity.111A formal proof of this statement via the Lagrange multiplier method can be found in [2, Section 2.5.3].
Inspired by Shannon’s TWC setup, the PTT idea was extended to other multi-user channels such as PTT multiaccess channels [15, Problem 14.7], [16], switch-to-talk broadcast channels, and incompatible broadcast channels [17, Section V]. In [4], a capacity result was established for a DM-PTT network with more than two users.
In this report, we generalize Shannon’s PTT-TWC by making two-way simultaneous transmission useful. We also allow noisy reception in the half-duplex transmission and extend the channel input and output alphabets beyond ternary-input and binary-output. Viewing the PTT-TWC as two sets of one-way channels (one for each direction of transmission), we further introduce a channel symmetry property, which imposes on each transition matrix of the one-way channels a uniform structure, a weakly-symmetric structure[18], and a capacity constraint. Under this symmetry property, we analytically derive the capacity region for the generalized PTT-TWCs. We also illustrate the possible different shapes of the capacity region and discuss efficient transmission strategies via examples.
It is worth mentioning that a by-product of our derivation is a new way to show the tightness of Shannon’s capacity inner bound [1] which is complementary to prior methods in [7, 10, 12]. In fact, we find that none of these prior results imply that Shannon’s inner bound is tight for Shannon’s PTT-TWC (and for our general model under the symmetry property). We will discuss this issue later (in Section III).
The rest of this report is organized as follows. In Section II, a brief review on the general TWC and the proposed DM-PTT-TWC models is given. A capacity result for the proposed model is derived in Section III. Examples are presented and qualitatively assessed in Section IV, and conclusions are drawn in Section V.
II Preliminaries and Generalized
DM-PTT-TWC Model
II-A General DM-TWC Model
In point-to-point two-way communication, two users exchange messages and via channel uses. Messages and are assumed to be independent and uniformly distributed on the finite sets and , respectively, for some integers . Let and be the channel input and output alphabets, respectively, for . For , let and denote the channel input and output of user at time , respectively. Given the channel transition probability , a TWC is said to be memoryless if for all , where and . A channel code for a DM-TWC is defined as follows.
Definition 1**.**
An code for a DM-TWC consists of two message sets and , two sequences of encoding functions and such that , , , and for , and two decoding functions and such that and .
When messages and are encoded via an channel code, the probability of decoding error is defined as
Definition 2**.**
A rate pair is said to be achievable if there exists a sequence of codes such that . The capacity region is defined as the closure of all achievable rate pairs.
To date, a computable single-letter expression for the capacity region of general DM-TWCs has not been found. Capacity bounds such as [1, 19, 20, 21] still play crucial roles in studying transmission problems over DM-TWCs. Let denote the set of rate pairs with and , where the joint distribution of all random variables is given by . Shannon in [1] showed that the capacity region of a DM-TWC with transition probability is inner bounded by
[TABLE]
and outer bounded by
[TABLE]
where denotes taking the closure of the convex hull. An alternative expression of without the convex hull operation can be obtained by introducing an auxiliary random variable [22, Proposition 17.2].
In general, and do not coincide, but various sufficient conditions that imply the tightness of have been proposed in [1, 10, 12]. However, these conditions only apply to DM-TWCs for which the convex hull operation is unnecessary in obtaining , thus failing to determine the capacity region for channels requiring this operation, such as Shannon’s PTT-TWC. In this report, we address this issue for a generalized DM-PTT-TWC model.
II-B Generalized DM-PTT-TWC Model
For , let and , where and (to avoid trivial cases). Without loss of generality, we set and as the signals for the “PTT mode”. For , let denote the length- row vector with all entries equal to . Also, let denote a channel transition matrix with capacity for with and . An generalized DM-PTT-TWC with transition probability is defined by imposing the following structure for the marginal channel transition matrices (where the rows and columns are indexed by the channel inputs and outputs, respectively): for all ,
[TABLE]
and for all ,
[TABLE]
We remark that the above structures do not imply the property .
Unlike Shannon’s original PTT-TWC, our proposed model considers both perfect and noisy reception in the PTT mode and allows reliable full-duplex transmission. Shannon’s PTT-TWC can be recovered by setting , , and for , where and denote the identity and all-one matrices, respectively, and the overall channel transition probability can be obtained as .
III Capacity Region of Generalized DM-PTT-TWCs with a Symmetry Property
The capacity region of an DM-PTT-TWC is generally unknown. Below, we show that the capacity region can be analytically determined when the marginal channels exhibit the following symmetry property:
Channel Symmetry Property for Generalized PTT-TWCs: for with , ’s are weakly-symmetric222A channel is said to be weakly-symmetric if its transition matrix has identical column sums and its rows are permutations of each other [18, Section 7.2]; for such a channel, the mutual information is maximized by the uniform input distribution. We note that for more general symmetric transition matrices for which mutual information is maximized by the uniform input distribution (e.g. quasi-symmetric channels [23]), Theorem 1 does not necessarily hold. for all and for all .
Letting denote indicator function, and letting denote the probability distribution that assigns zero probability mass to and is uniform over the set , , we define six rate pairs and their associated input distributions for the generalized PTT-TWC with the above symmetry property as follows:
- •
, ;
- •
, ;
- •
, ;
- •
, ;
- •
, ;
- •
, .
Note that the ’s are all attained via independent inputs.
Theorem 1**.**
For an DM-PTT-TWC that satisfies the above channel symmetry property, Shannon’s inner bound is tight and the capacity region can be determined by taking the convex hull of , , , and .333We set iff is upper-bounded component-wise by .
The idea behind the proof of Theorem 1 is to show that any rate pair in Shannon’s outer bound region can be upper-bounded component-wise by another rate pair that is a convex combination of the ’s. More specifically, depending on the value of ’s, we can use the four rate pairs: , , , and , to upper-bound any rate pair in and hence determine the capacity region. Here, we only prove the case where and . The same argument can be used to prove other cases, and hence the details are omitted.
Proof:
Given any , we bound the associated rate pair as follows:
[TABLE]
where (3) follows from Lemma 3 in the Appendix and (3) holds since for all . Similarly, we have
[TABLE]
Note that (3) and (5) and the fact that imply that the pair is upper-bounded component-wise by
[TABLE]
Since the coefficients of the above four rate pairs sum to one, any rate pair in is outer bounded by some convex combination of , , , and . Since the four rate pairs are achievable via independent inputs, we conclude that Shannon’s inner bound is tight. ∎
Clearly, the capacity region of Shannon’s PTT-TWC can be easily determined via Theorem 1 without using the time-sharing argument [1] or the Lagrange multiplier method [2].
Moreover, we note that (1) can be interpreted as the average amount of information sent over a set of state-dependent one-way channels , where the channel input, output, and state, correspond to , , and , respectively. Thus, user-2’s input distribution not only carries its own message but also determines how often each one-way channel can be used for user 1. The same interpretation also applies to (5). Clearly, the best channel input distribution for one user may not create the most favorable one-way channel allocation for the other user, necessitating a rate trade-off between the two users’ transmissions.
Quantifying the trade-off is often the most involved part of determining the capacity region of general TWCs. The prior approach to tackle the problem is to exploit (when they exist) channel symmetry or invariance properties so that for any , one can always find a such that [1, 10, 12]. However, this approach fails here since such may not exist for each . This observation can be illustrated via Shannon’s PTT-TWC as one can see that no single independent input distribution can achieve the rate pair , where . It is thus of interest to exploit other symmetry property as the one presented at the beginning of the section that allows us to show directly.
IV Examples and Discussion
In the last section, we proved the tightness of Shannon’s inner bound for a class of generalized DM-PTT-TWCs. The capacity result in Theorem 1 suggests a way to use different state-dependent one-way channels to optimize bi-directional transmission rates. In what follows, we illustrate all possible shapes of the capacity region via examples and discuss the optimal transmission strategy behind each result.
Let . Consider the generalized PTT-TWC with the parameterized marginal transition matrices as shown in Table II and the following settings:
Setting 1:
[TABLE]
for with and all ;
Setting 2:
[TABLE]
for all and ;
Setting 3:
[TABLE]
for all and ;
Setting 4:
[TABLE]
for all and .
Note that, unlike for Shannon’s original PTT-TWC, reliable full-duplex transmission is possible in the above settings since for all with and .
In Figures 2(a)-(d) (corresponding to Settings 1–4, respectively), the blue dots444In our computations, we discretized the standard 2-dimensional simplex to generate the input distributions for each user. The mutual information is then evaluated under the product of the discretized input distributions. A similar approach is used to obtain rate pairs in Shannon’s outer bound region. are the achievable rate pairs via independent inputs of the form: ; Shannon’s inner bound region is then given by taking the convex hull of those rate pairs. Shannon’s outer bound is obtained using a similar method, but the convex hull operation is not needed. We also depict the achievable rate region using the half-duplex transmission mode (via input symbol ‘[math]’). In all settings, we have that as expected.
In Figure 2(a), we first observe that the half-duplex transmission can attain the entire capacity region. Indeed, although full-duplex transmission is reliable, the large difference between and (for ) limits the rates achievable via two-way simultaneous transmission and hence the half-duplex transmission is still optimal (in the sense of achieving capacity). Nevertheless, the benefit of full-duplex transmission can be made significant by increasing the value of for . In Figure 2(b), we illustrate a situation where two-way simultaneous transmission achieves better rate pairs than using the half-duplex transmission.
Moreover, when the ’s () are much larger than , using and for information transmission becomes inefficient since they contribute very little to the overall transmission rates in (1) and (5). In this case, one should expect to abandon the (relatively) inefficient channels and use only the efficient ones. This is illustrated in Figs. 2(c) and 2(d). In an extreme case, such as Setting 4, the upper-right corner point of the capacity region is given by , implying that both users shut down the state-dependent one-way channels and and only use the remaining channels for information exchange.
V Conclusions
We identified a channel symmetry property under which Shannon’s capacity inner bound is tight for a class of generalized DM-PTT-TWCs. This symmetry property differs from prior ones in that it necessitates the use of a time-sharing scheme to achieve capacity. Specifically, a time-sharing coding scheme that involves two independent transmissions is optimal. Viewing the generalized DM-PTT-TWC as two sets of one-way channels, we further observed that one-way channel components with (relatively) low capacity should be abandoned for efficient transmissions. Future research directions include finding a more general tightness condition for DM-TWCs, identifying the connections between different channel symmetry properties (in particular between the channel symmetry property introduced in this paper and the ones in [10] and [12]), and investigating the transmission of correlated sources over the generalized DM-PTT-TWCs.
The appendix establishes input-output mutual information results for one-way channels that are of the same type as the state-dependent one-way channels in the generalized PTT-TWC of Theorem 1. Let and denote channel input and output alphabets, respectively, for some integers and . Suppose that the set of probability vectors specifies a weakly-symmetric channel and for all . The input-output mutual information for a specific channel input symbol is defined as
[TABLE]
The following results are needed in the proof of Theorem 1.
Lemma 2**.**
The capacity of the channel with the above properties is given by , where denotes the entropy of the probability vector . The capacity-achieving input distribution is given by:
[TABLE]
Proof:
We apply the KKT condition for channel capacity [24, Theorem 4.5.1] to check the optimality of . Under , we first have that for [18, Theorem 7.2.1] since is a uniform distribution when restricted to the input alphabet and the channel with the restricted inputs is weakly-symmetric. Moreover, for , we have
[TABLE]
where is arbitrary in (V) since does not depend on and (V) holds since . Combining the above results then gives that for all , thus implying the optimality of . Finally, we conclude that for any by the KKT condition. ∎
Lemma 3**.**
For any , consider the following channel input distribution:
[TABLE]
Let denote any input distribution with . Then, we have that (here the superscript indicates which input distribution is used for evaluation).
Proof:
First, we have that . Also, since for all due to the weakly-symmetric structure, one can easily conclude that . The above results then imply that . Moreover, a direct computation (with the result in Lemma 2) yields that , thereby completing the proof. ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] C. E. Shannon, “Two-way communication channels,” in Proc. 4th Berkeley Symp. Math. Stat. Probab. , 1961, pp. 611–644.
- 2[2] H. B. Meeuwissen, “Information theoretical aspects of two-way communication,” Ph.D. dissertation, TU Eindhoven, 1998.
- 3[3] G. Kramer, “Directed information for channels with feedback,” Ph.D. dissertation, Swiss Federal Institute of Technology Zurich, 1998.
- 4[4] ——, “Capacity results for the discrete memoryless network,” IEEE Trans. Inf. Theory , vol. 49, no. 1, pp. 4–21, Jan. 2003.
- 5[5] A. Maor and N. Merhav, “Two-way successively refined joint source-channel coding,” IEEE Trans. Inf. Theory , vol. 52, no. 4, pp. 1483–1494, Apr. 2006.
- 6[6] D. Gunduz, E. Erkip, A. Goldsmith, and H. V. Poor, “Source and channel coding for correlated sources over multiuser channels,” IEEE Trans. Inf. Theory , vol. 55, no. 9, pp. 3927–3944, Aug. 2009.
- 7[7] L. R. Varshney, “Two way communication over exponential family type channels,” in Proc. IEEE Int. Symp. Inf. Theory , 2013, pp. 2795–2799.
- 8[8] Z. Cheng and N. Devroye, “Two-way networks: when adaptation is useless,” IEEE Trans. Inf. Theory , vol. 60, no. 3, pp. 1793–1813, Mar. 2014.
