Strong Coordination over Noisy Channels: Is Separation Sufficient?
Sarah A. Obead, Badri N. Vellambi, J\"org Kliewer

TL;DR
This paper investigates how two agents can achieve strong coordination over noisy channels, proposing two novel coding schemes and analyzing their effectiveness in terms of coordination capacity.
Contribution
It introduces two new schemes for noisy strong coordination and derives inner bounds for the coordination capacity region, comparing their performance.
Findings
Joint scheme reduces local randomness needed at agent Y.
Separate scheme extracts randomness after decoding.
Joint scheme can outperform separate scheme in certain scenarios.
Abstract
We study the problem of strong coordination of actions of two agents and that communicate over a noisy communication channel such that the actions follow a given joint probability distribution. We propose two novel schemes for this noisy strong coordination problem, and derive inner bounds for the underlying strong coordination capacity region. The first scheme is a joint coordination-channel coding scheme that utilizes the randomness provided by the communication channel to reduce the local randomness required in generating the action sequence at agent . The second scheme exploits separate coordination and channel coding where local randomness is extracted from the channel after decoding. Finally, we present an example in which the joint scheme is able to outperform the separate scheme in terms of coordination rate.
Click any figure to enlarge with its caption.
Figure 1
Figure 1
Figure 2
Figure 4
Figure 5
Figure 6Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Strong Coordination over Noisy Channels:
Is Separation Sufficient?
Sarah A. Obead, Jörg Kliewer This work is supported by NSF grants CCF-1440014, CCF-1439465. Department of Electrical and Computer Engineering
New Jersey Institute of Technology
Newark, New Jersey 07102
Email: [email protected], [email protected]
Badri N. Vellambi
Research School of Computer Science
Australian National University
Acton, Australia 2601
Email: [email protected]
Abstract
We study the problem of strong coordination of actions of two agents and that communicate over a noisy communication channel such that the actions follow a given joint probability distribution. We propose two novel schemes for this noisy strong coordination problem, and derive inner bounds for the underlying strong coordination capacity region. The first scheme is a joint coordination-channel coding scheme that utilizes the randomness provided by the communication channel to reduce the local randomness required in generating the action sequence at agent . The second scheme exploits separate coordination and channel coding where local randomness is extracted from the channel after decoding. Finally, we present an example in which the joint scheme is able to outperform the separate scheme in terms of coordination rate.
I Introduction
The problem of communication-based coordination of multi-agent systems arises in numerous applications including mobile robotic networks, smart traffic control, and distributed computing such as distributed games and grid computing [1]. Several theoretical and applied studies on multi-agent coordination have targeted questions on how agents exchange information and how their actions can be correlated to achieve a desired overall behavior. Two types of coordination have been addressed in the literature – empirical coordination where the histogram of induced actions is required to be close to a prescribed target distribution, and strong coordination, where the induced sequence of joint actions of all the agents is required to be statistically close (i.e., nearly indistinguishable) from a chosen target probability mass function (pmf).
Recently, the capacity regions of several empirical and strong coordination network problems have been established [2, 1, 3, 4, 5, 6]. Bounds for the capacity region for the point-to-point case were obtained in [7] under the assumption that the nodes communicate in a bidirectional fashion in order to achieve coordination. A similar framework was adopted and improved in [8]. In [4, 9, 6], the authors addressed inner and outer bounds for the capacity region of a three-terminal network in the presence of a relay. The work of [4] was later extended in [10, 5] to derive a precise characterization of the strong coordination region for multi-hop networks. Starkly, the majority of the recent works on coordination have considered noise-free communication channels with the exception of two works: joint empirical coordination of the channel inputs/outputs of a noisy communication channel with source and reproduction sequences is considered in [11], and in [12], the notion of strong coordination is used to simulate a discrete memoryless channel via another channel.
In this work, we consider the point-to-point coordination setup illustrated in Fig. 1, where in contrast to [11] only source and reproduction sequences at two different nodes ( and ) are coordinated by means of a suitable communication scheme over a discrete memoryless channel (DMC).
Specifically, we propose two different novel achievable coding schemes for this noisy coordination scenario, and derive inner bounds to the underlying strong capacity region. The first scheme is a joint coordination channel coding scheme that utilizes randomness provided by the DMC to reduce the local randomness required in generating the action sequence at Node (see Fig. 1). The second scheme exploits separate coordination and channel coding where local randomness is extracted from the channel after decoding. Even though the proposed joint scheme is related to the scheme in [12], the presented scheme exhibits a significantly different codebook construction adapted to our coordination framework. Our scheme requires the quantification of the amount of common randomness shared by the two nodes as well as the local randomness at each of the two nodes. This is a feature that is absent from the analysis in [12]. Lastly, when the noisy channel and the correlation between to are both given by binary symmetric channels (BSCs), we study the effect of the capacity of the noisy channel on the sum rate of common and local randomness. We conclude this work by showing that the joint scheme outperforms the separate scheme in terms of the coordination rate in the high-capacity regime.
The remainder of the paper is organized as follows: Section II sets the notation. The problem of strong coordination over a noisy communication link is presented in Section III. We then derive achievability results for the noisy point-to-point coordination in Section IV for the joint scheme and in Section V for the separate scheme, respectively. In Section VI, we present numerical results for both schemes when the target joint distribution is described as a doubly binary symmetric source and the noisy channel is given by a BSC.
II Notation
Throughout the paper, we denote a discrete random variable with upper-case letters (e.g., ) and its realization with lower case letters (e.g., ), respectively. The alphabet size of the random variable is denoted as . We use to denote the finite sequence . The binary entropy function is denoted as , the indicator function by , and the counting function as . is the probability that the event occurs. The pmf of the discrete random variable is denoted as . However, we sometime use the lower case notation (e.g., ) to distinguish target pmfs or alternative definitions. We let denote the Kullback-Leibler divergence between two distributions and defined over an alphabet . denotes the set of -strongly letter-typical sequences of length . Finally, denotes the joint pmf of i.i.d. random variables .
III Problem Definition
The point-to-point coordination setup we consider in this work is depicted in Fig. 1. Node receives a sequence of actions specified by nature where is i.i.d. according to a pmf . Both nodes have access to shared randomness at rate bits/action from a common source, and each node possesses local randomness at rate , . Thus, in designing a block scheme to coordinate actions of the nodes, we assume , and , , and we wish to communicate a codeword over the rate-limited DMC to Node , where denotes the (appropriately selected) coordination message. The codeword is constructed based on the input action sequence , the local randomness at Node , and the common randomness . Node generates a sequence of actions based on the received codeword , common randomness , and local randomness . We assume that the common randomness is independent of the action specified at Node . A tuple is deemed achievable if for each , there exist and a (strong coordination) coding scheme such that the joint pmf of actions induced by this scheme and the -fold product111This is the joint pmf of i.i.d. copies of . of the desired joint pmf are close in total variation, i.e.,
[TABLE]
We now present the two achievable coordination schemes.
IV Joint Coordination Channel Coding
This scheme follows an approach similar to those in [1, 10, 4, 5] where coordination codes are designed based on allied channel resolvability problems [13]. The structure of the allied problem pertinent to the coordination problem at hand is given in Fig. 2. The aim of the allied problem is to generate symbols for two correlated sources and whose joint statistics is close to as defined by (1). To do so, we employ three independent and uniformly distributed messages , , and and two codebooks and as shown in Fig. 2. To define the two codebooks, consider auxiliary random variables and jointly correlated with as .
From this factorization it can be seen that the scheme consists of two reverse test channels and used to generate the sources from the codebooks. In particular, , i.e., the randomness of the DMC contributes to the randomized generation of .
Generating and from , , represents a complex channel resolvability problem with the following ingredients:
- •
Nested codebooks: Codebook of size is generated i.i.d. according to pmf , i.e., for all . Codebook is generated by randomly selecting for all .
- •
Encoding functions:
,
.
- •
Indices: are independent and uniformly distributed over , , and , respectively. These indices select the pair of codewords and from codebooks and .
- •
The selected codewords and are then passed through DMC at Node , while at Node , codeword is sent through DMC whose output is used to decode codeword and both are then passed through DMC to obtain .
Since the codewords are randomly chosen, the induced joint pmf of the generated actions and codeword indices in the allied problem is itself a random variable and depends on the random codebook. Given a realization of the codebooks
[TABLE]
the code-induced joint pmf of the actions and codeword indices in the allied problem is given by
[TABLE]
where denotes the pmf induced by the operation of decoding the index using the common randomness and the channel output at Node . Note that by denoting the decoding operation as a pmf, we can even incorporate randomized decoders. Note also that the indices for the -codeword that generate and sequences in (3) can be different since the decoding of the index at Node may fail. We are done if we accomplish the following tasks: (1) identify conditions on under which the code-induced pmf is close to the design pmf in the total variation sense; and (2) devise a strong coordination scheme by inverting the operation at Node . This will be done in following sections by subdividing the analysis of the allied problem.
IV-A Resolvability constraints
Assuming that the decoding of and the codeword occurs perfectly at Node , we see that the code-induced joint pmf induced by the scheme for the allied problem for a given realization of the codebook in (2) is
[TABLE]
The following result quantifies when the induced distribution in (4) is close to the -fold product of the design pmf .
Lemma 1** (Resolvability constraints).**
The total variation between the code-induced pmf in (4) and the desired pmf asymptotically vanishes, i.e., {\mathbb{E}_{\mathsf{C}}}\big{[}\left\lVert\check{P}_{X^{n}Y^{n}}-P^{\otimes n}_{XY}\right\rVert_{{\scriptscriptstyle TV}}\big{]}\rightarrow 0 as , if
[TABLE]
Note that here denotes the expectation over the random realization of the codebooks.
Proof.
In the following, we drop the subscripts from the pmfs for simplicity. Let , and choose . Consider the argument for \mathbb{E}_{\mathsf{C}}\big{[}\mathbb{D}(\check{P}_{X^{n}Y^{n}}||P^{\otimes n}_{XY})\big{]} shown at the top of the following page.
In this argument:
- (a)
follows from the law of iterated expectation. Note that we have used to denote the codewords corresponding to the indices , and to denote the codewords corresponding to the indices , respectively.
- (b)
follows from Jensen’s inequality.
- (c)
follows from dividing the inner summation over the indices into three subsets based on the indices from the outer summation.
- (d)
follows from taking the expectation within the subsets in (c) such that when
- –
: is conditionally independent of following the nature of the codebook construction (i.e., i.i.d. at random);
- –
: both codewords () are independent of regardless of the value of . As a result, the expected value of the induced distribution with respect to the input codebooks is the desired distribution [1].
- (e)
follows from
- –
: there is only one pair of codewords ;
- –
when while there are indices in the sum;
- –
: the number of the indices is at most
- (f)
results from splitting the outer summation: The first summation contains typical sequences and is bounded by using the probabilities of the typical set. The second summation contains the tuple of sequences when the pair of actions sequences and codewords are not -jointly typical (i.e., ). This sum is upper bounded following [4] with \mu_{XY}=\min_{x,y}\big{(}P_{XY}(x,y)\big{)}.
- (g)
following the Chernoff bound of the probability that a sequence is not strongly typical [14] where \mu_{XYAC}=\min_{x,y,a,c}\big{(}P_{XYAC}(x,y,a,c)\big{)}.
Consequently, the contribution of typical sequences can be made asymptotically small if
[TABLE]
while the second term converges to zero exponentially fast with [14]. Finally, by applying Pinsker’s inequality we have
[TABLE]
∎
Remark*.*
Given , , , satisfying (5) and (6), it follows from (IV-A) that there exist an and a random codebook realization for which the code-induced pmf between the indices and the pair of actions satisfies
[TABLE]
IV-B Decodability constraint
Since the operation at Node in Fig. 2 involves the decoding of and thus the codeword using and , the induced distribution of the scheme for the allied problem will not match that of (4) unless and until we ensure that the decoding succeeds with high probability as . The following lemma quantifies the necessary rate for this decoding to succeed asymptotically almost always.
Lemma 2** (Decodability constraint).**
Let be the output of a typicality-based decoder that uses common randomness to decode the index and the sequence from . If the rate for the index satisfies then,
- i.
\mathbb{E}_{\mathsf{C}}\big{[}\mathbb{P}[\hat{I}\neq I]\big{]}\rightarrow 0* as , where is the probability that the decoding fails for a realization of the random codebook, and* 2. ii.
\lim\limits_{n\rightarrow\infty}\mathbb{E}_{\mathsf{C}}\big{[}\lVert\check{P}_{X^{n}Y^{n}IJK}-\mathring{P}_{X^{n}Y^{n}IJK}\rVert_{{\scriptscriptstyle TV}}\big{]}=0.**
Proof.
We start the proof of i) by calculating the average probability of error, averaged over all codewords in the codebook and averaged over all random codebook realizations.
[TABLE]
where in (a) we have used the fact that the conditional probability of error is independent of the triple of indices due to the i.i.d. nature of the codebook construction. Also, due to the random construction and the properties of jointly typical set, we have
[TABLE]
We now continue the proof by constructing the sets for each and that Node will use to identify the transmitted index:
[TABLE]
The set consists of indices such that for a given common randomness index and channel realization , the sequences are jointly-typical. Assuming was realized, and if , then the decoding will be successful. The probability of this event is divided into two steps as follows:
• First, assuming was realized, for successful decoding, must be an element of . The probability of this event can be bounded as follows.
[TABLE]
where (a) follows from the properties of jointly typical sets.
• Next, assuming again that was realized, for successful decoding no index greater than or equal to must be an element of . The probability of this event can be bounded as follows:
[TABLE]
where (a) follows from the packing lemma [15], and (b) results if .
Then from (9), the claim in i) follows as given by
[TABLE]
Finally, the proof of ii) follows in a straightforward manner. If the previous two conditions are met, then and , where denotes the Kronecker delta. Consequently, from (3) and (4)
[TABLE]
∎
IV-C Independence constraint
We complete modifying the allied structure to mimic the original problem with a final step. By assumption, we have a natural independence between the action sequence and the common randomness . As a result, the joint distribution over and in the original problem is a product of the marginal distributions and . To mimic this behavior in the scheme for the allied problem, in Lemma 3 we artificially enforce independence by ensuring that the mutual information between and vanishes.
Lemma 3** (Independence constraint).**
Consider the scheme for the allied problem given in Fig. 2. Both and \mathbb{E}_{\mathsf{C}}\big{[}||\check{P}_{X^{n}J}-P^{\otimes n}_{X}P_{J}||_{{\scriptscriptstyle TV}}\big{]}\rightarrow 0 as if the code rates satisfy
[TABLE]
The proof of Lemma 3 builds on the results of Section IV-B and the proof of Lemma 1 of Section IV-A, resulting in
[TABLE]
Remark*.*
Given , , meeting (11) and (12), it follows from (IV-C) that there exist an and a random codebook realization for which the code-induced pmf between the common randomness and the actions of Node satisfies
[TABLE]
In the original problem of Fig. 1, the input action sequence and the index from the common randomness source are available and the - and -codewords are to be selected. Now, to devise a scheme for the strong coordination problem, we proceed as follows. We let Node choose indices and (and, consequently, the - and -codewords) from the realized and using the conditional distribution . The joint pmf of the actions and the indices is then given by
[TABLE]
Finally, we can argue that
[TABLE]
since the total variation between the marginal pmf and the design pmf can be bounded as
[TABLE]
where (a) follows from the triangle inequality; (b) follows from (3), (4), (15) and [3, Lemma V.1]; (c) follows from [3, Lemma V.2]. The terms in the RHS of (c) can be made vanishingly small provided the resolvability, decodability, and independence conditions are met. Thus, we are guaranteed that by meeting the five conditions of Lemmas 1-3, the scheme defined by (15) achieves strong coordination between Nodes and by communicating over the DMC . Note that since the operation at Nodes and amount to an index selection according to , and a generation of using the DMC , both operations are randomized. The last step is to derandomize the operations at Nodes and by viewing the corresponding local randomness as the source of randomness in these operations. This is detailed next.
IV-D Local randomness rates
At Node , local randomness is employed to randomize the selection of indices by synthesizing the channel whereas Node utilizes its local randomness to generate the action sequence by simulating the channel . Using the arguments in [5], we can argue that for any given realization of , the minimum rate of local randomness required for the probabilistic selection of indices can be derived by quantifying the number of and codewords (equivalently the pair of indices ) jointly typical with . Quantifying the list size as in [5] yields . At Node , the necessary local randomness for the generation of the action sequence is bounded by the channel simulation rate of DMC [16]. Thus, .
Moreover, one can always view a part of the common randomness as local randomness, which then allows us to incorporate the rate-transfer arguments given in [5, Lemma 2]. Combining the rate-transfer argument with the constraints in Lemmas 1-3, we obtain following inner bound to the strong coordination capacity region.
Theorem 1**.**
A tuple is achievable for the strong noisy communication setup in Fig. 1 if for some ,
[TABLE]
V Separate Coordination-Channel Coding Scheme with Randomness Extraction
As a basis for comparison, we will now introduce a separation-based scheme that involves randomness extraction. We first use a noiseless coordination code with the codebook to generate a message of rate . Such a code exists if and only if the rates satisfy [1]
[TABLE]
This coordination message is then communicated over the noisy channel using a rate- channel code over channel uses with codebook . Hence, , where . The probability of decoding error can be made vanishingly small if . Then, from the decoder output and the common randomness message we reconstruct the coordination sequence and pass it though a test channel to generate the action sequence at Node . Note that this separation scheme is constructed as a special case of the joint coordination-channel scheme of Fig. 2 by choosing and .
In the following, we restrict ourselves to additive-noise DMCs, i.e.,
[TABLE]
where is the noise random variable drawn from some finite field , and “” is the native addition operation in the field. To extract randomness, we exploit the additive nature of the channel to recover the realization of the channel noise from the decoded codeword. Thus, at the channel decoder output we obtain
[TABLE]
where is the channel output and the corresponding decoded channel codeword. We can then utilize a randomness extractor on to supplement the local randomness available at Node . The following lemma provides some guarantees with respect to the randomness extraction stage.
Lemma 4**.**
Consider the separation based scheme over a finite-field additive DMC. If and with , the following hold:
- i.
** 2. ii.
* and* 3. iii.
.
Proof.
Let be the probability of decoding error (i.e., and ). We first show the claim in i). From the channel coding theorem we obtain that . Consequently, from (18) and (19) will follow directly as .
Then, the claim in ii) is shown as follows
[TABLE]
where (a) follows from the chain rule of entropy; (b) follows from Fano’s inequality and the fact that ;
Finally, the claim in iii) is shown by the following chain of inequalities:
[TABLE]
where (a) follows from Fano’s inequality; (b) follows from , and as respectively. ∎
Now, similar to the joint scheme, we can quantify the local randomness at both nodes, apply the rate transfer lemma [5, Lemma 2], and set to facilitate comparison with the joint scheme from Section IV. The following theorem then describes an inner bound to the strong coordination region using the separate-based scheme with randomness extraction.
Theorem 2**.**
There exists an achievable separation based coordination-channel coding scheme for the strong setup in Fig 1 such that (1) is satisfied for if
[TABLE]
The proof follows in a straightforward way from the proofs of both Theorem 1 and Lemma 4 and is therefore omitted.
VI Example
In the following, we compare the performance of the joint scheme in Section IV and the separation-based scheme in Section V using a simple example. Specifically, we let be a Bernoulli- source, the communication channel be a binary symmetric channel with crossover probability (BSC()), and the conditional distribution be a BSC().
VI-A Basic separation scheme with randomness extraction
To derive the rate constraints for the basic separation scheme, we consider with (which is known to be optimal [3]), , and , , . Using this to obtain the mutual information terms in Theorem 2, we get
[TABLE]
After a round of Fourier-Motzkin elimination by using (21a)-(21c) in Theorem 2, we obtain the following constraints for the achievable region using the separation-based scheme with randomness extraction:
[TABLE]
Note that (22a) presents the achievable sum rate constraint for the total required randomness in the system.
VI-B Joint scheme
The rate constraints for the joint scheme are constructed in two stages. First, we derive the scheme for the codebook cardinalities and , an extension to larger is straightforward but more tedious (see Figs. 3 and 4)222Note that these cardinalities are not optimal. They are, however, analytically feasible and provide a good intuition about the performance of the scheme.. The joint scheme correlates the codebooks while ensuring that the decodability constraint (17e) is satisfied. To get the best tradeoff, we find the joint distribution that maximizes . For this is simply given by . Then, the distribution that produces the boundary of the strong coordination region for the joint scheme is formed by cascading two BSCs and another symmetric channel, yielding the Markov chain , with the channel transition matrices
[TABLE]
for some
Then, the mutual information terms in Theorem 1 can be expressed with as
[TABLE]
To find the minimum achievable sum rate we first perform Fourier-Motzkin elimination on the rate constraints in Theorem 1 and then minimize the information terms with respect to the parameters , , and as follows:
[TABLE]
VI-C Numerical results
Fig. 3 presents a comparison between the minimum randomness sum rate required to achieve coordination using the joint and the separate scheme with randomness extraction when the communication channel is given by BSC(). The target distribution is set as . The rates for the joint scheme are obtained by solving the optimization problem in (29). Similar results are obtained for the joint scheme with . For the separate scheme we choose such that to maximize the amount of extracted randomness. We also include the performance of the separate scheme without randomness extraction. As can be seen from Fig. 3, both the joint scheme and the separate scheme with randomness extraction provide the same sum rate for where . We also observe that for noisy channels the joint scheme approaches the performance of the separate scheme when the cardinality of is increased. In this regime, we let such that in order to maximize the amount of extracted randomness. This is done by selecting and associated with . However, it can be easily shown that for this does not ensure a target distribution of anymore. Therefore, the optimization over the parameters and now results in a larger sum rate as can be seen from Fig. 3. As increases further, the required total randomness of the joint scheme approaches the one for the basic separate scheme again.
Fig. 4 provides a comparison of the communication rate for both schemes. Note that the joint scheme provides significantly smaller rates than the separation scheme with randomness extraction for , independent of the cardinality of . Thus, in this regime joint coordination-channel coding provides an advantage in terms of communication cost and outperforms the separation-based scheme for the same amount of randomness injected into the system.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] P. W. Cuff, H. H. Permuter, and T. M. Cover, “Coordination capacity,” IEEE Trans. Inf. Theory , vol. 56, no. 9, pp. 4181–4206, 2010.
- 2[2] E. Soljanin, “Compressing quantum mixed-state sources by sending classical information,” IEEE Trans. Inf. Theory , vol. 48, no. 8, pp. 2263–2275, Aug. 2002.
- 3[3] P. Cuff, “Distributed channel synthesis,” IEEE Trans. Inf. Theory , vol. 59, no. 1, pp. 7071–7096, Nov. 2013.
- 4[4] M. R. Bloch and J. Kliewer, “Strong coordination over a three-terminal relay network,” in Proc. IEEE Information Theory Workshop , Hobart, Australia, Nov. 2014, pp. 646–650.
- 5[5] B. N. Vellambi, J. Kliewer, and M. R. Bloch, “Strong coordination over multi-hop line networks,” 2016. [Online]. Available: http://arxiv.org/abs/1602.09001
- 6[6] A. Bereyhi, M. Bahrami, M. Mirmohseni, and M. R. Aref, “Empirical coordination in a triangular multi-terminal network,” in Proc. IEEE Int. Sympos. on Inform. Theory , Istanbul, Turkey, 2013, pp. 2149–2153.
- 7[7] A. A. Gohari and V. Anantharam, “Generating dependent random variables over networks,” in Proc. IEEE Information Theory Workshop , Paraty, Brazil, Oct. 2011, pp. 698–702.
- 8[8] M. H. Yassaee, A. Gohari, and M. R. Aref, “Channel simulation via interactive communications,” IEEE Trans. Inf. Theory , vol. 61, no. 6, pp. 2964–2982, 2015.
