A Unified Framework for One-shot Achievability via the Poisson Matching Lemma
Cheuk Ting Li, Venkat Anantharam

TL;DR
This paper introduces the Poisson matching lemma, a new fundamental tool that simplifies and improves one-shot achievability bounds across various information theory problems, extending previous work on Poisson functional representation.
Contribution
The paper presents the Poisson matching lemma and demonstrates its broad applicability, providing improved one-shot bounds and simpler proofs in network information theory.
Findings
Improved one-shot bounds in most settings compared to previous results.
Simplified proofs by replacing packing and covering lemmas.
Extension of Poisson functional representation to fixed-length settings.
Abstract
We introduce a fundamental lemma called the Poisson matching lemma, and apply it to prove one-shot achievability results for various settings, namely channels with state information at the encoder, lossy source coding with side information at the decoder, joint source-channel coding, broadcast channels, distributed lossy source coding, multiple access channels, channel resolvability and wiretap channels. Our one-shot bounds improve upon the best known one-shot bounds in most of the aforementioned settings (except multiple access channels, channel resolvability and wiretap channels, where we recover bounds comparable to the best known bounds), with shorter proofs in some settings even when compared to the conventional asymptotic approach using typicality. The Poisson matching lemma replaces both the packing and covering lemmas, greatly simplifying the error analysis. This paper extends…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
A Unified Framework for One-shot Achievability via the Poisson Matching
Lemma
Cheuk Ting Li and Venkat Anantharam
EECS, UC Berkeley, Berkeley, CA, USA
Email: [email protected], [email protected]
Abstract
We introduce a fundamental lemma called the Poisson matching lemma, and apply it to prove one-shot achievability results for various settings, namely channels with state information at the encoder, lossy source coding with side information at the decoder, joint source-channel coding, broadcast channels, distributed lossy source coding, multiple access channels, channel resolvability and wiretap channels. Our one-shot bounds improve upon the best known one-shot bounds in most of the aforementioned settings (except multiple access channels, channel resolvability and wiretap channels, where we recover bounds comparable to the best known bounds), with shorter proofs in some settings even when compared to the conventional asymptotic approach using typicality. The Poisson matching lemma replaces both the packing and covering lemmas, greatly simplifying the error analysis. This paper extends the work of Li and El Gamal on Poisson functional representation, which mainly considered variable-length source coding settings, whereas this paper studies fixed-length settings, and is not limited to source coding, showing that the Poisson functional representation is a viable alternative to typicality for most problems in network information theory.
I Introduction
The Poisson functional representation was introduced by Li and El Gamal [1] to prove the strong functional representation lemma: for any pair of random variables , there exists a random variable independent of such that is a function of , and . The lemma is applied to show various one-shot variable-length lossy source coding results, and a simple proof of the asymptotic achievability in the Gelfand-Pinsker theorem [2].
In this paper, we introduce the Poisson matching lemma, which gives a bound on the probability of mismatch between the Poisson functional representations applied on different distributions, and use it to prove one-shot achievability results for various settings, namely channels with state information at the encoder, lossy source coding with side information at the decoder, joint source-channel coding, broadcast channels, distributed lossy source coding, multiple access channels, channel resolvability and wiretap channels. The Poisson matching lemma can replace both the packing and covering lemmas (and generalizations such as the mutual covering lemma) in asymptotic typicality-based proofs. The one-shot bounds in this paper subsume the corresponding asymptotic achievability results by straightforward applications of the law of large numbers.
Various non-asymptotic alternatives to typicality have been proposed, e.g. one-shot packing and covering lemmas [3, 4], stochastic likelihood coder [5], likelihood encoder [6] and random binning [7]. However, these non-asymptotic approaches generally require more complex proofs than their asymptotic counterparts, whereas proofs using the Poisson matching lemma can be even simpler than asymptotic proofs.
Our approach is better than the conventional asymptotic approach using typicality (and previous one-shot results, e.g. [3, 5]), in the following ways:
We can give one-shot bounds stronger than the best known one-shot bounds in many settings discussed in this paper, with the exception of channel coding, multiple access channels, channel resolvability and wiretap channels, which are included for demonstration purposes, where we recover bounds comparable to the best known bounds. 2. 2.
Our proofs work for random variables in general Polish spaces. 3. 3.
To the best of our knowledge, for the achievability in the Gelfand-Pinsker theorem [2] (for channels with state information at the encoder) and the Wyner-Ziv theorem [8, 9] (for lossy source coding with side information at the decoder), our proofs are significantly shorter than all previous proofs (another short proof of the achievability in the Gelfand-Pinsker theorem is given in [1], though it is asymptotic). Using our approach, we can also greatly shorten the proof of the achievability of the dispersion in joint source-channel coding [10]. 4. 4.
Our proofs only use the Poisson matching lemma introduced in this paper, which replaces both the packing and covering lemmas in proofs using typicality. The Poisson matching lemma can also be used to prove a soft covering lemma. Hence the Poisson matching lemma can be the only tool needed to prove a wide range of results in network information theory. 5. 5.
Our analyses usually involve fewer (or no) uses of sub-codebooks and binning. As a result, we can reduce the number of error events and give sharper second-order bounds. For example:
- (a)
Conventional proofs of the Gelfand-Pinsker theorem involve one sub-codebook, giving an additional error event, whereas we do not use any sub-codebook. 2. (b)
Conventional proofs of the Wyner-Ziv theorem and the Berger-Tung inner bound [11, 12] (for distributed lossy source coding) use binning, giving additional error events, whereas we do not require binning. 3. (c)
Conventional proofs of Marton’s inner bound [13] (for broadcast channels) involve two sub-codebooks, whereas we use only one. 6. 6.
In our approach, the encoders and decoders are characterized using a common framework (the Poisson functional representation), which is noteworthy since the roles of an encoder and a decoder in an operational setting are very different, and their constructions usually have little in common in conventional approaches.
Notation
Throughout this paper, we assume that is to base 2 and the entropy is in bits. We write for .
The set of positive integers is denoted as . We use the notation: , and . The conditional information density is denoted as
[TABLE]
We consider to be defined only if .
For discrete , we write the probability mass function as . For continuous , we write the probability density function as . For a general random variable in a measurable space, we write its distribution as . The uniform distribution over a finite set is denoted as . The joint distribution of is written as . The degenerate distribution is denoted as . The conditional independence of and given is denoted as .
The Q-function and its inverse are denoted as and respectively. For positive semidefinite, define where and denotes entrywise comparison.
We assume that every random variable mentioned in this paper lies in a Polish space with its Borel -algebra, and all functions mentioned (e.g. distortion measures, the function in Theorem 2) are measurable. The Lebesgue measure over is denoted as . The Lebesgue measure restricted to the set is denoted as . For two measures over (a Polish space with its Borel -algebra) such that is absolutely continuous with respect to (denoted as ), the Radon-Nikodym derivative is written as
[TABLE]
If (but may not hold), we write
[TABLE]
which is [math] if , and is if and .
The total variation distance between two distributions over is denoted as .
II Poisson Matching Lemma
We first state the definition of Poisson functional representation in [1], with a different notation that allows the proofs to be written in a simpler and more intuitive manner.
Definition 1** (Poisson functional representation).**
Let be the points of a Poisson process with intensity measure on (where is a Polish space with its Borel -algebra, and is -finite). For a probability measure over , define
[TABLE]
where
[TABLE]
with arbitrary tie-breaking (a tie occurs with probability 0). We omit and only write if the Poisson process is clear from the context. If the Poisson process is instead of , then the Poisson functional representation is likewise denoted as . If is multivariate, and is a distribution over , the Poisson functional representation is denoted as . We write its components as .
Note that while is only uniquely defined up to a -null set, changing the value of on a -null set will only affect the values of on a null set with respect to the distribution of , since the probability that there exists on that -null set is zero. Therefore is uniquely defined up to a null set.
By the mapping theorem [14, 15] (also see Appendix A of [1]), we have . This is termed Poisson functional representation in [1] since it can be regarded as a construction for the functional representation lemma [16]. Consider the distribution . Let be the points of a Poisson process with intensity measure , independent of the process, and . Then . Hence we can express as a function of and (which is independent of ). This fact will be used repeatedly throughout the proofs in this paper.
For two different distributions and , and are coupled in such a way that occurs with a probability that can be bounded in terms of . We now present the core lemma of this paper. The proof is given in Appendix -A.
Lemma 1** (Poisson matching lemma).**
Let be the points of a Poisson process with intensity measure , and be probability measures on with . Then we have the following almost surely:
[TABLE]
where we write as in (1) (we do not require ). The right hand side of (2) is considered to be 1 if and .
The exact expression for the left hand side of (2) is in (16).
We usually do not apply the Poisson matching lemma on fixed , but rather on conditional distributions. The following conditional version of the Poisson matching lemma follows directly from applying the lemma on . The proof is given in Appendix -B for the sake of completeness.
Lemma 2** (Conditional Poisson matching lemma).**
Fix a distribution and a probability kernel (that is not necessarily ) satisfying almost surely. Let , and be the points of a Poisson process with intensity measure independent of . Let and (note that and ). Then we have the following almost surely:
[TABLE]
The condition that almost surely is satisfied, for example, when , , and . Note that since , we have , whereas may not be independent of , so may not follow the conditional distribution .
III One-shot Channel Coding
To demonstrate the application of the Poisson matching lemma, we apply it to recover a bound for one-shot channel coding in [5] (with a slight penalty of having instead of ). Upon observing , the encoder produces , which is sent through the channel . The decoder observes and recovers with error probability .
Proposition 1**.**
Fix any . There exists a code for the channel , with message , with average error probability
[TABLE]
if .
Proof:
Let be the points of a Poisson process with intensity measure (where is ) independent of . The encoding function is (i.e., ), and the decoding function is (i.e., ). Note that the encoding and decoding functions also depend on the common randomness , which will be fixed later. We have .
[TABLE]
where (a) is by the conditional Poisson matching lemma (Lemma 2) on (note that ). Therefore there exists a fixed such that conditioned on , the average probability of error is bounded by . ∎
Compared to the scheme in [5], we use the Poisson process to create a codebook, instead of the conventional i.i.d. random codebook in [5]. While the codewords for different ’s are still i.i.d., we attach a bias to each codeword. Our scheme does not use a stochastic decoder as in [5], but rather a biased maximum likelihood decoder where . In the following sections, we will demonstrate how our approach can lead to simpler proofs and sharper bounds compared to [5].
Using the generalized Poisson matching lemma that will be introduced in Section VII, we can prove the following bound. The proof is in Appendix -C.
Theorem 1**.**
Fix any . There exists a code for the channel , with message , with average error probability
[TABLE]
if .
Compare this to the dependence testing bound [17]:
[TABLE]
Theorem 1 is at least as strong (with a slight penalty of having instead of ) since
[TABLE]
Remark 1*.*
Apart from the dependence testing bound [17], there are other one-shot bounds for channel coding such as the random-coding union (RCU) bound and the bound in [17], which are tighter in certain situations (e.g. the RCU bound is suitable for error exponent analysis). The technique introduced in this paper is suitable for first and second order analysis, but does not seem to give tight error exponent bounds.
IV One-shot Coding for Channels with State Information at the Encoder
The one-shot coding setting for a channel with state information at the encoder is described as follows. Upon observing and , the encoder produces , which is sent through the channel with state . The decoder observes and recovers with error probability .
We show a one-shot version of the Gelfand-Pinsker theorem [2]. This is the first one-shot bound attaining the best known second order result in [18] (which considers a finite-blocklength, not one-shot scenario). Our bound is stronger than the one-shot bounds in [3, 5, 19] (in the second order), and significantly simpler to state and prove than all the aforementioned results. Unlike previous approaches, our proof does not require sub-codebooks.
Theorem 2**.**
Fix any and function . There exists a code for the channel with state distribution with message , with error probability
[TABLE]
if and , where .
Proof:
Let be the points of a Poisson process with intensity measure independent of . The encoding function is (let , ), and the decoding function is (i.e., ). Note that . We have
[TABLE]
where (a) is by the conditional Poisson matching lemma on (note that ). Therefore there exists a fixed attaining the desired bound. ∎
Compared to Theorem 3 in [3]:
[TABLE]
for any , , our result is strictly stronger since
[TABLE]
This is due to the fact that the Poisson matching lemma simultaneously replaces both the covering and the packing lemma, resulting in only one error event.
Next, we prove a second-order result. Fix . Let , . We apply Theorem 2 on uses of the memoryless channel with i.i.d. state sequence , and
[TABLE]
where is a constant that depends on . For , by the Berry-Esseen theorem [20, 21, 22], we have
[TABLE]
if we let be the constant given by the Berry-Esseen theorem. This coincides with the best known second order result in [18], which is stronger than the second order results implied by [3, 5, 19]. We bound as a single quantity, instead of bounding the two terms separately as in [3, 5, 19], resulting in a sharper second order bound.
V One-shot Lossy Source Coding with Side Information at the Decoder
The one-shot lossy source coding setting with side information at the decoder is described as follows. Upon observing , the encoder produces . The decoder observes and and recovers with probability of excess distortion , where is a distortion measure.
We show a one-shot version of the Wyner-Ziv theorem [8, 9]. Our bound is stronger than those in [3, 19], and significantly simpler to state and prove. Unlike previous approaches, our proof does not require binning.
Theorem 3**.**
Fix any and function . There exists a code for lossy source coding with source distribution , side information at the decoder given by , and message size , with probability of excess distortion
[TABLE]
if and , where .
Proof:
Let be the points of a Poisson process with intensity measure independent of , where is . The encoding function is (i.e., ), and the decoding function is (let , ). Also define , . Note that . We have
[TABLE]
where (a) is by the conditional Poisson matching lemma on (note that ). Therefore there exists a fixed attaining the desired bound. ∎
This reduces to lossy source coding (without side information) when . Note that the encoder is designed in the same way with or without side information. An encoder for lossy source coding is sufficient to achieve the bound in Theorem 3 even when side information is present. Binning is not required at the encoder.
Similar to the case in Section IV, it can be checked that our bound is stronger than that in Theorem 2 in [3]. Compared to Corollary 9 in [19]:
[TABLE]
for any , , our result is stronger since
[TABLE]
where the last inequality is due to
[TABLE]
by the AM-GM inequality for , (since the right hand side of (3) for it to be meaningful). We bound as a single quantity, instead of bounding the two terms separately, resulting in a sharper bound.
VI One-shot Joint Source-Channel Coding
The one-shot joint source-channel coding setting is described as follows. Upon observing the source symbol , the encoder produces , which is sent through the channel . The decoder observes and recovers with probability of excess distortion , where is a distortion measure.
We show a one-shot joint source-channel coding result that achieves the optimal dispersion in [10].
Theorem 4**.**
Fix any and . There exists a code for the source distribution and channel , with probability of excess distortion
[TABLE]
if , where , and .
Proof:
Let be the points of a Poisson process with intensity measure independent of . Let . Let be defined as
[TABLE]
The encoding function is (i.e., ). The decoding function is (i.e., ). Also define . We have .
[TABLE]
where (a) is by the conditional Poisson matching lemma on (note that ). Therefore there exists a fixed attaining the desired bound. ∎
Compare this to Theorem 7 in [10]:
[TABLE]
for any , . While neither of the bounds implies the other, our bound is at least within a factor of 2 from (4), since
[TABLE]
However, (4) does not imply a bound that is within a constant factor from our bound. Theorem 8 in [10] is obtained by substituting in (4):
[TABLE]
which is strictly weaker than our bound with an unbounded multiplicative gap (that tends to when the bound tends to 0). Hence our bound is stronger than Theorem 7 and 8 in [10] (ignoring constant multiplicative gaps). Also our proof is significantly shorter than that of Theorem 7 in [10].
Please refer to Appendix -D for the proof that Theorem 4 achieves the optimal dispersion.
VII Poisson Matching Lemma Beyond the First Index
The Poisson functional representation concerns the point with the smallest . We can generalize it to obtain a sequence ordered in ascending order of .
Definition 2** (Mapped Poisson process).**
Let be the points of a Poisson process with intensity measure on (where is a Polish space with its Borel -algebra, and is -finite). For a probability measure over , let be a sequence of distinct integers such that and is sorted in ascending order with arbitrary tie-breaking (a tie occurs with probability 0). For , define the *mapped Poisson process with respect to *as
[TABLE]
where
[TABLE]
[TABLE]
For probability measures over , define and as above. Define
[TABLE]
where the minimum is if such does not exist. We omit and only write , , if the Poisson process is clear from the context. Note that, with probability 1, we have either or . Also, for any , . Loosely speaking, can be regarded as “” (if there are no atoms in ), i.e., finding the -th point in the mapped Poisson process w.r.t. , then finding its index in the mapped Poisson process w.r.t. .
While is only uniquely defined up to a -null set, changing the value of on a -null set will only affect the values of on a null set with respect to the distribution of , since the probability that there exists in that -null set is zero. Therefore is uniquely defined up to a null set. The same is true for .
By the mapping theorem [14, 15] (also see Appendix A of [1]),
[TABLE]
is a Poisson process with intensity measure . Hence
[TABLE]
We present a generalized Poisson matching lemma concerning the indices beyond the first. The proof is given in Appendix -A.
Lemma 3** (Generalized Poisson matching lemma).**
Let be the points of a Poisson process with intensity measure on , and be probability measures over with . Fix any . Then we have the following almost surely:
[TABLE]
where we write as in (1) (we do not require ). As a result, we have the following almost surely: for all ,
[TABLE]
For , this can be slightly strengthened to
[TABLE]
For , this can be slightly strengthened to: for all ,
[TABLE]
The exact distribution of is given in (15).
Similar to Lemma 2, we can state a conditional version of the generalized Poisson matching lemma. The proof follows the same logic as Lemma 2 and is omitted.
Lemma 4** (Conditional generalized Poisson matching lemma).**
Fix a distribution and a probability kernel , satisfying and almost surely. Let , and be the points of a Poisson process with intensity measure independent of . Let and (note that and ). Then we have the following almost surely:
[TABLE]
and for all ,
[TABLE]
and
[TABLE]
If almost surely, then we also have the following almost surely: for all ,
[TABLE]
Remark 2*.*
We can use the generalized Poisson matching lemma to extend Proposition 1 to the list decoding setting with fixed list size . The decoder outputs the list . The error event becomes . The probability of error is bounded by .
VIII One-shot Coding for Broadcast Channels and Mutual Covering
The one-shot coding setting for the broadcast channel with common message is described as follows. Upon observing three independent messages , , the encoder produces , which is sent through the broadcast channel . Decoder observes and recovers and (). The error probability is .
We show a one-shot version of the inner bound in [23, Theorem 5] (which is shown to be equivalent to [24, Theorem 1] in [25]). The proof is given in Appendix -F.
Theorem 5**.**
Fix any and function . For any , there exists a code for the broadcast channel for independent messages , , with the error probability bounded by
[TABLE]
if all the information density terms are defined almost surely, where
[TABLE]
As a result, for ,
[TABLE]
The logarithmic terms and (or the last term in (6)) result in an penalty on the rate in the finite blocklength regime, and do not affect the second order result. Ignoring the last term in (6), the error event in (6) is a strict subset of those in [5, eqn (32)] and [4, eqn (49)]. This is because the error event in [5] is a superset of (6) by Fourier-Motzkin elimination on in the error event in [5], but the reverse is not true since Fourier-Motzkin elimination only guarantees the existence of a random variable for (that depends on the information density terms) satisfying the bounds, but must be a constant since it is a parameter of the code construction in [5].
Theorem 5 gives the following second order bound. Consider independent channel uses. Let for . By the multi-dimensional Berry-Esseen theorem [26] (using the notation in [5]), we have if there exists such that
[TABLE]
if , where is a constant that depends on , and , for , and
[TABLE]
To demonstrate the use of the generalized Poisson matching lemma in place of the mutual covering lemma, we prove a one-shot version of Marton’s inner bound without common message [13] (i.e., ). Our bound is stronger than that in [3] in the sense that our bound implies [3] (with a slight penalty of having instead of ), but [3] does not imply our bound. We also note that a finite-blocklength bound is given in [7]. Nevertheless, the analysis in [7] only works for discrete auxiliary random variables , and does not appear to yield a one-shot bound due to the use of typical sequences.
In the conventional mutual covering approach in [5, 4], sub-codebooks for both and are generated, whereas in our approach we generate a sub-codebook only for , and the codebook of adapts to the sub-codebook automatically, eliminating the need for a sub-codebook for .
Theorem 6**.**
Fix any and function . For any , there exists a code for the broadcast channel for independent private messages , , with the error probability bounded by
[TABLE]
if all the information density terms are defined, where .
Proof:
Let , be two independent Poisson processes with intensity measures and respectively, independent of .
The encoder would generate such that
[TABLE]
where , and is an intermediate list (which can be regarded as a sub-codebook). The term in (7) means that are i.i.d. , and . To accomplish this, the encoder computes for (which Poisson process we are referring to can be deduced from whether we are discussing or ), , and (where is derived from (7)), and outputs . It can be verified that (7) is satisfied.
The decoding functions are , . We have the following almost surely:
[TABLE]
where (a) is by (see Figure 1 middle), and (b) is by the conditional generalized Poisson matching lemma on , since , , and , which can be deduced from (7) and (see Figure 1 middle).
Also, almost surely,
[TABLE]
where (a) is by (see Figure 1 right), (b) is by the conditional Poisson matching lemma on , and (c) is because (the ’s not selected as ) are independent of , , and Jensen’s inequality. Hence,
[TABLE]
Therefore there exist fixed realizations of the Poisson processes attaining the desired bound. ∎
IX One-shot Distributed Lossy Source Coding
The one-shot distributed lossy source coding setting is described as follows. Let . Upon observing , encoder produces , . The decoder observes and recovers , with probability of excess distortion , where is a distortion measure for .
We show a one-shot version of the Berger-Tung inner bound [11, 12].
Theorem 7**.**
Fix any , and functions , . There exists a code for distributed lossy source coding with sources and message sizes , with probability of excess distortion
[TABLE]
if all the information density terms are defined, where . As a result, for ,
[TABLE]
The logarithmic term in (8) (or the last term in (9)) results in an penalty on the rate in the finite blocklength regime, and does not affect the second order result. Ignoring the last term in (9), the error event in (9) is a strict subset of that in [5, eqn (47)]. This is because the error event in [5] is a superset of (9) by Fourier-Motzkin elimination on in the error event in [5], but the reverse is not true since Fourier-Motzkin elimination only guarantees the existence of random variables for (that depend on the information density terms) satisfying the bounds, but must be constants since they are parameters of the code construction in [5].
We now prove the result. Unlike previous approaches, our proof does not require binning. The encoders are the same as those for point-to-point lossy source coding.
Proof:
Let , be two independent Poisson processes with intensity measures and respectively, independent of . The encoding functions are , (which Poisson process we are referring to can be deduced from whether we are discussing or ). Also define , , . For the decoding function, let for , where with , and , , . Note that .
Let (using the Poisson process ). By the conditional generalized Poisson matching lemma on (note that ), almost surely,
[TABLE]
Since is a function of and , we have . By the conditional Poisson matching lemma on (note that ), almost surely,
[TABLE]
where (a) is by Proposition 6, and (b) is by , (10) and Jensen’s inequality. By the conditional Poisson matching lemma on (note that ), and , almost surely,
[TABLE]
We have
[TABLE]
Therefore there exist fixed values of the Poisson processes attaining the desired bound.
For (9), if the event in (9) does not occur, by Proposition 6 with , , ,
[TABLE]
∎
Remark 3*.*
The reason for the logarithmic term is that we want to translate a bound on (given by the generalized Poisson matching lemma) into a bound on for some distribution over . Ideally, we wish , but this is impossible since the harmonic series diverges. Therefore we use a slow converging series instead, resulting in a logarithmic penalty.
If we use instead of in the proof, we can obtain the following bound for any :
[TABLE]
Compared to Theorem 7, this does not contain the logarithmic term, but requires optimizing over , and may give a worse second order result.
Another choice is to use instead of . We can obtain the following bound for any :
[TABLE]
which gives the same second order result as Theorem 7. Nevertheless, we prefer using which eliminates the need for a parameter at the decoder.
X One-shot Coding for Multiple Access Channels
The one-shot coding setting for the multiple access channel is described as follows. Upon observing ( independent), encoder produces , . The decoder observes the output of the channel and recovers . The error probability is .
We present a one-shot achievability result for the capacity region in [27, 28, 29]. While this result is slightly weaker than that in [3], we include it to illustrate the use of the generalized Poisson matching lemma in simultaneous decoding. Note that the logarithmic term results in an penalty on the rate in the finite blocklength regime, and does not affect the second order result.
Theorem 8**.**
Fix any . There exists a code for the multiple access channel for messages , , with the error probability bounded by
[TABLE]
if , where . As a result, for ,
[TABLE]
Proof:
Let , be two independent Poisson processes with intensity measures and respectively, independent of . The encoding functions are , (which Poisson process we are referring to can be deduced from whether we are discussing or ). For the decoding function, let for , where with , and .
Let (using the Poisson process ). By the conditional generalized Poisson matching lemma on (note that ), almost surely,
[TABLE]
Since is a function of and , we have . By the conditional Poisson matching lemma on (note that ), almost surely,
[TABLE]
where (a) is by Proposition 6, and (b) is by , (12) and Jensen’s inequality. By the conditional Poisson matching lemma on (note that ), almost surely,
[TABLE]
Therefore there exist fixed values of the Poisson processes attaining the desired bound.
For (11), if the event in (11) does not occur, by Proposition 6 with , , ,
[TABLE]
∎
Remark 4*.*
If we use instead of in the proof, we can obtain the following bound for any :
[TABLE]
Compared to Theorem 8, this does not contain the logarithmic term, but requires optimizing over , and may give a worse second order result.
Another choice is to use instead of . We can obtain the following bound for any :
[TABLE]
which gives the same second order result as Theorem 8. Nevertheless, we prefer using which eliminates the need for a parameter at the decoder.
XI One-shot Channel Resolvability and Soft Covering
The one-shot channel resolvability setting [30] is described as follows. Fix a channel and input distribution . Upon observing an integer , the encoder applies a deterministic mapping on to produce , which is sent through the channel and gives the output . The goal is to minimize the total variation distance between and (-marginal of ), i.e., .
We show a one-shot channel resolvability result using the the Poisson matching lemma. This result can also be regarded as a one-shot soft covering lemma [31].
Proposition 2**.**
Given channel and input distribution with . Let , then for any ,
[TABLE]
As a result, for any ,
[TABLE]
Hence there exists a code for channel resolvability satisfying the above bounds.
Proof:
Let be the points of a Poisson process with intensity measure . Let , (), and . Let , and for . We have
[TABLE]
where (a) is by the convexity of the total variation distance, and (b) is because almost surely (note that the summation ignores multiplicity of elements in ). For the first term, note that since is a function of , we have , and hence
[TABLE]
We have
[TABLE]
For the second term, by the conditional generalized Poisson matching lemma on ,
[TABLE]
Hence,
[TABLE]
For (14), substitute ,
[TABLE]
where (a) is because , and for , .
∎
Compare this to Theorem 2 in [32] (weakened by substituting ): for any ,
[TABLE]
If we assume and substitute in (14), we obtain the following slightly weaker bound (within a logarithmic gap from that in [32]):
[TABLE]
Nevertheless, the bound in [32] does not imply (13), so neither bound is stronger than the other.
The channel resolvability or soft covering bound in Proposition 2 can be applied to prove various secrecy and coordination results, e.g. one-shot coding for wiretap channels [33], one-shot channel synthesis [31], and one-shot distributed source simulation [34]. Hence these results can also be proved using the Poisson matching lemma alone. In the next section, we will prove a one-shot result for wiretap channels.
XII One-shot Coding for Wiretap Channels
The one-shot version of the wiretap channel setting [33] is described as follows. Upon observing , the encoder produces , which is sent through the broadcast channel . The legitimate decoder observes and recovers with error probability . The eavesdropper observes . Secrecy is measured by the total variation distance .
The following bound is a direct result of the generalized Poisson matching lemma and Proposition 2. It is included for demonstration purposes. See [32, 35, 36] for other one-shot bounds (that are not strictly stronger or weaker than ours).
Proposition 3**.**
Fix any . For any , , there exists a code for the wiretap channel , with message , with average error probability and secrecy measure satisfying
[TABLE]
if and .
Proof:
Let be the points of a Poisson process with intensity measure independent of . Let independent of . The encoder computes and generates . The decoder recovers . We have . By the conditional generalized Poisson matching lemma on (note that ),
[TABLE]
For the secrecy measure,
[TABLE]
where (a) is by the convexity of total variation distance, and (b) is by Proposition 2 since for any . Therefore there exists a fixed set of points for satisfying the desired bound. ∎
XIII Strong Functional Representation Lemma and Noncausal Sampling
The generalized Poisson matching lemma can be applied to give a slight improvement on the constant in the strong functional representation lemma in [1], and hence improves on the variable-length channel simulation result in [37], and the result on minimax remote prediction with a communication constraint in [38]. It also gives an achievability bound on the moments for the noncausal sampling setting in [39].
Proposition 4**.**
Let be the points of a Poisson process with intensity measure over , and be probability measures over with . For any , concave nondecreasing, we have
[TABLE]
i.e., dominates in the second order. As a result, let
[TABLE]
be the upper concave envelope of , then
[TABLE]
In particular,
[TABLE]
and for ,
[TABLE]
where is the Rényi divergence.
Proof:
For concave nondecreasing, we have
[TABLE]
where (a) is by Jensen’s inequality, and (b) is by the generalized Poisson matching lemma. For any such that for ,
[TABLE]
For , , and hence
[TABLE]
For , , is concave, and hence
[TABLE]
∎
Consider the setting in the strong functional representation lemma [1]: given , we want to find a random variable independent of such that is a function of , and is minimized. Take . Applying Proposition 4 on , , we obtain
[TABLE]
Using Proposition 4 in [1],
[TABLE]
The constant is smaller than that in [1]:
[TABLE]
XIV Conclusions and Discussion
In this paper, we introduced a simple yet versatile approach to achievability proofs via the Poisson matching lemma. By reducing the uses of sub-codebooks and binning, we improved upon existing one-shot bounds on channels with state information at the encoder, lossy source coding with side information at the decoder, broadcast channels, and distributed lossy source coding. The Poisson matching lemma can replace the packing lemma, covering lemma and soft covering lemma to be the only tool needed to prove a wide range of results in network information theory.
In the proofs, random variables (e.g. the channel input and message in channel coding settings, the source and description in source coding settings, the channel output in channel resolvability) are regarded as points in a Poisson process. The Poisson functional representation is applied to map the Poisson process to give the correct conditional distribution. Viewing every random variable in the operational setting as a Poisson process gives a simple, unified and systematic approach to code constructions.
A possible extension is to generalize the Poisson functional representation to the multivariate case. In the proof of Marton’s inner bound for broadcast channels, we had two independent Poisson processes for and respectively. We first used the process for to obtain a list of values for , then used the list to index into the process for . A more symmetric approach where we select together (similar to the conventional mutual covering approach) using a multivariate version of the Poisson functional representation may be possible. Similarly, for distributed lossy source coding and the multiple access channel, it may be possible to decode both sources/messages simultaneously. While it can be argued that the gain we obtained in broadcast channels and distributed lossy source coding over conventional approaches comes from the asymmetry of our construction (our bounds are asymmetric unlike previous bounds), a symmetric treatment that does not result in a looser bound may be developed in the future.
XV Acknowledgements
The authors acknowledge support from the NSF grants CNS-1527846, CCF-1618145, the NSF Science & Technology Center grant CCF-0939370 (Science of Information), and the William and Flora Hewlett Foundation supported Center for Long Term Cybersecurity at Berkeley.
-A Proof of Lemmas 1 and 3
We first prove Lemma 3. For notational simplicity, we use to denote that is the set of points of a Poisson process with intensity measure (the ordering of the points is ignored). Let , . Let . Let be the points where . By the mapping theorem [14, 15] on the mapping
[TABLE]
we have (where denotes restricted to the set ), and hence (the points in with ) is independent of (the points in with ).
Condition on and unless otherwise stated. Assume (which happens almost surely since ) and (otherwise the inequalities in the lemmas trivially hold). Recall that by definition. It is straightforward to check that independent of independent of , the uniform distribution over the ordered simplex (i.e., has the same distribution as i.i.d. points following sorted in ascending order of the second coordinate). We have
[TABLE]
where
[TABLE]
Due to the aforementioned independence between , and , we have . For , since , conditioning on does not affect the distribution of , and hence follows the Poisson distribution with rate
[TABLE]
For , since , follows the Poisson distribution with rate
[TABLE]
Hence follows the Poisson distribution with rate
[TABLE]
For , since has the same distribution as i.i.d. points following sorted in ascending order of the second coordinate, follows the binomial distribution with number of trials and success probability
[TABLE]
Conditioned on (without conditioning on ), we have , and . Hence, conditioned on , the distribution of is
[TABLE]
i.e., the sum of a negative binomial random variable and an independent binomial random variable. The mean is
[TABLE]
Also,
[TABLE]
For ,
[TABLE]
-B Proof of the Conditional Poisson Matching Lemma
The conditional Poisson matching lemma is intuitively obvious. The Poisson matching lemma can be equivalently stated as: for any probability measures , the following holds for -almost all :
[TABLE]
where is the conditional distribution of the Poisson process given . Intuitively, we can consider the Poisson matching lemma to be a statement with 3 parameters (ignore the almost-all condition on for the moment). Since the statement holds for (almost) any , it also holds for any random choice of . In particular, it holds for , where , which gives the conditional Poisson matching lemma. Note that the probability in the conditional Poisson matching lemma is conditional on , where , and hence conditioning on has the same effect on as conditioning on the parameters .
We now prove the conditional Poisson matching lemma rigorously. Let be the probability space for , the points of a Poisson process with intensity measure on (let be the Borel -algebra of ). The Poisson matching lemma can be equivalently stated as: for any probability measures , and a regular conditional probability distribution (RCPD) of conditioned on (i.e., is a probability kernel, and for any , , where denotes the preimage of under , note that ), then we have
[TABLE]
for -almost all .
Consider the conditional Poisson matching lemma. We have the following for -almost all :
[TABLE]
where (a) holds for -almost all due to , and (b) is by (17) with , which holds for -almost all , and hence holds for -almost all . We now check that satisfies the RCPD condition for -almost all . Since , we have for -almost all . Since , we have for -almost all . Hence the following conditions are satisfied for -almost all :
[TABLE]
[TABLE]
For any satisfying (18) and (19), we have the following: for all , ,
[TABLE]
where (a) is by (18), and (b), (c) are by (19).
-C Proof of Theorem 1
Let be the points of a Poisson process with intensity measure independent of . The encoding function is (i.e., ), and the decoding function is . We have ,
[TABLE]
where (a) is by the definition of , (b) is by the conditional generalized Poisson matching lemma on , and (c) is by and Jensen’s inequality. Therefore there exists a fixed attaining the desired bound.
A noteworthy property of this construction is that both the encoder and the decoder do not require knowledge of . The code can transmit any integer with error probability , assuming unlimited common randomness between the encoder and the decoder.
-D Dispersion of Joint Source-Channel Coding
We show a second order result for joint source-channel coding using Theorem 4 that coincides with the optimal dispersion in [10]. Consider an i.i.d. source sequence of length , separable distortion measure , and uses of the memoryless channel . Let attain the infimum of the rate-distortion function
[TABLE]
The -tilted information [40] is defined as
[TABLE]
where (the unconditional -marginal of ), and (the derivative exists if the infimum in is achieved by a unique [40]). We invoke a lemma in [40]:
Lemma 5** ([40], Lemma 2).**
If the following conditions hold:
- •
,
- •
the infimum in is achieved by a unique ,
- •
there exists a finite set such that , and
- •
* (computed assuming independent),*
then there exist constants such that for ,
[TABLE]
where , and .
We now show a second order result.
Proposition 5**.**
Fix , , . We have if the conditions in Lemma 5 are satisfied, , and
[TABLE]
where , , , and is a constant that depends on and the distribution of .
Proof:
We have
[TABLE]
where (a) is by Theorem 4, (b) is by Lemma 5, and (c) is by the Berry-Esseen theorem [20, 21, 22] if we let be a constant given by the Berry-Esseen theorem. ∎
This coincides with the optimal dispersion in [10]. Although this is not a self-contained proof (it requires the lemma in [40] for the dispersion of lossy source coding), it shows how we can obtain the achievability of the dispersion in joint source-channel coding from a result on the dispersion of lossy source coding with little additional effort, using the Poisson matching lemma. This proof is considerably simpler than that in [10].
-E Properties of
Let , , where such that . Note that is convex. It can be checked numerically that . We prove a useful inequality about .
Proposition 6**.**
For any , , we have
[TABLE]
Moreover, if , , and , then
[TABLE]
Proof:
Write for the inverse function of . Since
[TABLE]
we have
[TABLE]
By the convexity of ,
[TABLE]
If , , and ,
[TABLE]
where the last inequality follows from considering whether is positive or negative, and the inequality . ∎
-F *Proof of Theorem 5 for Broadcast Channel with Common
Message*
The parameters correspond to rate splitting. We can split into and , and treat as part of to be decoded by both decoders. Although and may not be uniformly distributed, we can apply a random cyclic shift to such that (and hence are also uniform), and condition on a fixed shift at the end. Also can be split similarly. Therefore we assume without loss of generality.
Let , , be three independent Poisson processes with intensity measures , and respectively, independent of .
The encoder would generate such that
[TABLE]
where , and is an intermediate list (which can be regarded as a sub-codebook). The term in (20) means that are conditionally i.i.d. given , and . To accomplish this, the encoder computes , for ,
(which Poisson process we are referring to can be deduced from whether we are discussing , or ), (where is derived from (20)), and outputs . It can be verified that (20) is satisfied.
For the decoding function at the decoder , let for , where with .
Let (using the Poisson process ). By the conditional generalized Poisson matching lemma on , almost surely,
[TABLE]
By (20), , , and , we have and (see Figure (2) middle). Hence by the conditional generalized Poisson matching lemma on , almost surely,
[TABLE]
where (a) is due to (see Figure 2 middle), (b) is due to the aforementioned application of the conditional generalized Poisson matching lemma, (c) is by Proposition 6, and (d) is due to (21) and (see Figure 2 middle).
Also, since and (see Figure 2 right), by the conditional Poisson matching lemma on , almost surely,
[TABLE]
where (a) is due to (see Figure 2 right), (b) is due to the aforementioned application of the conditional Poisson matching lemma, (c) is by the same arguments as in the proof of Theorem 6, (d) is by Proposition 6, and (e) is due to (21) and (see Figure 2 right). Hence
[TABLE]
where , B=\bigl{(}\log((\mathsf{L}_{2}\mathsf{J}^{-1}2^{\iota_{U_{1};U_{2}|U_{0}}(U_{1};U_{2}|U_{0})-\iota_{U_{2},Y_{2}|U_{0}}(U_{2};Y_{2}|U_{0})}+\mathsf{L}_{2}(1-\mathsf{J}^{-1})2^{-\iota_{U_{2},Y_{2}|U_{0}}(U_{2};Y_{2}|U_{0})})^{-1}+1)+1\bigr{)}^{2}.
For (6), if the event in (6) does not occur, by Proposition 6,
[TABLE]
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] C. T. Li and A. El Gamal, “Strong functional representation lemma and applications to coding theorems,” IEEE Transactions on Information Theory , vol. 64, no. 11, pp. 6967–6978, Nov 2018.
- 2[2] S. I. Gel’fand and M. S. Pinsker, “Coding for channel with random parameters,” Probl. Contr. and Inf. Theory , vol. 9, no. 1, pp. 19–31, 1980.
- 3[3] S. Verdú, “Non-asymptotic achievability bounds in multiuser information theory,” in Communication, Control, and Computing (Allerton), 2012 50th Annual Allerton Conference on , Oct 2012, pp. 1–8.
- 4[4] J. Liu, P. Cuff, and S. Verdú, “One-shot mutual covering lemma and Marton’s inner bound with a common message,” in 2015 IEEE International Symposium on Information Theory (ISIT) , June 2015, pp. 1457–1461.
- 5[5] M. H. Yassaee, M. R. Aref, and A. Gohari, “A technique for deriving one-shot achievability results in network information theory,” in 2013 IEEE International Symposium on Information Theory , July 2013, pp. 1287–1291.
- 6[6] E. C. Song, P. Cuff, and H. V. Poor, “The likelihood encoder for lossy compression,” IEEE Transactions on Information Theory , vol. 62, no. 4, pp. 1836–1849, 2016.
- 7[7] M. H. Yassaee, M. R. Aref, and A. Gohari, “Non-asymptotic output statistics of random binning and its applications,” in 2013 IEEE International Symposium on Information Theory , July 2013, pp. 1849–1853.
- 8[8] A. D. Wyner and J. Ziv, “The rate-distortion function for source coding with side information at the decoder,” IEEE Transactions on Information Theory , vol. 22, no. 1, pp. 1–10, January 1976.
