Exponential Source/Channel Duality
Sergey Tridenski, Ram Zamir

TL;DR
This paper establishes a duality between source and channel coding in the exponential regime, deriving exact exponents for various coding scenarios and showing their interrelations through a unified framework.
Contribution
It introduces a novel source/channel duality in the exponential regime and derives exact exponents for multiple coding schemes, unifying source and channel coding error analysis.
Findings
Exact exponents for lossy source coding success and failure.
Unified derivation of channel decoding error exponents from source coding exponents.
Validation of Forney's bounds and derivation of the optimal erasure/list decoding exponent.
Abstract
We propose a source/channel duality in the exponential regime, where success/failure in source coding parallels error/correctness in channel coding, and a distortion constraint becomes a log-likelihood ratio (LLR) threshold. We establish this duality by first deriving exact exponents for lossy coding of a memoryless source P, at distortion D, for a general i.i.d. codebook distribution Q, for both encoding success (R < R(P,Q,D)) and failure (R > R(P,Q,D)). We then turn to maximum likelihood (ML) decoding over a memoryless channel P with an i.i.d. input Q, and show that if we substitute P=QP, Q=Q, and D=0 under the LLR distortion measure, then the exact exponents for decoding-error (R < I(Q, P)) and strict correct-decoding (R > I(Q, P)) follow as special cases of the exponents for source encoding success/failure, respectively. Moreover, by letting the threshold D take general values, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Exponential Source/Channel Duality
Sergey Tridenski
EE - Systems Department
Tel Aviv University, Israel
Email: [email protected]
Ram Zamir
EE - Systems Department
Tel Aviv University, Israel
Email: [email protected]
Abstract
We propose a source/channel duality in the exponential regime, where success/failure in source coding parallels error/correctness in channel coding, and a distortion constraint becomes a log-likelihood ratio (LLR) threshold. We establish this duality by first deriving exact exponents for lossy coding of a memoryless source , at distortion , for a general i.i.d. codebook distribution , for both encoding success () and failure (). We then turn to maximum likelihood (ML) decoding over a memoryless channel with an i.i.d. input , and show that if we substitute , , and under the LLR distortion measure, then the exact exponents for decoding-error () and strict correct-decoding () follow as special cases of the exponents for source encoding success/failure, respectively. Moreover, by letting the threshold take general values, the exact random-coding exponents for erasure () and list decoding () under the simplified Forney decoder are obtained. Finally, we derive the exact random-coding exponent for Forney’s optimum tradeoff erasure/list decoder, and show that at the erasure regime it coincides with Forney’s lower bound and with the simplified decoder exponent. 111This research was supported in part by the Israel Academy of Science, grant # 676/15.
Index Terms:
erasure/list decoding, random coding exponents, correct-decoding exponent, source coding exponents.
I Introduction
The aim of our study is to define an analogy between lossy source coding and coding through a noisy channel, allowing to translate the terms and results between the two branches. We consider an analogy, in which encoding for sources corresponds to decoding for channels, encoding success translates to decoding error, and a source translates to a channel input-output pair. Channel coding, in the random coding setting with a fixed generating distribution , emerges as a special case of lossy source coding. Although other analogies may be possible, the proposed analogy requires minimal generalization on the source coding part. Generalization of the channel decoder, on the other hand, leads to a broader correspondence between the two branches, such that correct-decoding event for channels (which becomes a rare event for a sufficiently large codebook) translates to encoding failure for sources.
In other works on the source/channel duality, the rate-distortion function of a DMS
[TABLE]
and the capacity of a DMC
[TABLE]
are related directly via introduction of a cost constraint on the channel input [1], [2], [3], or using covering/packing duality [4]. In order to look at the similarities and the differences of the expressions for the RDF and capacity more closely, let us rewrite them in a unified fashion, with the help of the Kullback-Leibler divergence , as follows:
[TABLE]
[TABLE]
where in the case of the capacity we use a particular distortion measure, defined by the LLR as
[TABLE]
Note, that the distortion constraint in the capacity case (2) is . More concisely, the expressions (1) and (2) may be written with the help of the function , defined as the inner in (1), [5], which represents the rate in lossy coding of a source using an i.i.d. codebook under a distortion constraint :
[TABLE]
The expression for the capacity (2), or (5), follows by our new results, and it can also be shown directly, by the method of Lagrange multipliers, that, for the distortion measure (3), , where is the mutual information. Obviously, and are two mathematically different problems and it is difficult to relate between them. On the other hand, for a given , i.e. before minimization/maximization over , the expression for channels in (2) is just a special case of the expression for sources in (1) or (4). Therefore, in this work we focus on the source/channel analogy for a given . In the rest of the paper, plays the role of a generating distribution of an i.i.d. random codebook.
We extend the described analogy to the framework of random coding exponents. In our terminology, the source encoding failure is the same as the source coding error defined in [6]. We derive asymptotically-exact exponents of source encoding failure and success, which are similar in form to the best lossy source coding exponents given in [6] and [7, p. 158], respectively, but correspond to i.i.d. random code ensembles generated according to an arbitrary (not necessarily optimal) . Such ensembles prove to be useful for adaptive source and channel coding [5], [8].
Next, we modify the ML channel decoder with a threshold . The resulting decoder can be identified as a simplified erasure/list decoder [9, eq. 11a], suggested by Forney as an approximation to his optimum tradeoff decoder [9, eq. 11]. Exact random coding exponents for the simplified decoder, via the source/channel analogy, then become corollaries of our source coding exponents, where Gallager’s random coding error exponent of the ML decoder is obtained as a special case for .
The fixed composition version of the random coding error exponent for the simplified decoder was derived recently in [10]. In comparison, the i.i.d. random coding error exponent, derived here, can be expressed similarly to Forney’s random coding bound for the optimum tradeoff decoder [9, eq. 24], and therefore can be easily compared to it. We show that the exact i.i.d. random coding exponent of the simplified decoder coincides with Forney’s lower bound on the random coding exponent [9, eq. 24] for . It follows, that Forney’s lower bound is tight for random codes, also with respect to the optimum tradeoff decoder, for .
Finally, we derive an exact random coding exponent for Forney’s optimum tradeoff decoder [9, eq. 11] for all values of the threshold . The resulting expression is also similar to the original Forney’s random coding bound [9, eq. 24] and we show directly that they coincide for the threshold . This proves a conjecture in [11], and also extends/improves the results in [12], [11] for list decoding.
In what follows, we present our results for sources, then translate them to the results for channels. We discuss briefly the relation of the new channel exponents to the error/correct exponents of the ML decoder. Finally, we present our result for Forney’s optimum tradeoff decoder. In the remainder of the paper, a selected proof is given. The rest of the proofs can be found in [13].
II Exponents for sources
We assume that the source alphabet and the reproduction alphabet are finite. Assume also an additive distortion measure, of arbitrary sign, , such that the distortion in a source-reproduction pair of length , is given by . For an arbitrary distortion constraint and a distribution over , let us define a function
[TABLE]
where . If the set is empty, then . For brevity, we define also the following function
[TABLE]
Consider a reproduction codebook of random codewords of length , generated i.i.d. according to the distribution . Let X be a random source sequence of length from the DMS . Let us define encoding success as an event
[TABLE]
Then, our results for encoding success exponent can be formulated as follows:
Definition 1 (Implicit formula):
[TABLE]
Note that this expression is zero for .
Theorem 1** (Explicit formula)**
[TABLE]
Theorem 2** (Encoding success exponent)**
[TABLE]
except possibly for , when the right-hand side is a lower bound.**
Let us define encoding failure as a complementary event . Then, our results for encoding failure exponent can be formulated as follows:
Definition 2 (Implicit formula):
[TABLE]
Note that this expression is zero for and is considered if .
We give an explicit formula, which does not always coincide with the implicit formula (12) for all , but gives the best convex () lower bound for (12) as a function of , for sufficiently lax distortion constraint :
Theorem 3** (Explicit formula)**
For distortion constraint ,**
[TABLE]
For , the right-hand side expression gives zero, which is strictly lower than , if .**
Note that the above explicit formula is similar to the lower bound on the failure exponent given in [3] (except here it is without maximization over and pertains to the random code ensemble of distribution ). Our result is also more specific about the relationship (convex envelope) between the lower bound and the true exponent, given by (12) according to
Theorem 4** (Encoding failure exponent)**
[TABLE]
with the possible exception of points of discontinuity of the function .**
III Exponents for channels
We assume a DMC with finite input and output alphabets and . For simplicity, we assume also that for any the channel probability . Consider a codebook of random codewords of length , generated i.i.d. according to a distribution over . Without loss of generality, assume that message is transmitted. Let Y be a response, of length , of the DMC to the input . Let us define decoding error as an event:
[TABLE]
corresponding to a simplified erasure/list decoder [9, eq. 11a]. Observe, from comparison of the events (8) and (15), that the latter can be seen as a special case of the former. In (15), the channel input-output pair pays a role analogous to the source sequence in (8), and the incorrect codeword plays a role analogous to the reproduction sequence in (8). In the proposed analogy, the reproduction alphabet is the alphabet of incorrect codewords, which is , and the alphabet of the source is the product alphabet of the channel input-output pair . We make the following substitutions:
[TABLE]
Definition (7) now acquires a specific form
[TABLE]
Note, that the minimal distortion now depends on the support of the distribution :
[TABLE]
The results for decoding error follow as simple corollaries of the results for encoding success. The definition of the implicit expression for decoding error exponent parallels Definition 1:
[TABLE]
where is defined with , as in (2). Note, that is zero for .
Corollary 1** (Explicit formula)**
[TABLE]
Corollary 2** (Decoding error exponent)**
[TABLE]
except possibly for , given by (21), when the right-hand side is a lower bound.**
The best random coding exponent is given by
Theorem 5** (Maximal decoding error exponent)**
[TABLE]
for all .**
This result can be contrasted with the fixed composition exponent [10, eq. 29], and (together with the explicit form (23)) can be easily compared with Forney’s random coding bound [9, eq. 24].
Similarly, the results for the correct decoding event follow as simple corollaries of the results for encoding failure. The definition of the implicit expression for correct decoding exponent parallels Definition 2:
[TABLE]
The superscript ∗ serves to indicate that this exponent is different from the correct decoding exponent of the ML decoder, for , as here the receiver declares an error also when there is only equality in (15), i.e. no tie-breaking. This distinction is important in the case of the correct-decoding exponent, but not in the case of the decoding error exponent.
The following explicit formula gives the best convex () lower bound for (26) as a function of , for nonnegative distortion constraint :
Corollary 3** (Explicit formula)**
For distortion constraint ,**
[TABLE]
For , the right-hand side expression gives zero, which is strictly lower than , if .**
Corollary 4** (Correct decoding exponent)**
[TABLE]
with the possible exception of points of discontinuity of the function .**
IV Relation to the exponents of the ML decoder
The maximum likelihood decoder has the same error exponent as the decoder (15) with . The Gallager exponent [14] is obtained from the explicit formula (23) with .
On the other hand, the correct-decoding exponent of the ML decoder is given by
[TABLE]
where the equality () is shown in [13], (31) is another implicit formula, which is more convenient to derive (as well as convenient for the derivation of (30)), and (32) is the fixed composition exponent of Dueck and Körner [15]. Note, that the correct-decoding exponent of the ML decoder (29) is different from the corresponding exponent of the decoder (15), (26) with . The difference is the result of the tie-breaking, the ML decoder performs. Without tie-breaking, the decoding can be termed as strict. The two nondecreasing curves, given by (29) and (26), with , both as a function of , coincide for slopes . Then, for greater , the correct decoding exponent of the ML decoder continues to increase linearly, with constant slope , while the exponent of the strict decoder (26), with , eventually becomes .
V Random coding error exponent of
the erasure/list optimum tradeoff decoder
In [9, eq. 11] the decoding error event, given that message is transmitted, is defined as
[TABLE]
which is different than the error event of the simplified decoder (15). We derive an exact i.i.d. random coding error exponent for this decoder, for all values of the threshold . The result is given by the minimum of two terms. One of the terms is given by (23) and corresponds to the error exponent of the source/channel duality decoder, and the other term is very similar, defined as
[TABLE]
Theorem 6** (Error exponent for optimum tradeoff decoder)**
[TABLE]
for all , with the possible exception of points with , and points with (for ), where is a finite set, where still**
[TABLE]
with and given explicitly by (23) and (34).**
For comparison, the random coding lower bound given by [9, eq. 24], can be written, without maximization over , in our present terms (with in place of and a differently defined , not scaled by ) as
[TABLE]
Observe, that indeed
[TABLE]
The next lemma shows, that the exact exponents (25), (35), and Forney’s lower bound, given by the maximum of (36) over , coincide for (erasure regime).
Lemma 1
For
[TABLE]
Proof:
[TABLE]
where () holds by Hölder’s inequality for and . We conclude, that
[TABLE]
∎
As for the case (list decoding regime), we note that the exponent (25) becomes for , and the exponent (35) becomes for , while Forney’s lower bound stays finite. For details see [13].
VI A selected proof
Here we derive a lower bound on the encoding failure exponent, which, together with an upper bound, derived in [13], results in Theorem 4. Due to the lack of space, all the other proofs are deferred to [13].
The derivation uses a generic auxiliary lemma:
Lemma 2
Let , . If , with , then**
[TABLE]
We use the method fo types [7], with notation , for types and type classes, and , for conditional types and type classes, respectively. We upper-bound the probability of encoding failure as follows:
[TABLE]
[TABLE]
[TABLE]
[TABLE]
Explanation of steps: () holds for sufficiently large , when
[TABLE]
with
[TABLE]
() holds for
[TABLE]
where
[TABLE]
() holds by Lemma 2 for
[TABLE]
() uses the upper bound on the probability of a type.
() Let denote the conditional distribution, achieving for some . This implies
[TABLE]
Let denote a quantized version of the conditional distribution with variable precision 1/\big{(}n{P\mathstrut}_{\bf x}(x)\big{)}, i.e. a set of types with denominators , such that the joint distribution is a type with denominator . Observe, that the differences between and do not exceed . Therefore, since the divergence, as a function of , has bounded derivatives, and also the distortion measure is bounded, for any there exists large enough, such that the quantized distribution satisfies
[TABLE]
The last inequality implies
[TABLE]
The relations (44), (43), (42) together give
[TABLE]
This explains (). () uses the definition
[TABLE]
() is bounded from below by defined in (12). We conclude from (38), (39), (40):
[TABLE]
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] A. Gupta and S. Verdú, “Operational duality between lossy compression and channel coding,” IEEE Trans. on Information Theory , vol. 57, no. 6, pp. 3171–3179, Jun. 2011.
- 2[2] S. Pradhan, J. Chou, and K. Ramchandran, “Duality between source coding and channel coding and its extension to the side information case,” IEEE Trans. on Information Theory , vol. 49, no. 5, pp. 1181–1203, May 2003.
- 3[3] R. Blahut, “Hypothesis testing and Information Theory,” IEEE Trans. on Information Theory , vol. 20, no. 4, pp. 405–417, Jul. 1974.
- 4[4] S. Jana and R. Blahut, “Insight into source/channel duality and more based on an intuitive covering/packing lemma,” in IEEE International Symposium on Information Theory , Jul. 9-14 2006, pp. 2329–2333.
- 5[5] R. Zamir and K. Rose, “Natural type selection in adaptive lossy compression,” IEEE Trans. on Information Theory , vol. 47, no. 1, pp. 99–111, Jan. 2001.
- 6[6] K. Marton, “Error exponent for source coding with a fidelity criterion,” IEEE Trans. on Information Theory , vol. 20, no. 2, pp. 197–199, Mar. 1974.
- 7[7] I. Csiszár and J. Körner, Information Theory: Coding Theorems for Discrete Memoryless Systems . Academic Press, 1981.
- 8[8] S. Tridenski and R. Zamir, “Stochastic interpretation for the Arimoto algorithm,” in IEEE Information Theory Workshop , Apr. 2015.
