Coding for Arbitrarily Varying Remote Sources
Amitalok J. Budkuley, Bikash Kumar Dey, Vinod M. Prabhakaran

TL;DR
This paper investigates the limits of lossy source coding over an arbitrarily varying channel controlled by an adversary, providing bounds and exact characterizations of the rate distortion function in certain cases.
Contribution
It introduces bounds and exact solutions for the adversarial rate distortion problem in remote source coding with side information.
Findings
Derived upper and lower bounds on the adversarial rate distortion function.
Identified special cases where bounds coincide, giving exact rate distortion characterizations.
Abstract
We study a lossy source coding problem for a memoryless remote source. The source data is broadcast over an arbitrarily varying channel (AVC) controlled by an adversary. One output of the AVC is received as input at the encoder, and another output is received as side information at the decoder. The adversary is assumed to know the source data non-causally, and can employ randomized jamming strategies arbitrarily correlated to the source data. The decoder reconstructs the source data from the encoded message and the side information. We prove upper and lower bounds on the adversarial rate distortion function for the source under randomized coding. Furthermore, we present some interesting special cases of our general setup where the above bounds coincide, and thus, provide their complete rate distortion function characterization.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Coding for Arbitrarily Varying Remote Sources
Amitalok J. Budkuley2 , Bikash Kumar Dey and Vinod M. Prabhakaran 2This work was done when Amitalok J. Budkuley was with the Dept. of Electrical Engineering at IIT Bombay, Mumbai-India.
Emails: [email protected], [email protected], [email protected]
Abstract
We study a lossy source coding problem for a memoryless remote source. The source data is broadcast over an arbitrarily varying channel (AVC) controlled by an adversary. One output of the AVC is received as input at the encoder, and another output is received as side information at the decoder. The adversary is assumed to know the source data non-causally, and can employ randomized jamming strategies arbitrarily correlated to the source data. The decoder reconstructs the source data from the encoded message and the side information. We prove upper and lower bounds on the adversarial rate distortion function for the source under randomized coding. Furthermore, we present some interesting special cases of our general setup where the above bounds coincide, and thus, provide their complete rate distortion function characterization.
I Introduction
Consider the communication scenario depicted in Fig. 1.
A memoryless source outputs an independent and identically distributed (i.i.d.) data vector , which is broadcast over a memoryless channel . Apart from the source, this channel has an input from an adversary, and it has two outputs and . The output is fed to the source encoder which encodes it into a message . The decoder receives the other output and the message , and wants to reconstruct under an average distortion criterion. The adversary knows non-causally, and is allowed to employ randomized vector jamming strategies arbitrarily correlated with it, thereby inducing an arbitrarily varying channel (AVC) [1]. As is common in AVC-related channel coding problems, we first undertake a study of this setup under randomized coding in this paper, where we assume that the encoder-decoder share an unbounded amount of randomness , unknown to the adversary [1]. We prove a maximin lower bound and a minimax upper bound for the rate distortion function for this arbitrarily varying remote source under randomized coding.
In standard source coding scenarios involving noisy observations (e.g. noisy source coding [2] or source coding with side information [3]), the noise statistics are known a priori. In our setup, however, the jamming signal of the malicious adversary renders these statistics completely arbitrary and unknown, thereby making its analysis considerably more challenging. Furthermore, as depicted in Fig. 1, the jamming noise controls the observations as well as . Thus, the adversary in our problem can jointly degrade the compression as well as the decoding/estimation phases of communication.
Lossy source coding has been studied extensively since the seminal work by Shannon [4], and the field has subsequently been advanced in many directions (cf. [5, 6]). Apart from noisy source coding [2] and source coding with side information [3], some of the other prominent directions related to this work include source coding under several distortion measures [7] and universal source coding [8]. Particularly relevant are the compound and universal coding problem formulations which have appeared for classical coding, noisy/indirect coding, coding under several distortion measures, and coding with side information (cf. [9, 7, 10], and some of the references therein). Our problem also has a direct connection to universal noisy source coding problems which present a wider set of challenges (cf. [7]) compared to their noise-free counterparts. Another closely related model is that of an arbitrarily varying source (AVS) introduced in [11]. This model is further studied under variable rate codes in [12]. Inspired by an adversary capable of switching among different sources, Berger [13] introduced a different AVS. In his problem, a multiplexer with inputs from several memoryless sources with a common alphabet and a single output, feeds data to the encoder. The multiplexer is controlled by a strictly causal switching adversary. An extension of results under adversaries with causal as well as non-causal knowledge of the data has subsequently appeared in [14].
The rest of the paper is organized as follows. In Section II, we first introduce the notation and problem setup. We state our main result in Section III. The proof of our main result is presented in Section IV. Finally, we discuss some implications of our work, and make concluding remarks in Section V.
II Notation and Problem Setup
II-A Notation and Preliminaries
We denote random variables by upper case letters (e.g. ), the values they take by lower case letters (e.g. ) and their alphabets by calligraphic letters (e.g. ). We use boldface notation to denote random vectors (e.g. ) and their values (e.g. ). Here, the vectors are of length (e.g. ), where is the block length of operation. Let us also denote and as well as and . We use the (denoted by ) norm for vectors. For a set , let be the set of all probability distributions on . Similarly, let be the set of all conditional distributions of a random variable with alphabet conditioned on another random variable with alphabet . For two random variables and , we denote the marginal distribution of obtained from the joint distribution by . Distributions corresponding to strategies adopted by the adversary are denoted by instead of for clarity. The set of all conditional distributions is specifically denoted by . In cases where the subscripts are clear from the context, we sometimes omit them to keep the notation simple.
Deterministic functions will be denoted in lowercase (e.g. ). We denote a type of by . Given sequences , , we denote by the type of , by the joint type of and by the conditional type of given . For , the set of -typical set of sequences for a distribution is In addition, for a joint distribution and , the conditionally typical set of sequences, conditioned on , is defined as
II-B The Problem Setup
Refer the communication setup depicted in Fig. 1. Let , , , and denote finite sets. Consider an i.i.d. source with distribution and alphabet . We assume without loss of generality that , . A length- block of data is sent over a noisy AVC. This channel has two inputs and and two outputs and , and its behaviour is given by the memoryless distribution . In Fig. 1, the two inputs and are from the source and the jamming adversary respectively. The output is available at the encoder and is available at the decoder. We assume that the adversary knows non-causally. Given inputs and , we observe and with probability given by
[TABLE]
The encoder compresses and transmits a message losslessly to the decoder. Using and the available side information , the decoder outputs an estimate . The quality of the estimate is measured in terms of the average per-letter distortion where denotes a single-letter distortion measure with .
An deterministic code of block length and rate is a pair of mappings, consisting of the encoder map and the decoder map The encoder sends the message to the decoder over an error free channel. An randomized code of block length and rate is a random variable which takes values in the set of deterministic codes. We denote by the encoder and decoder for this randomized code. This forms the shared randomness . The message sent is . For this randomized code, the average distortion is given by
[TABLE]
where the expectation is over the shared randomness , the source, the channel and the adversary’s jamming action. Given a target distortion , a rate is achievable if for any there exists an such that for every there exists an randomized code with the resulting average distortion . We define the rate distortion function as the infimum of all achievable rates. Our aim is to determine the rate distortion function .
III The Main Result
Recall that denotes the set of all conditional distributions of given . For any distribution , the system model gives the single-letter joint distribution . Let
[TABLE]
Here is the minimax average distortion when both and are available at the decoder, while is the minimax distortion when the decoder has access to only the side information (Please see discussion in Sec. IV-A). Let be an auxiliary random variable with a finite alphabet and conditional distribution , such that forms a Markov chain. The joint distribution of is then given by . We now define the following:
[TABLE]
where the minimization is over and such that . Clearly, we may restrict the cardinality of to which is the number of possible functions from to .
[TABLE]
where the minimization is over and such that for the specified . Here, we may restrict the cardinality of to ; this cardinality bound follows in a manner similar to [3]. We next state our main result.
Theorem 1**.**
The adversarial rate distortion function for the arbitrarily varying remote source problem in Fig. 1 under randomized coding satisfies
[TABLE]
Our setup can be considered to be an “arbitrarily varying remote” version of the Wyner-Ziv setup [3], where both the input to the encoder as well as the side-information are corrupted by the adversary. The setup gives two interesting special cases by limiting the adversary’s control to either one of these (i.e., or ). If the adversary controls only , i.e., , then the order of maximum and minimum can be interchanged. This is a consequence of the convexity-concavity properties of . Specifically, is concave in and convex111In order to have a convex domain, we need to rewrite the minimization as a minimization only over where the alphabet of is the set of Shannon strategies at the decoder; see [15] for details. in . We can now use the Minimax theorem [16] to conclude that the minimax and the maximin are equal. Similarly, if the adversary controls only , that is, when only the side-information is arbitrarily varying (), then again one can show that is convex and concave in and respectively. Hence, the maximum and minimum can be interchanged. In both these special cases, the upper bound and the lower bound in Theorem 1 match, and they give a characterization of the optimum rate.
IV Proof of Theorem 1
IV-A Achievability
We present an outline of the achievability proof. The detailed proof can be found in Appendix A. Observe that if , then we can estimate using an estimator based solely on the side information . Thus, for , .
Let us now assume that . We fix an arbitrary and , and prove the achievability of the rate
[TABLE]
where the equality follows from the Markov chain . We rewrite this rate as 222Here we indicate as a function of only as is fixed in our discussion of achievability. For the same reason, we indicate only as a function of , as , and are fixed in our discussion.
[TABLE]
where we have . Note that is a function of . Here, we write to mean that . We have used a function such that for , and if . The existence of such a function follows from the uniform continuity of as a function of for fixed . Now interchanging the maximizations, we get
[TABLE]
Now for every type , we define
[TABLE]
Code Construction:
- •
We will now describe the generation of a random code. We assume that both the encoder and decoder share the ensemble of all possible such codes, and they jointly select a code at random from this ensemble using their shared randomness . This is equivalent to generating the code randomly and then sharing it between the encoder and the decoder.
- •
For each type , we generate vectors i.i.d. , where , to form the codebook . The codebook is randomly partitioned into bins.
- •
The randomly generated code containing the list of binned codebooks for each is shared between the encoder and the decoder.
Encoder operations:
- •
The encoder, upon observing a vector , computes its type . It checks if there is at least one codeword in which is jointly typical with with respect to (w.r.t.) the joint distribution . The encoder then sends and the bin index of such a codeword in , selecting one uniformly at random if there is more than one possibility.
- •
Since there are at most a polynomial number of types, for large enough , the rate required to convey is at most . So, the rate of the full message is bounded as
[TABLE]
Decoder operations:
- •
The decoder knows and the bin index sent by the encoder; it also knows as the side information. The decoder identifies the set of conditional types
[TABLE]
such that the resulting -marginal distribution is close to
- •
The decoder then checks within the bin if there is a codeword such that is jointly typical w.r.t. the distribution for some type . If there is a unique such codeword , then it chooses that codeword, otherwise it chooses an arbitrary codeword from the bin. Using this codeword and , it then outputs , where , .
Average distortion analysis:
- •
We first analyse the error probability in decoding the right codeword . A decoding error can occur due to three possibilities:
Encoder does not find any codeword that is jointly typical with w.r.t. . The probability that there is no such codeword in is exponentially small (by covering lemma) since . 2. 2.
Let us assume that the encoder succeeded in finding a suitable codeword . For this correct codeword and the actual conditional type instantiated by the adversary, we will argue that will satisfy the decoding condition with high probability (w.h.p.) 333All our w.h.p. statements hold under “except for an exponentially small probability.”. First, w.h.p. is typical w.r.t. , i.e., is “close” to . In that case, is one of the conditional types considered by the decoder for the code associated with . Secondly, w.h.p. is jointly typical w.r.t. and so it is also jointly typical w.r.t. the distribution (though with a bigger slack). Now, using a version of the refined Markov lemma [17, Lemma 5], it follows that w.h.p., is jointly typical w.r.t. . In particular, is jointly typical w.r.t. . 3. 3.
Now, let us consider all the wrong codewords in the bin. For any type , the probability that at least one of the wrong codewords will be jointly typical with w.r.t. is exponentially small due to the choice of (by packing lemma). By taking union bound over all (at most polynomial number of) types in , the probability that any of them will be jointly typical with w.r.t. for any such is exponentially small.
- •
We now note that if is jointly typical w.r.t. , then is jointly typical w.r.t. , and thus, is jointly typical. Finally, the average distortion is bounded using the typical average lemma.
Remark 2**.**
We have taken the code and binning rates (see (11) and (14)) such that their difference is more than the term in (10). A crucial feature of our achievability scheme is the choice of in (14), which motivated the expression of as in (10). We now explain the insight behind this choice of . It is worth noting that instead of taking the rate as the minimum value of over all such that , we have taken it to be the minimum over all such that is “close” to . Firstly, a part of our proof relies on bounding the probability of error by union bounding over the (polynomial number of) conditional types (in ) that the decoder considers. For the union bound to work, the decoder cannot consider every conditional distribution , which gives the right , as the number of such distributions can be infinite. Secondly, specially since the decoder only searches over the conditional types and not every conditional distribution , it may not find any conditional type that gives exactly as the marginal on . Thirdly, our proof argument relies on the fact that the instantiated conditional type is considered by the decoder, i.e., it is in . However is only guaranteed (w.h.p.) to be close to , and this is the reason for defining and with a slack in the resulting marginal on .
IV-B The proof of the lower bound
We will prove now that any achievable rate is lower bounded by the maximin lower bound in (4). We consider . Consider an randomized code which achieves an average distortion of , i.e., the code is such that
[TABLE]
under any jamming distribution . In particular, it satisfies the distortion constraint under the i.i.d. jamming distribution with
[TABLE]
Under this jamming distribution, , , form an i.i.d. sequence with joint distribution given by , where , . Let us define
[TABLE]
where the minimization is over , such that under the given . It then follows using a similar argument as in the converse for the Wyner-Ziv problem [3] that (see Appendix C for details)
[TABLE]
Hence, by the continuity of in (Lemma 18 in Appendix C),
[TABLE]
Since this is true for any , we have the lower bound.
V Conclusion
In this paper, we studied a setup of lossy source coding for an arbitrarily varying remote source with side-information. As a natural first step, we gave upper and lower bounds for the rate-distortion function for the randomized coding setup. The proof of achievability employed novel techniques. We also presented interesting special cases of our setup, and completely characterized their rate distortion function. The deterministic coding version is open and is under current investigation.
Acknowledgment
This work was supported in part by Bharti Centre for Communication, IIT Bombay and in part by Information Technology Research Academy (ITRA), Government of India under grant ITRA/15(64)/Mobile/USEAADWN/01. In addition, Amitalok J. Budkuley, Bikash K. Dey and Vinod M. Prabhakaran were supported in part by RGC’s GRF grants 14208315 and 14313116, the Department of Science & Technology, Government of India under a grant SB/S3/EECE/057/2013, and the Ramanujan Fellowship respectively.
Appendix A Proof of Achievability
In this detailed proof of achievability, we begin with the description of our randomized coding scheme.
Code Construction:
- •
As discussed in the outline, the random code is a list of individual codes for every type . This list of codes is shared as the common randomness between the encoder-decoder.
- •
For a fixed type , our code is a binned codebook comprising vectors , where and . Here and are as given in (11) and (14) respectively, and . Every codeword is chosen i.i.d. , where . There are bins indexed by , with each bin containing codewords indexed by . Let denote the bin with index . Thus, our code is the list containing .
Encoding:
- •
Given input , the encoder determines its type to identify . In , it finds a codeword , where and , such that
[TABLE]
Here is a fixed constant (the choice of is indicated in Lemma 4)444Here is a function of , such that as and it is such that (A) holds.. This implies that and are jointly typical according to the distribution . If no such is found, then the encoder chooses . If more than one satisfying (18) exist, then the encoder chooses one uniformly at random from amongst them. Let denote the chosen codeword.
- •
The encoder transmits and the bin index losslessly to the decoder.
Decoding:
- •
Let the bin index and side information be received at the decoder. In addition, the decoder knows the type of the encoder’s input , and so the code used by the encoder.
- •
For some fixed parameter (the choice of is indicated in Lemma 5), the decoder determines the set of codewords
[TABLE]
Here .
- •
If contains exactly one codeword, then the decoder chooses it. Otherwise it chooses . Let the chosen codeword be .
- •
The decoder outputs , where .
Average distortion analysis:
We first analyse the error in decoding the codeword chosen by the encoder. The decoder makes an error if one or more of the following events occur.
[TABLE]
Then, using the union bound we can express the probability of decoding error by
[TABLE]
We will show that for every there exists small enough such that as . We first make the following obvious claim.
Claim 3**.**
Let be generated i.i.d. . Then, with probability at least , .
Let us define this “good” event as . We now state the following lemma which guarantees that the first term in (20) is vanishingly small.
Lemma 4**.**
Under the event , there exist , where as , such that the encoder finds a codeword with probability at least such that .
The proof of this lemma follows from the covering lemma [18, Lemma 3.3]. Note that this lemma specifies the parameter which appears in the definition of the encoder in (18). This lemma implies as . Our next lemma addresses the remaining two terms in the RHS of (20).
Lemma 5**.**
Let the codeword chosen be (where ) and let the output on the channel be . Then,
- (a)
there exists , where as , such that except for an exponentially small probability, . 2. (b)
there exists , where as , such that
[TABLE]
The proof of this lemma can be found in Appendix B. This lemma specifies the parameter which appears in the the decoder operation in (19). Lemma 5 implies that as . Hence, we can conclude that as .
We now get a bound on the average distortion. Toward this, we first make the following claim.
Claim 6**.**
There exists , where as , such that .
Proof:
By Claim 15 in App. B, with high probability, is -typical according to the joint distribution . As is a deterministic function of , it follows by the conditional typicality lemma (see Lemma 9 in Appendix B) that with probability at least , the tuple is -typical, and hence is -typical, where . This completes the proof. ∎
We now show that the average distortion for the code can be made arbitrarily close to . Let . From Claim 6, we know that as . Then,
[TABLE]
Recall that . In addition, from the typical average lemma we know that , where and as . Thus,
[TABLE]
As as , we choose a large enough and a small enough to get . This implies that the average distortion can be made arbitrarily close to . We have, thus, shown that for any , the rate is achievable. This completes the proof of achievability.
Appendix B Proof of Lemma 5
Let us define . Consider the “good” encoder event . We now state and prove some necessary claims.
Claim 7**.**
Let be generated i.i.d. . Then, with probability at least , .
Let us define this “good” event as .
Claim 8**.**
Let be a pair of vectors where . Then, .
Let us denote the event that is jointly typical w.r.t. as . By the above claim, we have .
Lemma 9** (Conditional typicality lemma).**
Let and be generated from using the memoryless distribution . Then,
[TABLE]
Proof:
We need to show that
[TABLE]
is exponentially small for all . We consider two cases.
Case I: . As , this implies that Then, ,
[TABLE]
Thus, for such , .
Case II: . Using Chernoff-Hoeffding’s theorem [19, Theorem 1] for each , we have
[TABLE]
Now, it can be easily checked that and together imply
[TABLE]
Hence, (22) follows by taking union bound over all . ∎
Claim 10**.**
With probability at least , are jointly -typical according to the distribution .
The proof of this result follows from Lemma 9. We now consider this “good” event , where .
Claim 11**.**
Under the event , is -typical w.r.t. , where as . That is, .
The proof is straightforward, and hence, omitted. The above claim implies that, except for an exponentially small probability, the decoder considers the conditional type for decoding.
Claim 12**.**
Under and , are jointly -typical according to the distribution , where and as .
Proof:
Note that
[TABLE]
where . ∎
Claim 13**.**
There exists , where as , such that ,
[TABLE]
where is computed with the distribution .
Proof:
We have two cases.
Case 1: When . Then we note that
[TABLE]
where as . Since , and , using [11, Lemma 2.7], we get
[TABLE]
Together, the above two equations imply
[TABLE]
By defining , we get
[TABLE]
Case II: When . For such a , the encoder outputs it only if and there is no codeword which is jointly typical with w.r.t. . Thus,
[TABLE]
where .
Combining the two cases, and taking , the lemma follows. ∎
Lemma 14** **(Refined Markov Lemma [17] 555In the refined
Markov lemma presented in [17], condition (b) also has a lower bound on . However, the lower bound is not used in the proof given in [17], and hence, can be removed. Here, we state this lemma without any lower bound. We also note that condition (a) and the upper bound on the probability of a typical sequence imply that probability of too many typical sequences can not be too small; and so some essence of the lower bound in condition (b) is already implied by these. Thus, it is not surprising that the lower bound is not needed for the lemma to hold. ).
Suppose is a Markov chain, i.e., . Let and be such that
- (a)
, where , 2. (b)
for every ,
[TABLE]
for some , where as .
Then, there exists , where as , such that
[TABLE]
*Here and does not depend on , , or but does depend on , and . Further, the function does not depend on , or . *
We now use the above lemma to prove the following claim.
Claim 15**.**
There exists , where as , such that except for a small probability, is jointly -typical w.r.t. .
Proof:
Let us assume that is true. Now we use the refined Markov lemma (Lemma 14) on the Markov chain . Then, by Claims 12 and 13, is chosen such that both conditions (a) and (b) in Lemma 14 are satisfied. Thus, the claim follows. ∎
We define this “good” event as .
Claim 16**.**
There exists , where as , such that except for an exponentially small probability, .
Proof:
Consider the event . Under this event, are -typical w.r.t. , where . Thus, the claim follows from Claim 15. ∎
This completes the proof of the first part of the lemma. The proof of the second part directly follows from the following claim.
Claim 17**.**
There exists , where as , such that
[TABLE]
Proof:
Note that the codewords are independently generated, and hence, and are independent. Consider a fixed conditional type , and let the resulting distribution . Then,
[TABLE]
for some as . This follows from the packing lemma [18, Lemma 3.1]. By taking the union bound over all conditional types (the number of such types is at most polynomial in ), we get
[TABLE]
∎
This completes the proof of the lemma.
Appendix C
Here we prove (17). We first state the following useful lemma.
Lemma 18**.**
For a fixed , is a non-decreasing, convex and continuous function of .
Proof:
The proof of Lemma 18 can be given using similar arguments as in the proof of the same statement about the rate-distortion function for the Wyner-Ziv problem [3]. We provide the proof below for completeness.
To prove that is a non-increasing function of , note that the minimization in the definition of is over the set }. So, for , the corresponding domains of minimization satisfy . Thus .
To prove the convexity of as a function of , note that the minimization over and can be rewritten as a minimization over only in a manner similar to [15]. Then, the alphabet of the auxiliary random variable is the set of ‘Shannon strategies’. To establish the convexity, we will first show that for given and for fixed and , is convex in . Toward this, consider the joint distribution . For fixed , we know that is a convex function of [20]. Now , where the last equality follows from the Markov chain . As , and are fixed, it follows that is fixed, and hence, is a convex function of .
Now consider two distortion values , such that and achieve the values and respectively. Let us define the convex combinations and . Note that the other factors of the joint distribution are fixed here. So the average distortion is linear in , and thus, is a feasible distribution for . Thus
[TABLE]
Here follows from the convexity of w.r.t. . This proves the convexity of w.r.t. .
Finally, the continuity of follows from its convexity [21]. ∎
In the following, for the given code and the i.i.d. jamming distribution , we denote . Now we have
[TABLE]
Under the memoryless jamming strategy of the adversary given in (15), and are independent, conditioned on . This gives us . By defining , we have . For , note first that from (15), it follows that . The minimization in (c) can hence be taken over pairs such that under . We get from the fact that the distortion constraint in the minimization being linear in allows us to replace it with a function of as in the definition of in (16). Finally, and follow respectively from the convexity and non-decreasing nature of (from Lemma 18) where we note that . This establishes (17).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] A. Lapidoth and P. Narayan, “Reliable communication under channel uncertainty,” IEEE Trans. Inform. Theory , vol. 44, pp. 2148–2177, 1998.
- 2[2] R. Dobrushin and B. Tsybakov, “Information transmission with additional noise,” IRE Trans. Inform. Theory , vol. 8, pp. 293–304, September 1962.
- 3[3] A. D. Wyner and J. Ziv, “The rate-distortion function for source coding with side information at the decoder,” IEEE Trans. Inform. Theory , vol. 22, pp. 1–10, January 1976.
- 4[4] C. E. Shannon, “Coding theorems for a discrete source with a fidelity criterion,” IRE Nat. Conv. Rec. , vol. 7, pp. 142–163, 1959.
- 5[5] J. C. Kieffer, “A survey of the theory of source coding with a fidelity criterion,” IEEE Trans. Inform. Theory , vol. 39, pp. 1473–1490, September 1993.
- 6[6] T. Berger and J. D. Gibson, “Lossy source coding,” IEEE Trans. Inform. Theory , vol. 44, pp. 2693–2723, October 1998.
- 7[7] A. Dembo and T. Weissman, “The minimax distortion redundancy in noisy source coding,” IEEE Trans. Inform. Theory , vol. 49, pp. 3020–3030, November 2003.
- 8[8] D. L. Neuhoff, R. M. Gray, and L. D. Davisson, “Fixed rate universal block source coding with a fidelity criterion,” IEEE Trans. Inform. Theory , vol. 21, pp. 511–523, September 1975.
