Exponential Strong Converse for Successive Refinement with Causal Decoder Side Information
Lin Zhou, Alfred Hero

TL;DR
This paper establishes an exponential strong converse theorem for the $k$-user successive refinement problem with causal decoder side information, showing that outside the rate-distortion region, the error probability approaches one exponentially fast.
Contribution
It extends the strong converse theorem to the $k$-user case with causal side information, generalizing previous two-user results and unifying special cases.
Findings
Exponential decay of success probability outside the rate-distortion region.
Extension of the strong converse to the $k$-user successive refinement problem.
Corollary for the case of $k=1$, recovering the El Gamal and Weissman result.
Abstract
We consider the -user successive refinement problem with causal decoder side information and derive an exponential strong converse theorem. The rate-distortion region for the problem can be derived as a straightforward extension of the two-user case by Maor and Merhav (2008). We show that for any rate-distortion tuple outside the rate-distortion region of the -user successive refinement problem with causal decoder side information, the joint excess-distortion probability approaches one exponentially fast. Our proof follows by judiciously adapting the recently proposed strong converse technique by Oohama using the information spectrum method, the variational form of the rate-distortion region and H\"older's inequality. The lossy source coding problem with causal decoder side information considered by El Gamal and Weissman is a special case () of the current problem. Therefore,âŠ
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Exponential Strong Converse for Successive Refinement with Causal Decoder Side Information
Lin Zhou and Alfred Hero
Department of EECS
University of Michigan
Emails: [email protected]; [email protected]
Abstract
We consider the -user successive refinement problem with causal decoder side information and derive an exponential strong converse theorem. The rate-distortion region for the problem can be derived as a straightforward extension of the two-user case by Maor and Merhav (2008). We show that for any rate-distortion tuple outside the rate-distortion region of the -user successive refinement problem with causal decoder side information, the joint excess-distortion probability approaches one exponentially fast. Our proof follows by judiciously adapting the recently proposed strong converse technique by Oohama using the information spectrum method, the variational form of the rate-distortion region and Hölderâs inequality. The lossy source coding problem with causal decoder side information considered by El Gamal and Weissman is a special case () of the current problem. Therefore, the exponential strong converse theorem for the El Gamal and Weissman problem follows as a corollary of our result.
I Introduction
We consider the -user successive refinement problem with causal decoder side information shown in Figure 1, which we refer to as the -user causal successive refinement problem. The decoders aim to recover the source sequence based on the encoded symbols and causally available private side information sequences. Specifically, given the source sequence , each encoder where compresses into a codeword . At time , for each , the -th user aims to recover the -th source symbol using the codewords from encoders , the side information up to time and a decoding function , i.e., . Finally, at time , for all , the -th user outputs the source estimate which, under a distortion measure , is required to be less than or equal to a specified distortion level .
This problem was first considered by Maor and Merhav in [1] who fully characterized the rate-distortion region for the two-user version. Maor and Merhav showed that, unlike the case with non-causal side information [2], no special structure e.g., degradedness, is required between the side information and . However, Maor and Merhav only presented a weak converse in [1]. In this paper, we strengthen the result in [1] by providing an exponential strong converse theorem for the -user causal successive refinement problem, which states that the joint excess-distortion probability approaches one exponentially fast if the rate-distortion tuple falls outside the rate-distortion region.
I-A Related Works
We first briefly summarize existing works on the successive refinement problem. The successive refinement problem was first considered by Equitz and Cover [3] and by Koshelev [4] who considered necessary and sufficient conditions for a source-distortion triple to be successively refinable. Rimoldi [5] fully characterized the rate-distortion region of the successive refinement problem under the joint excess-distortion probability criterion while Kanlis and Narayan [6] derived the excess-distortion exponent in the same setting. The second-order asymptotic analysis of No and Weissman [7], which provides approximations to finite blocklength performance and implies strong converse theorems, was derived under the marginal excess-distortion probabilities criteria. This analysis was extended to the joint excess-distortion probability criterion by Zhou, Tan and Motani [8]. Other frameworks for successive refinement decoding include [9, 10, 11, 12].
The study of source coding with causal decoder side information was initiated by Weissman and El Gamal in [13] where they derived the rate-distortion function for the lossy source coding problem with causal side information at the decoders (i.e., , see also [14, Chapter 11.2]). Subsequently, Timo and Vellambi [15] characterized the rate-distortion regions of the Gu-Effros two-hop network [16] and the Gray-Wyner problem [17] with causal decoder side information; Maor and Merhav [18] derived the rate-distortion region for the successive refinement of the Heegard-Berger problem [19] with causal side information available at the decoders; Chia and Weissman [20] considered the cascade and triangular source coding problem with causal decoder side information. However, to the best of our knowledge, no strong converse theorems exist for these problems.
As the information spectrum method will be used in this paper to derive an exponential strong converse theorem for the causal successive refinement problem, we briefly summarize the previous applications of this method to network information theory problems. In [21, 22, 23], Oohama used this method to derive exponential strong converses for the lossless source coding problem with one-helper [24, 25] (i.e., the Wyner-Ahlswede-Körner (WAK) problem), the asymmetric broadcast channel problem [26], and the Wyner-Ziv problem [27] respectively. Furthermore, Oohamaâs information spectrum method was also used to derive exponential strong converse theorems for content identification with lossy recovery [28] by Zhou, Tan, Yu and Motani [29] and for Wynerâs common information problem under the total variation distance measure [30] by Yu and Tan [31].
I-B Main Contribution and Challenges
We consider the -user causal successive refinement problem and present an exponential strong converse theorem. For given rates and blocklength, define the joint excess-distortion probability as the probability that either decoder incurs a distortion level greater than the specified distortion level (see (3)) and define the non-excess-distortion probability as the probability that all decoders satisfy the specified distortion levels (see (24)). Our proof proceeds as follows. First, we derive a non-asymptotic converse (finite blocklength upper) bound on the non-excess-distortion probability of any code for the -user causal successive refinement problem using the information spectrum method. Subsequently, by using CramĂ©râs inequality and the variational formulation of the rate-distortion region, we show that the non-excess-distortion probability decays exponentially fast to zero as the blocklength tends to infinity if the rate-distortion tuple falls outside the rate-distortion region of the causal successive refinement problem.
As far as we are aware, this paper is the first to establish a strong converse theorem for any lossy source coding problem with causal decoder side information. Furthermore, our methods can be used to derive exponential strong converse theorems for other lossy source coding problems with causal decoder side information discussed in Section I-A. In particular, since the lossy source coding problems with causal decoder side information in [13, 1] are special cases of the -user causal successive refinement problem, the exponential strong converse theorems for the problems in [13, 1] follow as a corollary of our result.
In order to establish the strong converse in this paper, we must overcome several major technical challenges. The main difficulty lies in the fact that for the causal successive refinement problem, the side information is available to the decoder causally instead of non-causally. This causal nature of the side information makes the design of the decoder much more complicated and involved, which complicates the analysis of the excess-distortion probability. We find that classical strong converse techniques like the image size characterization [32] and the perturbation approach [33] cannot lead to a strong converse theorem due to the above-mentioned difficulty. However, it is possible that other approaches different from ours can be used to obtain a strong converse theorem for the current problem. For example, it is interesting to explore whether two recently proposed strong converse techniques in [34, 35] can be used for this purpose considering the fact that the methods in [34, 35] have been successfully applied to problems including the Wyner-Ziv problem [27] and the Wyner-Ahlswede-Körner (WAK) problem [24, 25].
II Problem Formulation and Existing Results
Notation
Random variables and their realizations are in upper (e.g., ) and lower case (e.g., ) respectively. Sets are denoted in calligraphic font (e.g., ). We use to denote the complement of and use to denote a random vector of length . Furthermore, given any , we use to denote . We use and to denote the set of positive real numbers and integers respectively. Given two integers and , we use to denote the set of all integers between and and use to denote . The set of all probability distributions on is denoted as and the set of all conditional probability distributions from to is denoted as . For information-theoretic quantities such as entropy and mutual information, we follow the notation in [32]. In particular, when the joint distribution of is , we use and interchangeably.
II-A Problem Formulation
Let be a fixed finite integer and let be a joint probability mass function (pmf) on the finite alphabet with its marginals denoted in the customary way, e.g., , . Throughout the paper, we consider memoryless sources , which are generated i.i.d. according to . Let be the alphabet of the reproduced source symbol for user . Recall the encoder-decoder system model for the -user causal successive refinement problem in Figure 1.
A formal definition of a code for the causal successive refinement problem is as follows.
Definition 1**.**
An -code for the causal successive refinement problem consists of
- âą
* encoding functions*
[TABLE]
- âą
and decoding functions: for each
[TABLE]
For , let be a distortion measure. Given the source sequence and a reproduced version , we measure the distortion between them using the additive distortion measure . To evaluate the performance of an -code for the causal successive refinement problem, given distortion specified levels , we consider the following joint excess-distortion probability
[TABLE]
For ease of notation, throughout the paper, we use to denote , to denote and to denote .
Given , the -rate-distortion region for the -user causal successive refinement problem is defined as follows.
Definition 2**.**
Given any , a rate-distortion tuple is said to be -achievable if there exists a sequence of -codes such that
[TABLE]
The closure of the set of all -achievable rate-distortion tuples is called the -rate-distortion region and is denoted as .
Note that in Definition 2, is the sum rate of the first decoders. Using Definition 2, the rate-distortion region for the problem is defined as
[TABLE]
II-B Existing Results
For the two-user causal successive refinement problem, the rate-distortion region was fully characterized by Maor and Merhav [1, Theorem 1]. With slight generalization, the result can be extended to -user case.
For , let be a random variable taking values in a finite alphabet . For simplicity, throughout the paper, we let
[TABLE]
and let be a particular realization of and its alphabet set, respectively.
Define the following set of joint distributions:
[TABLE]
Given any joint distribution , define the following set of rate-distortion tuples
[TABLE]
Maor and Merhav [1] defined the following information theoretical sets of rate-distortion tuples
[TABLE]
Theorem 1**.**
The rate-distortion region for the causal successive refinement problem satisfies
[TABLE]
We remark that in [1], Maor and Merhav considered the average distortion criterion for , i.e.,
[TABLE]
instead of the vanishing joint excess-distortion probability criterion (see (6)) in Definition 2. However, with slight modification to the proof of [1], it can be verified (see Appendix -A) that the rate-distortion region under the vanishing joint excess-distortion probability criterion, is identical to the rate-distortion region derived by Maor and Merhav under the average distortion criterion.
Theorem 1 implies that if a rate-distortion tuple falls outside the rate-distortion region, i.e., , then the excess-distortion probability is bounded away from zero. We strengthen the converse proof of Theorem 1 by showing that if , then the excess-distortion probability approaches one exponentially fast as the blocklength tends to infinity.
III Main Results
III-A Preliminaries
In this subsection, we present necessary definitions and a key lemma before stating our main result.
Define the following set of distributions
[TABLE]
Given any such that
[TABLE]
for any , define the following linear combination of log likelihoods
[TABLE]
Given any and any , define the minus cumulant generating function of as
[TABLE]
Furthermore, define the minimal minus cumulant generating function over distributions in as
[TABLE]
Finally, given any rate-distortion tuple , define
[TABLE]
With the above definitions, we have the following lemma establishing the properties of the exponent function .
Lemma 2**.**
The following claims hold.
- (i)
For any rate-distortion tuple outside the rate-distortion region, i.e., , we have
[TABLE] 2. (ii)
For any rate-distortion tuple inside the rate-distortion region, i.e., , we have
[TABLE]
The proof of Lemma 2 is inspired by [23, Property 4], [29, Lemma 2] and is given in Section V. As will be shown in Theorem 3, the exponent function is a lower bound on the exponent of the probability of non-excess-distortion probability for the -user causal successive refinement problem. Thus, Claim (i) in Lemma 2 is crucial to establish the exponential strong converse theorem which states that the excess-distortion probability (see (3)) approaches one exponentially fast with respect to the blocklength of the source sequences.
III-B Main Result
Define the probability of non-excess-distortion as
[TABLE]
Theorem 3**.**
Given any -code for the -user causal successive refinement problem such that
[TABLE]
we have the following non-asymptotic upper bound on the probability of non-excess-distortion
[TABLE]
The proof of Theorem 3 is given in Section IV. Several remarks are in order.
First, our result is non-asymptotic, i.e., the bound in (26) holds for any . In order to prove Theorem 3, we adapt the recently proposed strong converse technique by Oohama [23] to analyze the probability of non-excess-distortion probability. We first obtain a non-asymptotic upper bound using the information spectrum of log-likelihoods involved in the definition of (see (16)) and then apply CramĂ©râs bound on large deviations (see e.g., [29, Lemma 13]) to obtain an exponential type non-asymptotic upper bound. Subsequently, we apply the recursive method [23] and proceed similarly as in [29] to obtain the desired result. Our method can also be used to establish similar results for other source coding problems with causal decoder side information [15, 20, 18].
Second, we believe that classical strong converse techniques including the image size characterization [32] and the perturbation approach [33] cannot lead to the strong converse theorem for the causal successive refinement problem. The main obstacle is that the side information is available causally and thus complicates the decoding analysis significantly.
Invoking Lemma 2 and Theorem 3, we conclude that the exponent on the right hand side of (26) is positive if and only if the rate-distortion tuple is outside the rate-distortion region, which implies the following exponential strong converse theorem.
Theorem 4**.**
For any sequence of -codes satisfying the rate constraints in (25), given any distortion levels , we have that if , then the probability of correct decoding decays exponentially fast to zero as the blocklength of the source sequences tends to infinity.
As a result of Theorem 4, we conclude that for every , the -rate distortion region (see Definition 2) satisfies that
[TABLE]
i.e., strong converse holds for the -user causal successive refinement problem. Using the strong converse theorem and Martonâs change-of-measure technique [36], similarly to [29, Theorem 5], we can also derive an upper bound on the exponent of the excess-distortion probability. Furthermore, applying the one-shot techniques in [37], we can also establish a non-asymptotic achievability bound. Applying the Berry-Esseen theorem to the achievability bound and analyzing the non-asymptotic converse bound in Theorem 3, similarly to [23], we conclude that the backoff from the rate-distortion region at finite blocklength scales on the order of . However, nailing down the exact second-order asymptotics [38, 39] is challenging and is left for future work.
Our main results in Lemma 2, Theorems 3 and 4 can be specialized to the settings in [13, 1] with and respectively.
IV Proof of the Non-Asymptotic Converse Bound (Theorem 3)
IV-A Preliminaries
Given any -code with encoding functions and and decoding functions , we define the following induced conditional distributions: for each ,
[TABLE]
For simplicity, in the following, we let
[TABLE]
and let be a particular realization and the alphabet of respectively. With above definitions, we have that the distribution satisfies that for any ,
[TABLE]
In the remaining part of this section, all distributions denoted by are induced by the joint distribution .
For simplicity, given any , we use to denote and we use to denote . Similarly, we use and . For each , let auxiliary random variables be and for all . Note that as a function of , the Markov chain holds under . Throughout the paper, for each , we let
[TABLE]
and let be a particular realization and the alphabet of , respectively.
For each , let be arbitrary distributions where and . Given any positive real number , define the following subsets of :
[TABLE]
IV-B Proof Steps
Lemma 5**.**
For any -code satisfying (25), given any distortion levels , we have
[TABLE]
The proof of Lemma 5 is given in Appendix -B and divided into two steps. First, we derive a -letter non-asymptotic upper bound which holds for certain arbitrary -letter auxiliary distributions. Subsequently, we single-letterize the derived bound by proper choice of auxiliary distributions and careful decomposition of induced distributions of .
For simplicity, in the following, we will use to denote and use to denote . Given any and any satisfying (15), define
[TABLE]
Furthermore, given any non-negative real number , define
[TABLE]
Recall the definition of in (19). Using CramĂ©râs bound [29, Lemma 13], we obtain the following non-asymptotic exponential type upper bound on the probability of non-excess-distortion, whose proof is given in in Appendix -D.
Lemma 6**.**
For any -code satisfying the conditions in Lemma 5, given any distortion levels , we have
[TABLE]
Furthermore, let
[TABLE]
Given any such that
[TABLE]
let
[TABLE]
then we have
[TABLE]
The following lemma which relates with (recall (18)) is crucial.
Lemma 7**.**
Given any satisfying (15) and (45), we have that for defined in (46),
[TABLE]
The proof of Lemma 7 is given in Appendix -E. In the proof of Lemma 7, we apply Hölderâs inequality and the recursive method used in [23].
Combining Lemmas 6 and 7, we conclude that for any -code satisfying the conditions in Lemma 5 and for any , given any satisfying (45), we have
[TABLE]
where (50) follows from the definitions of in (19), in (46) and the results in (47), and (51) follows from the definition of in (21).
V Proof of Properties of Strong Converse Exponent: Proof of Lemma 2
V-A Alternative Expressions of the Rate-Distortion Region
In this section, we present several definitions and an alternative characterization of the rate-distortion region using the supporting hyperplanes, which facilitate the proof of Lemma 2.
Define the following set of joint distributions
[TABLE]
Recall the definition of in (10). Define
[TABLE]
Furthermore, let be the following set of joint distributions
[TABLE]
Given any satisfying (15), define the following linear combination of achievable rate-distortion tuples
[TABLE]
Recall the definition of in (19). Finally, let be the following collection of rate-distortion tuples
[TABLE]
Recall the definitions of in (7) and in (11). Similarly to [23, Properties 2 and 3], one can show that the rate-distortion region for the -user causal successive refinement problem remains unchanged even if one uses stochastic decoding functions. Furthermore, the rate-distortion region has the following alternative characterization using supporting hyperplanes.
Lemma 8**.**
The rate-distortion region for the causal successive refinement problem satisfies
[TABLE]
V-B Proof of Claim (i)
Recall that we use (see (8)) to denote the collection of random variables and use similarly to denote a realization of and its alphabet, respectively. For any (recall (55)), any satisfying (15) and any , for any , define
[TABLE]
For simplicity, we let
[TABLE]
Furthermore, paralleling (18) to (21) and recalling (19), let
[TABLE]
For subsequent analysis, define the following tilted distribution
[TABLE]
Finally, define the following dispersion function
[TABLE]
Note that is positive and finite.
The proof of Claim (i) in Lemma 2 is completed by the following lemma which relates with .
Lemma 9**.**
The following claims hold.
- (i)
For any rate-distortion tuple ,
[TABLE] 2. (ii)
For any rate-distortion tuple outside the rate-distortion region, i.e., , we have that for some ,
[TABLE]
The proof of Lemma 9 is inspired by [23, 29] and given in Appendix -F. In order to prove Lemma 9, we use the alternative characterizations of the rate-distortion region in Lemma 8 and analyze the connections between the two exponent functions and .
V-C Proof of Claim (ii)
If a rate-distortion tuple falls inside the rate-distortion region, i.e., , then there exists a distribution (see (55)) such that for any satisfying (15), we have
[TABLE]
Recall the definition of in (17). From simple calculation, we have that
[TABLE]
Combining (70) to (72), by applying Taylor expansions, we have that for any ,
[TABLE]
Using the definition of in (18), we conclude that
[TABLE]
where (74) follows since (recall (14)), (75) follows from the result in (73), (76) follows from the definitions of in (17) and in (55), and (77) follows from the result in (69).
Using the definition of in (21) and the result in (77), we conclude that for any ,
[TABLE]
The proof of Claim (ii) is completed by noting that
[TABLE]
VI Conclusion
We considered the -user causal successive refinement problem [1] and established an exponential strong converse theorem using the strong converse techniques proposed by Oohama [23]. Our work appears to be the first to derive a strong converse theorem for any source coding problem with causal decoder side information. The methods we adopted can also be used to obtain exponential strong converse theorems for other source coding problems with causal decoder side information. This paper further illustrates the usefulness and generality of Oohamaâs information spectrum method in deriving exponential strong converse theorems. We believe that using Oohamaâs techniques [23], the strong converse theorem for channel coding with causal state information [40, 41] can also be established.
There are several natural future research directions. In Theorem 3, we presented only an lower bound on the strong converse exponent. It would be worthwhile to obtain an exact strong converse exponent and thus characterize the exact speed at which the probability of non-excess-distortion decays exponentially fast with respect to the blocklength of source sequences when the rate-distortion tuple falls outside the rate-distortion region. Furthermore, one can explore whether the methods in this paper can be used to establish strong converse theorems for causal successive refinement under the logarithmic loss [42, 43], which corresponds to soft decoding of each source symbol. Finally, one can also explore extensions to continuous alphabet by considering Gaussian memoryless sources under bounded distortion measures and derive second-order asymptotics [44, 45, 46, 39, 47] for the causal successive refinement problem.
-A Proof of Theorem 1
Replacing (6) with Definition 2, we can define the -rate-distortion region under the average distortion criterion. Furthermore, let
[TABLE]
Maor and Merhav [1] showed that for ,
[TABLE]
Actually, in [1, Section VII], in order to prove that , it was already shown that . Furthermore, it is straightforward to show that the above results hold for any finite . Thus, to prove Theorem 1, it suffices to show
[TABLE]
For this purpose, given any , let
[TABLE]
From the problem formulation, we know that for all . Now consider any rate-distortion tuple , then we have (4) to (6). Therefore, for any ,
[TABLE]
As a result, we have . Therefore, we have shown that .
-B Proof of Lemma 5
Recall the definition of and in (30). Given any and , let be arbitrary distributions. For simplicity, given each , we use to denote and use to denote where . Similarly we use and .
Given any positive real number , define the following sets:
[TABLE]
Then we have the following non-asymptotic upper bound on the probability of non-excess-distortion.
Lemma 10**.**
Given any -code satisfying (25) and any distortion levels , we have
[TABLE]
The proof of Lemma 10 is given in Appendix -C.
In the remaining of this subsection, we single-letterize the bound in Lemma 10. Recall that given any , we use to denote .Recalling that the distributions starting with are all induced by the joint distribution in (31) and using the choice of auxiliary random variables , we have
[TABLE]
[TABLE]
where (95) follows from the Markov chain , (98) follows from the Markov chain , (101) follows from the Markov chain , and (104) follows from the Markov chain .
Furthermore, recall that for , are arbitrary distributions where and . Note that Lemma 10 holds for arbitrary choices of distributions where and . The proof of Lemma 5 is completed by using Lemma 10 with the following choices of auxiliary distributions and noting that :
[TABLE]
-C Proof of Lemma 10
Recall the definition of the probability of non-excess-distortion in (24) and the definitions of sets in (86) to (92). For any -code, we have that
[TABLE]
where (113) follows from the union bound and the fact that for any two sets and . The proof of Lemma 10 is completed by showing that
[TABLE]
In the remaining of this subsection, we show that (114) holds. Recall the joint distribution of in (31). In the following, when we use a (conditional) distribution starting with , we mean that the (conditional) distribution is induced by the joint distribution in (31).
Using the definition of in (86), we have
[TABLE]
Similarly to (116), we have that
[TABLE]
Furthermore, using the definition of in (89) and the union bound, we have that
[TABLE]
Furthermore, using the definition of in (90), we obtain that
[TABLE]
where (128) follows since for all , and (130) follows since .
Using the definition of in (91) and the union bound similarly to (126), we have
[TABLE]
where (132) follows since for all and (134) follows since .
-D Proof of Lemma 6
For any satisfying (15), for , define (cf. (33) to (36)) and for , define
[TABLE]
Furthermore, let
[TABLE]
Using Lemma 5 and definitions in (135) to (138), we obtain that
[TABLE]
where (141) follows from CramĂ©râs bound in [29, Lemma 13] and (142) follows from the definition of in (42).
Choose such that
[TABLE]
i.e.,
[TABLE]
The proof of Lemma 6 is completed by combining (142) and (144).
-E Proof of Lemma 7
Recall that for each , we use to denote and use similarly.
Recall that the auxiliary random variables are chosen as and for all . Using the definition of in (41), we define
[TABLE]
Recall the joint distribution of in (31). For each , define
[TABLE]
Combining (42) and (148), we have
[TABLE]
Furthermore, similar to [23, Lemma 5], we obtain the following lemma, which is critical in the proof of Lemma 7.
Lemma 11**.**
For each ,
[TABLE]
Furthermore, for each , define
[TABLE]
Using Lemma 11 and (153), we have that for each ,
[TABLE]
Recall that the auxiliary distributions can be arbitrary distributions. Following the recursive method in [23], for each , we choose such that
[TABLE]
Let , where and , be induced by . Using the definition of in (145), we define
[TABLE]
In the following, for simplicity, we let . Combining (153) and (154), we obtain that for each ,
[TABLE]
where (160) follows Hölderâs inequality, (161) follows from the definitions of in (17) and in (156), (162) follows from the result in (46) and (47), and (164) follows from the definition of in (18) and the fact it is sufficient to consider distributions with cardinality bounds and for the optimization problem in (163) (the proof of this fact is similar to [23, Property 4(a)] and thus omitted).
The proof of Lemma 7 is completed by combining (151) and (164).
-F Proof of Lemma 9
-F1 Proof of Claim (i)
For any (see (14)), let (see (55)) be chosen such that and for all .
In the following, we drop the subscript of distributions when there is no confusion. For any satisfying (15) and
[TABLE]
using the definition of in (17), we obtain
[TABLE]
where (167) follows since i) with our choice of , we have
[TABLE]
and ii) the following equality holds
[TABLE]
(170) follows from Hölderâs inequality, (171) follows from the concavity of for and the choice of which ensures for all , (173) follows by applying Hölderâs inequality and recalling that , and (174) follows from the definition of in (60).
Therefore, for any satisfying (15) and (165), using the definition of in (18) and the result in (174), we have that
[TABLE]
Recalling the definition of in (21) and using the result in (177), we have
[TABLE]
where (181) follows since
[TABLE]
(182) follows by choosing and (183) follows from the definition of in (64).
-F2 Proof of Claim (ii)
Recall the definitions of in (59) and in (65). By simple calculation, one can verify that
[TABLE]
Applying Taylor expansion to at around and combining (186), (187), we have that for any and any , there exists such that
[TABLE]
where (189) follows from the definitions in (59), (65) and (66).
Using the definitions in (59), (62), (56) and the result in (189), we have that for any ,
[TABLE]
For any rate-distortion tuple outside the rate-distortion region, i.e., , from Lemma 8, we conclude that there exists satisfying (15) such that for some positive
[TABLE]
Using the definition of in (64), we have
[TABLE]
where (195) follows from the results in (191), (192) and the inequality
[TABLE]
resulting from the constraints that satisfying (15) and .
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] A. Maor and N. Merhav, âOn successive refinement with causal side information at the decoders,â IEEE Trans. Inf. Theory , vol. 54, no. 1, pp. 332â343, 2008.
- 2[2] C. Tian and S. N. Diggavi, âOn multistage successive refinement for WynerâZiv source coding with degraded side informations,â IEEE Trans. Inf. Theory , vol. 53, no. 8, pp. 2946â2960, 2007.
- 3[3] W. H. Equitz and T. M. Cover, âSuccessive refinement of information,â IEEE Trans. Inf. Theory , vol. 37, no. 2, pp. 269â275, 1991.
- 4[4] V. Koshelev, âEstimation of mean error for a discrete successive-approximation scheme,â Probl. Pered. Informat. , vol. 17, no. 3, pp. 20â33, 1981.
- 5[5] B. Rimoldi, âSuccessive refinement of information: characterization of the achievable rates,â IEEE Trans. Inf. Theory , vol. 40, no. 1, pp. 253â259, 1994.
- 6[6] A. Kanlis and P. Narayan, âError exponents for successive refinement by partitioning,â IEEE Trans. Inf. Theory , vol. 42, no. 1, pp. 275â282, 1996.
- 7[7] A. No, A. Ingber, and T. Weissman, âStrong successive refinability and rate-distortion-complexity tradeoff,â IEEE Trans. Inf. Theory , vol. 62, no. 6, pp. 3618â3635, 2016.
- 8[8] L. Zhou, V. Y. F. Tan, and M. Motani, âSecond-order and moderate deviation asymptotics for successive refinement,â IEEE Trans. Inf. Theory , vol. 63, no. 5, pp. 2896â2921, 2017.
