Optimal quantization for piecewise uniform distributions
Joseph Rosenblatt, Mrinal Kanti Roychowdhury

TL;DR
This paper develops a general method for optimal quantization of distributions and applies it to piecewise uniform distributions, providing explicit solutions for both finite and infinite piece cases.
Contribution
It introduces a unified approach to quantization using ergodic maps and applies it to derive explicit optimal sets for piecewise uniform distributions.
Findings
Explicit optimal sets of n-means for finite pieces
Asymptotic optimal quantization errors for all n
Difference in approach between finite and infinite piece distributions
Abstract
Quantization for a probability distribution refers to the idea of estimating a given probability by a discrete probability supported by a finite number of points. In this paper, firstly a general approach to this process is outlined using independent random variables and ergodic maps; these give asymptotically the optimal sets of -means and the th quantization errors for all positive integers . Secondly two piecewise uniform distributions are considered on : one with infinite number of pieces and one with finite number of pieces. For these two probability measures, we describe the optimal sets of -means and the th quantization errors for all . It is seen that for a uniform distribution with infinite number of pieces to determine the optimal sets of -means for one needs to know an optimal set of -means, but for a uniform…
| canonical sequence | canonical sequence | canonical sequence | |||
|---|---|---|---|---|---|
| 2 | {1, 1} | 21 | {12, 5, 2, 1, 1} | 40 | {24, 9, 4, 1, 1, 1} |
| 3 | {1, 1, 1} | 22 | {13, 5, 2, 1, 1} | 41 | {25, 9, 4, 1, 1, 1} |
| 4 | {2, 1, 1} | 23 | {14, 5, 2, 1, 1} | 42 | {25, 10, 4, 1, 1, 1} |
| 5 | {3, 1, 1} | 24 | {14, 6, 2, 1, 1} | 43 | {25, 10, 4, 2, 1, 1} |
| 6 | {3, 1, 1, 1} | 25 | {15, 6, 2, 1, 1} | 44 | {26, 10, 4, 2, 1, 1} |
| 7 | {4, 1, 1, 1} | 26 | {16, 6, 2, 1, 1} | 45 | {27, 10, 4, 2, 1, 1} |
| 8 | {4, 2, 1, 1} | 27 | {17, 6, 2, 1, 1} | 46 | {27, 11, 4, 2, 1, 1} |
| 9 | {5, 2, 1, 1} | 28 | {17, 6, 3, 1, 1} | 47 | {28, 11, 4, 2, 1, 1} |
| 10 | {6, 2, 1, 1} | 29 | {17, 7, 3, 1, 1} | 48 | {29, 11, 4, 2, 1, 1} |
| 11 | {6, 3, 1, 1} | 30 | {18, 7, 3, 1, 1} | 49 | {30, 11, 4, 2, 1, 1} |
| 12 | {7, 3, 1, 1} | 31 | {19, 7, 3, 1, 1} | 50 | {30, 12, 4, 2, 1, 1} |
| 13 | {8, 3, 1, 1} | 32 | {20, 7, 3, 1, 1} | 51 | {31, 12, 4, 2, 1, 1} |
| 14 | {8, 3, 1, 1, 1} | 33 | {20, 8, 3, 1, 1} | 52 | {31, 12, 5, 2, 1, 1} |
| 15 | {9, 3, 1, 1, 1} | 34 | {21, 8, 3, 1, 1} | 53 | {32, 12, 5, 2, 1, 1} |
| 16 | {9, 4, 1, 1, 1} | 35 | {21, 8, 3, 1, 1, 1} | 54 | {33, 12, 5, 2, 1, 1} |
| 17 | {10, 4, 1, 1, 1} | 36 | {22, 8, 3, 1, 1, 1} | 55 | {33, 13, 5, 2, 1, 1} |
| 18 | { 10, 4, 2, 1, 1} | 37 | {22, 9, 3, 1, 1, 1} | 56 | {34, 13, 5, 2, 1, 1} |
| 19 | {11, 4, 2, 1, 1} | 38 | {23, 9, 3, 1, 1, 1} | 57 | {35, 13, 5, 2, 1, 1} |
| 20 | { 12, 4, 2, 1, 1} | 39 | {24, 9, 3, 1, 1, 1} | 58 | {35, 14, 5, 2, 1, 1} |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
To appear, Uniform Distribution Theory
Optimal quantization for piecewise uniform distributions
Joseph Rosenblatt
Department of Mathematical Sciences
Indiana University-Purdue University Indianapolis
402 N. Blackford Street
Indianapolis, IN 46202-3217, USA.
and
Mrinal Kanti Roychowdhury
School of Mathematical and Statistical Sciences
University of Texas Rio Grande Valley
1201 West University Drive
Edinburg, TX 78539-2999, USA.
Abstract.
Quantization for a probability distribution refers to the idea of estimating a given probability by a discrete probability supported by a finite number of points. In this paper, firstly a general approach to this process is outlined using independent random variables and ergodic maps; these give asymptotically the optimal sets of -means and the th quantization errors for all positive integers . Secondly two piecewise uniform distributions are considered on : one with infinite number of pieces and one with finite number of pieces. For these two probability measures, we describe the optimal sets of -means and the th quantization errors for all . It is seen that for a uniform distribution with infinite number of pieces to determine the optimal sets of -means for one needs to know an optimal set of -means, but for a uniform distribution with finite number of pieces one can directly determine the optimal sets of -means and the th quantization errors for all .
Key words and phrases:
Optimal quantizers, quantization error, uniform distribution
2010 Mathematics Subject Classification:
60Exx, 94A34.
The research of the second author was supported by U.S. National Security Agency (NSA) Grant H98230-14-1-0320
1. Introduction
Quantization is the process of converting a continuous analog signal into a digital signal of discrete levels, or converting a digital signal of levels into another digital signal of levels, where . It is essential when analog quantities are represented, processed, stored, or transmitted by a digital system, or when data compression is required. It is a classic and still very active research topic in source coding and information theory. It has broad application in engineering and technology, for example in signal processing and data compression (see [GG, GN, Z]). For mathematical treatment of quantization one is referred to Graf and Luschgy’s book (see [GL]). For most recent work on quantization for uniform distributions interested readers can see [DR, R]. Let denote a Borel probability measure on and let denote the Euclidean norm on for any . Then, the th quantization error for (of order ) is defined by
[TABLE]
where the infimum is taken over all subsets of with card for . We assume that to make sure that there is a set for which the infimum occurs (see [AW, GKL, GL, GL2]). Such a set for which the infimum occurs and contains no more than -points is called an optimal set of -means and the elements of an optimal set are called optimal quantizers. Let be the largest open subset of for which . Then, is called the support of , and is denoted by . Notice that if is finite, i.e., if for some positive integer , then for all . On the other hand, if the support of is countable, or if is a continuous probability measure, then an optimal set of -means contains exactly -elements, i.e., for all (also see [GL]). For a finite set , by we denote the set of all elements in which are nearest to among all the elements in , i.e.,
[TABLE]
is called the Voronoi region generated by . On the other hand, the set is called the Voronoi diagram or Voronoi tessellation of with respect to the set . Let us now state the following proposition (see [GG, GL]).
Proposition 1.1**.**
Let be an optimal set of -means with respect to a probability distribution , , and be the Voronoi region generated by . Then, for every ,
* , , and .*
Notice that for , implies that the point is the conditional expectation of the random variable given that takes values in the Voronoi region . In [DR], Dettmann and Roychowdhury considered a uniform distribution on an equilateral triangle, and investigated the optimal sets of -means and the th quantization errors for the uniform distribution for all . In this direction one can also see [R]. In this paper, in Section 2 we describe some general approaches to construct asymptotically optimal -means that are highly worth considering, and it seems that they have not been looked at in the applied or theoretical literature on quantization. Then, after some preliminaries in Section 3, and in Section 4, we analyze optimality for a piecewise uniform distribution with infinitely many pieces on the real line, and in Section 5, we analyze optimality for a piecewise uniform distribution with finitely many pieces. For the uniform distribution with infinitely many pieces, in Lemma 4.1 and Lemma 4.2, we first determine the optimal sets of -means and the th quantization errors for and . Then, we prove Proposition 4.3, Proposition 4.4, Proposition 4.6 and Proposition 4.7, which help us to give the definition Definition 4.8 of a canonical sequence. With the help of the canonical sequences, in Theorem 4.14, we give an induction formula to determine the optimal sets of -means and the th quantization errors for all . We also give a tabular representation of several canonical sequences. For the uniform distribution with finitely many pieces, described in Section 5, one can directly determine the optimal sets of -means and the th quantization error for any , induction formula is not needed in this case.
2. The General Setting
We are interested in explicit sequences that are optimal -means, or asymptotically optimal -means, for given probability measures. In later sections of this article, explicit -means will be derived for piecewise uniform measures in a couple of different scenarios. For now, as a way of framing issues with and motivating that work, we want to consider some simple ways of generating discrete finite sets of points that can possibly be asymptotically optimal -means, if not optimal ones, and get some control on the rate that the distortion error tends to zero.
The methods we consider here are both random models with uncorrelated variables and dynamical models in which there can be correlation of the outputs. Each has advantages over the other. They also have advantages over carrying out the detailed, hard work needed to construct explicit optimal -means with the trade-off being that one generally obtains only asymptotically optimal results.
For concreteness, we keep this introductory discussion limited to the interval in Lebesgue measure. We are interested in easy methods of obtaining a sequence such that for all , is as small as possible. The classical case is with . Indeed, it is also reasonable to consider the unaveraged error itself. Given a choice of , we would like to know the exact rate at which the distortion error tends to zero, and compare that with the optimal distortion error rate.
2.1. IID Models
Consider a method of randomly generating -means for this simplest case of uniform measure on the interval modulo one. We take to be IID random variables with uniform distribution. We actually are taking with as the model underlying probability space , but we will suppress the dependence on if it will not create confusion.
The naive approach would be to estimate how many terms are needed so that each interval for , contains at least one point, with high probability. This will guarantee that the quantization error is no larger than , a common estimate for the optimal quantization error. It is easiest to consider the probability of the complementary case: there is some such that no term is in . This probability is for each such . So an estimate for the entire scope of the possibility is . Taking as a real variable would give for large , . Hence, with probability , each contains some . This gives the estimate for the quantization error with this probability. Asymptotically, this translates to taking and then as a real variable to derive the same estimate with probability as . This only gives convergence in distribution as goes to , but a simple increase in growth of can guarantee an almost sure result. Note: instead of the optimal distortion error of , this approach is giving a somewhat worse estimate .
However, we can do better. Consider the probability . It is easy to see that this is . So scaling of the distortion error by results in convergence in distribution to the distribution function , one can also compute expectations, and other moments. For example,
[TABLE]
Going further than this distributional convergence is not going to be possible because of the Hewitt-Savage Theorem [HS]. It shows that if this sequence converges a.e. or even just in measure, then the limit function would be a constant. The distributional convergence shows that this is not possible.
But if we also integrate with respect to instead of , then there is a.s. convergence to a computable constant. That is, there is a non-zero constant such that for a.e. , converges to as . This is not a difficult calculation, if we use estimates of the series of variances for this distortion rate. This convergence, indeed the distributional convergence above, shows that the random -means are asymptotically optimal. For details of the calculations in greater generality, see Cohort [PC]. This article contains other interesting results related to a.s. convergence of the random proxy for optimal -means and conclusions that follow about the asymptotic optimality of the random -means.
The quantization process is closely related to the discrepancy estimates for the random sequence . See Kuipers and Niederreiter [KN], especially the chapter notes, for a wealth of background information and references on discrepancy. We again take our interval modulo one, but we suppress this in the notation for simplicity.
Definition 2.2**.**
Given a sequence in , the discrepancy is defined by
[TABLE]
The smaller discrepancy is defined by
[TABLE]
It is easy to see that .
Now if , then for any interval of length , there must be some with . So too. Hence, we have the useful basic estimate:
Lemma 2.3**.**
.
Thus, the following result of K-L Chung [C] gives an upper bound on the distortion error.
Theorem 2.4**.**
For a.e. ,
[TABLE]
However, the actual distortion error rate here is likely to be faster. That is, if we take , then some experimentation with estimates suggested that for a.e. . Indeed, this is the case. It was perhaps first proved by Lévy [L]. But many sophisticated extension of this have been achieved, many under the title or order statistics. See for example the article by Deheuvels [D].
If the measure that we are quantizing is not uniform, then we need to adjust the placement of the random variables . The obvious approach is to just take to be IID with distribution given by the fixed probability measure . Notice that then we would under some general assumptions have the empirical measures converging weakly to . The result of Theorem 7.5 in Graf and Luschgy [GL] shows that our random empirical measure would not be asymptotically optimal except in the case of uniform measure. However, given an absolutely continuous measure , with a regular density function , we could choose the to be distributed according to the law . Then we would not only get a good estimate for the quantization error, but we would also have the empirical measures converging weakly to itself. See Graf and Luschgy [GL] discussion following Theorem 7.5.
2.5. Ergodic and Diophantine Models
Consider a dynamical systems approach to asymptotically optimal -means. For this model, we take an ergodic, measure-preserving mapping of . For a fixed , let . What can we say about the rate that tends to zero for arbitrary , and at least a.e. ? Also, is there better stabilization of this if we instead consider the mean behavior ? This is the stationary version of the IID case above, where correlation of the -means is being allowed.
So far we know some things, but not enough about this variation on possible asymptotically optimal -means. Results in this direction will appear in future work. But it is clear that the ergodicity is not needed for the most important property in obtaining asymptotically optimal -means. What ergodicity implies is that for a.e. , the orbit is dense in . This is all that is needed for to converge to zero for . What then happens if instead we take as our map a minimal map of ? The same property would hold for all points. That is, if we have a minimal map of a compact, metric space , in place of , then also tends to zero for arbitrary and . In any such case, it is in general not clear how to obtain a rate for the distortion error, or specific information about the distribution of the -means that are resulting. This type of issue is why the specific details presented in this article in Section 4 and Section 5 are so useful. Concrete, completely described optimal -means are worth a great deal in any applied, or theoretical, quantization process.
We might also consider a relative of the dynamical systems approach: a Diophantine method. Now we take for all , where is some irrational number and denotes the fraction in such that for some integer . We know that is uniformly distributed in and moreover there is an estimate on the discrepancy that holds for a.e. that comes from classical facts about continued fractions and Diophantine approximation. The estimate gives for a.e. and for all , for large enough . But then if , we must have for any interval with , there is some with . This then gives the discrete set with a quantization error no larger than . Again, we can translate this to real values by taking asymptotically to achieve this quantization error . It is not as good as the optimal one that would be . Despite the fact that the discrepancy estimate here is better than for the one in the IID case, the unaveraged distortion error is not as good as what one can obtain in the IID case. The virtue of the Diophantine result is that it is explicit.
What we are observing is that the same approach to over-estimating the distortion error that was used in the random approach will work for this Diophantine approach, replacing the iterated logarithm method of Chung with the theorem of Khinchin [K]. See also Kuipers and Niederreiter [KN] again. To be more exact, Khinchin’s theorem says for any non-decreasing such that , for a.e. , one has for the sequence
[TABLE]
But just as it proved to be the case in the IID model, using discrepancy for the Diophantine model, to over estimate the Diophantine model distortion error, seems likely to give too large an estimate. For example, see the results in Graham and Van Lint [GVL]. This article not only shows that there is a necessary spread in the distortion rate, but it shows that the optimal behavior for the Diophantine model is with that have bounded terms in the simple continued fraction expansion. For these, the distortion error is on the order of the optimal distortion error i.e. . What is not shown in [GVL], and seems missing in the literature, is a metric result that gives optimal control on the distortion rate for a.e. .
So it is possible that the dynamical system result or the Diophantine result can be improved by a couple of different approaches. One approach is to not consider the random input value, but take a specific very good value of , actually the Golden Mean. As mentioned above, this is what is considered in Graham and Van Lint [GVL]. See also Motta, Shipman, and Springer [MSS] where optimal transitivity is studied to limit the gaps in the sequence. Another approach would be to use bounded remainder sets so that the discrepancy error can be perhaps better controlled. See both Haynes, Kelly, and Koivusalo [HKK]; and Haynes and Koivusalo [HK].
In addition, we conjecture the following relationships between the asymptotic results from dynamical models and the optimal results that follow in later sections of this paper. Indeed, let be either the dynamical system or Diophantine construction above. Let be an optimal set of -means. While the unaveraged distortion rate is not going to be as good as the optimal distortion rate, averaging seems to have a very strong impact (as is shown in the IID case by Cohort [C]). We conjecture though that for every constant , when is sufficiently large,
[TABLE]
This result would show that the optimal -means are certainly better than either the random or dynamical approach to quantization. On the other hand, we also see that there may be lots of examples such that for every constant , when is sufficiently large,
[TABLE]
This would mean that the optimal -means are not better as far as the asymptotic behavior of the associated distortion rates are concerned, and that the random or dynamical system approaches give asymptotically optimal -means.
We summarize what has been demonstrated in this section, Section 2. Both the random and the dynamical approaches to quantization give fairly good quantization, but as we will see they do not give as good a quantization error as is possible using optimal quantization. This fact alone should help to motivate why we want to have explicitly optimal -means. To accomplish this, in the later sections of this paper we take some care to describe completely how to get optimal -means in a number of different contexts.
3. Notation and Some Facts
Let be a piecewise uniform distribution with infinitely many pieces on the real line with probability density function (pdf) given by
[TABLE]
In the sequel we will write and , where . For , by and , we denote the left and right end points of the interval , respectively, i.e., and .
Lemma 3.1**.**
Let and represent the expected value and the variance of a random variable with distribution . Then, and .
Proof.
We have
[TABLE]
and thus the lemma is yielded. ∎
Note 3.2**.**
Lemma 3.1 implies that the optimal set of one-mean is and the corresponding quantization error is . Let . By we denote the restriction of the probability measure on the interval , i.e., , in other words, for any Borel subset of we have . Similarly, write to denote the restriction of the probability measure on . For a probability distribution , by , we denote an optimal set of -means for . For a Borel subset of , by , it is meant the quantization error (or distortion measure) contributed by on the set with respect to the probability distribution . If nothing is mentioned within a parenthesis, by and , it is meant an optimal set of -means and the th quantization error with respect to the probability distribution .
Lemma 3.3**.**
For , let and denote the expectations of the random variables with distributions and , respectively. Then,
[TABLE]
Proof.
By the definition of the conditional expectation, we have
[TABLE]
[TABLE]
implying , and thus the lemma is yielded. ∎
Remark 3.4**.**
Lemma 3.3 implies that , , , and . can also be calculated in the following way:
[TABLE]
Proposition 3.5**.**
Let . Then, the set is a unique optimal set of -means for , i.e., . Moreover,
[TABLE]
Proof.
Since is uniformly distributed on , the boundaries of the Voronoi regions of an optimal set of -means will divide the interval into equal subintervals, i.e., the boundaries of the Voronoi regions are given by
[TABLE]
This implies that an optimal set of -means for is unique, and it consists of the midpoints of the boundaries of the Voronoi regions, i.e., the optimal set of -means for is given by for any . Then, the th quantization error for due to the set on is given by
[TABLE]
which after simplification implies . Again, , and so,
[TABLE]
which upon simplification yields . Thus, the proof of the proposition is complete. ∎
In the following section, we investigate the optimal sets of -means for . Once the optimal sets of -means are known the corresponding quantization error can easily be calculated.
4. Optimal Sets of -Means for
In this section, we first determine the optimal sets of -means for and .
Lemma 4.1**.**
Let be an optimal set of two-means such that . Then, and , and the corresponding quantization error is .
Proof.
Consider the set of two points . The distortion error due to the set is given by
[TABLE]
Since is the quantization error for two-means, we have . Let be an optimal set of two-means such that . Since the optimal quantizers are the expected values of their own Voronoi regions, we have . If , then
[TABLE]
which leads to a contradiction. So, we can assume that . If , then
[TABLE]
which leads to another contradiction. So, we can assume that . Since and , we have yielding the fact that the Voronoi region of does not contain any point from and the Voronoi region does not contain any point from . This implies that and , and the corresponding quantization error is , which is the lemma. ∎
Lemma 4.2**.**
Let be an optimal set of three-means such that . Then, , , , and the corresponding quantization error is .
Proof.
Consider the set of three points . The distortion error due to the set is given by
[TABLE]
Since is the quantization error for three-means, we have . Let be an optimal set of three-means. Since the optimal quantizers are the expected values of their own Voronoi regions we have . If , then
[TABLE]
which leads to a contradiction. So, we can assume that , and then the Voronoi region of does not contain any point from . If it does, then we must have implying , which gives a contradiction. Thus, we see that . Suppose that . The following two cases can arise:
Case 1. Voronoi region of contains points from .
Then, implying . First, assume that , and then
[TABLE]
which is a contradiction. Next, assume that . Then, . Also, notice that , and so, we have
[TABLE]
which leads to a contradiction.
Case 2. Voronoi region of does not contain any point from .
Then, as , we have
[TABLE]
which yields a contradiction.
Thus, by Case 1 and Case 2, we can assume that . We now show that -almost surely the Voronoi region of does not contain any point from . For the sake of contradiction assume that the Voronoi region of contains points from . Then, the distortion error contributed by and on the set is given by
[TABLE]
which is minimum when and . Then, notice that , i.e., -almost surely the Voronoi region of does not contain any point from . This implies the fact that and . Suppose that . Then,
[TABLE]
which is a contradiction. So, we can assume that . Then, the Voronoi region of does not contain any point from . If it does, then we must have implying , which yields a contradiction as . Thus, we have and . Moreover, we have seen . Then, by (1), the quantization error is . This completes the proof of the lemma. ∎
Proposition 4.3**.**
Let and let be an optimal set of -means. Then,
* and ;*
* does not contain any point from the open interval ;*
* the Voronoi region of any point in does not contain any point from , and the Voronoi region of any point in does not contain any point from .*
Proof.
By Lemma 4.1 and Lemma 4.2, the proposition is true for . We now show that the proposition is true for all . Consider the set of four points . The distortion error due to the set is given by
[TABLE]
Since is the quantization error for -means with , we have . Let be an optimal set of -means. If , then which is a contradiction. If , then
[TABLE]
which leads to another contradiction. Thus, and , which completes the proof of .
To prove and , let . Then, . We need to show that . For the sake of contradiction, assume that . If , then implying and so, which yields a contradiction. Next, suppose that . Then, implying , and so, which gives a contradiction. So, we can assume that , i.e., does not contain any point from the open interval , which yields .
If the Voronoi region of contains points from , we must have implying , which is a contradiction. Similarly, if the Voronoi region of any point in contains points from , we will arrive at a contradiction. Thus, is yielded, and this completes the proof of the proposition. ∎
Proposition 4.4**.**
Let be an optimal set of -means for . Then, and .
Proof.
As shown in the proof of Proposition 4.3, since is the quantization error for -means for , we have . By Proposition 4.3, we have and . First, we show that . Suppose that . Then, as , we have
[TABLE]
which leads to a contradiction. So, we can assume that . Next, suppose that . Then, as , we have
[TABLE]
which leads to another contradiction. Thus, the proof of the proposition is complete. ∎
Remark 4.5**.**
From Proposition 4.4, it follows that if is an optimal set of four-means, then and .
Proposition 4.6**.**
Let be an optimal set of -means for such that for some and . Then,
* and ;*
* does not contain any point from the open interval ;*
* the Voronoi region of any point in does not contain any point from and the Voronoi region of any point in does not contain any point from .*
Proof.
To prove the proposition it is enough to prove it for , and then inductively the proposition will follow for all . Fix . Suppose that . By Lemma 4.2, it is clear that the proposition is true for . We now prove that the proposition is true for . Let be an optimal set of -means for any . Let be the quantization error contributed by the set in the region . Let be a set such that . The distortion error due to the set is given by
[TABLE]
and so, . Suppose that does not contain any point from . Since by Proposition 4.3, the Voronoi region of any point in does not contain any point from , we have
[TABLE]
which leads to a contradiction. So, we can assume that . Suppose that . Then, , and so,
[TABLE]
which gives another contradiction. Therefore, , i.e., is proved.
To prove we proceed as follows: If , then as Lemma 4.1, it can be proved that . Since and , in this case we see that . If , then as Lemma 4.2, it can be proved that
[TABLE]
implying the fact that . We now assume that , then as mentioned in Remark 4.5, in this case, we can also prove that and , in fact, we have implying , and the corresponding quantization error, by Proposition 3.5, is given by
[TABLE]
Next, assume that . Then, we must have . Let implying . Suppose that . The following cases can arise:
Case 1. .
Then, implying , and so,
[TABLE]
which is contradiction.
Case 2.
Then, implying , and so,
[TABLE]
which gives a contradiction.
Thus, , which completes the proof of . The proof of is similar to the proof of in Proposition 4.3. Hence, the proposition is yielded. ∎
Proposition 4.7**.**
Let be an optimal set of -means for . Then, there exists a positive integer such that for all , and . Write and . Then, and , with
[TABLE]
Proof.
Proposition 4.3 says that if is an optimal set of -means for , then , , and does not contain any point from the open interval . Proposition 4.6 says that if for some , then and . Moreover, does not take any point from the open interval . Thus, by Induction Principle, we can say that if is an optimal set of -means for , then there exists a positive integer such that for all and .
For a given , write and . Since the Voronoi region of any point in does not contain any point from , and , we must have . Again, are disjoint for and does not contain any point from the open intervals for . This implies the fact that and , and so,
[TABLE]
Thus, the proof of the proposition is complete. ∎
Definition 4.8**.**
Let for be the positive integers as defined in Proposition 4.7. Then, we call the sequence a canonical sequence of order or just a canonical sequence. Notice that once a canonical sequence of order is known the corresponding optimal set of -means can easily be determined and vice versa. Let be a canonical sequence and with . Then, the sequence is called a subblock of the canonical sequence .
The canonical sequence has the following property.
Lemma 4.9**.**
Let be a canonical sequence for . Then, .
Proof.
Let be an optimal set of -means, and be the canonical sequence associated with . Take any . Let . Notice that is constant if remains fixed. The distortion error in the intervals and is given by
[TABLE]
which is minimum if , where for any positive real number , by it is meant that is the positive integer nearest to . Then, notice that implies , and if then yielding . By Proposition 4.7, it follows that , and thus, the lemma is yielded. ∎
Remark 4.10**.**
From Table 1, we see that is a canonical sequence, where , and . Take , then . Thus, we see that the canonical sequence violates the statement as mentioned in the proof of Lemma 4.9. But, such a canonical sequence does not occur frequently, and it does not violate the statement of Lemma 4.9. Putting and in the expression (4), we see that it is minimum if , which is the value that occurs in the canonical sequence . Hence, if and are known, using the expression (4) one can exactly determine .
We now give the following example.
Example 4.11**.**
By Lemma 4.2, for , we have implying and , and . Here the canonical sequence is . By Proposition 4.7,
[TABLE]
and so, by Proposition 3.5, , which is the quantization error for three-means obtained in Lemma 4.2.
The following lemma gives some more properties of canonical sequences.
Lemma 4.12**.**
Let and . Then, a canonical sequence of order is unique, and each subblock of a canonical sequence is also a canonical sequence.
Proof.
Let be a canonical sequence of order . For the sake of contradiction assume that is another canonical sequence of order . Then, we must have indices such that , but and . Putting in the expression similar to (4), we can uniquely determine and . Similarly, putting , we can uniquely determine and . Since , we will have and . Similarly, implies and . Thus, we see that and yield a contradiction to our assumption that . Therefore, we can assume that the canonical sequence of order is unique, which completes the proof of . To prove , we proceed as follows: Let be the canonical sequence of order . It is enough to show that is the canonical sequence of order . For the sake of contradiction, assume that is the canonical sequence of order . Since a canonical sequence of a given order is unique, if we calculate the quantization error, we must have
[TABLE]
which contradicts the fact that is the canonical sequence of order . Hence, every subblock of a canonical sequence is also a canonical sequence. ∎
Lemma 4.13**.**
Let be the canonical sequence of order for and . Then, the canonical sequence of order will be either for some , or .
Proof.
We prove the lemma by induction. By Lemma 4.1 and Lemma 4.2, the canonical sequences of order two and three are and , respectively. Again, by Remark 4.5, it can be seen that the canonical sequence of order four is . Thus, we see that the lemma is true for and . Let be a positive integer such that the lemma is true for all positive integers , where . We will show that the lemma is also true for . Let be the canonical sequence of order implying that the optimal set contains elements from and one element from . Then, the optimal set contains exactly one or two elements from . Assume that contains two elements from . Since is the only subblock of order two, the canonical sequence of order is . Again, as and the canonical sequence of order is unique, we must have . Thus, in this case the lemma is true. Now, assume that contains only one element from . In this case the canonical sequence of order is . We need to show that for exactly one , and for all other . First, assume that . Then, both and are canonical sequences of order and respectively. Since , and we assumed that the lemma is true for all positive integers , we have for exactly one , and for all other , which combined with yields that the lemma is true for . If , then as both and are canonical sequences of the same order, we have , which combined with yields that the lemma is true for . We now show that can not be any integer other than or . For the sake of contradiction, assume that for some . Then, is the canonical sequence of order , and is the canonical sequence of order . Since we assumed that the lemma is true for all positive integers , we must have for at least one . Without any loss of generality, assume that and then for some , and so, , which by an expression similar to (4) implies that and yielding a contradiction. Similarly, we can show that if for any , a contradiction arises. Thus, the lemma is true for if it is true for all positive integers . Hence, by the principle of Mathematical Induction the proof of the lemma is complete. ∎
We are now ready to state and prove the following theorem which gives the optimal set of -means whenever the optimal set of -means is known.
Theorem 4.14**.**
Let be the canonical sequence for an optimal set of -means for some . Construct the sequence such that
[TABLE]
for . For , set
[TABLE]
Write . If for some , then the sequence is the canonical sequence which gives an optimal set of -means. If , then is the canonical sequence which gives an optimal set of -means.
Proof.
By Lemma 4.1, we see that is the canonical sequence for an optimal set of two-means and is the canonical sequence for an optimal set of three-means. In fact, for the canonical sequence , we have and implying . Thus, we see that the theorem is true if . Let us now assume that is the canonical sequence for an optimal set of -means for . Then, using the hypothesis of the theorem, and Lemma 4.13, the proof of the theorem is complete. ∎
Remark 4.15**.**
Using Theorem 4.14, we obtain Table 1 which gives a list of canonical sequences of order for . Notice that for any positive integer , , to obtain the canonical sequence of order one needs to know the canonical sequence of order . A closed formula to obtain the canonical sequence of any order is still not known. On the other hand, in the following section, we show that for a piecewise uniform distribution with finitely many pieces we can easily determine the optimal sets of -means and the th quantization errors for all , see Note 5.10.
5. Optimal Quantization for Uniform Distribution with Finitely Many Pieces
Most of the notations and basic definitions used in this section are same as they are described in Section 3. Write , and . Let be a piecewise uniform distribution on the real line with probability density function (pdf) given by
[TABLE]
Lemma 5.1**.**
Let and represent the expected value and the variance of a random variable with distribution . Then, and .
Proof.
We have
[TABLE]
and thus the lemma is yielded. ∎
Lemma 5.2**.**
For , let denote the expectations of the random variable with distributions . Then,
[TABLE]
Proof.
By the definition of the conditional expectation, we have
[TABLE]
we can obtain . Hence, the lemma is yielded. ∎
The following proposition is similar to Proposition 3.5.
Proposition 5.3**.**
Let . Then, the set is a unique optimal set of -means for , i.e., . Similarly, and . Moreover,
[TABLE]
The following two lemmas are similar to Lemma 4.1 and Lemma 4.2.
Lemma 5.4**.**
Let be an optimal set of two-means such that . Then, and , and the corresponding quantization error is .
Lemma 5.5**.**
Let be an optimal set of three-means such that . Then, , , , and the corresponding quantization error is .
Lemma 5.6**.**
Let be an optimal set of four-means such that . Then, , , , , and the corresponding quantization error is .
Proof.
Consider the set of four points . The distortion error due to the set is given by
[TABLE]
implying .
Let be an optimal set of four-means. Since optimal quantizers are the expected values of their own Voronoi regions, we have . If , then
[TABLE]
which leads to a contradiction, so we can assume that . Suppose that . Then, the distortion error contributed by and on the set is given by
[TABLE]
which is minimum when , and the minimum value is , which is a contradiction. So, we can assume that . If , then
[TABLE]
which leads to a contradiction. So, we can assume that . Suppose that . Then, implying
[TABLE]
which is a contradiction. So, we can assume that . Now, if the Voronoi region of contains points from , we must have implying , and so,
[TABLE]
which yields a contradiction. Thus, we can assume that the Voronoi region of does not contain any point from implying . If , then
[TABLE]
which gives a contradiction. So, we can assume that . We now show that the Voronoi region of does not contain any point from . If it does, then
[TABLE]
which is minimum if . Notice that yielding the fact that -almost surely the Voronoi region of does not contain any point from implying . Thus, we see that , , and and the corresponding quantization error is given by , which completes the proof of the lemma. ∎
Proposition 5.7**.**
Let and let be an optimal set of -means. Then,
* for all ;*
* does not contain any point from the open intervals and ;*
* the Voronoi region of any point in does not contain any point from for .*
Proof.
From Lemma 5.5 and Lemma 5.6, it follows that the proposition is true for . We now prove that the proposition is true for . Consider the set of five points . The distortion error due to the set is given by
[TABLE]
implying . Since is the quantization error for -means for all , we have . Let be an optimal set of five-means. Since optimal quantizers are the expected values of their own Voronoi regions, we have . If , then
[TABLE]
which leads to a contradiction, so we can assume that , i.e., . If , then
[TABLE]
which is a contradiction. So, yielding . Let . Then, . We now show that does not contain any point from the open interval . For the sake of contradiction assume that contain a point from the open interval . The following two cases can arise:
Case 1. .
Then, implying , and so,
[TABLE]
which is a contradiction.
Case 2. .
Then, implying , and so,
[TABLE]
which leads to a contradiction.
By Case 1 and Case 2, we can assume that does not contain any point from the open interval . If , then
[TABLE]
which is a contradiction. So, we can assume that implying . If the Voronoi region of any point in contains points from , then we must have implying , which is a contradiction. If the Voronoi region of any point in contains points from , then we must have implying , which gives another contradiction. Hence, the Voronoi region of any point in does not contain any point from , and the Voronoi region of any point in does not contain any point from .
We now show that does not contain any point from the open interval . Since does not contain any point from and the Voronoi region of any point in does not contain any point from , and the Voronoi region of any point in does not contain any point from , we have
[TABLE]
Let be the quantization error contributed by the set in the region . Since and , if , then does not contain any point from . Assume that . Consider the set of three points . Since,
[TABLE]
we have If contains a point from , we must have . Suppose that . Then, implying . Now, notice that
[TABLE]
which is minimum if , and then , which contradicts the fact that . So, we can assume that is not true. Reflecting the situation with respect to the point , we can show that is also not true. Therefore, if , the set does not contain any point from . Next, assume that for some positive integer . Let . Then, . We need to show that . Consider the set of four points . Since is the quantization error for -means for , we have
[TABLE]
For the sake of contradiction, assume that . The following two cases can arise:
Case A. .
Then, implying , and so,
[TABLE]
implying , which is a contradiction.
Case B. .
Reflecting the situation in Case A with respect to the point , in this case, we can also show that a contradiction arises.
Hence, by Case A and Case B, we can assume that does not contain any point from the open interval , i.e., . If the Voronoi region of any point in contains points from , then we must have implying , which contradicts the fact that . If the Voronoi region of any point in contains points from , then we must have implying , which gives another contradiction. Hence, the Voronoi region of any point in does not contain any point from , and the Voronoi region of any point in does not contain any point from . Thus, the proof of the proposition is complete. ∎
Due to Proposition 5.7, we are now ready to state and prove the following proposition, which helps us to determine the optimal sets of -means and the th quantization errors for all as stated in the subsequent notes.
Proposition 5.8**.**
Let be an optimal set of -means for . Write and for . Then, and , with
[TABLE]
Proof.
If is not an optimal set of -means with respect to the probability distribution , we must have another set with cardinality which will give smaller distortion error with respect to than the distortion error due to the set . This will contradict the fact that is an optimal set of -means with respect to the probability distribution . Since are disjoint for and does not contain any point from the open intervals and , we have and , and so,
[TABLE]
Thus, the proof of the proposition is complete. ∎
Note 5.9**.**
Since represents the th quantization error for any , if for some positive integer , the expression \frac{1}{3888}\Big{(}\frac{1}{n_{2}^{2}}+\frac{1}{n_{3}^{2}}\Big{)} is minimum if and . Thus, we see that if for some positive integer , then , and if for some positive integer , then either and or and . Moreover, writing , or in (3), it can be seen that for any positive integer . Thus, we see that unlike the uniform distribution with infinitely many pieces, described in the previous section, the optimal sets of -means for the uniform distribution with finitely many pieces for all are not unique: if is an odd number then there are two different optimal sets of -means, and if is an even number then the optimal set of -means is unique.
In the following note we describe how to determine the optimal sets of -means and the th quantization errors for all .
Note 5.10**.**
To determine an optimal set of -means for any positive integer , we need to know , and as described in Proposition 5.8. Notice that for any , , we can easily determine , and by minimizing the following function:
[TABLE]
subject to the constraint . Once and are known, then by Proposition 5.3, using the following formula we can determine the corresponding optimal set of -means:
[TABLE]
For example: If , then , or and the corresponding quantization error is . If , then and the corresponding quantization error is , etc.
Acknowledgments We thank B. Pittel for useful facts about random quantizations and suggestions for some possible asymptotic results. We also would like to thank the referee whose questions and suggestions have been very important in improving this article, both in terms of its content and its citations.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[AW] E.F. Abaya and G.L. Wise, Some remarks on the existence of optimal quantizers , Statistics & Probability Letters, Volume 2, Issue 6, December 1984, Pages 349-351.
- 2[C] K-L Chung, An estimate concerning the Kolmogoroff limits distribution , Transactions of the AMS 67 (1949) 36-50.
- 3[PC] P. Cohort, Limit theorems for random normalized distortion , Annals Applied Probability, 14 (2004), no. 1, 118-143.
- 4[D] P. Deheuvels, Strong bounds for multidimensional spacings , Z Wash. verw. Gebiete 64 (1983) 411-424.
- 5[DR] C.P. Dettmann and M.K. Roychowdhury, Quantization for uniform distributions on equilateral triangles , Real Analysis Exchange, Vol. 42(1), 2017, pp. 149-166.
- 6[GG] A. Gersho and R.M. Gray, Vector quantization and signal compression , Kluwer Academy publishers: Boston, 1992.
- 7[GL] S. Graf and H. Luschgy, Foundations of quantization for probability distributions , Lecture Notes in Mathematics 1730, Springer, Berlin, 2000.
- 8[GVL] R. Graham and J. H. Van Lint, On the distribution of n θ 𝑛 𝜃 n\theta modulo 1 1 1 , Canadian Journal Math, 20 (1968) 1020-1024.
