Optimal quantization for piecewise uniform distributions

Joseph Rosenblatt; Mrinal Kanti Roychowdhury

arXiv:1701.04160·math.PR·January 26, 2022

Optimal quantization for piecewise uniform distributions

Joseph Rosenblatt, Mrinal Kanti Roychowdhury

PDF

TL;DR

This paper develops a general method for optimal quantization of distributions and applies it to piecewise uniform distributions, providing explicit solutions for both finite and infinite piece cases.

Contribution

It introduces a unified approach to quantization using ergodic maps and applies it to derive explicit optimal sets for piecewise uniform distributions.

Findings

01

Explicit optimal sets of n-means for finite pieces

02

Asymptotic optimal quantization errors for all n

03

Difference in approach between finite and infinite piece distributions

Abstract

Quantization for a probability distribution refers to the idea of estimating a given probability by a discrete probability supported by a finite number of points. In this paper, firstly a general approach to this process is outlined using independent random variables and ergodic maps; these give asymptotically the optimal sets of $n$ -means and the $n$ th quantization errors for all positive integers $n$ . Secondly two piecewise uniform distributions are considered on $R$ : one with infinite number of pieces and one with finite number of pieces. For these two probability measures, we describe the optimal sets of $n$ -means and the $n$ th quantization errors for all $n \in N$ . It is seen that for a uniform distribution with infinite number of pieces to determine the optimal sets of $n$ -means for $n \geq 2$ one needs to know an optimal set of $(n - 1)$ -means, but for a uniform…

Tables1

Table 1. Table 1. List of canonical sequences for the optimal sets α n subscript 𝛼 𝑛 \alpha_{n} in the range 2 ≤ n ≤ 58 2 𝑛 58 2\leq n\leq 58 .

$n$	canonical sequence	$n$	canonical sequence	$n$	canonical sequence
2	{1, 1}	21	{12, 5, 2, 1, 1}	40	{24, 9, 4, 1, 1, 1}
3	{1, 1, 1}	22	{13, 5, 2, 1, 1}	41	{25, 9, 4, 1, 1, 1}
4	{2, 1, 1}	23	{14, 5, 2, 1, 1}	42	{25, 10, 4, 1, 1, 1}
5	{3, 1, 1}	24	{14, 6, 2, 1, 1}	43	{25, 10, 4, 2, 1, 1}
6	{3, 1, 1, 1}	25	{15, 6, 2, 1, 1}	44	{26, 10, 4, 2, 1, 1}
7	{4, 1, 1, 1}	26	{16, 6, 2, 1, 1}	45	{27, 10, 4, 2, 1, 1}
8	{4, 2, 1, 1}	27	{17, 6, 2, 1, 1}	46	{27, 11, 4, 2, 1, 1}
9	{5, 2, 1, 1}	28	{17, 6, 3, 1, 1}	47	{28, 11, 4, 2, 1, 1}
10	{6, 2, 1, 1}	29	{17, 7, 3, 1, 1}	48	{29, 11, 4, 2, 1, 1}
11	{6, 3, 1, 1}	30	{18, 7, 3, 1, 1}	49	{30, 11, 4, 2, 1, 1}
12	{7, 3, 1, 1}	31	{19, 7, 3, 1, 1}	50	{30, 12, 4, 2, 1, 1}
13	{8, 3, 1, 1}	32	{20, 7, 3, 1, 1}	51	{31, 12, 4, 2, 1, 1}
14	{8, 3, 1, 1, 1}	33	{20, 8, 3, 1, 1}	52	{31, 12, 5, 2, 1, 1}
15	{9, 3, 1, 1, 1}	34	{21, 8, 3, 1, 1}	53	{32, 12, 5, 2, 1, 1}
16	{9, 4, 1, 1, 1}	35	{21, 8, 3, 1, 1, 1}	54	{33, 12, 5, 2, 1, 1}
17	{10, 4, 1, 1, 1}	36	{22, 8, 3, 1, 1, 1}	55	{33, 13, 5, 2, 1, 1}
18	{ 10, 4, 2, 1, 1}	37	{22, 9, 3, 1, 1, 1}	56	{34, 13, 5, 2, 1, 1}
19	{11, 4, 2, 1, 1}	38	{23, 9, 3, 1, 1, 1}	57	{35, 13, 5, 2, 1, 1}
20	{ 12, 4, 2, 1, 1}	39	{24, 9, 3, 1, 1, 1}	58	{35, 14, 5, 2, 1, 1}

Equations162

V_{n}:=V_{n}(P)=\inf\Big{\{}\int\min_{a\in\alpha}\|x-a\|^{2}dP(x):\alpha\subset\mathbb{R}^{d},\text{ card}(\alpha)\leq n\Big{\}},

V_{n}:=V_{n}(P)=\inf\Big{\{}\int\min_{a\in\alpha}\|x-a\|^{2}dP(x):\alpha\subset\mathbb{R}^{d},\text{ card}(\alpha)\leq n\Big{\}},

M (a ∣ α) = {x \in R^{d} : ∥ x - a ∥ = b \in α min ∥ x - b ∥} .

M (a ∣ α) = {x \in R^{d} : ∥ x - a ∥ = b \in α min ∥ x - b ∥} .

\int_{Ω} n 1 \leq k \leq n min ∣ x - β (k, ω) ∣ d P (ω)

\int_{Ω} n 1 \leq k \leq n min ∣ x - β (k, ω) ∣ d P (ω)

= \int_{0}^{\infty} P ({ω : n 1 \leq k \leq n min ∣ x - β (k, ω) ∣ \geq t}) d t = 0 \int n /2 (1 - 2 t / n)^{n} d t = \frac{n}{2 ( n + 1 )} .

D_{n} (β) = sup {∣ \frac{1}{n} k = 1 \sum n 1_{[x, y)} (β (k)) - (y - x) ∣ : 0 \leq x < y < 1} .

D_{n} (β) = sup {∣ \frac{1}{n} k = 1 \sum n 1_{[x, y)} (β (k)) - (y - x) ∣ : 0 \leq x < y < 1} .

D_{n}^{*} (β) = sup {∣ \frac{1}{n} k = 1 \sum n 1_{[0, y)} (β (k)) - y ∣ : 0 \leq y < 1} .

D_{n}^{*} (β) = sup {∣ \frac{1}{n} k = 1 \sum n 1_{[0, y)} (β (k)) - y ∣ : 0 \leq y < 1} .

n \to \infty lim sup \frac{2 n D _{n}^{*} ( β ( ω ))}{ln ln ( n )} = 1.

n \to \infty lim sup \frac{2 n D _{n}^{*} ( β ( ω ))}{ln ln ( n )} = 1.

n D_{n} (β (θ)) = O (ln (n) g (ln ln (n))) .

n D_{n} (β (θ)) = O (ln (n) g (ln ln (n))) .

K + \int_{0}^{1} 1 \leq k \leq n min ∣ x - α_{n} (k) ∣^{2} d x \leq \int_{0}^{1} 1 \leq k \leq n min ∣ x - β (k) ∣^{2} d x .

K + \int_{0}^{1} 1 \leq k \leq n min ∣ x - α_{n} (k) ∣^{2} d x \leq \int_{0}^{1} 1 \leq k \leq n min ∣ x - β (k) ∣^{2} d x .

R \int_{0}^{1} 1 \leq k \leq n min ∣ x - α_{n} (k) ∣^{2} d x \geq \int_{0}^{1} 1 \leq k \leq n min ∣ x - β (k) ∣^{2} d x .

R \int_{0}^{1} 1 \leq k \leq n min ∣ x - α_{n} (k) ∣^{2} d x \geq \int_{0}^{1} 1 \leq k \leq n min ∣ x - β (k) ∣^{2} d x .

f(x)=\left\{\begin{array}[]{ccc}(\frac{3}{2})^{n}&\text{ if }1-\frac{1}{3^{n-1}}\leq x\leq 1-\frac{2}{3^{n}}\text{ for }n\in\mathbb{N},\\ 0&\text{ otherwise}.\end{array}\right.

f(x)=\left\{\begin{array}[]{ccc}(\frac{3}{2})^{n}&\text{ if }1-\frac{1}{3^{n-1}}\leq x\leq 1-\frac{2}{3^{n}}\text{ for }n\in\mathbb{N},\\ 0&\text{ otherwise}.\end{array}\right.

E (P) = n = 1 \sum \infty \int_{J_{n}} x d P = \frac{1}{2}, and V (P) = n = 1 \sum \infty \int_{J_{n}} (x - \frac{1}{2})^{2} d P = \frac{25}{204},

E (P) = n = 1 \sum \infty \int_{J_{n}} x d P = \frac{1}{2}, and V (P) = n = 1 \sum \infty \int_{J_{n}} (x - \frac{1}{2})^{2} d P = \frac{25}{204},

E (P (\cdot ∣ J_{k})) = 1 - \frac{5}{2} \frac{1}{3 ^{k}} and E (P (\cdot ∣ J_{(k, \infty)})) = 1 - \frac{1}{2} \frac{1}{3 ^{k}} .

E (P (\cdot ∣ J_{k})) = 1 - \frac{5}{2} \frac{1}{3 ^{k}} and E (P (\cdot ∣ J_{(k, \infty)})) = 1 - \frac{1}{2} \frac{1}{3 ^{k}} .

E (P (\cdot ∣ J_{k})) = \int_{J_{k}} x d P (\cdot ∣ J_{k}) = \frac{1}{P ( J _{k} )} \int_{J_{k}} x d P = 2^{k} \int_{J_{k}} (\frac{3}{2})^{k} x d x = 1 - \frac{5}{2} \frac{1}{3 ^{k}}, and

E (P (\cdot ∣ J_{k})) = \int_{J_{k}} x d P (\cdot ∣ J_{k}) = \frac{1}{P ( J _{k} )} \int_{J_{k}} x d P = 2^{k} \int_{J_{k}} (\frac{3}{2})^{k} x d x = 1 - \frac{5}{2} \frac{1}{3 ^{k}}, and

E (P (\cdot ∣ J_{(k, \infty)}))

E (P (\cdot ∣ J_{(k, \infty)}))

E (P (\cdot ∣ J_{(k, \infty)})) = \frac{1}{P ( J _{(k, \infty)} )} j = k + 1 \sum \infty P (J_{j}) E (P (\cdot ∣ J_{j})) = 2^{k} j = k + 1 \sum \infty \frac{1}{2 ^{j}} (1 - \frac{5}{2} \frac{1}{3 ^{j}}) = 1 - \frac{1}{2} \frac{1}{3 ^{k}} .

E (P (\cdot ∣ J_{(k, \infty)})) = \frac{1}{P ( J _{(k, \infty)} )} j = k + 1 \sum \infty P (J_{j}) E (P (\cdot ∣ J_{j})) = 2^{k} j = k + 1 \sum \infty \frac{1}{2 ^{j}} (1 - \frac{5}{2} \frac{1}{3 ^{j}}) = 1 - \frac{1}{2} \frac{1}{3 ^{k}} .

V (P, α_{n} (P (\cdot ∣ J_{k})), J_{k}) = \frac{1}{n ^{2}} \frac{1}{12} \frac{1}{1 8 ^{k}} and V (P, α_{1} (P (\cdot ∣ J_{(k, \infty)})), J_{(k, \infty)}) = \frac{25}{204} \frac{1}{1 8 ^{k}} .

V (P, α_{n} (P (\cdot ∣ J_{k})), J_{k}) = \frac{1}{n ^{2}} \frac{1}{12} \frac{1}{1 8 ^{k}} and V (P, α_{1} (P (\cdot ∣ J_{(k, \infty)})), J_{(k, \infty)}) = \frac{25}{204} \frac{1}{1 8 ^{k}} .

{1 - \frac{1}{3 ^{k - 1}}, 1 - \frac{1}{3 ^{k - 1}} + \frac{1}{n} \frac{1}{3 ^{k}}, 1 - \frac{1}{3 ^{k - 1}} + \frac{2}{n} \frac{1}{3 ^{k}}, \dots, 1 - \frac{1}{3 ^{k - 1}} + \frac{n - 1}{n} \frac{1}{3 ^{k}}, 1 - \frac{2}{3 ^{k}}} .

{1 - \frac{1}{3 ^{k - 1}}, 1 - \frac{1}{3 ^{k - 1}} + \frac{1}{n} \frac{1}{3 ^{k}}, 1 - \frac{1}{3 ^{k - 1}} + \frac{2}{n} \frac{1}{3 ^{k}}, \dots, 1 - \frac{1}{3 ^{k - 1}} + \frac{n - 1}{n} \frac{1}{3 ^{k}}, 1 - \frac{2}{3 ^{k}}} .

V (P, α_{n} (P (\cdot ∣ J_{k})), J_{k}) = n \times (the quantization error in each Voronoi region)

V (P, α_{n} (P (\cdot ∣ J_{k})), J_{k}) = n \times (the quantization error in each Voronoi region)

\displaystyle=n\Big{(}\int_{[1-\frac{1}{3^{k-1}},1-\frac{1}{3^{k-1}}+\frac{1}{n}\frac{1}{3^{k}}]}\Big{(}\frac{3}{2}\Big{)}^{k}\Big{(}x-(1-\frac{1}{3^{k-1}}+\frac{1}{2n}\frac{1}{3^{k}})\Big{)}^{2}dx\Big{)},

V (P, α_{1} (P (\cdot ∣ J_{(k, \infty)})), J_{(k, \infty)}) = n = k + 1 \sum \infty \int_{J_{n}} (x - (1 - \frac{1}{2} \frac{1}{3 ^{k}}))^{2} d P = n = k + 1 \sum \infty \int_{J_{n}} (\frac{3}{2})^{n} (x - (1 - \frac{1}{2} \frac{1}{3 ^{k}}))^{2} d P,

V (P, α_{1} (P (\cdot ∣ J_{(k, \infty)})), J_{(k, \infty)}) = n = k + 1 \sum \infty \int_{J_{n}} (x - (1 - \frac{1}{2} \frac{1}{3 ^{k}}))^{2} d P = n = k + 1 \sum \infty \int_{J_{n}} (\frac{3}{2})^{n} (x - (1 - \frac{1}{2} \frac{1}{3 ^{k}}))^{2} d P,

\int a \in β min (x - a)^{2} d P = \int_{J_{1}} (x - \frac{1}{6})^{2} d P + n = 2 \sum \infty \int_{J_{n}} (x - \frac{5}{6})^{2} d P = \frac{7}{612} .

\int a \in β min (x - a)^{2} d P = \int_{J_{1}} (x - \frac{1}{6})^{2} d P + n = 2 \sum \infty \int_{J_{n}} (x - \frac{5}{6})^{2} d P = \frac{7}{612} .

V_{2} \geq \int_{J_{1}} (x - \frac{1}{3})^{2} d P = \frac{1}{54} = 0.0185185 > V_{2},

V_{2} \geq \int_{J_{1}} (x - \frac{1}{3})^{2} d P = \frac{1}{54} = 0.0185185 > V_{2},

V_{2} \geq n = 2 \sum \infty \int_{J_{n}} (x - \frac{2}{3})^{2} d P = \frac{19}{918} = 0.0206972 > V_{2},

V_{2} \geq n = 2 \sum \infty \int_{J_{n}} (x - \frac{2}{3})^{2} d P = \frac{19}{918} = 0.0206972 > V_{2},

\int_{J_{1}} (x - \frac{1}{6})^{2} d P + \int_{J_{2}} (x - \frac{13}{18})^{2} d P + \int_{J_{(2, \infty)}} (x - \frac{17}{18})^{2} d P = \frac{29}{5508} = 0.00526507.

\int_{J_{1}} (x - \frac{1}{6})^{2} d P + \int_{J_{2}} (x - \frac{13}{18})^{2} d P + \int_{J_{(2, \infty)}} (x - \frac{17}{18})^{2} d P = \frac{29}{5508} = 0.00526507.

V_{3} \geq \int_{J_{1}} (x - \frac{1}{3})^{2} d P = \frac{1}{54} = 0.0185185 > V_{3},

V_{3} \geq \int_{J_{1}} (x - \frac{1}{3})^{2} d P = \frac{1}{54} = 0.0185185 > V_{3},

V_{3} \geq \int_{J_{2}} (x - \frac{5}{6})^{2} d P + n = 3 \sum \infty \int_{J_{(2, \infty)}} (x - \frac{31}{36})^{2} d P = \frac{481}{88128} = 0.00545797 > V_{3},

V_{3} \geq \int_{J_{2}} (x - \frac{5}{6})^{2} d P + n = 3 \sum \infty \int_{J_{(2, \infty)}} (x - \frac{31}{36})^{2} d P = \frac{481}{88128} = 0.00545797 > V_{3},

V_{3}

V_{3}

= \frac{15431}{1410048} = 0.0109436 > V_{3},

V_{3} \geq \int_{J_{(1, \infty)}} (x - \frac{5}{6})^{2} d P = \frac{25}{3672} = 0.00680828 > V_{3},

V_{3} \geq \int_{J_{(1, \infty)}} (x - \frac{5}{6})^{2} d P = \frac{25}{3672} = 0.00680828 > V_{3},

\int_{[0, \frac{a _{1} + a _{2}}{2}]} (x - a_{1})^{2} d P + \int_{[\frac{a _{1} + a _{2}}{2}, \frac{1}{3}]} (x - a_{2})^{2} d P = \frac{3 a _{1}^{3}}{8} + \frac{3}{8} a_{2} a_{1}^{2} - \frac{3}{8} a_{2}^{2} a_{1} + \frac{a _{2}^{2}}{2} - \frac{a _{2}}{6} - \frac{3 a _{2}^{3}}{8} + \frac{1}{54},

\int_{[0, \frac{a _{1} + a _{2}}{2}]} (x - a_{1})^{2} d P + \int_{[\frac{a _{1} + a _{2}}{2}, \frac{1}{3}]} (x - a_{2})^{2} d P = \frac{3 a _{1}^{3}}{8} + \frac{3}{8} a_{2} a_{1}^{2} - \frac{3}{8} a_{2}^{2} a_{1} + \frac{a _{2}^{2}}{2} - \frac{a _{2}}{6} - \frac{3 a _{2}^{3}}{8} + \frac{1}{54},

V_{3} \geq \int_{J_{1}} (x - \frac{1}{6})^{2} d P + \int_{J_{2}} (x - \frac{7}{9})^{2} d P = \frac{11}{1944} = 0.00565844 > V_{3},

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

To appear, Uniform Distribution Theory

Optimal quantization for piecewise uniform distributions

Joseph Rosenblatt

Department of Mathematical Sciences

Indiana University-Purdue University Indianapolis

402 N. Blackford Street

Indianapolis, IN 46202-3217, USA.

[email protected]

and

Mrinal Kanti Roychowdhury

School of Mathematical and Statistical Sciences

University of Texas Rio Grande Valley

1201 West University Drive

Edinburg, TX 78539-2999, USA.

[email protected]

Abstract.

Quantization for a probability distribution refers to the idea of estimating a given probability by a discrete probability supported by a finite number of points. In this paper, firstly a general approach to this process is outlined using independent random variables and ergodic maps; these give asymptotically the optimal sets of $n$ -means and the $n$ th quantization errors for all positive integers $n$ . Secondly two piecewise uniform distributions are considered on $\mathbb{R}$ : one with infinite number of pieces and one with finite number of pieces. For these two probability measures, we describe the optimal sets of $n$ -means and the $n$ th quantization errors for all $n\in\mathbb{N}$ . It is seen that for a uniform distribution with infinite number of pieces to determine the optimal sets of $n$ -means for $n\geq 2$ one needs to know an optimal set of $(n-1)$ -means, but for a uniform distribution with finite number of pieces one can directly determine the optimal sets of $n$ -means and the $n$ th quantization errors for all $n\in\mathbb{N}$ .

Key words and phrases:

Optimal quantizers, quantization error, uniform distribution

2010 Mathematics Subject Classification:

60Exx, 94A34.

The research of the second author was supported by U.S. National Security Agency (NSA) Grant H98230-14-1-0320

1. Introduction

Quantization is the process of converting a continuous analog signal into a digital signal of $k$ discrete levels, or converting a digital signal of $n$ levels into another digital signal of $k$ levels, where $k<n$ . It is essential when analog quantities are represented, processed, stored, or transmitted by a digital system, or when data compression is required. It is a classic and still very active research topic in source coding and information theory. It has broad application in engineering and technology, for example in signal processing and data compression (see [GG, GN, Z]). For mathematical treatment of quantization one is referred to Graf and Luschgy’s book (see [GL]). For most recent work on quantization for uniform distributions interested readers can see [DR, R]. Let $P$ denote a Borel probability measure on $\mathbb{R}^{d}$ and let $\|\cdot\|$ denote the Euclidean norm on $\mathbb{R}^{d}$ for any $d\geq 1$ . Then, the $n$ th quantization error for $P$ (of order $2$ ) is defined by

[TABLE]

where the infimum is taken over all subsets $\alpha$ of $\mathbb{R}^{d}$ with card $(\alpha)\leq n$ for $n\geq 1$ . We assume that $\int\|x\|^{2}dP(x)<\infty$ to make sure that there is a set $\alpha$ for which the infimum occurs (see [AW, GKL, GL, GL2]). Such a set $\alpha$ for which the infimum occurs and contains no more than $n$ -points is called an optimal set of $n$ -means and the elements of an optimal set are called optimal quantizers. Let $U$ be the largest open subset of $\mathbb{R}^{d}$ for which $P(U)=0$ . Then, $\mathbb{R}^{d}\setminus U$ is called the support of $P$ , and is denoted by $\text{supp}(P)$ . Notice that if $\text{supp}(P)$ is finite, i.e., if $\text{card}(\text{supp}(P))=N$ for some positive integer $N$ , then $V_{n}(P)=0$ for all $n\geq N$ . On the other hand, if the support of $P$ is countable, or if $P$ is a continuous probability measure, then an optimal set of $n$ -means contains exactly $n$ -elements, i.e., $V_{n}(P)>V_{n+1}(P)$ for all $n\in\mathbb{N}$ (also see [GL]). For a finite set $\alpha\subset\mathbb{R}^{d}$ , by $M(a|\alpha)$ we denote the set of all elements in $\mathbb{R}^{d}$ which are nearest to $a$ among all the elements in $\alpha$ , i.e.,

[TABLE]

$M(a|\alpha)$ is called the Voronoi region generated by $a\in\alpha$ . On the other hand, the set $\{M(a|\alpha):a\in\alpha\}$ is called the Voronoi diagram or Voronoi tessellation of $\mathbb{R}^{d}$ with respect to the set $\alpha$ . Let us now state the following proposition (see [GG, GL]).

Proposition 1.1.

Let $\alpha$ be an optimal set of $n$ -means with respect to a probability distribution $P$ , $a\in\alpha$ , and $M(a|\alpha)$ be the Voronoi region generated by $a\in\alpha$ . Then, for every $a\in\alpha$ ,

$(i)$ * $P(M(a|\alpha))>0$ , $(ii)$ $P(\partial M(a|\alpha))=0$ , and $(iii)$ $a=E(X:X\in M(a|\alpha))$ .*

Notice that for $a\in\alpha$ , $a=E(X:X\in M(a|\alpha))$ implies that the point $a$ is the conditional expectation of the random variable $X$ given that $X$ takes values in the Voronoi region $M(a|\alpha)$ . In [DR], Dettmann and Roychowdhury considered a uniform distribution on an equilateral triangle, and investigated the optimal sets of $n$ -means and the $n$ th quantization errors for the uniform distribution for all $n\geq 2$ . In this direction one can also see [R]. In this paper, in Section 2 we describe some general approaches to construct asymptotically optimal $n$ -means that are highly worth considering, and it seems that they have not been looked at in the applied or theoretical literature on quantization. Then, after some preliminaries in Section 3, and in Section 4, we analyze optimality for a piecewise uniform distribution with infinitely many pieces on the real line, and in Section 5, we analyze optimality for a piecewise uniform distribution with finitely many pieces. For the uniform distribution with infinitely many pieces, in Lemma 4.1 and Lemma 4.2, we first determine the optimal sets of $n$ -means and the $n$ th quantization errors for $n=2$ and $n=3$ . Then, we prove Proposition 4.3, Proposition 4.4, Proposition 4.6 and Proposition 4.7, which help us to give the definition Definition 4.8 of a canonical sequence. With the help of the canonical sequences, in Theorem 4.14, we give an induction formula to determine the optimal sets of $n$ -means and the $n$ th quantization errors for all $n\geq 2$ . We also give a tabular representation of several canonical sequences. For the uniform distribution with finitely many pieces, described in Section 5, one can directly determine the optimal sets of $n$ -means and the $n$ th quantization error for any $n\in\mathbb{N}$ , induction formula is not needed in this case.

2. The General Setting

We are interested in explicit sequences that are optimal $n$ -means, or asymptotically optimal $n$ -means, for given probability measures. In later sections of this article, explicit $n$ -means will be derived for piecewise uniform measures in a couple of different scenarios. For now, as a way of framing issues with and motivating that work, we want to consider some simple ways of generating discrete finite sets of points that can possibly be asymptotically optimal $n$ -means, if not optimal ones, and get some control on the rate that the distortion error tends to zero.

The methods we consider here are both random models with uncorrelated variables and dynamical models in which there can be correlation of the outputs. Each has advantages over the other. They also have advantages over carrying out the detailed, hard work needed to construct explicit optimal $n$ -means with the trade-off being that one generally obtains only asymptotically optimal results.

For concreteness, we keep this introductory discussion limited to the interval $[0,1)\mod 1$ in Lebesgue measure. We are interested in easy methods of obtaining a sequence $(\beta(k):k\geq 1)$ such that for all $n$ , $\int_{0}^{1}\min\limits_{1\leq k\leq n}|x-\beta(k)|^{r}\,dx$ is as small as possible. The classical case is with $r=2$ . Indeed, it is also reasonable to consider the unaveraged error $\min\limits_{1\leq k\leq n}|x-\beta(k)|$ itself. Given a choice of $(\beta(k):k\geq 1)$ , we would like to know the exact rate at which the distortion error tends to zero, and compare that with the optimal distortion error rate.

2.1. IID Models

Consider a method of randomly generating $n$ -means for this simplest case of uniform measure on the interval $[0,1)$ modulo one. We take $\boldsymbol{\beta}=(\beta(k):k\geq 1)$ to be IID random variables with uniform distribution. We actually are taking $\beta(k,\omega)$ with $\omega\in\Omega$ as the model underlying probability space $(\Omega,P)$ , but we will suppress the dependence on $\omega$ if it will not create confusion.

The naive approach would be to estimate how many terms $(\beta(1),\dots,\beta(n))$ are needed so that each interval $I_{j}=[j/M,(j+1)/M),$ for $j=0,\dots,M-1$ , contains at least one point, with high probability. This will guarantee that the quantization error $\int_{0}^{1}\min\limits_{1\leq k\leq n}|x-\beta(k)|^{2}\,dx$ is no larger than $M\int_{0}^{1/M}x^{2}\,dx=1/3M^{2}$ , a common estimate for the optimal quantization error. It is easiest to consider the probability of the complementary case: there is some $I_{j}$ such that no term $\beta(k),k=1,\dots,n$ is in $I_{j}$ . This probability is $(1-\frac{1}{M})^{n}$ for each such $j$ . So an estimate for the entire scope of the possibility is $M(1-\frac{1}{M})^{n}$ . Taking $M=n/\ln(n)$ as a real variable would give for large $n$ , $M(1-\frac{1}{M})^{n}\sim 1/\ln(n)$ . Hence, with probability $1-1/\ln(n)$ , each $I_{j}$ contains some $\beta(k),1\leq k\leq n$ . This gives the estimate $1/3M^{2}=\ln^{2}(n)/2n^{2}$ for the quantization error with this probability. Asymptotically, this translates to taking $M\geq 1$ and then $n=M\ln(M)$ as a real variable to derive the same estimate with probability $1-1/\ln M\asymp 1-1/n$ as $n\to\infty$ . This only gives convergence in distribution as $n$ goes to $\infty$ , but a simple increase in growth of $M$ can guarantee an almost sure result. Note: instead of the optimal distortion error of $C/M^{2}\ln^{2}(M)$ , this approach is giving a somewhat worse estimate $C/M^{2}$ .

However, we can do better. Consider the probability $P(\{\omega:n\min\limits_{1\leq k\leq n}|x-\beta(k,\omega)|\geq t\})$ . It is easy to see that this is $(1-\frac{2t}{n})^{n}$ . So scaling of the distortion error by $n$ results in convergence in distribution to the distribution function $d(t)=1-e^{-2t},t\geq 0$ , one can also compute expectations, and other moments. For example,

[TABLE]

Going further than this distributional convergence is not going to be possible because of the Hewitt-Savage Theorem [HS]. It shows that if this sequence converges a.e. or even just in measure, then the limit function would be a constant. The distributional convergence shows that this is not possible.

But if we also integrate with respect to $x$ instead of $\omega$ , then there is a.s. convergence to a computable constant. That is, there is a non-zero constant $C$ such that for a.e. $\omega$ , $\int_{0}^{1}n\min\limits_{1\leq k\leq n}|x-\beta(k,\omega)|\,dx$ converges to $C$ as $n\to\infty$ . This is not a difficult calculation, if we use estimates of the series of variances for this distortion rate. This convergence, indeed the distributional convergence above, shows that the random $n$ -means are asymptotically optimal. For details of the calculations in greater generality, see Cohort [PC]. This article contains other interesting results related to a.s. convergence of the random proxy for optimal $n$ -means and conclusions that follow about the asymptotic optimality of the random $n$ -means.

The quantization process is closely related to the discrepancy estimates for the random sequence $(\beta(k,\omega))$ . See Kuipers and Niederreiter [KN], especially the chapter notes, for a wealth of background information and references on discrepancy. We again take our interval modulo one, but we suppress this in the notation for simplicity.

Definition 2.2.

Given a sequence $\boldsymbol{\beta}=(\beta(k):k\geq 1)$ in $[0,1)$ , the discrepancy $D_{n}(\boldsymbol{\beta})$ is defined by

[TABLE]

The smaller discrepancy $D_{n}^{*}(\boldsymbol{\beta})$ is defined by

[TABLE]

It is easy to see that $D_{n}^{*}\leq D_{n}\leq 2D_{n}^{*}$ .

Now if $D_{n}<1/M$ , then for any interval $I$ of length $1/M$ , there must be some $\beta_{k}\in I$ with $k\leq n$ . So $\min\limits_{1\leq k\leq n}|x-\beta(k)|\leq 1/M$ too. Hence, we have the useful basic estimate:

Lemma 2.3.

$\min\limits_{1\leq k\leq n}|x-\beta(k)|\leq D_{n}(\boldsymbol{\beta})$ .

Thus, the following result of K-L Chung [C] gives an upper bound on the distortion error.

Theorem 2.4.

For a.e. $\omega$ ,

[TABLE]

However, the actual distortion error rate here is likely to be faster. That is, if we take $d_{n}(\boldsymbol{\beta}(\omega))=\min\limits_{1\leq k\leq n}|x-\beta(k,\omega)|$ , then some experimentation with estimates suggested that $\limsup\limits_{n\to\infty}\frac{nd_{n}(\boldsymbol{\beta}(\omega))}{\ln n}<\infty$ for a.e. $\omega$ . Indeed, this is the case. It was perhaps first proved by Lévy [L]. But many sophisticated extension of this have been achieved, many under the title or order statistics. See for example the article by Deheuvels [D].

If the measure that we are quantizing is not uniform, then we need to adjust the placement of the random variables $(\beta(k):k\geq 1)$ . The obvious approach is to just take $\beta(k)$ to be IID with distribution given by the fixed probability measure $\nu$ . Notice that then we would under some general assumptions have the empirical measures $\frac{1}{n}\sum\limits_{k=1}^{n}\delta_{\beta(k)}$ converging weakly to $\nu$ . The result of Theorem 7.5 in Graf and Luschgy [GL] shows that our random empirical measure would not be asymptotically optimal except in the case of uniform measure. However, given an absolutely continuous measure $d\nu=hd\lambda$ , with a regular density function $h$ , we could choose the $\beta(k)$ to be distributed according to the law $h^{3}d\lambda$ . Then we would not only get a good estimate for the quantization error, but we would also have the empirical measures converging weakly to $hd\lambda=d\nu$ itself. See Graf and Luschgy [GL] discussion following Theorem 7.5.

2.5. Ergodic and Diophantine Models

Consider a dynamical systems approach to asymptotically optimal $n$ -means. For this model, we take an ergodic, measure-preserving mapping $\tau$ of $[0,1]\mod 1$ . For a fixed $y\in[0,1]$ , let $\beta(k,y)=\tau^{k}(y)$ . What can we say about the rate that $\min\limits_{1\leq k\leq n}|x-\beta(k,y)|$ tends to zero for arbitrary $x$ , and at least a.e. $y$ ? Also, is there better stabilization of this if we instead consider the mean behavior $\int_{0}^{1}\min\limits_{1\leq k\leq n}|x-\beta(k,y)|^{2}\,dx$ ? This is the stationary version of the IID case above, where correlation of the $n$ -means is being allowed.

So far we know some things, but not enough about this variation on possible asymptotically optimal $n$ -means. Results in this direction will appear in future work. But it is clear that the ergodicity is not needed for the most important property in obtaining asymptotically optimal $n$ -means. What ergodicity implies is that for a.e. $y$ , the orbit $(\tau^{k}(y):k\geq 1)$ is dense in $[0,1]$ . This is all that is needed for $\min\limits_{1\leq k\leq n}|x-\beta(k,y)|$ to converge to zero for $x$ . What then happens if instead we take as our map a minimal map of $[0,1]$ ? The same property would hold for all points. That is, if we have a minimal map $\tau$ of a compact, metric space $(X,d_{X})$ , in place of $[0,1]$ , then $\min\limits_{1\leq k\leq n}d_{X}(x,\tau^{k}(y))$ also tends to zero for arbitrary $x$ and $y$ . In any such case, it is in general not clear how to obtain a rate for the distortion error, or specific information about the distribution of the $n$ -means that are resulting. This type of issue is why the specific details presented in this article in Section 4 and Section 5 are so useful. Concrete, completely described optimal $n$ -means are worth a great deal in any applied, or theoretical, quantization process.

We might also consider a relative of the dynamical systems approach: a Diophantine method. Now we take $\beta(k,\theta)=\{k\theta\}$ for all $k\geq 1$ , where $\theta$ is some irrational number and $\{t\}$ denotes the fraction in $[0,1)$ such that $t=\{t\}+k$ for some integer $k$ . We know that $\boldsymbol{\beta}(\theta)=(\beta(k,\theta):k\geq 1)$ is uniformly distributed in $[0,1]$ and moreover there is an estimate on the discrepancy $D_{n}(\boldsymbol{\beta}(\theta))$ that holds for a.e. $\theta$ that comes from classical facts about continued fractions and Diophantine approximation. The estimate gives for a.e. $\theta$ and for all $\delta>0$ , $D_{n}(\boldsymbol{\beta}(\theta))\leq\ln((n)^{1+\delta}/n$ for large enough $n$ . But then if $D_{n}(\boldsymbol{\beta}(\theta))<\frac{1}{M}$ , we must have for any interval $I\subset[0,1]$ with $|I|=\frac{1}{M}$ , there is some $k\theta\in I$ with $1\leq k\leq n$ . This then gives the discrete set $\{\beta(k):1\leq k\leq n\}$ with a quantization error no larger than $1/3M^{2}$ . Again, we can translate this to real values by taking $n=M\ln^{1+\delta}(M)$ asymptotically to achieve this quantization error $C/M^{2}$ . It is not as good as the optimal one that would be $C/M^{2}\ln^{2+2\delta}(M)$ . Despite the fact that the discrepancy estimate here is better than for the one in the IID case, the unaveraged distortion error is not as good as what one can obtain in the IID case. The virtue of the Diophantine result is that it is explicit.

What we are observing is that the same approach to over-estimating the distortion error that was used in the random approach will work for this Diophantine approach, replacing the iterated logarithm method of Chung with the theorem of Khinchin [K]. See also Kuipers and Niederreiter [KN] again. To be more exact, Khinchin’s theorem says for any non-decreasing $g$ such that $\sum\limits_{n=1}^{\infty}\frac{1}{g(n)}<\infty$ , for a.e. $\theta$ , one has for the sequence ${\boldsymbol{\beta}(\theta)}=(k\theta\mod 1:k\geq 1)$

[TABLE]

But just as it proved to be the case in the IID model, using discrepancy for the Diophantine model, to over estimate the Diophantine model distortion error, seems likely to give too large an estimate. For example, see the results in Graham and Van Lint [GVL]. This article not only shows that there is a necessary spread in the distortion rate, but it shows that the optimal behavior for the Diophantine model is with $\theta$ that have bounded terms in the simple continued fraction expansion. For these, the distortion error is on the order of the optimal distortion error i.e. $d_{n}(\theta)=O(1/n)$ . What is not shown in [GVL], and seems missing in the literature, is a metric result that gives optimal control on the distortion rate for a.e. $\theta$ .

So it is possible that the dynamical system result or the Diophantine result can be improved by a couple of different approaches. One approach is to not consider the random input value, but take a specific very good value of $\theta$ , actually the Golden Mean. As mentioned above, this is what is considered in Graham and Van Lint [GVL]. See also Motta, Shipman, and Springer [MSS] where optimal transitivity is studied to limit the gaps in the sequence. Another approach would be to use bounded remainder sets so that the discrepancy error can be perhaps better controlled. See both Haynes, Kelly, and Koivusalo [HKK]; and Haynes and Koivusalo [HK].

In addition, we conjecture the following relationships between the asymptotic results from dynamical models and the optimal results that follow in later sections of this paper. Indeed, let $(\beta(k):1\leq k\leq n)$ be either the dynamical system or Diophantine construction above. Let $(\alpha_{n}(k):1\leq k\leq n)$ be an optimal set of $n$ -means. While the unaveraged distortion rate is not going to be as good as the optimal distortion rate, averaging seems to have a very strong impact (as is shown in the IID case by Cohort [C]). We conjecture though that for every constant $K$ , when $n$ is sufficiently large,

[TABLE]

This result would show that the optimal $n$ -means are certainly better than either the random or dynamical approach to quantization. On the other hand, we also see that there may be lots of examples such that for every constant $R>1$ , when $n$ is sufficiently large,

[TABLE]

This would mean that the optimal $n$ -means are not better as far as the asymptotic behavior of the associated distortion rates are concerned, and that the random or dynamical system approaches give asymptotically optimal $n$ -means.

We summarize what has been demonstrated in this section, Section 2. Both the random and the dynamical approaches to quantization give fairly good quantization, but as we will see they do not give as good a quantization error as is possible using optimal quantization. This fact alone should help to motivate why we want to have explicitly optimal $n$ -means. To accomplish this, in the later sections of this paper we take some care to describe completely how to get optimal $n$ -means in a number of different contexts.

3. Notation and Some Facts

Let $P$ be a piecewise uniform distribution with infinitely many pieces on the real line with probability density function (pdf) $f$ given by

[TABLE]

In the sequel we will write $J_{n}:=[1-\frac{1}{3^{n-1}},1-\frac{2}{3^{n}}]$ and $J_{(n,\infty)}:=\mathop{\cup}\limits_{j=n+1}^{\infty}J_{j}$ , where $n\in\mathbb{N}$ . For $n\in\mathbb{N}$ , by $J_{n}(0)$ and $J_{n}(1)$ , we denote the left and right end points of the interval $J_{n}$ , respectively, i.e., $J_{n}(0)=1-\frac{1}{3^{n-1}}$ and $J_{n}(1)=1-\frac{2}{3^{n}}$ .

Lemma 3.1.

Let $E(P)$ and $V(P)$ represent the expected value and the variance of a random variable $X$ with distribution $P$ . Then, $E(P)=\frac{1}{2}$ and $V(P)=\frac{25}{204}$ .

Proof.

We have

[TABLE]

and thus the lemma is yielded. ∎

Note 3.2.

Lemma 3.1 implies that the optimal set of one-mean is $\{\frac{1}{2}\}$ and the corresponding quantization error is $\frac{25}{204}$ . Let $k\in\mathbb{N}$ . By $P(\cdot|J_{k})$ we denote the restriction of the probability measure $P$ on the interval $J_{k}$ , i.e., $P(\cdot|J_{k})=P(\cdot\cap J_{k})/P(J_{k})$ , in other words, for any Borel subset $B$ of $J_{k}$ we have $P(B|J_{k})=\frac{P(B\cap J_{k})}{P(J_{k})}$ . Similarly, write $P(\cdot|J_{(k,\infty)})$ to denote the restriction of the probability measure $P$ on $J_{(k,\infty)}$ . For a probability distribution $Q$ , by $\alpha_{n}(Q)$ , we denote an optimal set of $n$ -means for $Q$ . For a Borel subset $B$ of $\mathbb{R}$ , by $V(P,\alpha_{n}(Q),B)$ , it is meant the quantization error (or distortion measure) contributed by $\alpha_{n}(Q)$ on the set $B$ with respect to the probability distribution $P$ . If nothing is mentioned within a parenthesis, by $\alpha_{n}$ and $V_{n}$ , it is meant an optimal set of $n$ -means and the $n$ th quantization error with respect to the probability distribution $P$ .

Lemma 3.3.

For $k\in\mathbb{N}$ , let $E(P(\cdot|J_{k}))$ and $E(P(\cdot|J_{(k,\infty)}))$ denote the expectations of the random variables with distributions $P(\cdot|J_{k})$ and $P(\cdot|J_{(k,\infty)})$ , respectively. Then,

[TABLE]

Proof.

By the definition of the conditional expectation, we have

[TABLE]

implying $E(P(\cdot|J_{(k,\infty)}))=1-\frac{1}{2}\frac{1}{3^{k}}$ , and thus the lemma is yielded. ∎

Remark 3.4.

Lemma 3.3 implies that $\alpha_{1}(P(\cdot|J_{k}))=\{1-\frac{5}{2}\frac{1}{3^{k}}\}$ , $\alpha_{1}(P(\cdot|J_{(k,\infty)}))=\{1-\frac{1}{2}\frac{1}{3^{k}}\}$ , $E(P(\cdot|J_{k}))=\frac{1}{2}(J_{k}(0)+J_{k}(1))$ , and $E(P(\cdot|J_{(k,\infty)}))=\frac{1}{2}(J_{k+1}(1)+J_{k+2}(0))$ . $E(P(\cdot|J_{(k,\infty)}))$ can also be calculated in the following way:

[TABLE]

Proposition 3.5.

Let $k,n\in\mathbb{N}$ . Then, the set $\{1-\frac{1}{3^{k-1}}+\frac{2i-1}{2n}\frac{1}{3^{k}}:1\leq i\leq n\}$ is a unique optimal set of $n$ -means for $P(\cdot|J_{k})$ , i.e., $\alpha_{n}(P(\cdot|J_{k}))=\{1-\frac{1}{3^{k-1}}+\frac{2i-1}{2n}\frac{1}{3^{k}}:1\leq i\leq n\}$ . Moreover,

[TABLE]

Proof.

Since $P(\cdot|J_{k})$ is uniformly distributed on $J_{k}$ , the boundaries of the Voronoi regions of an optimal set of $n$ -means will divide the interval $[1-\frac{1}{3^{k-1}},1-\frac{2}{3^{k}}]$ into $n$ equal subintervals, i.e., the boundaries of the Voronoi regions are given by

[TABLE]

This implies that an optimal set of $n$ -means for $P(\cdot|J_{k})$ is unique, and it consists of the midpoints of the boundaries of the Voronoi regions, i.e., the optimal set of $n$ -means for $P(\cdot|J_{k})$ is given by $\{1-\frac{1}{3^{k-1}}+\frac{2i-1}{2n}\frac{1}{3^{k}}:1\leq i\leq n\}$ for any $n\geq 1$ . Then, the $n$ th quantization error for $P$ due to the set $\alpha_{n}(P(\cdot|J_{k}))$ on $J_{k}$ is given by

[TABLE]

which after simplification implies $V(P,\alpha_{n}(P(\cdot|J_{k})),J_{k})=\frac{1}{n^{2}}\frac{1}{12}\frac{1}{18^{k}}$ . Again, $E(P(\cdot|J_{(k,\infty)}))=1-\frac{1}{2}\frac{1}{3^{k}}$ , and so,

[TABLE]

which upon simplification yields $V(P,\alpha_{1}(P(\cdot|J_{(k,\infty)})),J_{(k,\infty)})=\frac{25}{204}\frac{1}{18^{k}}$ . Thus, the proof of the proposition is complete. ∎

In the following section, we investigate the optimal sets of $n$ -means for $n\geq 2$ . Once the optimal sets of $n$ -means are known the corresponding quantization error can easily be calculated.

4. Optimal Sets of $n$ -Means for $n\geq 2$

In this section, we first determine the optimal sets of $n$ -means for $n=2$ and $n=3$ .

Lemma 4.1.

Let $\alpha:=\{a_{1},a_{2}\}$ be an optimal set of two-means such that $a_{1}<a_{2}$ . Then, $a_{1}=\frac{1}{6}$ and $a_{2}=\frac{5}{6}$ , and the corresponding quantization error is $V_{2}=\frac{7}{612}$ .

Proof.

Consider the set of two points $\beta:=\{\frac{1}{6},\frac{5}{6}\}$ . The distortion error due to the set $\beta$ is given by

[TABLE]

Since $V_{2}$ is the quantization error for two-means, we have $V_{2}\leq\frac{7}{612}=0.0114379$ . Let $\alpha:=\{a_{1},a_{2}\}$ be an optimal set of two-means such that $a_{1}<a_{2}$ . Since the optimal quantizers are the expected values of their own Voronoi regions, we have $0<a_{1}<a_{2}<1$ . If $\frac{1}{3}\leq a_{1}$ , then

[TABLE]

which leads to a contradiction. So, we can assume that $a_{1}<\frac{1}{3}$ . If $a_{2}<\frac{2}{3}$ , then

[TABLE]

which leads to another contradiction. So, we can assume that $\frac{2}{3}<a_{2}$ . Since $0<a_{1}<\frac{1}{3}$ and $\frac{2}{3}<a_{2}<1$ , we have $\frac{1}{3}<\frac{1}{2}(a_{1}+a_{2})<\frac{2}{3}$ yielding the fact that the Voronoi region of $a_{1}$ does not contain any point from $J_{(1,\infty)}$ and the Voronoi region $a_{2}$ does not contain any point from $J_{1}$ . This implies that $a_{1}=E(X:X\in J_{1})=\frac{1}{6}$ and $a_{2}=E(X:X\in J_{(1,\infty)})=\frac{5}{6}$ , and the corresponding quantization error is $V_{2}=\frac{7}{612}$ , which is the lemma. ∎

Lemma 4.2.

Let $\alpha:=\{a_{1},a_{2},a_{3}\}$ be an optimal set of three-means such that $a_{1}<a_{2}<a_{3}$ . Then, $a_{1}=\frac{1}{6}$ , $a_{2}=\frac{13}{18}$ , $a_{3}=\frac{17}{18}$ , and the corresponding quantization error is $V_{3}=\frac{29}{5508}$ .

Proof.

Consider the set of three points $\beta:=\{\frac{1}{6},\frac{13}{18},\frac{17}{18}\}$ . The distortion error due to the set $\beta$ is given by

[TABLE]

Since $V_{3}$ is the quantization error for three-means, we have $V_{3}\leq 0.00526507$ . Let $\alpha:=\{a_{1}<a_{2}<a_{3}\}$ be an optimal set of three-means. Since the optimal quantizers are the expected values of their own Voronoi regions we have $0<a_{1}<a_{2}<a_{3}<1$ . If $\frac{1}{3}\leq a_{1}$ , then

[TABLE]

which leads to a contradiction. So, we can assume that $a_{1}<\frac{1}{3}$ , and then the Voronoi region of $a_{1}$ does not contain any point from $J_{(1,\infty)}$ . If it does, then we must have $\frac{1}{2}(a_{1}+a_{2})>\frac{2}{3}$ implying $a_{2}>\frac{4}{3}-a_{1}\geq\frac{4}{3}-\frac{1}{3}=1$ , which gives a contradiction. Thus, we see that $a_{1}\leq E(X:X\in J_{1})=\frac{1}{6}$ . Suppose that $a_{2}<\frac{1}{2}$ . The following two cases can arise:

Case 1. Voronoi region of $a_{2}$ contains points from $J_{(1,\infty)}$ .

Then, $\frac{1}{2}(a_{2}+a_{3})>\frac{2}{3}$ implying $a_{3}>\frac{4}{3}-a_{2}\geq\frac{4}{3}-\frac{1}{2}=\frac{5}{6}$ . First, assume that $\frac{5}{6}<a_{3}\leq\frac{31}{36}<J_{3}(0)$ , and then

[TABLE]

which is a contradiction. Next, assume that $\frac{31}{36}\leq a_{3}$ . Then, $\frac{1}{2}(\frac{2}{3}+\frac{31}{36})=\frac{55}{72}$ . Also, notice that $E(X:X\in J_{(2,\infty)})=\frac{17}{18}$ , and so, we have

[TABLE]

which leads to a contradiction.

Case 2. Voronoi region of $a_{2}$ does not contain any point from $J_{(1,\infty)}$ .

Then, as $E(X:X\in J_{(1,\infty)})=\frac{5}{6}$ , we have

[TABLE]

which yields a contradiction.

Thus, by Case 1 and Case 2, we can assume that $\frac{1}{2}\leq a_{2}$ . We now show that $P$ -almost surely the Voronoi region of $a_{2}$ does not contain any point from $J_{1}$ . For the sake of contradiction assume that the Voronoi region of $a_{2}$ contains points from $J_{1}$ . Then, the distortion error contributed by $a_{1}$ and $a_{2}$ on the set $J_{1}$ is given by

[TABLE]

which is minimum when $a_{1}=\frac{1}{6}$ and $a_{2}=\frac{1}{2}$ . Then, notice that $\frac{1}{2}(a_{1}+a_{2})=\frac{1}{3}$ , i.e., $P$ -almost surely the Voronoi region of $a_{2}$ does not contain any point from $J_{1}$ . This implies the fact that $a_{1}=E(X:X\in J_{1})=\frac{1}{6}$ and $\frac{2}{3}\leq a_{2}$ . Suppose that $\frac{7}{9}\leq a_{2}$ . Then,

[TABLE]

which is a contradiction. So, we can assume that $\frac{2}{3}\leq a_{2}<\frac{7}{9}$ . Then, the Voronoi region of $a_{2}$ does not contain any point from $J_{(2,\infty)}$ . If it does, then we must have $\frac{1}{2}(a_{2}+a_{3})>\frac{8}{9}$ implying $a_{3}>\frac{16}{9}-a_{2}\geq\frac{16}{9}-\frac{7}{9}=1$ , which yields a contradiction as $a_{3}<1$ . Thus, we have $a_{2}=E(X:X\in J_{2})=\frac{13}{18}$ and $a_{3}=E(X:X\in J_{(2,\infty)})=\frac{17}{18}$ . Moreover, we have seen $a_{1}=\frac{1}{6}$ . Then, by (1), the quantization error is $V_{3}=\frac{29}{5508}$ . This completes the proof of the lemma. ∎

Proposition 4.3.

Let $n\geq 2$ and let $\alpha_{n}$ be an optimal set of $n$ -means. Then,

$(i)$ * $\alpha_{n}\cap J_{1}\neq\emptyset$ and $\alpha_{n}\cap[J_{2}(0),1]\neq\emptyset$ ;*

$(ii)$ * $\alpha_{n}$ does not contain any point from the open interval $(J_{1}(1),J_{2}(0)))$ ;*

$(iii)$ * the Voronoi region of any point in $\alpha_{n}\cap J_{1}$ does not contain any point from $[J_{2}(0),1]$ , and the Voronoi region of any point in $\alpha_{n}\cap[J_{2}(0),1]$ does not contain any point from $J_{1}$ .*

Proof.

By Lemma 4.1 and Lemma 4.2, the proposition is true for $n=2,3$ . We now show that the proposition is true for all $n\geq 4$ . Consider the set of four points $\beta:=\{\frac{1}{12},\frac{1}{4},\frac{13}{18},\frac{17}{18}\}$ . The distortion error due to the set $\beta$ is given by

[TABLE]

Since $V_{n}$ is the quantization error for $n$ -means with $n\geq 4$ , we have $V_{n}\leq V_{4}\leq\frac{79}{44064}=0.00179285$ . Let $\alpha_{n}:=\{0<a_{1}<a_{2}<\cdots<a_{n}<1\}$ be an optimal set of $n$ -means. If $\frac{1}{3}<a_{1}$ , then $V_{n}\geq\int_{J_{1}}(x-\frac{1}{3})^{2}dP=\frac{1}{54}=0.0185185>V_{n},$ which is a contradiction. If $a_{n}<J_{2}(0)=\frac{2}{3}$ , then

[TABLE]

which leads to another contradiction. Thus, $\alpha_{n}\cap J_{1}\neq\emptyset$ and $\alpha_{n}\cap[J_{2}(0),1]\neq\emptyset$ , which completes the proof of $(i)$ .

To prove $(ii)$ and $(iii)$ , let $j:=\max\{i:a_{i}\leq\frac{1}{3}\}$ . Then, $a_{j}\leq\frac{1}{3}$ . We need to show that $\frac{2}{3}\leq a_{j+1}$ . For the sake of contradiction, assume that $\frac{1}{3}<a_{j+1}<\frac{2}{3}$ . If $\frac{1}{3}<a_{j+1}\leq\frac{1}{2}$ , then $\frac{1}{2}(a_{j+1}+a_{j+2})>\frac{2}{3}$ implying $a_{j+2}>\frac{4}{3}-a_{j+1}\geq\frac{4}{3}-\frac{1}{2}=\frac{5}{6}>\frac{7}{9}$ and so, $V_{n}\geq\int_{J_{2}}(x-\frac{5}{6})^{2}dP=\frac{13}{3888}=0.00334362>V_{n},$ which yields a contradiction. Next, suppose that $\frac{1}{2}\leq a_{j+1}<\frac{2}{3}$ . Then, $\frac{1}{2}(a_{j}+a_{j+1})<\frac{1}{3}$ implying $a_{j}<\frac{2}{3}-a_{j+1}\leq\frac{2}{3}-\frac{1}{2}=\frac{1}{6}$ , and so, $V_{n}\geq\int_{[\frac{1}{6},\frac{1}{3}]}(x-\frac{1}{6})^{2}dP=\frac{1}{432}=0.00231481>V_{n},$ which gives a contradiction. So, we can assume that $a_{j}\leq\frac{1}{3}<\frac{2}{3}\leq a_{j+1}$ , i.e., $\alpha_{n}$ does not contain any point from the open interval $(J_{1}(1),J_{2}(0))$ , which yields $(ii)$ .

If the Voronoi region of $a_{j}$ contains points from $[J_{2}(0),1]$ , we must have $\frac{1}{2}(a_{j}+a_{j+1})>\frac{2}{3}$ implying $a_{j+1}\geq\frac{4}{3}-a_{j}=\frac{4}{3}-\frac{1}{3}=1$ , which is a contradiction. Similarly, if the Voronoi region of any point in $\alpha_{n}\cap[J_{2}(0),1]$ contains points from $J_{1}$ , we will arrive at a contradiction. Thus, $(iii)$ is yielded, and this completes the proof of the proposition. ∎

Proposition 4.4.

Let $\alpha_{n}$ be an optimal set of $n$ -means for $n\geq 4$ . Then, $\text{card}(\alpha_{n}\cap J_{1})\geq 2$ and $\text{card}(\alpha_{n}\cap[J_{2}(0),1])\geq 2$ .

Proof.

As shown in the proof of Proposition 4.3, since $V_{n}$ is the quantization error for $n$ -means for $n\geq 4$ , we have $V_{n}\leq V_{4}\leq\frac{79}{44064}=0.00179285$ . By Proposition 4.3, we have $\text{card}(\alpha_{n}\cap J_{1})\geq 1$ and $\text{card}(\alpha_{n}\cap[J_{2}(0),1])\geq 1$ . First, we show that $\text{card}(\alpha_{n}\cap[J_{2}(0),1])\geq 2$ . Suppose that $\text{card}(\alpha_{n}\cap[J_{2}(0),1])=1$ . Then, as $E(P(\cdot|J_{(1,\infty)}))=\frac{5}{6}$ , we have

[TABLE]

which leads to a contradiction. So, we can assume that $\text{card}(\alpha_{n}\cap[S_{2}(0),1])\geq 2$ . Next, suppose that $\text{card}(\alpha_{n}\cap J_{1})=1$ . Then, as $E(P(\cdot|J_{1}))=\frac{1}{6}$ , we have

[TABLE]

which leads to another contradiction. Thus, the proof of the proposition is complete. ∎

Remark 4.5.

From Proposition 4.4, it follows that if $\alpha_{n}$ is an optimal set of four-means, then $\text{card}(\alpha_{n}\cap J_{1})=2$ and $\text{card}(\alpha_{n}\cap[J_{2}(0),1])=2$ .

Proposition 4.6.

Let $\alpha_{n}$ be an optimal set of $n$ -means for $P$ such that $\text{card}(\alpha_{n}\cap[J_{k+1}(0),1])\geq 2$ for some $k\in\mathbb{N}$ and $n\in\mathbb{N}$ . Then,

$(i)$ * $\alpha_{n}\cap J_{k+1}\neq\emptyset$ and $\alpha_{n}\cap[J_{k+2}(0),1]\neq\emptyset$ ;*

$(ii)$ * $\alpha_{n}$ does not contain any point from the open interval $(J_{k+1}(1),J_{k+2}(0))$ ;*

$(iii)$ * the Voronoi region of any point in $\alpha_{n}\cap J_{k+1}$ does not contain any point from $[J_{k+2}(0),1]$ and the Voronoi region of any point in $\alpha_{n}\cap[J_{k+2}(0),1]$ does not contain any point from $J_{k+1}$ .*

Proof.

To prove the proposition it is enough to prove it for $k=1$ , and then inductively the proposition will follow for all $k\geq 2$ . Fix $k=1$ . Suppose that $\text{card}(\alpha_{n}\cap[J_{2}(0),1])\geq 2$ . By Lemma 4.2, it is clear that the proposition is true for $n=3$ . We now prove that the proposition is true for $n\geq 4$ . Let $\alpha_{n}:=\{0<a_{1}<a_{2}<\cdots<a_{n}<1\}$ be an optimal set of $n$ -means for any $n\geq 4$ . Let $V(P,\alpha_{n}\cap[J_{2}(0),1])$ be the quantization error contributed by the set $\alpha_{n}\cap[J_{2}(0),1]$ in the region $[J_{2}(0),1]$ . Let $\beta$ be a set such that $\beta:=\{\frac{1}{12},\frac{1}{4},\frac{13}{18},\frac{17}{18}\}$ . The distortion error due to the set $\beta\cap[J_{2}(0),1]:=\{\frac{13}{18},\frac{17}{18}\}$ is given by

[TABLE]

and so, $V(P,\alpha_{n}\cap[J_{2}(0),1])\leq 0.000635439$ . Suppose that $\alpha_{n}$ does not contain any point from $J_{2}$ . Since by Proposition 4.3, the Voronoi region of any point in $\alpha_{n}\cap J_{1}$ does not contain any point from $[J_{2}(0),1]$ , we have

[TABLE]

which leads to a contradiction. So, we can assume that $\alpha_{n}\cap J_{2}\neq\emptyset$ . Suppose that $\alpha_{n}\cap[J_{3}(0),1]=\emptyset$ . Then, $a_{n}<J_{3}(0)=\frac{8}{9}$ , and so,

[TABLE]

which gives another contradiction. Therefore, $\alpha_{n}\cap[J_{3}(0),1]\neq\emptyset$ , i.e., $(i)$ is proved.

To prove $(ii)$ we proceed as follows: If $\text{card}(\alpha_{n}\cap[J_{2}(0),1])=2$ , then as Lemma 4.1, it can be proved that $\alpha_{n}\cap[J_{2}(0),1]=\{E(P(\cdot|J_{2})),E(P(\cdot|J_{(2,\infty)}))\}=\{\frac{13}{18},\frac{17}{18}\}$ . Since $\frac{13}{18}\in J_{2}$ and $J_{3}(1)=\frac{8}{9}<\frac{17}{18}$ , in this case we see that $\alpha_{n}\cap(J_{2}(1),J_{3}(0))=\emptyset$ . If $\text{card}(\alpha_{n}\cap[J_{2}(0),1])=3$ , then as Lemma 4.2, it can be proved that

[TABLE]

implying the fact that $\alpha_{n}\cap(J_{2}(1),J_{3}(0))=\emptyset$ . We now assume that $\text{card}(\alpha_{n}\cap[J_{2}(0),1])=4$ , then as mentioned in Remark 4.5, in this case, we can also prove that $\text{card}(\alpha_{n}\cap J_{2})=2$ and $\text{card}(\alpha_{n}\cap[J_{3}(0),1])=2$ , in fact, we have $\text{card}(\alpha_{n}\cap[J_{2}(0),1])=\{\frac{25}{36},\frac{3}{4},\frac{49}{54},\frac{53}{54}\}$ implying $\alpha_{n}\cap(J_{2}(1),J_{3}(0))=\emptyset$ , and the corresponding quantization error, by Proposition 3.5, is given by

[TABLE]

Next, assume that $\text{card}(\alpha_{n}\cap[J_{2}(0),1])\geq 4$ . Then, we must have $V(P,\alpha_{n}\cap[J_{2}(0),1])\leq\frac{79}{793152}=0.0000996026$ . Let $j:=\max\{i:a_{i}\leq J_{2}(1)\text{ for }1\leq i\leq n\}$ implying $a_{j}\leq\frac{7}{9}=J_{2}(1)$ . Suppose that $\frac{7}{9}<a_{j+1}<\frac{8}{9}$ . The following cases can arise:

Case 1. $\frac{7}{9}<a_{j+1}<\frac{5}{6}$ .

Then, $\frac{1}{2}(a_{j+1}+a_{j+2})>\frac{8}{9}$ implying $a_{j+2}>\frac{16}{9}-a_{j+1}\geq\frac{16}{9}-\frac{5}{6}=\frac{17}{18}>J_{3}(1)$ , and so,

[TABLE]

which is contradiction.

Case 2. $\frac{5}{6}\leq a_{j+1}<\frac{8}{9}.$

Then, $\frac{1}{2}(a_{j}+a_{j+1})<\frac{7}{9}$ implying $a_{j}<\frac{14}{9}-a_{j+1}\leq\frac{14}{9}-\frac{5}{6}=\frac{13}{18}$ , and so,

[TABLE]

which gives a contradiction.

Thus, $\alpha_{n}\cap(J_{2}(1),J_{3}(0))=\emptyset$ , which completes the proof of $(ii)$ . The proof of $(iii)$ is similar to the proof of $(iii)$ in Proposition 4.3. Hence, the proposition is yielded. ∎

Proposition 4.7.

Let $\alpha_{n}$ be an optimal set of $n$ -means for $n\geq 2$ . Then, there exists a positive integer $k:=k(n)$ such that $\alpha_{n}\cap J_{j}\neq\emptyset$ for all $1\leq j\leq k$ , and $\text{card}(\alpha_{n}\cap[J_{k+1}(0),1])=1$ . Write $\alpha_{n,j}:=\alpha_{n}\cap J_{j}$ and $n_{j}:=\text{card}(\alpha_{n,j})$ . Then, $\alpha_{n,j}=\alpha_{n_{j}}(P(\cdot|J_{j}))$ and $n=\sum_{j=1}^{k}n_{j}+1$ , with

[TABLE]

Proof.

Proposition 4.3 says that if $\alpha_{n}$ is an optimal set of $n$ -means for $n\geq 2$ , then $\alpha_{n}\cap J_{1}\neq\emptyset$ , $\alpha_{n}\cap[J_{2}(0),1]\neq\emptyset$ , and $\alpha_{n}$ does not contain any point from the open interval $(J_{1}(1),J_{2}(0))$ . Proposition 4.6 says that if $\text{card}(\alpha_{n}\cap[J_{k+1}(0),1])\geq 2$ for some $k\in\mathbb{N}$ , then $\alpha_{n}\cap J_{k+1}\neq\emptyset$ and $\alpha_{n}\cap[J_{k+2}(0),1]\neq\emptyset$ . Moreover, $\alpha_{n}$ does not take any point from the open interval $(J_{k+1}(1),J_{k+2}(0))$ . Thus, by Induction Principle, we can say that if $\alpha_{n}$ is an optimal set of $n$ -means for $n\geq 2$ , then there exists a positive integer $k$ such that $\alpha_{n}\cap J_{j}\neq\emptyset$ for all $1\leq j\leq k$ and $\text{card}(\alpha_{n}\cap[J_{k+1}(0),1])=1$ .

For a given $n\geq 2$ , write $\alpha_{n,j}:=\alpha_{n}\cap J_{j}$ and $n_{j}:=\text{card}(\alpha_{n,j})$ . Since the Voronoi region of any point in $\alpha_{n,j}$ does not contain any point from $J_{1},J_{2},\cdots,J_{j-1}$ , and $J_{(j,\infty)}$ , we must have $\alpha_{n,j}=\alpha_{n_{j}}(P(\cdot|J_{j}))$ . Again, $\alpha_{n,j}$ are disjoint for $1\leq j\leq k$ and $\alpha_{n}$ does not contain any point from the open intervals $(J_{\ell}(1),J_{\ell+1}(0))$ for $1\leq\ell\leq k$ . This implies the fact that $\alpha_{n}=\mathop{\cup}\limits_{j=1}^{k}\alpha_{n,j}\cup\{\alpha_{1}(P(\cdot|J_{(k,\infty)}))\}$ and $n=n_{1}+n_{2}+\cdots+n_{k}+1$ , and so,

[TABLE]

Thus, the proof of the proposition is complete. ∎

Definition 4.8.

Let $n_{j}$ for $1\leq j\leq k$ be the positive integers as defined in Proposition 4.7. Then, we call the sequence $\{n_{1},n_{2},\cdots,n_{k},1\}$ a canonical sequence of order $n$ or just a canonical sequence. Notice that once a canonical sequence of order $n$ is known the corresponding optimal set of $n$ -means can easily be determined and vice versa. Let $\{n_{1},n_{2},\cdots,n_{k},1\}$ be a canonical sequence and $m\in\mathbb{N}$ with $1\leq m\leq k$ . Then, the sequence $\{n_{m},n_{m+1},\cdots,n_{k},1\}$ is called a subblock of the canonical sequence $\{n_{1},n_{2},\cdots,n_{k},1\}$ .

The canonical sequence has the following property.

Lemma 4.9.

Let $\{n_{1},n_{2},\cdots,n_{k},1\}$ be a canonical sequence for $k\geq 2$ . Then, $n_{1}>n_{2}>n_{3}>\cdots>n_{k-1}\geq n_{k}=1$ .

Proof.

Let $\alpha_{n}$ be an optimal set of $n$ -means, and $\{n_{1},n_{2},\cdots,n_{k},1\}$ be the canonical sequence associated with $\alpha_{n}$ . Take any $1\leq i<k$ . Let $n_{i}+n_{i+1}=m$ . Notice that $m$ is constant if $i$ remains fixed. The distortion error in the intervals $J_{i}$ and $J_{i+1}$ is given by

[TABLE]

which is minimum if $n_{i}\approx\frac{1}{19}(18\,m-3\sqrt[3]{12}\,m+\sqrt[3]{18}\,m)$ , where for any positive real number $x$ , by $n_{i}\approx x$ it is meant that $n_{i}$ is the positive integer nearest to $x$ . Then, notice that $m=2$ implies $n_{i}=n_{i+1}=1$ , and if $m\geq 3$ then $n_{i}>\frac{m}{2}$ yielding $n_{i}>n_{i+1}$ . By Proposition 4.7, it follows that $n_{k}=1$ , and thus, the lemma is yielded. ∎

Remark 4.10.

From Table 1, we see that $\{6,3,1,1\}$ is a canonical sequence, where $n_{1}=6$ , $n_{2}=3$ and $n_{3}=1$ . Take $m=n_{1}+n_{2}=9$ , then $\frac{1}{19}(18\,m-3\sqrt[3]{12}\,m+\sqrt[3]{18}\,m)=6.51432\approx 7\neq n_{1}$ . Thus, we see that the canonical sequence $\{6,3,1,1\}$ violates the statement $n_{i}\approx\frac{1}{19}(18\,m-3\sqrt[3]{12}\,m+\sqrt[3]{18}\,m)$ as mentioned in the proof of Lemma 4.9. But, such a canonical sequence does not occur frequently, and it does not violate the statement of Lemma 4.9. Putting $i=1$ and $m=9$ in the expression (4), we see that it is minimum if $n_{1}=6$ , which is the value that occurs in the canonical sequence $\{6,3,1,1\}$ . Hence, if $m$ and $i$ are known, using the expression (4) one can exactly determine $n_{i}$ .

We now give the following example.

Example 4.11.

By Lemma 4.2, for $n=3$ , we have $\alpha_{3}=\{\frac{1}{6},\frac{13}{18},\frac{17}{18}\}$ implying $\alpha_{3,1}=\{\frac{1}{6}\}$ and $\alpha_{3,2}=\{\frac{13}{18}\}$ , and $\alpha_{1}(P(\cdot|J_{(2,\infty)}))=\{\frac{17}{18}\}$ . Here the canonical sequence is $\{1,1,1\}$ . By Proposition 4.7,

[TABLE]

and so, by Proposition 3.5, $V_{3}=\frac{1}{1^{2}}\frac{1}{12}\frac{1}{18}+\frac{1}{1^{2}}\frac{1}{12}\frac{1}{18^{2}}+\frac{25}{204}\frac{1}{18^{2}}=\frac{29}{5508}$ , which is the quantization error for three-means obtained in Lemma 4.2.

The following lemma gives some more properties of canonical sequences.

Lemma 4.12.

Let $n\in\mathbb{N}$ and $n\geq 2$ . Then, $(i)$ a canonical sequence of order $n$ is unique, and $(ii)$ each subblock of a canonical sequence is also a canonical sequence.

Proof.

Let $\{n_{1},n_{2},\cdots,n_{k},1\}$ be a canonical sequence of order $n$ . For the sake of contradiction assume that $\{n_{1}^{\prime},n_{2}^{\prime},\cdots,n_{k}^{\prime},1\}$ is another canonical sequence of order $n$ . Then, we must have indices $i_{1},i_{2},i_{3}$ such that $n_{i_{2}}\neq n_{i_{2}}^{\prime}$ , but $n_{i_{1}}+n_{i_{2}}>n_{i_{1}}^{\prime}+n_{i_{2}}^{\prime}$ and $n_{i_{2}}+n_{i_{3}}<n_{i_{2}}^{\prime}+n_{i_{3}}^{\prime}$ . Putting $m=n_{i_{1}}+n_{i_{2}}$ in the expression similar to (4), we can uniquely determine $n_{i_{1}}$ and $n_{i_{2}}$ . Similarly, putting $m=n_{i_{1}}^{\prime}+n_{i_{2}}^{\prime}$ , we can uniquely determine $n_{i_{1}}^{\prime}$ and $n_{i_{2}}^{\prime}$ . Since $n_{i_{1}}+n_{i_{2}}>n_{i_{1}}^{\prime}+n_{i_{2}}^{\prime}$ , we will have $n_{i_{1}}\geq n_{i_{1}}^{\prime}$ and $n_{i_{2}}\geq n_{i_{2}}^{\prime}$ . Similarly, $n_{i_{2}}+n_{i_{3}}<n_{i_{2}}^{\prime}+n_{i_{3}}^{\prime}$ implies $n_{i_{2}}\leq n_{i_{2}}^{\prime}$ and $n_{i_{3}}\leq n_{i_{3}}^{\prime}$ . Thus, we see that $n_{i_{2}}\geq n_{i_{2}}^{\prime}$ and $n_{i_{2}}\leq n_{i_{2}}^{\prime}$ yield a contradiction to our assumption that $n_{i_{2}}\neq n_{i_{2}}^{\prime}$ . Therefore, we can assume that the canonical sequence of order $n$ is unique, which completes the proof of $(i)$ . To prove $(ii)$ , we proceed as follows: Let $\{n_{1},n_{2},\cdots,n_{k},1\}$ be the canonical sequence of order $n$ . It is enough to show that $\{n_{2},n_{3},\cdots,n_{k},1\}$ is the canonical sequence of order $n-n_{1}$ . For the sake of contradiction, assume that $\{n_{2}^{\prime},n_{3}^{\prime},\cdots,n_{k}^{\prime},1\}$ is the canonical sequence of order $n-n_{1}$ . Since a canonical sequence of a given order is unique, if we calculate the quantization error, we must have

[TABLE]

which contradicts the fact that $\{n_{1},n_{2},\cdots,n_{k},1\}$ is the canonical sequence of order $n$ . Hence, every subblock of a canonical sequence is also a canonical sequence. ∎

Lemma 4.13.

Let $\{n_{1},n_{2},\cdots,n_{k},1\}$ be the canonical sequence of order $n$ for $n\in\mathbb{N}$ and $n\geq 2$ . Then, the canonical sequence of order $(n+1)$ will be either $\{n_{1},n_{2},\cdots,n_{i-2},n_{i-1},n_{i}+1,n_{i+1},\cdots,n_{k-1},n_{k},1\}$ for some $1\leq i\leq k-1$ , or $\{n_{1},n_{2},\cdots,n_{k},1,1\}$ .

Proof.

We prove the lemma by induction. By Lemma 4.1 and Lemma 4.2, the canonical sequences of order two and three are $\{1,1\}$ and $\{1,1,1\}$ , respectively. Again, by Remark 4.5, it can be seen that the canonical sequence of order four is $\{2,1,1\}$ . Thus, we see that the lemma is true for $n=2$ and $n=3$ . Let $N\geq 4$ be a positive integer such that the lemma is true for all positive integers $n$ , where $2\leq n\leq N-1$ . We will show that the lemma is also true for $n=N$ . Let $\{n_{1},n_{2},\cdots,n_{k},1\}$ be the canonical sequence of order $N$ implying that the optimal set $\alpha_{N}$ contains $n_{1}+n_{2}+\cdots+n_{k}$ elements from $J_{1}\cup J_{2}\cup\cdots\cup J_{k}$ and one element from $J_{(k,\infty)}$ . Then, the optimal set $\alpha_{N+1}$ contains exactly one or two elements from $J_{(k,\infty)}$ . Assume that $\alpha_{N+1}$ contains two elements from $J_{(k,\infty)}$ . Since $\{1,1\}$ is the only subblock of order two, the canonical sequence of order $(N+1)$ is $\{m_{1},m_{2},\cdots,m_{k},1,1\}$ . Again, as $m_{1}+m_{2}+\cdots+m_{k}=n_{1}+n_{2}+\cdots+n_{k}=N-1$ and the canonical sequence of order $N$ is unique, we must have $m_{1}=n_{1},\,m_{2}=n_{2},\,\cdots,\,m_{k}=n_{k}$ . Thus, in this case the lemma is true. Now, assume that $\alpha_{N+1}$ contains only one element from $J_{(k,\infty)}$ . In this case the canonical sequence of order $(N+1)$ is $\{m_{1},m_{2},\cdots,m_{k},1\}$ . We need to show that $m_{j}=n_{j}+1$ for exactly one $1\leq j\leq k$ , and $m_{j}=n_{j}$ for all other $1\leq j\leq k$ . First, assume that $m_{1}=n_{1}$ . Then, both $\{m_{2},m_{3},\cdots,m_{k},1\}$ and $\{n_{2},n_{3},\cdots,n_{k},1\}$ are canonical sequences of order $N+1-m_{1}$ and $N-n_{1}$ respectively. Since $(N+1-m_{1})-(N-n_{1})=1$ , and we assumed that the lemma is true for all positive integers $n\leq N-1$ , we have $m_{j}=n_{j}+1$ for exactly one $2\leq j\leq k$ , and $m_{j}=n_{j}$ for all other $2\leq j\leq k$ , which combined with $m_{1}=n_{1}$ yields that the lemma is true for $n=N$ . If $m_{1}=n_{1}+1$ , then as both $\{m_{2},m_{3},\cdots,m_{k},1\}$ and $\{n_{2},n_{3},\cdots,n_{k},1\}$ are canonical sequences of the same order, we have $m_{2}=n_{2},m_{3}=n_{3},\cdots,m_{k}=n_{k}$ , which combined with $m_{1}=n_{1}+1$ yields that the lemma is true for $n=N$ . We now show that $m_{1}$ can not be any integer other than $n_{1}$ or $n_{1}+1$ . For the sake of contradiction, assume that $m_{1}=n_{1}+k$ for some $k\geq 2$ . Then, $\{m_{2},\cdots,m_{k},1\}$ is the canonical sequence of order $N+1-m_{1}=N+1-(n_{1}+k)=N-n_{1}-(k-1)$ , and $\{n_{2},n_{3},\cdots,n_{k},1\}$ is the canonical sequence of order $N-n_{1}$ . Since we assumed that the lemma is true for all positive integers $n\leq N-1$ , we must have $n_{j}>m_{j}$ for at least one $2\leq j\leq k$ . Without any loss of generality, assume that $n_{2}>m_{2}$ and then $n_{2}=m_{2}+\ell$ for some $1\leq\ell\leq(k-1)$ , and so, $m_{1}+m_{2}=n_{1}+n_{2}+(k-\ell)>n_{1}+n_{2}$ , which by an expression similar to (4) implies that $m_{1}\geq n_{1}$ and $m_{2}\geq n_{2}$ yielding a contradiction. Similarly, we can show that if $m_{1}=n_{1}-k$ for any $k\in\mathbb{N}$ , a contradiction arises. Thus, the lemma is true for $n=N$ if it is true for all positive integers $n\leq N-1$ . Hence, by the principle of Mathematical Induction the proof of the lemma is complete. ∎

We are now ready to state and prove the following theorem which gives the optimal set of $(n+1)$ -means whenever the optimal set of $n$ -means is known.

Theorem 4.14.

Let $\{n_{1},n_{2},\cdots,n_{k},1\}$ be the canonical sequence for an optimal set of $n$ -means for some $n\in\mathbb{N}$ . Construct the sequence $\{A(i)\}_{i=1}^{k}$ such that

[TABLE]

for $1\leq i\leq k$ . For $1\leq i\leq k$ , set

[TABLE]

Write $V_{\min}:=\min\{\min\{V(A(j)):1\leq j\leq k\},V(\infty)\}$ . If $V_{\min}:=V(A(m))$ for some $1\leq m\leq k$ , then the sequence $\{n_{1},n_{2},\cdots,n_{m-1},n_{m}+1,n_{m+1},\cdots,n_{k},1\}$ is the canonical sequence which gives an optimal set of $(n+1)$ -means. If $V_{\min}=V(\infty)$ , then $\{n_{1},n_{2},\cdots,n_{k},1,1\}$ is the canonical sequence which gives an optimal set of $(n+1)$ -means.

Proof.

By Lemma 4.1, we see that $\{1,1\}$ is the canonical sequence for an optimal set of two-means and $\{1,1,1\}$ is the canonical sequence for an optimal set of three-means. In fact, for the canonical sequence $\{1,1\}$ , we have $V(A(1))=\frac{1}{2^{2}}\frac{1}{12}\frac{1}{18}+\frac{25}{204}\frac{1}{18}=\frac{13}{1632}$ and $V(\infty)=\frac{1}{1^{2}}\frac{1}{12}\frac{1}{18}+\frac{1}{1^{2}}\frac{1}{12}\frac{1}{18^{2}}+\frac{25}{204}\frac{1}{18^{2}}=\frac{29}{5508}$ implying $V(\infty)<V(A(1))$ . Thus, we see that the theorem is true if $k=1$ . Let us now assume that $\{n_{1},n_{2},\cdots,n_{k},1\}$ is the canonical sequence for an optimal set of $n$ -means for $n\in\mathbb{N}$ . Then, using the hypothesis of the theorem, and Lemma 4.13, the proof of the theorem is complete. ∎

Remark 4.15.

Using Theorem 4.14, we obtain Table 1 which gives a list of canonical sequences of order $n$ for $2\leq n\leq 58$ . Notice that for any positive integer $n\in\mathbb{N}$ , $n\geq 2$ , to obtain the canonical sequence of order $(n+1)$ one needs to know the canonical sequence of order $n$ . A closed formula to obtain the canonical sequence of any order $n\in\mathbb{N}$ is still not known. On the other hand, in the following section, we show that for a piecewise uniform distribution with finitely many pieces we can easily determine the optimal sets of $n$ -means and the $n$ th quantization errors for all $n\in\mathbb{N}$ , see Note 5.10.

5. Optimal Quantization for Uniform Distribution with Finitely Many Pieces

Most of the notations and basic definitions used in this section are same as they are described in Section 3. Write $J_{1}=[0,\frac{1}{3}]$ , $J_{2}=[\frac{2}{3},\frac{7}{9}]$ and $J_{3}=[\frac{8}{9},1]$ . Let $P$ be a piecewise uniform distribution on the real line with probability density function (pdf) $f(x)$ given by

[TABLE]

Lemma 5.1.

Let $E(P)$ and $V(P)$ represent the expected value and the variance of a random variable $X$ with distribution $P$ . Then, $E(P)=\frac{1}{2}$ and $V(P)=\frac{119}{972}$ .

Proof.

We have

[TABLE]

and thus the lemma is yielded. ∎

Lemma 5.2.

For $k=1,2,3$ , let $E(P(\cdot|J_{k}))$ denote the expectations of the random variable $X$ with distributions $P(\cdot|J_{k})$ . Then,

[TABLE]

Proof.

By the definition of the conditional expectation, we have

[TABLE]

we can obtain $E(P(\cdot|J_{2}))=\frac{13}{18}\text{ and }E(P(\cdot|J_{3}))=\frac{17}{18}$ . Hence, the lemma is yielded. ∎

The following proposition is similar to Proposition 3.5.

Proposition 5.3.

Let $n\in\mathbb{N}$ . Then, the set $\{\frac{2i-1}{2n}\frac{1}{3}:1\leq i\leq n\}$ is a unique optimal set of $n$ -means for $P(\cdot|J_{1})$ , i.e., $\alpha_{n}(P(\cdot|J_{1}))=\{\frac{2i-1}{2n}\frac{1}{3}:1\leq i\leq n\}$ . Similarly, $\alpha_{n}(P(\cdot|J_{2}))=\{\frac{2}{3}+\frac{2i-1}{2n}\frac{1}{9}:1\leq i\leq n\}$ and $\alpha_{n}(P(\cdot|J_{3}))=\{\frac{8}{9}+\frac{2i-1}{2n}\frac{1}{9}:1\leq i\leq n\}$ . Moreover,

[TABLE]

The following two lemmas are similar to Lemma 4.1 and Lemma 4.2.

Lemma 5.4.

Let $\alpha:=\{a_{1},a_{2}\}$ be an optimal set of two-means such that $a_{1}<a_{2}$ . Then, $a_{1}=\frac{1}{6}$ and $a_{2}=\frac{5}{6}$ , and the corresponding quantization error is $V_{2}=\frac{11}{972}$ .

Lemma 5.5.

Let $\alpha:=\{a_{1},a_{2},a_{3}\}$ be an optimal set of three-means such that $a_{1}<a_{2}<a_{3}$ . Then, $a_{1}=\frac{1}{6}$ , $a_{2}=\frac{13}{18}$ , $a_{3}=\frac{17}{18}$ , and the corresponding quantization error is $V_{3}=\frac{5}{972}$ .

Lemma 5.6.

Let $\alpha:=\{a_{1},a_{2},a_{3},a_{4}\}$ be an optimal set of four-means such that $a_{1}<a_{2}<a_{3}<a_{4}$ . Then, $a_{1}=\frac{1}{12}$ , $a_{2}=\frac{1}{4}$ , $a_{3}=\frac{13}{18}$ , $a_{4}=\frac{17}{18}$ , and the corresponding quantization error is $V_{4}=\frac{13}{7776}$ .

Proof.

Consider the set of four points $\beta:=\{\frac{1}{12},\frac{1}{4},\frac{13}{18},\frac{17}{18}\}$ . The distortion error due to the set $\beta$ is given by

[TABLE]

implying $V_{4}\leq\frac{13}{7776}=0.00167181$ .

Let $\alpha:=\{a_{1}<a_{2}<a_{3}<a_{4}\}$ be an optimal set of four-means. Since optimal quantizers are the expected values of their own Voronoi regions, we have $0<a_{1}<a_{2}<a_{3}<a_{4}<1$ . If $\frac{1}{3}\leq a_{1}$ , then

[TABLE]

which leads to a contradiction, so we can assume that $a_{1}<\frac{1}{3}$ . Suppose that $\frac{1}{3}\leq a_{2}$ . Then, the distortion error contributed by $a_{1}$ and $a_{2}$ on the set $J_{1}$ is given by

[TABLE]

which is minimum when $a_{1}=\frac{1}{9}$ , and the minimum value is $\frac{1}{486}=0.00205761>V_{4}$ , which is a contradiction. So, we can assume that $0<a_{1}<a_{2}<\frac{1}{3}$ . If $a_{4}\leq\frac{5}{6}$ , then

[TABLE]

which leads to a contradiction. So, we can assume that $\frac{5}{6}<a_{4}$ . Suppose that $a_{3}\leq\frac{1}{2}$ . Then, $\frac{1}{2}(\frac{1}{2}+\frac{5}{6})=\frac{2}{3}$ implying

[TABLE]

which is a contradiction. So, we can assume that $\frac{1}{2}<a_{3}$ . Now, if the Voronoi region of $a_{3}$ contains points from $J_{1}$ , we must have $\frac{1}{2}(a_{2}+a_{3})<\frac{1}{3}$ implying $a_{2}<\frac{2}{3}-a_{3}\leq\frac{2}{3}-\frac{1}{2}=\frac{1}{6}$ , and so,

[TABLE]

which yields a contradiction. Thus, we can assume that the Voronoi region of $a_{3}$ does not contain any point from $J_{1}$ implying $\frac{2}{3}<a_{3}$ . If $\frac{7}{9}\leq a_{3}$ , then

[TABLE]

which gives a contradiction. So, we can assume that $\frac{2}{3}<a_{3}<\frac{7}{9}$ . We now show that the Voronoi region of $a_{4}$ does not contain any point from $J_{2}$ . If it does, then

[TABLE]

which is minimum if $a_{3}=\frac{13}{18}$ . Notice that $\frac{1}{2}(\frac{13}{18}+\frac{5}{6})=\frac{7}{9}$ yielding the fact that $P$ -almost surely the Voronoi region of $a_{4}$ does not contain any point from $J_{2}$ implying $\frac{8}{9}<a_{4}$ . Thus, we see that $a_{1}=\frac{1}{12}$ , $a_{2}=\frac{1}{4}$ , $a_{3}=\frac{13}{18}$ and $a_{4}=\frac{17}{18}$ and the corresponding quantization error is given by $V_{4}=\frac{13}{7776}$ , which completes the proof of the lemma. ∎

Proposition 5.7.

Let $n\geq 3$ and let $\alpha_{n}$ be an optimal set of $n$ -means. Then,

$(i)$ * $\alpha_{n}\cap J_{i}\neq\emptyset$ for all $1\leq i\leq 3$ ;*

$(ii)$ * $\alpha_{n}$ does not contain any point from the open intervals $(\frac{1}{3},\frac{2}{3})$ and $(\frac{7}{9},\frac{8}{9})$ ;*

$(iii)$ * the Voronoi region of any point in $\alpha_{n}\cap J_{i}$ does not contain any point from $J_{j}$ for $1\leq i\neq j\leq 3$ .*

Proof.

From Lemma 5.5 and Lemma 5.6, it follows that the proposition is true for $n=3,4$ . We now prove that the proposition is true for $n\geq 5$ . Consider the set of five points $\beta:=\{\frac{1}{18},\frac{1}{6},\frac{5}{18},\frac{13}{18},\frac{17}{18}\}$ . The distortion error due to the set $\beta$ is given by

[TABLE]

implying $V_{5}\leq\frac{1}{972}=0.00102881$ . Since $V_{n}$ is the quantization error for $n$ -means for all $n\geq 5$ , we have $V_{n}\leq V_{5}\leq 0.00102881$ . Let $\alpha:=\{a_{1}<a_{2}<a_{3}<a_{4}<a_{5}\}$ be an optimal set of five-means. Since optimal quantizers are the expected values of their own Voronoi regions, we have $0<a_{1}<a_{2}<a_{3}<a_{4}<a_{5}<1$ . If $\frac{1}{3}\leq a_{1}$ , then

[TABLE]

which leads to a contradiction, so we can assume that $a_{1}<\frac{1}{3}$ , i.e., $\alpha_{n}\cap J_{1}\neq\emptyset$ . If $a_{n}\leq\frac{8}{9}$ , then

[TABLE]

which is a contradiction. So, $\frac{8}{9}<a_{n}$ yielding $\alpha_{n}\cap J_{3}\neq\emptyset$ . Let $j=\max\{i:a_{i}<\frac{2}{3}\}$ . Then, $a_{j}<\frac{2}{3}$ . We now show that $\alpha_{n}$ does not contain any point from the open interval $(\frac{1}{3},\frac{2}{3})$ . For the sake of contradiction assume that $\alpha_{n}$ contain a point from the open interval $(\frac{1}{3},\frac{2}{3})$ . The following two cases can arise:

Case 1. $\frac{1}{2}\leq a_{j}<\frac{2}{3}$ .

Then, $\frac{1}{2}(a_{j-1}+a_{j})<\frac{1}{3}$ implying $a_{j-1}<\frac{2}{3}-a_{j}\leq\frac{2}{3}-\frac{1}{2}=\frac{1}{6}$ , and so,

[TABLE]

which is a contradiction.

Case 2. $\frac{1}{3}<a_{j}\leq\frac{1}{2}$ .

Then, $\frac{1}{2}(a_{j}+a_{j+1})>\frac{2}{3}$ implying $a_{j+1}>\frac{4}{3}-a_{j}\geq\frac{4}{3}-\frac{1}{2}=\frac{5}{6}>\frac{7}{9}$ , and so,

[TABLE]

which leads to a contradiction.

By Case 1 and Case 2, we can assume that $\alpha_{n}$ does not contain any point from the open interval $(\frac{1}{3},\frac{2}{3})$ . If $\frac{7}{9}\leq a_{j+1}$ , then

[TABLE]

which is a contradiction. So, we can assume that $a_{j+1}<\frac{7}{9}$ implying $\alpha_{n}\cap J_{2}\neq\emptyset$ . If the Voronoi region of any point in $\alpha_{n}\cap J_{2}$ contains points from $J_{1}$ , then we must have $\frac{1}{2}(a_{j}+a_{j+1})<\frac{1}{3}$ implying $a_{j}<\frac{2}{3}-a_{j+1}\leq\frac{2}{3}-\frac{2}{3}=0$ , which is a contradiction. If the Voronoi region of any point in $\alpha_{n}\cap J_{1}$ contains points from $J_{2}$ , then we must have $\frac{1}{2}(a_{j}+a_{j+1})>\frac{2}{3}$ implying $a_{j+1}>\frac{4}{3}-a_{j}\geq\frac{4}{3}-\frac{1}{3}=1$ , which gives another contradiction. Hence, the Voronoi region of any point in $\alpha_{n}\cap J_{2}$ does not contain any point from $J_{1}$ , and the Voronoi region of any point in $\alpha_{n}\cap J_{1}$ does not contain any point from $J_{2}$ .

We now show that $\alpha_{n}$ does not contain any point from the open interval $(\frac{7}{9},\frac{8}{9})$ . Since $\alpha_{n}$ does not contain any point from $(\frac{1}{3},\frac{2}{3})$ and the Voronoi region of any point in $\alpha_{n}\cap J_{2}$ does not contain any point from $J_{1}$ , and the Voronoi region of any point in $\alpha_{n}\cap J_{1}$ does not contain any point from $J_{2}$ , we have

[TABLE]

Let $V(P,\alpha_{n}\cap[\frac{2}{3},1])$ be the quantization error contributed by the set $\alpha_{n}\cap[\frac{2}{3},1]$ in the region $[\frac{2}{3},1]$ . Since $\alpha_{n}\cap J_{2}\neq\emptyset$ and $\alpha_{n}\cap J_{3}\neq\emptyset$ , if $\text{card}(\alpha_{n}\cap[\frac{2}{3},1])=2$ , then $\alpha_{n}$ does not contain any point from $(\frac{7}{9},\frac{8}{9})$ . Assume that $\text{card}(\alpha_{n}\cap[\frac{2}{3},1])=3$ . Consider the set of three points $\gamma=\{\frac{25}{36},\frac{3}{4},\frac{17}{18}\}$ . Since,

[TABLE]

we have $V(P,\alpha_{n}\cap[\frac{2}{3},1])\leq\frac{5}{15552}=0.000321502.$ If $\alpha_{n}$ contains a point from $(\frac{7}{9},\frac{8}{9})$ , we must have $\frac{7}{9}<a_{n-1}<\frac{8}{9}$ . Suppose that $\frac{5}{6}\leq a_{n-1}<\frac{8}{9}$ . Then, $\frac{1}{2}(a_{n-2}+a_{n-1})<\frac{7}{9}$ implying $a_{n-2}<\frac{14}{9}-a_{n-1}\leq\frac{14}{9}-\frac{5}{6}=\frac{13}{18}$ . Now, notice that

[TABLE]

which is minimum if $a_{n-2}=\frac{13}{18}$ , and then $\frac{1}{2}(a_{n-2}+a_{n-1})\geq\frac{1}{2}(\frac{13}{18}+\frac{5}{6})=\frac{7}{9}$ , which contradicts the fact that $\frac{1}{2}(a_{n-2}+a_{n-1})<\frac{7}{9}$ . So, we can assume that $\frac{5}{6}\leq a_{n-1}<\frac{8}{9}$ is not true. Reflecting the situation with respect to the point $\frac{5}{6}$ , we can show that $\frac{7}{9}<a_{n-1}\leq\frac{5}{6}$ is also not true. Therefore, if $\text{card}(\alpha_{n}\cap[\frac{2}{3},1])=3$ , the set $\alpha_{n}$ does not contain any point from $(\frac{7}{9},\frac{8}{9})$ . Next, assume that $\text{card}(\alpha_{n}\cap[\frac{2}{3},1])=m$ for some positive integer $m\geq 4$ . Let $k=\max\{i:a_{i}<\frac{8}{9}\}$ . Then, $a_{k}<\frac{8}{9}$ . We need to show that $a_{k}\leq\frac{7}{9}$ . Consider the set of four points $\delta:=\{\frac{25}{36},\frac{3}{4},\frac{11}{12},\frac{35}{36}\}$ . Since $V(P,\alpha_{n}\cap[\frac{2}{3},1])$ is the quantization error for $m$ -means for $m\geq 4$ , we have

[TABLE]

For the sake of contradiction, assume that $\frac{7}{9}<a_{k}<\frac{8}{9}$ . The following two cases can arise:

Case A. $\frac{5}{6}\leq a_{k}<\frac{8}{9}$ .

Then, $\frac{1}{2}(a_{k-1}+a_{k})<\frac{7}{9}$ implying $a_{k-1}<\frac{14}{9}-a_{k}=\frac{14}{9}-\frac{5}{6}=\frac{13}{18}$ , and so,

[TABLE]

implying $V(P,\alpha_{n}\cap[\frac{2}{3},1])>\frac{1}{7776}=V(P,\alpha_{n}\cap[\frac{2}{3},1])$ , which is a contradiction.

Case B. $\frac{7}{9}<a_{k}\leq\frac{5}{6}$ .

Reflecting the situation in Case A with respect to the point $\frac{5}{6}$ , in this case, we can also show that a contradiction arises.

Hence, by Case A and Case B, we can assume that $\alpha_{n}$ does not contain any point from the open interval $(\frac{7}{9},\frac{8}{9})$ , i.e., $a_{k}\leq\frac{7}{9}$ . If the Voronoi region of any point in $\alpha_{n}\cap J_{3}$ contains points from $J_{2}$ , then we must have $\frac{1}{2}(a_{k}+a_{k+1})<\frac{7}{9}$ implying $a_{k}<\frac{14}{9}-a_{k+1}\leq\frac{14}{9}-\frac{8}{9}=\frac{2}{3}$ , which contradicts the fact that $\alpha_{n}\cap J_{2}\neq\emptyset$ . If the Voronoi region of any point in $\alpha_{n}\cap J_{2}$ contains points from $J_{3}$ , then we must have $\frac{1}{2}(a_{k}+a_{k+1})>\frac{8}{9}$ implying $a_{k+1}>\frac{16}{9}-a_{k}\geq\frac{16}{9}-\frac{7}{9}=1$ , which gives another contradiction. Hence, the Voronoi region of any point in $\alpha_{n}\cap J_{3}$ does not contain any point from $J_{2}$ , and the Voronoi region of any point in $\alpha_{n}\cap J_{2}$ does not contain any point from $J_{3}$ . Thus, the proof of the proposition is complete. ∎

Due to Proposition 5.7, we are now ready to state and prove the following proposition, which helps us to determine the optimal sets of $n$ -means and the $n$ th quantization errors for all $n\geq 3$ as stated in the subsequent notes.

Proposition 5.8.

Let $\alpha_{n}$ be an optimal set of $n$ -means for $n\geq 3$ . Write $\alpha_{n,j}:=\alpha_{n}\cap J_{j}$ and $n_{j}:=\text{card}(\alpha_{n,j})$ for $1\leq j\leq 3$ . Then, $\alpha_{n,j}=\alpha_{n_{j}}(P(\cdot|J_{j}))$ and $n=n_{1}+n_{2}+n_{3}$ , with

[TABLE]

Proof.

If $\alpha_{n,j}$ is not an optimal set of $n_{j}$ -means with respect to the probability distribution $P(\cdot|J_{j})$ , we must have another set $\alpha_{n,j}^{\prime}$ with cardinality $n_{j}$ which will give smaller distortion error with respect to $P(\cdot|J_{j})$ than the distortion error due to the set $\alpha_{n,j}$ . This will contradict the fact that $\alpha_{n}$ is an optimal set of $n$ -means with respect to the probability distribution $P$ . Since $\alpha_{n,j}$ are disjoint for $1\leq j\leq 3$ and $\alpha_{n}$ does not contain any point from the open intervals $(\frac{1}{3},\frac{2}{3})$ and $(\frac{7}{9},\frac{8}{9})$ , we have $\alpha_{n}=\alpha_{n,1}\cup\alpha_{n,2}\cup\alpha_{n,3}$ and $n=n_{1}+n_{2}+n_{3}$ , and so,

[TABLE]

Thus, the proof of the proposition is complete. ∎

Note 5.9.

Since $V_{n}$ represents the $n$ th quantization error for any $n\in\mathbb{N}$ , if $n_{2}+n_{3}=m$ for some positive integer $m$ , the expression $\frac{1}{3888}\Big{(}\frac{1}{n_{2}^{2}}+\frac{1}{n_{3}^{2}}\Big{)}$ is minimum if $n_{2}\approx\frac{m}{2}$ and $n_{3}\approx\frac{m}{2}$ . Thus, we see that if $m=2k$ for some positive integer $k$ , then $n_{2}=n_{3}=k$ , and if $m=2k+1$ for some positive integer $k$ , then either $(n_{2}=k+1$ and $n_{3}=k)$ or $(n_{2}=k$ and $n_{3}=k+1)$ . Moreover, writing $n_{2}=n_{3}$ , or $n_{2}=n_{3}+1$ in (3), it can be seen that $n_{1}\geq\frac{n}{2}$ for any positive integer $n\geq 4$ . Thus, we see that unlike the uniform distribution with infinitely many pieces, described in the previous section, the optimal sets of $n$ -means for the uniform distribution with finitely many pieces for all $n\in\mathbb{N}$ are not unique: if $n_{2}+n_{3}$ is an odd number then there are two different optimal sets of $n$ -means, and if $n_{2}+n_{3}$ is an even number then the optimal set of $n$ -means is unique.

In the following note we describe how to determine the optimal sets of $n$ -means and the $n$ th quantization errors for all $n\geq 3$ .

Note 5.10.

To determine an optimal set of $n$ -means for any positive integer $n\geq 3$ , we need to know $n_{1}$ , $n_{2}$ and $n_{3}$ as described in Proposition 5.8. Notice that for any $n\in\mathbb{N}$ , $n\geq 3$ , we can easily determine $n_{1}$ , $n_{2}$ and $n_{3}$ by minimizing the following function:

[TABLE]

subject to the constraint $n_{1}+n_{2}+n_{3}=n$ . Once $n_{1},n_{2}$ and $n_{3}$ are known, then by Proposition 5.3, using the following formula we can determine the corresponding optimal set of $n$ -means:

[TABLE]

For example: If $n=7$ , then $\{n_{1}=4,\,n_{2}=2,\,n_{3}=1\}$ , or $\{n_{1}=4,\,n_{2}=1,\,n_{3}=2\}$ and the corresponding quantization error is $\frac{19}{31104}$ . If $n=100$ , then $\{n_{1}=56,\,n_{2}=n_{3}=22\}$ and the corresponding quantization error is $\frac{1873}{737662464}$ , etc.

Acknowledgments We thank B. Pittel for useful facts about random quantizations and suggestions for some possible asymptotic results. We also would like to thank the referee whose questions and suggestions have been very important in improving this article, both in terms of its content and its citations.

Bibliography20

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[AW] E.F. Abaya and G.L. Wise, Some remarks on the existence of optimal quantizers , Statistics & Probability Letters, Volume 2, Issue 6, December 1984, Pages 349-351.
2[C] K-L Chung, An estimate concerning the Kolmogoroff limits distribution , Transactions of the AMS 67 (1949) 36-50.
3[PC] P. Cohort, Limit theorems for random normalized distortion , Annals Applied Probability, 14 (2004), no. 1, 118-143.
4[D] P. Deheuvels, Strong bounds for multidimensional spacings , Z Wash. verw. Gebiete 64 (1983) 411-424.
5[DR] C.P. Dettmann and M.K. Roychowdhury, Quantization for uniform distributions on equilateral triangles , Real Analysis Exchange, Vol. 42(1), 2017, pp. 149-166.
6[GG] A. Gersho and R.M. Gray, Vector quantization and signal compression , Kluwer Academy publishers: Boston, 1992.
7[GL] S. Graf and H. Luschgy, Foundations of quantization for probability distributions , Lecture Notes in Mathematics 1730, Springer, Berlin, 2000.
8[GVL] R. Graham and J. H. Van Lint, On the distribution of n θ 𝑛 𝜃 n\theta modulo 1 1 1 , Canadian Journal Math, 20 (1968) 1020-1024.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Optimal quantization for piecewise uniform distributions

Abstract.

Key words and phrases:

2010 Mathematics Subject Classification:

1. Introduction

Proposition 1.1**.**

2. The General Setting

2.1. IID Models

Definition 2.2**.**

Lemma 2.3**.**

Theorem 2.4**.**

2.5. Ergodic and Diophantine Models

3. Notation and Some Facts

Lemma 3.1**.**

Proof.

Note 3.2**.**

Lemma 3.3**.**

Proof.

Remark 3.4**.**

Proposition 3.5**.**

Proof.

4. Optimal Sets of nnn-Means for n≥2n\geq 2n≥2

Lemma 4.1**.**

Proof.

Lemma 4.2**.**

Proof.

Proposition 4.3**.**

Proof.

Proposition 4.4**.**

Proof.

Remark 4.5**.**

Proposition 4.6**.**

Proof.

Proposition 4.7**.**

Proof.

Definition 4.8**.**

Lemma 4.9**.**

Proof.

Remark 4.10**.**

Example 4.11**.**

Lemma 4.12**.**

Proof.

Lemma 4.13**.**

Proof.

Theorem 4.14**.**

Proof.

Remark 4.15**.**

5. Optimal Quantization for Uniform Distribution with Finitely Many Pieces

Lemma 5.1**.**

Proof.

Lemma 5.2**.**

Proof.

Proposition 5.3**.**

Lemma 5.4**.**

Lemma 5.5**.**

Lemma 5.6**.**

Proof.

Proposition 5.7**.**

Proof.

Proposition 5.8**.**

Proof.

Note 5.9**.**

Note 5.10**.**

Proposition 1.1.

Definition 2.2.

Lemma 2.3.

Theorem 2.4.

Lemma 3.1.

Note 3.2.

Lemma 3.3.

Remark 3.4.

Proposition 3.5.

4. Optimal Sets of $n$ -Means for $n\geq 2$

Lemma 4.1.

Lemma 4.2.

Proposition 4.3.

Proposition 4.4.

Remark 4.5.

Proposition 4.6.

Proposition 4.7.

Definition 4.8.

Lemma 4.9.

Remark 4.10.

Example 4.11.

Lemma 4.12.

Lemma 4.13.

Theorem 4.14.

Remark 4.15.

Lemma 5.1.

Lemma 5.2.

Proposition 5.3.

Lemma 5.4.

Lemma 5.5.

Lemma 5.6.

Proposition 5.7.

Proposition 5.8.

Note 5.9.

Note 5.10.