Comparison-limited Vector Quantization
Joseph Chataignon, Stefano Rini

TL;DR
This paper introduces a new vector quantization architecture constrained by the number of comparators, not output cardinality, and proposes an algorithm to optimize its configuration for minimal distortion.
Contribution
It presents the first algorithm for optimal configuration of comparator-based vector quantizers and evaluates its performance for specific source distributions.
Findings
Algorithm effectively minimizes distortion for given comparator constraints.
Performance evaluated for uniform and Gaussian sources.
New architecture suitable for low-cost, energy-efficient A2D conversion.
Abstract
A variation of the classic vector quantization problem is considered, in which the analog-to-digital (A2D) conversion is not constrained by the cardinality of the output but rather by the number of comparators available for quantization. More specifically, we consider the scenario in which a vector quantizer of dimension d is comprised of k comparators, each receiving a linear combination of the inputs and producing zero/one when this signal is above/below a threshold. Given a distribution of the inputs and a distortion criterion, the value of the linear combinations and thresholds are to be configured so as to minimize the distortion between the quantizer input and its reconstruction. This vector quantizer architecture naturally arises in many A2D conversion scenarios in which the quantizer's cost and energy consumption are severely restricted. For this novel vector quantizerâŠ
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Comparison-limited Vector Quantization
Joseph Chataignon
TĂ©lĂ©com Saint-Ătienne
Université Jean Monnet, France
ââ
Stefano Rini
Department of Electrical and Computer Engineering
National Chiao Tung University, Taiwan
Abstract
A variation of the classic vector quantization problem is considered, in which the analog-to-digital (A2D) conversion is not constrained by the cardinality of the output but rather by the number of comparators available for quantization. More specifically, we consider the scenario in which a vector quantizer of dimension is comprised of comparators, each receiving a linear combination of the inputs and producing zero/one when this signal is above/below a threshold. Given a distribution of the inputs and a distortion criterion, the value of the linear combinations and thresholds are to be configured so as to minimize the distortion between the quantizer input and its reconstruction. This vector quantizer architecture naturally arises in many A2D conversion scenarios in which the quantizerâs cost and energy consumption are severely restricted. For this novel vector quantizer architecture, we propose an algorithm to determine the optimal configuration and provide the first performance evaluation for the case of uniform and Gaussian sources.
I Introduction
Quantization, that is transforming continuous amplitude values into discrete ones so as to minimize a prescribed distortion measure subject to a output cardinality constraint, is one of the fundamental signal processing operations. As such, quantization has been studied in a number of contexts and a vast amount of results have been derived for this problem. In this paper we consider a variation of this problem, so far neglected in the literature, that is relevant in the design of low-cost, energy-efficient quantizers. Vector quantizers are typically manufactured using op-amp comparators that obtain a linear combinations of the quantizer inputs and a bias and produce a zero/one voltage whether the comparison between these two signals is positive/negative. Generally speaking, comparators are components with high power consumption and manufacturing cost: for this reason it is reasonable to evaluate the cost of a quantizer in terms of the number of comparators it requires. Let us consider the case of a two dimensional quantizer () in which each dimension is quantized with a rate of 1.5 bits-per-sample (). When quantizing a generic source, the three discretization points are separated in Voronoi regions as in Fig. 1.a so that the number of comparators required is . More generally, since every two reconstruction points are separated by an edge of the Voronoi region, a vector quantizer require comparators. This scaling of the quantizer cost is generally valid for low-rate quantizer, since in this scenario the number of neighbors of each reconstruction point is large. Given that this quantization regime is naturally associated with low cost devices, the cost scaling seems to be particularly disadvantageous. A natural question that naturally arises is whether a better scaling of the quantizer cost can be attained. To address this question, note that number of comparators equals the total number of hyperplane segments in the Voronoi regions. Accordingly, the best scaling is attained when the hyperplanes induce the largest number of partitions of the space of dimension . After some geometrical consideration, one realizes that for the case of and , the largest number of partition is indeed and corresponds to the configuration in Fig. 1.b. It is now apparent that there exists a large gap in the optimal quantizer design whether one considers a constraint on the number of points used in reconstruction or the number of comparators employed by the quantizer
Relevant Results
The idea of accounting for the number of comparators required for A2D conversion emerges from the work in [1]: here the authors investigate the capacity of a MIMO channel with output quantization constraint, i.e. the MIMO channel in which the channel output is processed at the receiver using a finite number of one-bit threshold quantizers. The channel model of [1] is relevant in mm-wave communications which allows for a large number of receive antennas, while the number of A2D conversion modules remains small due to limitation in the energy and costs of RF modules. Building on an idea of [2], a connection between combinatorial geometry and the MIMO channel with output quantization constraints of [1] is drawn in [3]. In particular, in [3] it is shown that each quantizer can be interpreted as an hyperplane bisecting the transmitter signal space; for this reason the largest rates are attained by the configuration allowing for the largest number of partitions induced by the set of hyperplanes. To the best of our knowledge, the problem of hardware limited quantization has so far only being considered in [4]. The difference between our approach and that of [4] is in focusing on vector quantization and considering the comparator limitations on the vector, rather than scalar, quantizer input.
Contribution
In the following, we define comparison limited vector quantization problem in its full generality as a variation of the classical vector quantization problem. We also provide a first algorithm for the quantizer design: although not optimal, this first approach investigates the combinatorial geometric aspects of the optimal quantization design problem. Numerical evaluations are provided for the case of an iid Gaussian and uniform source. The performance of the proposed quantizer is compared with the classic Max-Lloyd quantizer design [5, 6].
Paper Organization
The paper is organized as follows: Sec. II presents the vector quantization model, Sec. III introduces the proposed design algorithm. Sec. IV provides relevant numerical evaluations. Finally, Sec. V concludes the paper.
Notation
In the remainder of the paper, all logarithms are taken in base two. With we indicate a sequence of elements from with length . The notation indicates the substring of . The function returns a vector with values in which equals the sign of each entry of the vector . Random vector are indicated as . The set is indicated as .
II Comparison-limited vector quantizer model
We consider the source quantization scenario in Fig. 2: the source sequence with support . The source sequence is parsed in super-symbols of dimension with where is referred to as the dimension of the vector quantizer. The comparator obtains a linear combination of the each super-symbol and produces the signal as
[TABLE]
for . The value is is referred to as the resolution of the vector quantizer; , are fixed and known. The outputs of the quantizers in (1) is more conveniently expressed in vector for as
[TABLE]
where is such that the row corresponds to the vector in in (1). Similarly, has the entry equal to and . The set is referred to as the configuration of the linear combiner. The supersymbol is provided to a source encoder that produces a bit-restricted representation of the quantizersâ output as where referred to as the rate of the quantizer through the source encoding mapping
[TABLE]
The message is provided to a source decoder which produces a reconstruction of the source super-symbol , with , thorough the source decoding mapping
[TABLE]
The performance of the vector quantizer is evaluated through a distortion measure
[TABLE]
for which is assumed non-decreasing in . For a given configuration of the linear combiners , source encoder/decoder mappings /, and given a distortion measure between input and reconstruction sequence , the performance of the quantizer is evaluated as
[TABLE]
The optimal quantizer performance for the distortion , dimension , resolution and rate is obtained as
[TABLE]
where the infimization is over all linear combiner configurations and source encoder/decoder mappings.
Comparison-limited distortion-rate function
Let , then the comparison-limited distortion-rate function is defined as
[TABLE]
for in (7), that is is the minimum distortion attainable as the quantizer dimension grows to infinity while the message support grows as and the number of quantizers as . One can also show that in (8) for correspond to the classical rate distortion function [7, Ch. 13].
Remark 1
The above vector quantizer architecture formulation is rather general: in the remainder we consider only the case in which in (7), i.e. . This corresponds to the scenario in which the communication rate between the source encoder and the decoder in (2) unbounded.
Some combinatorial notions
In the following, we utilize some simple combinatorial concepts which we briefly introduce here.
A hyperplane arrangement is a finite set of affine hyperplanes in for some . A hyperplane arrangement can be expressed as where is obtained by letting each row correspond to and defining . A plane arrangement is said to be in General Position (GP) if and only if every sub-matrix of has non zero determinant [8]. An hyperplane arrangement induces a partition of the space in a number of regions.
Lemma II.1
A hyperplane arrangement of size in divides into at most
[TABLE]
regions. Hyperplanes in GP divide the space in regions.
We see from Lem. II.1 that the largest number of reconstruction points for is .
III Design algorithm
In the following section, we propose an algorithm to numerically determine the optimal linear combiner configuration and source reconstruction attaining . For simplicity we assume the case of iid sources to be reconstructed under Mean Squared Error (MSE) distortion, i.e.
[TABLE]
In this scenario, one can show that (7) simplifies as
[TABLE]
where and for in (11) are iid distributed, so that the subscript can be dropped. 111That is, every super-symbol is quantized in the same manner, regardless of .
Similarly to the classic Max-Lloyd algorithm, the optimal quantizer design for the model in Sec. II can be divided into two optimization steps to be iterated until convergence: (i) the optimization of the set of reconstruction points for a given combiner configuration and (ii) the optimization of the combiner configuration for a given set of set of reconstruction points. The optimization step (i) is rather straightforward as the reconstruction points are chosen as the centroids of the regions induced by the hyperplane arrangement in [9]. The optimization step (ii) is rather more involved and we propose two methods for this optimization: a global configuration update and a local one as described in Algorithm 1. In the global configuration update, all hyperplanes are randomly perturbed with a perturbation of variance decreasing with the iteration number. In the local configuration update, a hyperplane is selected at random and its position is optimally determined so as to minimize the MSE of the reconstruction points separated by such hyperplane. One of these two methods is selected at random at every iteration, with the probability to use the global configuration update exponentially decreasing over iterations.
The reasoning between local and global update is as follows: consider a lower triangular matrix of size and let the element in position in equal to the index of an hyperplane separating and if such reconstruction point exists and zero otherwise. 222Note that some hyperplane arrangements induce less that regions: we assume that there exists a natural numbering of the possible regions. The matrix can be though of as one among a finite number of ways in which hyperplanes separate the reconstruction points. In this view, the local update maximizes the quantizer performance in a given value of . The global update, instead, allows âhyperplanes to jump over centroidsâ, resulting in a different matrix .
A crucial step in step (ii) of the design algorithm is the evaluation of the MSE for a given hyperplane configuration and reconstruction points. Given numerical precision limitations, the MSE evaluation has to be approximated using numerical integration methods and particle filters as in Algorithm 2. More specifically, random points are generated and assigned to the corresponding reconstruction point until a minimum number of points per region is attained.
Note that an approach similar to that of Algorithm 2 is used is step (i) of the algorithm when estimating the centroid of each region induced by the hyperplane arrangement, as this evaluation also require numerical integration.
By alternating the optimization step (i) and step (ii) through the MSE approximation approach in Algorithm 2, the algorithm converges to a numerical solution. Upon multiple random restarts of the algorithm, convergence to multiple local minima is sometimes observed. These minimal values arises either from a limitation in the precision of the numerical integration or by a local minimum in the quantizer configuration.
An example of convergence to multiple local minima upon multiple random initializations is shown in Fig. 3: While the arrangement in Fig. 3.a has 6 centroids, the one in Fig 3.b has 7 but both attain similar performance in the case of a standard Gaussian distribution. From a high level perspective, though, one observes that the two configurations are rather distant and the proposed algorithm is not able to converge to the better solution in Fig 3.b starting from the configuration in Fig 3.a.
Remark 2
We conjecture that the number of possible matrices that describe how reconstruction points are obtained from the hyperplane configuration grows only polynomially with and . If this assumption were true, it would be computationally feasible to consider all possible matrices in order to avoid local minima. Unfortunately, we are currently unable to prove such conjecture.
IV Simulation results
In this section we present the quantization performance for the quantizer in Sec. II for a configuration obtained using the algorithm in Sec. III for the case of (i) standard Gaussian and (ii) unitary uniform distribution. In both instances, the performance is compared to that attainable using a classic quantizer designed using the classic Max-Lloyd algorithm with the same number of reconstruction points.
IV-A Gaussian distribution
As one could expect, the quantizer obtained performs slightly worse than an optimal quantizer obtained by Max-Lloydâs algorithm with the same number of reconstruction points. However, when comparing the number of hyperplanes used instead of the number of reconstruction points, it performs better than Max-Lloydâs quantizer, as we can see on Fig. 4. The ratio of this algorithmâs quantizerâs MSE over Max-Lloydâs quantizerâs is 0.64 for a configuration with 5 hyperplanes.
A result that may be surprising is that the hyperplanes do not necessarily form the maximum number of regions. Interestingly, they often arrange to make less regions with similar probabilities whether the starting configuration has many regions or not, which turns out to perform relatively well and sometimes better than configurations with more regions.
IV-B Uniform distribution
The results obtained with a uniform distribution are similar to the ones obtained with a Gaussian distribution. Again, the quantizer obtained by the algorithm performs worse than Max-Lloydâs quantizer with the same number of reconstruction points, but better than Max-Lloydâs quantizer with the same number of hyperplanes. This is shown in Fig. 4. The ratio of the MSE of the quantizer obtained over Max-Lloydâs quantizerâs is 0.72 for a configuration with 5 hyperplanes.
Because of the square shape of the distribution support, the hyperplanes are even more likely to form rectangular regions than with the Gaussian distribution, as Fig. 5 illustrates.
V Conclusion
In this paper, a novel paradigm for vector quantization is considered. In this paradigm, the performance of the quantizer is not limited by the number of reconstruction point as in the classic Max-Lloyd quantizer but rather by the number of comparisons necessary to determine the quantizer output. In particular, we consider the case in which a vector quantizer is comprised comparators with receive a linear combination of the quantizer input plus a constant and output the sign of received signal. Given a distribution of the quantizer input, and a distortion measure between source and reconstruction, we consider the problem of optimally determining the linear combination and constant coefficient so that the distortion between source and reconstruction is minimized. We propose a first algorithm for this optimization problem and apply such algorithm to the case of mean squared error distortion and Gaussian and uniform iid sources. In both cases, the performance is compared to that of the Max-Lloyd quantizer.
A number of research directions remain open from this new vector quantizer architecture. In particular, we are investigating the optimal performance attainable in the limit of infinitely long vector quantizer in which the number of available comparators and bits available to represent the quantizer inputs both grow to infinity at a given constant ratio . This limit should result in a rather interesting generalization of the classic distortion-rate function.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] S. Rini, L. Barletta, Y. C. Eldar, and E. Erkip, âA general framework for mimo receivers with low-resolution quantization,â in 2017 IEEE Information Theory Workshop (ITW) . IEEE, 2017, pp. 599â603.
- 2[2] J. Mo and R. W. Heath, âHigh snr capacity of millimeter wave mimo systems with one-bit quantization,â in 2014 Information Theory and Applications Workshop (ITA) . IEEE, 2014, pp. 1â5.
- 3[3] A. Khalili, S. Rini, L. Barletta, E. Erkip, and Y. C. Eldar, âOn mimo channel capacity with output quantization constraints,â in 2018 IEEE International Symposium on Information Theory (ISIT) . IEEE, 2018, pp. 1355â1359.
- 4[4] N. Shlezinger, Y. C. Eldar, and M. R. Rodrigues, âHardware-limited task-based quantization,â ar Xiv preprint ar Xiv:1807.08305 , 2018.
- 5[5] S. Max, âQuantizing for minimum distortion,â IEEE transactions on information theory , vol. 6, no. 1, pp. 7â12, 1960.
- 6[6] S. Lloyd, âLeast squares quantization in pcm,â IEEE transactions on information theory , vol. 28, no. 2, pp. 129â137, 1982.
- 7[7] T. M. Cover and J. A. Thomas, Elements of information theory . John Wiley & Sons, 2012.
- 8[8] T. Cover, âGeometrical and statistical properties of systems of linear inequalities with applications in pattern recognition,â IEEE Trans. Electron. Comput. , vol. EC-14, no.3, pp. 326â334, Jun. 1965.
