On the smallest singular value of multivariate Vandermonde matrices with clustered nodes
Stefan Kunis, Dominik Nagel

TL;DR
This paper establishes bounds on the smallest singular value of multivariate Vandermonde matrices with clustered nodes on the complex unit circle, revealing how clustering affects matrix stability and invertibility.
Contribution
It provides the first comprehensive bounds for the smallest singular value in multivariate Vandermonde matrices with clustered nodes, including sharp constants and geometric dependencies.
Findings
Lower bounds for singular values with clustered nodes
Upper bounds that match the univariate case
Dependence of singular value on cluster geometry
Abstract
We prove lower bounds for the smallest singular value of rectangular, multivariate Vandermonde matrices with nodes on the complex unit circle. The nodes are ``off the grid'', groups of nodes cluster, and the studied minimal singular value is bounded below by the product of inverted distances of a node to all other nodes in the specific cluster. By providing also upper bounds for the smallest singular value, this completely settles the univariate case and pairs of nodes in the multivariate case, both including reasonable sharp constants. For larger clusters, we show that the smallest singular value depends also on the geometric configuration within a cluster.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
On the smallest singular value of multivariate Vandermonde matrices with clustered nodes
Stefan Kunis111 Osnabrück University, Institute of Mathematics {skunis,dnagel}@uos.de Dominik Nagel111 Osnabrück University, Institute of Mathematics {skunis,dnagel}@uos.de
Abstract
We prove lower bounds for the smallest singular value of rectangular, multivariate Vandermonde matrices with nodes on the complex unit circle. The nodes are “off the grid”, groups of nodes cluster, and the studied minimal singular value is bounded below by the product of inverted distances of a node to all other nodes in the specific cluster. By providing also upper bounds for the smallest singular value, this completely settles the univariate case and pairs of nodes in the multivariate case, both including reasonable sharp constants. For larger clusters, we show that the smallest singular value depends also on the geometric configuration within a cluster.
Key words and phrases: Vandermonde matrix, colliding nodes, cluster, condition number, restricted Fourier matrices, frequency analysis, super resolution.
2010 AMS Mathematics Subject Classification : 15A18, 65T40, 42A15.
1 Introduction
Vandermonde matrices appear e.g. in the stability analysis of super-resolution algorithms like Prony’s method [6, 12], the matrix pencil method [11, 19], the ESPRIT algorithm [22, 21, 16], and the MUSIC algorithm [23, 17]. We are interested in the case of nodes on the complex unit circle and a large polynomial degree, the matrices then generalize the classical discrete Fourier matrices to non-equispaced nodes and the involved polynomial degree is also called bandwidth. If all nodes are well-separated, bounds on the condition number are established for example in [5, 14, 19, 2, 8] for the univariate case and in [14, 12] at least partially for the multivariate case. For node sets with distances of which some are below the inverse bandwidth, the behavior of the smallest singular value is subject of current research. The seminal paper [9] coined the term (inverse) super-resolution factor for the product of the bandwidth and the minimal separation of the nodes. For nodes on a grid, the results in [9, 7] imply that the smallest singular value is at most as small as the inverse super-resolution factor raised to the power of if the super-resolution factor is greater than . More recently, the practically relevant situation of clustered nodes was studied in [20, 1, 15, 3, 13, 4, 8]. In the univariate case and for different setups, all of these refinements are able to replace the exponent by the smaller number , where denotes the number of nodes that are in the largest cluster of nodes.
Here, we refine the proof technique developed in the second version of [15] and extend it to arbitrary dimensions. In contrast to [15], we only use the information on the biggest cluster size, minimal separation between clusters and a the worst case cluster complexity (or a minimal separation between nodes) instead of taking the structure of each cluster into account. In summary, our contributions are:
- i)
a refined analysis of the univariate case, cf. [15], eliminating the dependence on the total number of nodes, weakening a technical condition on the cluster separation, and improving constants, mainly by
- (a)
a geometric packing argument and 2. (b)
an improved estimate of Dirichlet kernels and Lagrange-like basis functions; 2. ii)
a multidimensional generalization, including
- (a)
a quantitative estimate for the well-separated case, 2. (b)
a sharp estimate for pair clusters in higher dimensions, and 3. (c)
an example on the limitations for larger clusters in higher dimensions.
The outline of this paper is as follows: Section 2 fixes notation, states the problem and gives some definitions. Furthermore, we generalize the so-called robust duality lemma from the second version of [15] to the multivariate case. In Section 3, we introduce some auxiliary functions which are used to prove our main results in Section 4. Additionally, we give examples with specified parameters, present implications of our result for special node configurations like pair clusters and well separated nodes, and compare them with existing results. In Section 5, upper bounds on the smallest singular value for the univariate case and for pair clusters in higher dimensions are presented - these match the lower bounds from our main theorem. Furthermore, an example of a triple cluster in two dimensions is given which shows that geometric properties beyond pairwise distances are needed for understanding the multivariate case. Finally, in Section 6 numerical experiments are presented that support statements and comparisons from preceding sections.
2 Preliminaries
Definition 2.1** (Setting).**
We denote the component of a vector by bracketing and setting a subscript, unless its components are defined differently. Let be a given dimension and a set of points. The corresponding nodes are given by , where denotes the complex unit circle. We identify the unit interval with the unit circle and therefore, we do not make a difference between the and and call them both nodes. Throughout the paper, denotes the euclidean norm for vectors and also its induced norm for matrices, and analogously the max-norm. Let be a degree, set and assume . We are interested in the multivariate, rectangular Vandermonde matrix
[TABLE]
and its smallest singular value
[TABLE]
The following lemma builds the core of the proof technique developed in the second version of [15] which we adapt here to the multivariate setting.
Lemma 2.2** (Robust duality, cf. [15, v2, Prop. 2]).**
Let and be given as in Definition 2.1. If for any unit norm vector , and , there exists a trigonometric polynomial of max-degree at most , i.e.,
[TABLE]
such that for each , then
[TABLE]
Proof.
Define the discrete measure . Its Fourier coefficients are given by
[TABLE]
On the one hand, using the interpolation property of and the lower triangular inequality of the absolute value, we have
[TABLE]
and on the other hand, using , the Cauchy–Schwarz inequality and Parseval’s identity, we have
[TABLE]
∎
The advantage of that lemma is, if is a unit norm vector such that , it suffices to construct a function almost interpolating the values of in order to provide a lower bound.
The following definition is similar to the ‘localized clumps’ model from the second version of [15]. We did some renaming in terms of [3] and use a normalization by rather than .
Definition 2.3** (Geometry of nodes).**
The wrap-around distance between two nodes is defined by
[TABLE]
- i)
A subset of nodes is called cluster if it is contained in a cube of length . For two clusters , we define
[TABLE] 2. ii)
The node set is called a clustered node configuration with clusters if it can be written as
[TABLE]
where the are clusters and the (normalized) minimal cluster separation fulfills
[TABLE]
We order and denote the cardinality of the biggest cluster by . In passing, we note that the node set is called well separated with normalized separation if . Moreover, we define the partitioning of into shells by
[TABLE] 3. iii)
The cluster complexity is defined by
[TABLE]
and finally, we define the (normalized) minimal separation
[TABLE]
Remark 2.4**.**
(Geometry of nodes) With the notation of Definition 2.3, we note that
- i)
the inequality for implies
[TABLE]
A higher order approximation is given in the second version of **[15]**,
[TABLE] 2. ii)
A necessary condition on for the existence of a clustered node configuration with clusters is , with equality if and only if all nodes are equispaced. Similarly, if , then equispaced cluster with arbitrary node configuration within each cluster exist. Moreover, the cluster separation needs to scale at least linearly in the biggest cluster size . If on the contrary, and for simplicity of the argument, then let nodes form a cluster (length at most ) and place one node as far as possible away. With fixed , we have and therefore, is equivalent to and thus . On the other hand, already implies .
Finally note, that the packing argument in **[14, Lemma 4.5]** yields
[TABLE]
see also Figure 2.1 (left). 3. iii)
The cluster complexity can be upper bounded by the normalized minimal separation as follows. For , we have and equality for and . Refined for , it is easy to see that the cluster complexity is maximized by an equispaced cluster with nodes separated by and taking distances from the center node, see Figure 2.1 (right). By logarithmic convexity, direct calculation, and Stirling’s approximation, we thus have
[TABLE]
and similarly
[TABLE]
where the maximum is taken over all clustered node configurations with normalized minimal separation and the largest cluster containing nodes.
3 Auxiliary functions
Lemma 3.1** (Modified Dirichlet kernel).**
For the modified Dirichlet kernel is defined as ,
[TABLE]
We define the powers of the multivariate modified Dirichlet kernel by
[TABLE]
If and , then
- i)
, 2. ii)
, 3. iii)
, 4. iv)
.
Proof.
First, note that
[TABLE]
and the point-wise bound follows in the univariate case by
[TABLE]
Second, in the multivariate case, setting , and using i) and the univariate bound yield
[TABLE]
Note that and therefore, the third assertion is proven for the univariate case as follows. For , Parseval’s identity and direct calculation show
[TABLE]
For and , the estimates in [18, Proof of Lemma 2] yield
[TABLE]
and thus, for , the remaining estimate
[TABLE]
In order to prove the fourth assertion, note and hence, i) and ii) yield
[TABLE]
and . Moreover, direct computation gives
[TABLE]
and with and Parseval’s identity also
[TABLE]
Finally, let be the coordinate with , then the Cauchy–Schwarz inequality, iii), and the above yield (noting that and omitting the second last line if )
[TABLE]
∎
Lemma 3.2** (Lagrange-like basis with decay, cf. [15, v2, Lem. 3]).**
Let , be even, be a clustered node configuration and . Then for each with for some , there exists an , such that
- i)
* for all ,* 2. ii)
* for all , and * 3. iii)
**
Proof.
We define the functions as product of a Lagrange polynomial within the cluster and a fast decaying function . Let be fixed and define the -th Lagrange polynomial within its cluster , , as follows. If , we simply set . Otherwise, let
[TABLE]
denote the ’blow-up-factor’ and for let be the index of the vector component that realizes the distance . We immediately have and thus
[TABLE]
fulfills and by inequality (2.2)
[TABLE]
We proceed by setting
[TABLE]
and . Lemma 3.1 yields ,
[TABLE]
Finally, we define . This yields since , , and
[TABLE]
Moreover, this function has the desired property for all and the two remaining inequalities follow by and by using , also . ∎
Remark 3.3**.**
Following the calculation in the second version of [15, p. 36], we can improve (3.2) to
[TABLE]
with for and where the first two bracketed terms are due to (2.3) and (3.1), respectively.
4 A lower bound on the smallest singular value
In this chapter we work out the multivariate extension of Theorem 1 in the second version of [15]. Additionally, we do an improvement on the cluster separation condition, especially make the cluster separation independent on the number of nodes . Furthermore, we provide an improved estimate on the smallest singular value only depending on the biggest cluster size and not on the number of all nodes .
Theorem 4.1**.**
Let , even, be a clustered node configuration and . Moreover, assume the cluster separation
[TABLE]
Then the smallest singular value of the Vandermonde matrix from Definition 2.1 is bounded by
[TABLE]
Proof.
We apply the robust duality from Lemma 2.2, with , , such that , and
[TABLE]
where the Lagrange-like basis functions are given by Lemma 3.2. The interpolation errors fulfill , where has the entries
[TABLE]
We proceed by , where the second inequality follows from monotonicity of the norm [10, p. 520] (or [13, Lem. A.2]) and Lemma 3.2 i) and ii) with
[TABLE]
Since is symmetric, we bound the spectral norm by the maximum norm and apply the packing argument from Definition 2.3 ii) and Remark 2.4 ii) to get
[TABLE]
Condition (4.1) and imply . To bound the -norm of , let . The triangle inequality, symmetry of , Lemma 3.2 iii), and the packing argument from Definition 2.3 ii) and Remark 2.4 ii) yield
[TABLE]
Condition (4.1) implies
[TABLE]
and Lemma 2.2 finally the result. ∎
For , Remark 2.4 iii) applied to the cluster complexity yields:
Corollary 4.2**.**
*Under the assumptions of Theorem 4.1 with and , we have *
[TABLE]
Example 4.3** (Specific choices of ).**
Specific choices of in Theorem 4.1 yield the following:
- i)
By choosing or for being odd or even, respectively, and some additional cosmetics, the condition
[TABLE]
implies our best estimate
[TABLE] 2. ii)
By choosing and noting that for even and , our weakest condition
[TABLE]
implies
[TABLE]
Example 4.4** (Well separated nodes).**
For , we have and the nodes are well separated. For , Example 4.3 i) yields
[TABLE]
Note that Theorem 4.1 always assumes . This compares to [12], where already suffices for . Using Theorem 4.1 directly for and , then implies
[TABLE]
This compares to [2, 19], which provide under the same condition on , .
Example 4.5** (Pair clusters).**
For , we have and at most pairs of nodes form clusters. Example 4.3 i) with
[TABLE]
implies
[TABLE]
Example 4.6** (Pair clusters, comparison).**
Let and . We apply Theorem 4.1 with , and , respectively. These results are compared to [15, Thm. 1] (with minor corrections and where we simplified slightly ), to [13, Thm. 4.9] (under the additional assumption that all nodes inside the clusters have the same separation), and to [8, Cor. 4.2] (with a minor improvement for and in estimating [8, Eq. (8)]).
These comparisons are also presented in section 6.1 numerically.
Example 4.7**.**
(Comparison with [15]) Let and , then and imply
[TABLE]
where we set for the moment. This can be compared to [15, Thm. 1], where after minor corrections and imply
[TABLE]
According to Remark 3.3, depending on and . In total, we have a stronger condition on but our condition on is always weaker and our estimate on is sharper if . This comparison is also presented in Figure 6.2.
Example 4.8** (All nodes cluster).**
Let and . If , then Corollary 4.2 implies
[TABLE]
This compares to [3], where the restriction of the nodes to an interval of length and imply
[TABLE]
but, note that the definition of a clustered node configuration in [3] is in principle more flexible than ours.
5 Upper bounds and beyond distances
In this section, we show that the obtained lower bounds are sharp for and for , respectively. Moreover, we show for and nodes in generic position (e.g. not all nodes on a line for ), that the cluster complexity is not the optimal quantity to understand the situation here. If we assume a normalized minimal separation between nodes, then the estimate in Theorem 4.1 is sub-optimal with respect to the order in we can derive from the cluster complexity. For this, we give an example with one cluster of three nodes in the bivariate case, .
Example 5.1** (Matching bounds for ).**
*In the second version of [15, Prop. 3] an upper bound on is given for a clustered node configuration that consists of at least one cluster of equispaced, separated nodes. After further simplifications, we can derive
[TABLE]
Together with Remark 3.3 and Corollary 4.2 this assures that for sufficiently large , small and , there exist constants such that
[TABLE]
where the minimum is taken over all clustered node configurations with at least one cluster of nodes with normalized minimal separation .
This was also expected in [3, Rem. 3.5]. In particular note that the lower bound in Remark 2.4 iii) implies that the term in Theorem 4.1 cannot be avoided.
Example 5.2** (Matching bounds for ).**
Let , , and be such that , then the Cauchy interlacing theorem for eigenvalues ([10, Thm. 4.3.28]) and the binomial formula yield
[TABLE]
Together with Example 4.5, there exists constants such that
[TABLE]
where the minimum is taken over all clustered node configurations with at least one cluster of nodes with normalized minimal separation .
Example 5.3** (Triple cluster).**
Let , , and with
[TABLE]
and hence, the normalized minimal separation of is . Then the smallest singular value of the corresponding Vandermonde matrix fulfills
[TABLE]
and this can be seen as follows: Define the real matrix
[TABLE]
note that , and use the explicit formula
[TABLE]
The univariate Taylor expansion
[TABLE]
and yield
[TABLE]
and similar expressions for the other quantities. By direct computation, we see that the entries in the matrix on the right hand side of (5.1) are all and for example the diagonal entry is independent of and . Hence, the norm of that matrix is . Similarly, the denominator of (5.1) can be computed to be
[TABLE]
Finally, this yields
[TABLE]
and together with Theorem 4.1 the assertion.
6 Numerics
In this section we do four different experiments. Two of them are to compare our results with recent results from the literature () and two of them underline our results from section 5. All computations were carried out using MATLAB R2017b.
6.1 Pair clusters
In order to compare our results (see Example 4.6) with the ones from the second version of [15, Thm. 2], [8] and [13], we set , ([13] requires odd without further considerations), and take and nodes, respectively. The node configuration consists of uniformly placed clusters (at , ) that include two nodes each. The first cluster realizes the minimal separation , which is picked logarithmically uniformly at random from , i.e. and . The further clusters have nodes and for , where (parameter in [13, Thm. 4.7]) is picked uniformly randomly. Afterwards, we compute , where is the Vandermonde matrix defined in (2.1) corresponding to the node configuration. For each we pick instances of and the results are presented in Figure 6.1.
This clustered node configuration fulfills independently of . Theorem 4.1 and the second version of [15, Thm. 1] make restrictions to through the condition on . Therefore, choosing logarithmically as in Example 4.3 ii) requires , which is below for both and . The second version of [15] and our result with requires respectively
[TABLE]
[TABLE]
6.2 Bigger clusters
In this numerical example, we confirm our results in the univariate case, , for bigger clusters of size and compare them with the results from the second version of [15]. The polynomial degree is set to . We build up clustered node configurations with and clusters placed equispaced at for . At each cluster position the cluster nodes start to lie equispaced with separation , where (the right hand interval bound is due to cluster lying in an interval of length ) is picked logarithmically uniformly at random. Afterwards the smallest singular value is computed. This procedure is repeated 100 times for the respective choice of and the results are presented in Figure 6.2. We use the statements from Example 4.7 with . Since , the worst case cluster complexity is estimated by (2.4) to .
6.3 Pair clusters, bivariate
We present a numerical experiment in order to confirm our results for the higher dimensional case and set . Randomized clustered node configurations of , and clusters with nodes each are constructed for different minimal separations , respectively. Then the smallest singular values of the corresponding Vandermonde matrices are computed and the upper bound from Example 5.2 and the lower bound from Example 4.5 are shown. The results are presented in Figure 6.3. The node configurations are built as follows. The minimal separation is picked logarithmically uniformly at random in . We set so that the condition on in Example 4.5 together with the left interval bound for make (value shown in the figure) necessary. Two clusters realize the cluster separation and for the remaining clusters, we pick a position in uniformly at random. The positions are fixed for the respective choice of and do not change for different . Each cluster is constructed randomly by setting one node to and one to either or for some . Then we scale the clusters by and move them to their respective cluster positions.
6.4 One triple cluster, bivariate
Here we present a numerical experiment for Example 5.3. We set , and build the triple cluster consisting of the nodes , and (see Figure 6.4, left), where is picked logarithmically uniformly at random. Then we compute the smallest singular value of the Vandermonde matrix . This is repeated times for and each. The results are presented in Figure 6.4 (right). We see the asymptotic behavior with respect to calculated in Example 5.3. Furthermore, for nodes not being antipodal, we observe that the asymptotic starts when becomes smaller than the displacement parameter .
Acknowledgements. The authors thank Jürgen Prestin for discussions on Lemma 3.1 and gratefully acknowledge support by the projects DFG-GK1916 and DFG-SFB944.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] A. Akinshin, G. Goldman, and Y. Yomdin. Geometry of error amplification in solving Prony system with near-colliding nodes. Ar Xiv e-prints , Jan. 2017.
- 2[2] C. Aubel and H. Bölcskei. Vandermonde matrices with nodes in the unit disk and the large sieve. Appl. Comput. Harmon. Anal. , 47(1):53–86, jul 2019.
- 3[3] D. Batenkov, L. Demanet, G. Goldman, and Y. Yomdin. Stability of partial Fourier matrices with clustered nodes. Ar Xiv e-prints , Sept. 2018.
- 4[4] D. Batenkov, G. Goldman, and Y. Yomdin. Super-resolution of near-colliding point sources. ar Xiv e-prints , page ar Xiv:1904.09186, Apr 2019.
- 5[5] F. S. V. Bazán. Conditioning of rectangular Vandermonde matrices with nodes in the unit disk. SIAM J. Matrix Anal. Appl. , 21(2):679–693, 1999.
- 6[6] B. G. R. de Prony. Essai éxperimental et analytique: sur les lois de la dilatabilité de fluides élastique et sur celles de la force expansive de la vapeur de l’alkool, a différentes températures. Journal de l’école polytechnique , 1(22):24–76, 1795.
- 7[7] L. Demanet and N. Nguyen. The recoverability limit for superresolution via sparsity. Ar Xiv e-prints , Feb. 2015.
- 8[8] B. Diederichs. Well-Posedness of Sparse Frequency Estimation. ar Xiv e-prints , May 2019.
