On the smallest singular value of multivariate Vandermonde matrices with   clustered nodes

Stefan Kunis; Dominik Nagel

arXiv:1907.07119·math.NA·July 17, 2019

On the smallest singular value of multivariate Vandermonde matrices with clustered nodes

Stefan Kunis, Dominik Nagel

PDF

TL;DR

This paper establishes bounds on the smallest singular value of multivariate Vandermonde matrices with clustered nodes on the complex unit circle, revealing how clustering affects matrix stability and invertibility.

Contribution

It provides the first comprehensive bounds for the smallest singular value in multivariate Vandermonde matrices with clustered nodes, including sharp constants and geometric dependencies.

Findings

01

Lower bounds for singular values with clustered nodes

02

Upper bounds that match the univariate case

03

Dependence of singular value on cluster geometry

Abstract

We prove lower bounds for the smallest singular value of rectangular, multivariate Vandermonde matrices with nodes on the complex unit circle. The nodes are ``off the grid'', groups of nodes cluster, and the studied minimal singular value is bounded below by the product of inverted distances of a node to all other nodes in the specific cluster. By providing also upper bounds for the smallest singular value, this completely settles the univariate case and pairs of nodes in the multivariate case, both including reasonable sharp constants. For larger clusters, we show that the smallest singular value depends also on the geometric configuration within a cluster.

Tables1

Ref.	Thm. 4.1			[15]	[13]	[8]
$ρ \geq$	$\frac{17.3}{\sqrt{τ}}$	$34.9 + 6.6 \| \log τ \|$	$\frac{29}{\sqrt[4]{τ}}$	$\frac{42.5 \sqrt[4]{M}}{\sqrt[4]{τ}}$	$25 (\log (⌊ \frac{M}{4} ⌋) + 1)$	$3$
$σ_{\min} (𝐀) \geq$	$\frac{τ \sqrt{N}}{7.2}$	$\frac{τ \sqrt{N}}{6 \sqrt[4]{5.3 + \| \log τ \|}}$	$\frac{τ \sqrt{N}}{8.6}$	$\frac{τ \sqrt{N}}{4.5 \sqrt{M}}$	$\frac{τ \sqrt{N}}{3.5}$	$\frac{τ \sqrt{N}}{1.7}$

Equations162

A := A (Ω, n) := (z_{j}^{α})_{j = 1, \dots, M α \in N_{0}^{d}, ∥ α ∥_{\infty} \leq n} \in C^{M \times N^{d}},

A := A (Ω, n) := (z_{j}^{α})_{j = 1, \dots, M α \in N_{0}^{d}, ∥ α ∥_{\infty} \leq n} \in C^{M \times N^{d}},

σ_{m i n} (A) := v \in C^{M} ∥ v ∥_{2} = 1 min ∥ A^{*} v ∥_{2} .

σ_{m i n} (A) := v \in C^{M} ∥ v ∥_{2} = 1 min ∥ A^{*} v ∥_{2} .

f \in P (n) := ⎩ ⎨ ⎧ g : T^{d} \to C : g (t) = α \in N_{0}^{d}, ∥ α ∥_{\infty} \leq n \sum \overset{g}{^}_{α} e^{2 π i α \cdot t}, \overset{g}{^}_{α} \in C ⎭ ⎬ ⎫,

f \in P (n) := ⎩ ⎨ ⎧ g : T^{d} \to C : g (t) = α \in N_{0}^{d}, ∥ α ∥_{\infty} \leq n \sum \overset{g}{^}_{α} e^{2 π i α \cdot t}, \overset{g}{^}_{α} \in C ⎭ ⎬ ⎫,

∥ A^{*} v ∥_{2} \geq (1 - ∥ ϵ ∥_{2}) ∥ f ∥_{L^{2} (T^{d})}^{- 1} .

∥ A^{*} v ∥_{2} \geq (1 - ∥ ϵ ∥_{2}) ∥ f ∥_{L^{2} (T^{d})}^{- 1} .

\overset{μ}{^} (α) = \int_{T^{d}} e^{- 2 π i t \cdot α} d μ (t) = j = 1 \sum M v_{j} z_{j}^{- α} = (A^{*} v)_{α}, α \in N_{0}^{d}, ∥ α ∥_{\infty} \leq n .

\overset{μ}{^} (α) = \int_{T^{d}} e^{- 2 π i t \cdot α} d μ (t) = j = 1 \sum M v_{j} z_{j}^{- α} = (A^{*} v)_{α}, α \in N_{0}^{d}, ∥ α ∥_{\infty} \leq n .

\int_{T^{d}} \overline{f} d μ = j = 1 \sum M \overline{f (t_{j})} v_{j} = ∥ v ∥_{2}^{2} + j = 1 \sum M \overline{ϵ_{j}} v_{j} \geq ∥ v ∥_{2}^{2} - ∥ v ∥_{2} ∥ ϵ ∥_{2} = (1 - ∥ ϵ ∥_{2}),

\int_{T^{d}} \overline{f} d μ = j = 1 \sum M \overline{f (t_{j})} v_{j} = ∥ v ∥_{2}^{2} + j = 1 \sum M \overline{ϵ_{j}} v_{j} \geq ∥ v ∥_{2}^{2} - ∥ v ∥_{2} ∥ ϵ ∥_{2} = (1 - ∥ ϵ ∥_{2}),

\int_{T^{d}} \overline{f} d μ = α \in N_{0}^{d}, ∥ α ∥_{\infty} \leq n \sum \overline{\hat{f}_{α}} \overset{μ}{^} (α) \leq \hat{f}_{2} ∥ A^{*} v ∥_{2} = ∥ f ∥_{L^{2} (T^{d})} ∥ A^{*} v ∥_{2} .

\int_{T^{d}} \overline{f} d μ = α \in N_{0}^{d}, ∥ α ∥_{\infty} \leq n \sum \overline{\hat{f}_{α}} \overset{μ}{^} (α) \leq \hat{f}_{2} ∥ A^{*} v ∥_{2} = ∥ f ∥_{L^{2} (T^{d})} ∥ A^{*} v ∥_{2} .

∣ t - t^{'} ∣_{T^{d}} := r \in Z^{d} min ∥ t - t^{'} + r ∥_{\infty} .

∣ t - t^{'} ∣_{T^{d}} := r \in Z^{d} min ∥ t - t^{'} + r ∥_{\infty} .

dist (Λ^{'}, Λ^{''}) := min {∣ t^{'} - t^{''} ∣_{T^{d}} : t^{'} \in Λ^{'}, t^{''} \in Λ^{''}} .

dist (Λ^{'}, Λ^{''}) := min {∣ t^{'} - t^{''} ∣_{T^{d}} : t^{'} \in Λ^{'}, t^{''} \in Λ^{''}} .

Ω = l = 1 ⋃ L Λ_{l},

Ω = l = 1 ⋃ L Λ_{l},

ρ := N 1 \leq l < l^{'} \leq L min dist (Λ_{l}, Λ_{l^{'}}) > 1.

ρ := N 1 \leq l < l^{'} \leq L min dist (Λ_{l}, Λ_{l^{'}}) > 1.

J_{m} := J_{m} (Ω, N, ρ) := {t \in T^{d} : m ρ \leq N ∣ t ∣_{T^{d}} < (m + 1) ρ}, m = 0, \dots, ⌊ \frac{N}{2 ρ} ⌋ .

J_{m} := J_{m} (Ω, N, ρ) := {t \in T^{d} : m ρ \leq N ∣ t ∣_{T^{d}} < (m + 1) ρ}, m = 0, \dots, ⌊ \frac{N}{2 ρ} ⌋ .

C := C (Ω, N) := j = 1, \dots, M max t^{'} \in Ω : 0 < ∣ t_{j} - t^{'} ∣_{T^{d}} \leq 1/ N \prod \frac{1}{N ∣ t _{j} - t ^{'} ∣ _{T^{d}}}

C := C (Ω, N) := j = 1, \dots, M max t^{'} \in Ω : 0 < ∣ t_{j} - t^{'} ∣_{T^{d}} \leq 1/ N \prod \frac{1}{N ∣ t _{j} - t ^{'} ∣ _{T^{d}}}

τ := N 1 \leq j < j^{'} \leq M min ∣ t_{j} - t_{j^{'}} ∣_{T^{d}} .

τ := N 1 \leq j < j^{'} \leq M min ∣ t_{j} - t_{j^{'}} ∣_{T^{d}} .

∣ z - z^{'} ∣ = 2 sin (π ∣ t - t^{'} ∣_{T}) \geq 4 ∣ t - t^{'} ∣_{T}, z := e^{2 π i t}, z^{'} := e^{2 π i t^{'}} \in T .

∣ z - z^{'} ∣ = 2 sin (π ∣ t - t^{'} ∣_{T}) \geq 4 ∣ t - t^{'} ∣_{T}, z := e^{2 π i t}, z^{'} := e^{2 π i t^{'}} \in T .

∣ z - z^{'} ∣ \geq 2 π (1 - \frac{π ^{2} ∣ t - t ^{'} ∣ _{T}^{2}}{3})^{1/2} ∣ t - t^{'} ∣_{T} .

∣ z - z^{'} ∣ \geq 2 π (1 - \frac{π ^{2} ∣ t - t ^{'} ∣ _{T}^{2}}{3})^{1/2} ∣ t - t^{'} ∣_{T} .

∣ J_{m} \cap Ω ∣ \leq 2^{d} (2^{d} - 1) m^{d - 1} λ,

∣ J_{m} \cap Ω ∣ \leq 2^{d} (2^{d} - 1) m^{d - 1} λ,

C \leq \frac{1}{τ ^{λ - 1}} (⌊ \frac{λ - 1}{2} ⌋! \cdot ⌈ \frac{λ - 1}{2} ⌉!)^{- 1} \leq \frac{1}{τ ^{λ - 1} Γ ( \frac{λ + 1}{2} ) ^{2}} \leq \frac{( 2 e ) ^{λ - 1}}{λ ^{λ}} \cdot \frac{1}{τ ^{λ - 1}}

C \leq \frac{1}{τ ^{λ - 1}} (⌊ \frac{λ - 1}{2} ⌋! \cdot ⌈ \frac{λ - 1}{2} ⌉!)^{- 1} \leq \frac{1}{τ ^{λ - 1} Γ ( \frac{λ + 1}{2} ) ^{2}} \leq \frac{( 2 e ) ^{λ - 1}}{λ ^{λ}} \cdot \frac{1}{τ ^{λ - 1}}

Ω max C = \frac{1}{τ ^{λ - 1}} (⌊ \frac{λ - 1}{2} ⌋! \cdot ⌈ \frac{λ - 1}{2} ⌉!)^{- 1} \geq \frac{( 2 e ) ^{λ - 1}}{λ ^{λ + 1}} \cdot \frac{1}{τ ^{λ - 1}},

Ω max C = \frac{1}{τ ^{λ - 1}} (⌊ \frac{λ - 1}{2} ⌋! \cdot ⌈ \frac{λ - 1}{2} ⌉!)^{- 1} \geq \frac{( 2 e ) ^{λ - 1}}{λ ^{λ + 1}} \cdot \frac{1}{τ ^{λ - 1}},

d_{m} (t) := \frac{1}{m + 1} k = 0 \sum m e^{2 π i k t} = {1, \frac{e ^{π i m t}}{m + 1} \cdot \frac{s i n ( π ( m + 1 ) t )}{( s i n ( π t ) )}, t = 0, t \neq = 0.

d_{m} (t) := \frac{1}{m + 1} k = 0 \sum m e^{2 π i k t} = {1, \frac{e ^{π i m t}}{m + 1} \cdot \frac{s i n ( π ( m + 1 ) t )}{( s i n ( π t ) )}, t = 0, t \neq = 0.

d_{m}^{β} : [0, 1)^{d} \to C, d_{m}^{β} (t) := (ℓ = 1 \prod d d_{m} ((t)_{ℓ}))^{β} \in P (m β) .

d_{m}^{β} : [0, 1)^{d} \to C, d_{m}^{β} (t) := (ℓ = 1 \prod d d_{m} ((t)_{ℓ}))^{β} \in P (m β) .

∣ d_{m} (t) ∣ \leq (\frac{1}{m + 1} k = 0 \sum m e^{2 π i k t})^{d} = 1 = d_{m} (0)

∣ d_{m} (t) ∣ \leq (\frac{1}{m + 1} k = 0 \sum m e^{2 π i k t})^{d} = 1 = d_{m} (0)

∣ d_{m} (t) ∣ = \frac{1}{m + 1} \frac{sin ( π ( m + 1 ) t )}{sin ( π t )} \leq \frac{1}{( m + 1 ) ∣ sin ( π t ) ∣} \leq \frac{1}{2 ( m + 1 ) ∣ t ∣ _{T}} .

∣ d_{m} (t) ∣ = \frac{1}{m + 1} \frac{sin ( π ( m + 1 ) t )}{sin ( π t )} \leq \frac{1}{( m + 1 ) ∣ sin ( π t ) ∣} \leq \frac{1}{2 ( m + 1 ) ∣ t ∣ _{T}} .

∣ d_{m} (t) ∣ = ℓ = 1 \prod d ∣ d_{m} ((t)_{ℓ}) ∣ \leq ∣ d_{m} (t) ∣ \leq \frac{1}{2 ( m + 1 ) ∣ t ∣ _{T}} = \frac{1}{2 ( m + 1 ) ∣ t ∣ _{T^{d}}} .

∣ d_{m} (t) ∣ = ℓ = 1 \prod d ∣ d_{m} ((t)_{ℓ}) ∣ \leq ∣ d_{m} (t) ∣ \leq \frac{1}{2 ( m + 1 ) ∣ t ∣ _{T}} = \frac{1}{2 ( m + 1 ) ∣ t ∣ _{T^{d}}} .

∥ d_{m} ∥_{L^{2} (T)}^{2}

∥ d_{m} ∥_{L^{2} (T)}^{2}

d_{m}^{3}_{L^{2} (T)}^{2}

\frac{sin π x}{( m + 1 ) sin \frac{π}{m + 1} x}

\frac{sin π x}{( m + 1 ) sin \frac{π}{m + 1} x}

d_{m}^{β}_{L^{2} (T)}^{2}

d_{m}^{β}_{L^{2} (T)}^{2}

= \frac{2}{m + 1} \frac{1}{( m + 1 ) ^{2 β}} \int_{0}^{1} (\frac{sin ( π x )}{sin ( \frac{π}{m + 1} x )})^{2 β} d x + \int_{1}^{\frac{m + 1}{2}} \frac{sin ( π x )}{sin ( \frac{π}{m + 1} x )}^{2 β} d x

\leq \frac{2}{m + 1} [\int_{0}^{\infty} exp (- \frac{8 β π ^{2} x ^{2}}{25}) d x + \int_{1}^{\infty} (\frac{1}{2 x})^{2 β} d x]

= \frac{1}{m + 1} [\frac{5}{2 2 π} \frac{1}{β} + \frac{2 ^{1 - 2 β}}{2 β - 1}] \leq \frac{1}{m + 1} \cdot \frac{1}{β} .

∣ d_{m} (t^{'}) ∣ ∣ d_{m} (t^{'} - t) ∣ \leq \frac{1}{2 ( m + 1 )} min {\frac{1}{∣ t ^{'} - t ∣ _{T}}, \frac{1}{∣ t ^{'} ∣ _{T}}} \leq \frac{1}{( m + 1 ) ∣ t ∣ _{T}}

∣ d_{m} (t^{'}) ∣ ∣ d_{m} (t^{'} - t) ∣ \leq \frac{1}{2 ( m + 1 )} min {\frac{1}{∣ t ^{'} - t ∣ _{T}}, \frac{1}{∣ t ^{'} ∣ _{T}}} \leq \frac{1}{( m + 1 ) ∣ t ∣ _{T}}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

On the smallest singular value of multivariate Vandermonde matrices with clustered nodes

Stefan Kunis111 Osnabrück University, Institute of Mathematics {skunis,dnagel}@uos.de Dominik Nagel111 Osnabrück University, Institute of Mathematics {skunis,dnagel}@uos.de

Abstract

We prove lower bounds for the smallest singular value of rectangular, multivariate Vandermonde matrices with nodes on the complex unit circle. The nodes are “off the grid”, groups of nodes cluster, and the studied minimal singular value is bounded below by the product of inverted distances of a node to all other nodes in the specific cluster. By providing also upper bounds for the smallest singular value, this completely settles the univariate case and pairs of nodes in the multivariate case, both including reasonable sharp constants. For larger clusters, we show that the smallest singular value depends also on the geometric configuration within a cluster.

Key words and phrases: Vandermonde matrix, colliding nodes, cluster, condition number, restricted Fourier matrices, frequency analysis, super resolution.

2010 AMS Mathematics Subject Classification : 15A18, 65T40, 42A15.

1 Introduction

Vandermonde matrices appear e.g. in the stability analysis of super-resolution algorithms like Prony’s method [6, 12], the matrix pencil method [11, 19], the ESPRIT algorithm [22, 21, 16], and the MUSIC algorithm [23, 17]. We are interested in the case of nodes on the complex unit circle and a large polynomial degree, the matrices then generalize the classical discrete Fourier matrices to non-equispaced nodes and the involved polynomial degree is also called bandwidth. If all nodes are well-separated, bounds on the condition number are established for example in [5, 14, 19, 2, 8] for the univariate case and in [14, 12] at least partially for the multivariate case. For node sets with distances of which some are below the inverse bandwidth, the behavior of the smallest singular value is subject of current research. The seminal paper [9] coined the term (inverse) super-resolution factor for the product of the bandwidth and the minimal separation of the nodes. For $M$ nodes on a grid, the results in [9, 7] imply that the smallest singular value is at most as small as the inverse super-resolution factor raised to the power of $M-1$ if the super-resolution factor is greater than $1$ . More recently, the practically relevant situation of clustered nodes was studied in [20, 1, 15, 3, 13, 4, 8]. In the univariate case and for different setups, all of these refinements are able to replace the exponent $M-1$ by the smaller number $m-1$ , where $m$ denotes the number of nodes that are in the largest cluster of nodes.

Here, we refine the proof technique developed in the second version of [15] and extend it to arbitrary dimensions. In contrast to [15], we only use the information on the biggest cluster size, minimal separation between clusters and a the worst case cluster complexity (or a minimal separation between nodes) instead of taking the structure of each cluster into account. In summary, our contributions are:

i)

a refined analysis of the univariate case, cf. [15], eliminating the dependence on the total number of nodes, weakening a technical condition on the cluster separation, and improving constants, mainly by

(a)

a geometric packing argument and 2. (b)

an improved estimate of Dirichlet kernels and Lagrange-like basis functions; 2. ii)

a multidimensional generalization, including

(a)

a quantitative estimate for the well-separated case, 2. (b)

a sharp estimate for pair clusters in higher dimensions, and 3. (c)

an example on the limitations for larger clusters in higher dimensions.

The outline of this paper is as follows: Section 2 fixes notation, states the problem and gives some definitions. Furthermore, we generalize the so-called robust duality lemma from the second version of [15] to the multivariate case. In Section 3, we introduce some auxiliary functions which are used to prove our main results in Section 4. Additionally, we give examples with specified parameters, present implications of our result for special node configurations like pair clusters and well separated nodes, and compare them with existing results. In Section 5, upper bounds on the smallest singular value for the univariate case and for pair clusters in higher dimensions are presented - these match the lower bounds from our main theorem. Furthermore, an example of a triple cluster in two dimensions is given which shows that geometric properties beyond pairwise distances are needed for understanding the multivariate case. Finally, in Section 6 numerical experiments are presented that support statements and comparisons from preceding sections.

2 Preliminaries

Definition 2.1 (Setting).

We denote the component of a vector by bracketing and setting a subscript, unless its components are defined differently. Let $d\in\mathbb{N}$ be a given dimension and $\Omega:=\left\{\mathbf{\boldsymbol{t}}_{1},\dots,\mathbf{\boldsymbol{t}}_{M}\right\}\subset\left[0,1\right)^{d}$ a set of points. The corresponding nodes are given by $\mathbf{\boldsymbol{z}}_{j}:=\textnormal{e}^{2\pi\textnormal{i}{\mathbf{\boldsymbol{t}}_{j}}}\in\mathbb{T}^{d},j=1,\dots,M$ , where $\mathbb{T}:=\left\{z\in\mathbb{C}\colon\left|z\right|=1\right\}$ denotes the complex unit circle. We identify the unit interval with the unit circle and therefore, we do not make a difference between the $\mathbf{\boldsymbol{t}}_{j}$ and $\mathbf{\boldsymbol{z}}_{j}$ and call them both nodes. Throughout the paper, $\left\|\cdot\right\|_{2}$ denotes the euclidean norm for vectors and also its induced norm for matrices, and analogously $\left\|\cdot\right\|_{\infty}$ the max-norm. Let $n\in\mathbb{N}$ be a degree, set $N:=n+1$ and assume $M<N^{d}$ . We are interested in the multivariate, rectangular Vandermonde matrix

[TABLE]

and its smallest singular value

[TABLE]

The following lemma builds the core of the proof technique developed in the second version of [15] which we adapt here to the multivariate setting.

Lemma 2.2 (Robust duality, cf. [15, v2, Prop. 2]).

Let $\Omega$ and $\mathbf{\boldsymbol{A}}$ be given as in Definition 2.1. If for any unit norm vector $\mathbf{\boldsymbol{v}}=(v_{1},\dots,v_{M})^{\top}\in\mathbb{C}^{M}$ , and $\mathbf{\boldsymbol{\epsilon}}=(\epsilon_{1},\dots,\epsilon_{M})^{\top}\in\mathbb{C}^{M}with\left\|\mathbf{\boldsymbol{\epsilon}}\right\|_{2}\leq 1$ , there exists a trigonometric polynomial of max-degree at most $n\in\mathbb{N}$ , i.e.,

[TABLE]

such that $f(\mathbf{\boldsymbol{t}}_{j})=v_{j}+\epsilon_{j}$ for each $j=1,\dots,M$ , then

[TABLE]

Proof.

Define the discrete measure $\mu:=\sum_{j=1}^{M}v_{j}\delta_{\mathbf{\boldsymbol{t}}_{j}}$ . Its Fourier coefficients are given by

[TABLE]

On the one hand, using the interpolation property of $f$ and the lower triangular inequality of the absolute value, we have

[TABLE]

and on the other hand, using $f\in\mathcal{P}(n)$ , the Cauchy–Schwarz inequality and Parseval’s identity, we have

[TABLE]

∎

The advantage of that lemma is, if $\mathbf{\boldsymbol{v}}\in\mathbb{C}^{M}$ is a unit norm vector such that $\operatorname*{\sigma_{\min}}(\mathbf{\boldsymbol{A}})=\left\|\mathbf{\boldsymbol{A}}^{*}\mathbf{\boldsymbol{v}}\right\|_{2}$ , it suffices to construct a function $f\in\mathcal{P}(n)$ almost interpolating the values of $\mathbf{\boldsymbol{v}}$ in order to provide a lower bound.

The following definition is similar to the ‘localized clumps’ model from the second version of [15]. We did some renaming in terms of [3] and use a normalization by $N$ rather than $n$ .

Definition 2.3 (Geometry of nodes).

The wrap-around distance between two nodes $\mathbf{\boldsymbol{t}},\mathbf{\boldsymbol{t}}^{\prime}\in[0,1)^{d}$ is defined by

[TABLE]

i)

A subset of nodes is called cluster if it is contained in a cube of length $1/N$ . For two clusters $\Lambda^{\prime},\Lambda^{\prime\prime}\subset\Omega$ , we define

[TABLE] 2. ii)

The node set $\Omega$ is called a clustered node configuration with $L$ clusters if it can be written as

[TABLE]

where the $\Lambda_{l}$ are clusters and the (normalized) minimal cluster separation $\rho$ fulfills

[TABLE]

We order $\left|\Lambda_{1}\right|\geq\left|\Lambda_{2}\right|\geq\ldots\geq\left|\Lambda_{L}\right|$ and denote the cardinality of the biggest cluster by $\lambda:=\left|\Lambda_{1}\right|$ . In passing, we note that the node set $\Omega$ is called well separated with normalized separation $\rho$ if $\lambda=1$ . Moreover, we define the partitioning of $\mathbb{T}^{d}$ into shells by

[TABLE] 3. iii)

The cluster complexity is defined by

[TABLE]

and finally, we define the (normalized) minimal separation

[TABLE]

Remark 2.4.

(Geometry of nodes) With the notation of Definition 2.3, we note that

i)

the inequality $\sin(x)\geq 2x/\pi$ for $0\leq x\leq\pi/2$ implies

[TABLE]

A higher order approximation is given in the second version of **[15]**,

[TABLE] 2. ii)

A necessary condition on $N$ for the existence of a clustered node configuration with $L$ clusters is $L\rho^{d}\leq N^{d}$ , with equality if and only if all nodes are equispaced. Similarly, if $N\geq L^{1/d}(\rho+1)$ , then equispaced cluster with arbitrary node configuration within each cluster exist. Moreover, the cluster separation $\rho$ needs to scale at least linearly in the biggest cluster size $\lambda$ . If on the contrary, $\rho<\lambda/4$ and $d=1$ for simplicity of the argument, then let $\lambda$ nodes form a cluster (length at most $1/N$ ) and place one node as far as possible away. With fixed $N$ , we have $2\rho=N-1$ and therefore, $\rho<\lambda/4$ is equivalent to $N\leq M/2+1/2$ and thus $\operatorname{rank}(\mathbf{\boldsymbol{A}})\leq N<M$ . On the other hand, $\rho>\lambda$ already implies $N\geq L(\rho+1)\geq M$ .

Finally note, that the packing argument in **[14, Lemma 4.5]** yields

[TABLE]

see also Figure 2.1 (left). 3. iii)

The cluster complexity can be upper bounded by the normalized minimal separation as follows. For $d\in\mathbb{N}$ , we have $\mathcal{C}\leq\tau^{1-\lambda}$ and equality for $\lambda=1$ and $\lambda=2$ . Refined for $d=1$ , it is easy to see that the cluster complexity is maximized by an equispaced cluster with $\lambda$ nodes separated by $\tau/N$ and taking distances from the center node, see Figure 2.1 (right). By logarithmic convexity, direct calculation, and Stirling’s approximation, we thus have

[TABLE]

and similarly

[TABLE]

where the maximum is taken over all clustered node configurations with normalized minimal separation $\tau$ and the largest cluster containing $\lambda$ nodes.

3 Auxiliary functions

Lemma 3.1 (Modified Dirichlet kernel).

For $m,\beta\in\mathbb{N}$ the modified Dirichlet kernel is defined as $d_{m}\colon\left[0,1\right)\rightarrow\mathbb{C}$ ,

[TABLE]

We define the powers of the multivariate modified Dirichlet kernel by

[TABLE]

If $m\geq\beta$ and $\mathbf{\boldsymbol{t}}\in\mathbb{T}^{d}\setminus\left\{\mathbf{\boldsymbol{0}}\right\}$ , then

i)

$\left|d_{m}(\mathbf{\boldsymbol{t}})\right|\leq d_{m}(\mathbf{\boldsymbol{0}})=1$ , 2. ii)

$\left|d_{m}(\mathbf{\boldsymbol{t}})\right|\leq\frac{1}{2(m+1)\left|\mathbf{\boldsymbol{t}}\right|_{\mathbb{T}^{d}}}$ , 3. iii)

$\left\|d_{m}^{\beta}\right\|_{L^{2}(\mathbb{T}^{d})}^{2}\leq\frac{1}{(m+1)^{d}\beta^{d/2}}$ , 4. iv)

$\left|\left<d_{m}^{\beta},d_{m}^{\beta}(\cdot-\mathbf{\boldsymbol{t}})\right>_{L^{2}(\mathbb{T}^{d})}\right|\leq\frac{1}{2(m+1)^{d}\beta^{(d-1)/2}}\cdot\frac{1}{(m+1)^{\beta}\left|\mathbf{\boldsymbol{t}}\right|_{\mathbb{T}^{d}}^{\beta}}$ .

Proof.

First, note that

[TABLE]

and the point-wise bound follows in the univariate case by

[TABLE]

Second, in the multivariate case, setting $t:=\left|\mathbf{\boldsymbol{t}}\right|_{\mathbb{T}^{d}}$ , and using i) and the univariate bound yield

[TABLE]

Note that $\left\|d_{m}\right\|_{L^{2}(\mathbb{T}^{d})}^{2}=\left\|d_{m}\right\|_{L^{2}(\mathbb{T})}^{2d}$ and therefore, the third assertion is proven for the univariate case as follows. For $m\geq\beta$ , Parseval’s identity and direct calculation show

[TABLE]

For $x\in[0,1]$ and $m\geq 4$ , the estimates in [18, Proof of Lemma 2] yield

[TABLE]

and thus, for $m\geq\beta\geq 4$ , the remaining estimate

[TABLE]

In order to prove the fourth assertion, note $\left|t\right|_{\mathbb{T}}\leq\left|t-t^{\prime}\right|_{\mathbb{T}}+\left|t^{\prime}\right|_{\mathbb{T}}\leq 2\max\{\left|t-t^{\prime}\right|_{\mathbb{T}},\left|t^{\prime}\right|_{\mathbb{T}}\}$ and hence, i) and ii) yield

[TABLE]

and $\left\|d_{m}d_{m}(\cdot-t)\right\|_{L^{\infty}(\mathbb{T})}\leq((m+1)\left|t\right|_{\mathbb{T}})^{-1}$ . Moreover, direct computation gives

[TABLE]

and with $z=\textnormal{e}^{2\pi\textnormal{i}{t}}$ and Parseval’s identity also

[TABLE]

Finally, let $t$ be the coordinate with $\left|t\right|_{\mathbb{T}}=\left|\mathbf{\boldsymbol{t}}\right|_{\mathbb{T}^{d}}$ , then the Cauchy–Schwarz inequality, iii), and the above yield (noting that $\textnormal{e}^{-2\pi\textnormal{i}mt^{\prime}}d_{m}^{2}(t^{\prime})\geq 0$ and omitting the second last line if $\beta=1$ )

[TABLE]

∎

Lemma 3.2 (Lagrange-like basis with decay, cf. [15, v2, Lem. 3]).

Let $\beta,d,M,n\in\mathbb{N}$ , $\beta$ be even, $\Omega=\left\{\mathbf{\boldsymbol{t}}_{1},\ldots,\mathbf{\boldsymbol{t}}_{M}\right\}\subset[0,1)^{d}$ be a clustered node configuration and $n\geq 2\beta^{2}\lambda$ . Then for each $\mathbf{\boldsymbol{t}}_{j}\in\Omega$ with $\mathbf{\boldsymbol{t}}_{j}\in\Lambda_{l}$ for some $l=l(j)$ , there exists an $I_{j}\in\mathcal{P}(n)$ , such that

i)

$I_{j}(\mathbf{\boldsymbol{t}}_{k})=\delta_{jk}$ * for all $\mathbf{\boldsymbol{t}}_{k}\in\Lambda_{l}$ ,* 2. ii)

$\left|I_{j}(\mathbf{\boldsymbol{t}})\right|\leq\frac{\beta^{\beta}\lambda^{\beta+\lambda-1}}{(2N\left|\mathbf{\boldsymbol{t}}-\mathbf{\boldsymbol{t}}_{j}\right|_{\mathbb{T}^{d}})^{\beta}}\mathcal{C}$ * for all $\mathbf{\boldsymbol{t}}\neq\mathbf{\boldsymbol{t}}_{j}$ , and * 3. iii)

$\left|\left<I_{k},I_{j}\right>_{L^{2}(\mathbb{T}^{d})}\right|\leq\frac{\lambda^{d}\beta^{d/2}}{N^{d}}\lambda^{2\lambda-2}\mathcal{C}^{2}\begin{cases}1,&\mathbf{\boldsymbol{t}}_{k}\in\Lambda_{l},\\ \frac{\sqrt{\beta}}{2}\left(\frac{\lambda\beta}{N\left|\mathbf{\boldsymbol{t}}_{j}-\mathbf{\boldsymbol{t}}_{k}\right|_{\mathbb{T}^{d}}}\right)^{\beta},&\text{otherwise.}\end{cases}$ **

Proof.

We define the functions $I_{j}$ as product of a Lagrange polynomial $G_{j}$ within the cluster and a fast decaying function $H_{j}$ . Let $j\in\{1,\ldots,M\}$ be fixed and define the $j$ -th Lagrange polynomial within its cluster $\Lambda_{l}$ , $l=l(j)$ , as follows. If $\left|\Lambda_{l}\right|=1$ , we simply set $G_{j}\equiv 1$ . Otherwise, let

[TABLE]

denote the ’blow-up-factor’ and for $\mathbf{\boldsymbol{t}}_{k}\in\Lambda_{l}\setminus\left\{\mathbf{\boldsymbol{t}}_{j}\right\}$ let $\ell(k)$ be the index of the vector component that realizes the distance $\left|\mathbf{\boldsymbol{t}}_{j}-\mathbf{\boldsymbol{t}}_{k}\right|_{\mathbb{T}^{d}}$ . We immediately have $\left|Q\mathbf{\boldsymbol{t}}_{j}-Q\mathbf{\boldsymbol{t}}_{k}\right|_{\mathbb{T}^{d}}=Q\left|\mathbf{\boldsymbol{t}}_{j}-\mathbf{\boldsymbol{t}}_{k}\right|_{\mathbb{T}^{d}}\neq 0$ and thus

[TABLE]

fulfills $G_{j}(\mathbf{\boldsymbol{t}}_{k})=\delta_{j,k}$ and by inequality (2.2)

[TABLE]

We proceed by setting

[TABLE]

and $H_{j}(\mathbf{\boldsymbol{t}}):=d_{P}^{\beta}(\mathbf{\boldsymbol{t}}-\mathbf{\boldsymbol{t}}_{j})$ . Lemma 3.1 yields $H_{j}(\mathbf{\boldsymbol{t}}_{j})=1$ ,

[TABLE]

Finally, we define $I_{j}(\mathbf{\boldsymbol{t}}):=G_{j}(\mathbf{\boldsymbol{t}})H_{j}(\mathbf{\boldsymbol{t}})$ . This yields $I_{j}\in\mathcal{P}(n)$ since $G_{j}\in\mathcal{P}(Q(\lambda-1))$ , $H_{j}\in\mathcal{P}(P\beta)$ , and

[TABLE]

Moreover, this function has the desired property $I_{j}(\mathbf{\boldsymbol{t}}_{k})=\delta_{jk}$ for all $\mathbf{\boldsymbol{t}}_{k}\in\Lambda_{l}$ and the two remaining inequalities follow by $\left|I_{j}(\mathbf{\boldsymbol{t}})\right|\leq\left\|G_{j}\right\|_{L^{\infty}(\mathbb{T}^{d})}\left|H_{j}(\mathbf{\boldsymbol{t}})\right|$ and by using $\textnormal{e}^{-\pi\textnormal{i}\beta P(\mathbf{\boldsymbol{t}}-\mathbf{\boldsymbol{t}}_{j})}H_{j}(t)\geq 0$ , also $\left|\left<I_{k},I_{j}\right>_{L^{2}(\mathbb{T}^{d})}\right|\leq\left\|G_{j}\right\|_{L^{\infty}(\mathbb{T}^{d})}^{2}\left|\left<H_{k},H_{j}\right>_{L^{2}(\mathbb{T}^{d})}\right|$ . ∎

Remark 3.3.

Following the calculation in the second version of [15, p. 36], we can improve (3.2) to

[TABLE]

with $C(n)\to 1$ for $n\to\infty$ and where the first two bracketed terms are due to (2.3) and (3.1), respectively.

4 A lower bound on the smallest singular value

In this chapter we work out the multivariate extension of Theorem 1 in the second version of [15]. Additionally, we do an improvement on the cluster separation condition, especially make the cluster separation independent on the number of nodes $M$ . Furthermore, we provide an improved estimate on the smallest singular value $\operatorname*{\sigma_{\min}}(\mathbf{\boldsymbol{A}})$ only depending on the biggest cluster size $\lambda$ and not on the number of all nodes $M$ .

Theorem 4.1.

Let $\beta,d,N,M\in\mathbb{N}$ , $\beta\geq d+1$ even, $\Omega=\left\{\mathbf{\boldsymbol{t}}_{1},\ldots,\mathbf{\boldsymbol{t}}_{M}\right\}\subset[0,1)^{d}$ be a clustered node configuration and $N>2\beta^{2}\lambda$ . Moreover, assume the cluster separation

[TABLE]

Then the smallest singular value of the Vandermonde matrix $\mathbf{\boldsymbol{A}}\in\mathbb{C}^{M\times N^{d}}$ from Definition 2.1 is bounded by

[TABLE]

Proof.

We apply the robust duality from Lemma 2.2, with $\mathbf{\boldsymbol{v}}\in\mathbb{C}^{M}$ , $\left\|\mathbf{\boldsymbol{v}}\right\|_{2}=1$ , such that $\operatorname*{\sigma_{\min}}(\mathbf{\boldsymbol{A}})=\left\|\mathbf{\boldsymbol{A}}^{*}\mathbf{\boldsymbol{v}}\right\|_{2}$ , and

[TABLE]

where the Lagrange-like basis functions $I_{k}$ are given by Lemma 3.2. The interpolation errors $\epsilon_{j}=f(\mathbf{\boldsymbol{t}}_{j})-v_{j}$ fulfill $\mathbf{\boldsymbol{\epsilon}}=\mathbf{\boldsymbol{K}}\mathbf{\boldsymbol{v}}$ , where $\mathbf{\boldsymbol{K}}\in\mathbb{C}^{M\times M}$ has the entries

[TABLE]

We proceed by $\left\|\mathbf{\boldsymbol{\epsilon}}\right\|_{2}\leq\left\|\mathbf{\boldsymbol{K}}\right\|_{2}\leq\left\|\mathbf{\boldsymbol{\tilde{K}}}\right\|_{2}$ , where the second inequality follows from monotonicity of the norm [10, p. 520] (or [13, Lem. A.2]) and Lemma 3.2 i) and ii) with

[TABLE]

Since $\mathbf{\boldsymbol{\tilde{K}}}\in\mathbb{R}^{M\times M}$ is symmetric, we bound the spectral norm by the maximum norm and apply the packing argument from Definition 2.3 ii) and Remark 2.4 ii) to get

[TABLE]

Condition (4.1) and $\beta\geq 2$ imply $\left\|\mathbf{\boldsymbol{\epsilon}}\right\|_{2}\leq\frac{1}{4\sqrt{2}}$ . To bound the $L^{2}$ -norm of $f$ , let ${\mathbf{\boldsymbol{\hat{K}}}}:=\begin{pmatrix}\left|\left<I_{k},I_{j}\right>\right|\end{pmatrix}_{j,k=1,\ldots,M}\in\mathbb{R}^{M\times M}$ . The triangle inequality, symmetry of ${\mathbf{\boldsymbol{\hat{K}}}}$ , Lemma 3.2 iii), and the packing argument from Definition 2.3 ii) and Remark 2.4 ii) yield

[TABLE]

Condition (4.1) implies

[TABLE]

and Lemma 2.2 finally the result. ∎

For $d=1$ , Remark 2.4 iii) applied to the cluster complexity yields:

Corollary 4.2.

*Under the assumptions of Theorem 4.1 with $d=1$ and $\beta=2$ , we have *

[TABLE]

Example 4.3 (Specific choices of $\beta$ ).

Specific choices of $\beta$ in Theorem 4.1 yield the following:

i)

By choosing $\beta=d+1$ or $\beta=d+2$ for $d$ being odd or even, respectively, and some additional cosmetics, the condition

[TABLE]

implies our best estimate

[TABLE] 2. ii)

By choosing $\beta=2\left\lceil\frac{1}{2}\log\left(2^{d}(2^{d}-1)\lambda^{\lambda}\zeta(2)\mathcal{C}\right)\right\rceil$ and noting that $\sqrt[2\beta]{\beta}\leq 1.2$ for $\beta$ even and $\sqrt[\log C]{C}=\textnormal{e}$ , our weakest condition

[TABLE]

implies

[TABLE]

Example 4.4 (Well separated nodes).

For $\lambda=1$ , we have $\mathcal{C}=1$ and the nodes are well separated. For $\rho\geq 6d$ , Example 4.3 i) yields

[TABLE]

Note that Theorem 4.1 always assumes $\rho\geq\beta\geq d+1$ . This compares to [12], where $\rho\geq 3+2\log d$ already suffices for $\operatorname*{\sigma_{\min}}(\mathbf{\boldsymbol{A}})>0$ . Using Theorem 4.1 directly for $d=1$ and $\beta=2$ , then $\rho\geq 4.4$ implies

[TABLE]

This compares to [2, 19], which provide under the same condition on $\rho$ , $\operatorname*{\sigma_{\min}}(\mathbf{\boldsymbol{A}})\geq\sqrt{N}\cdot\sqrt{1-1/\rho}\geq\sqrt{N}/1.14$ .

Example 4.5 (Pair clusters).

For $\lambda=2$ , we have $\mathcal{C}=1/\tau$ and at most pairs of nodes form clusters. Example 4.3 i) with

[TABLE]

implies

[TABLE]

Example 4.6 (Pair clusters, comparison).

Let $d=1$ and $\lambda=2$ . We apply Theorem 4.1 with $\beta=2$ , $\beta=2\left\lceil\frac{1}{2}\log\left(\frac{\pi^{2}}{3}\lambda^{\lambda}\mathcal{C}\right)\right\rceil$ and $\beta=2\lambda$ , respectively. These results are compared to [15, Thm. 1] (with minor corrections and where we simplified slightly $\left\lfloor{n}/{\lambda}\right\rfloor\approx{n}/{\lambda}$ ), to [13, Thm. 4.9] (under the additional assumption that all nodes inside the clusters have the same separation), and to [8, Cor. 4.2] (with a minor improvement for $\tau\leq 1$ and in estimating [8, Eq. (8)]).

These comparisons are also presented in section 6.1 numerically.

Example 4.7.

(Comparison with [15]) Let $d=1$ and $\beta=2\lambda$ , then $N>2\lambda^{3}$ and $\rho\geq 4.4\lambda^{5/2}\mathcal{C}^{\frac{1}{2\lambda}}$ imply

[TABLE]

where we set $C_{0}=1$ for the moment. This can be compared to [15, Thm. 1], where after minor corrections $N>2\lambda^{2}$ and $\rho\geq 10\lambda^{5/2}(M\mathcal{C})^{\frac{1}{2\lambda}},$ imply

[TABLE]

According to Remark 3.3, $C_{0}\in\left(\pi^{-1},1\right]$ depending on $\lambda$ and $n$ . In total, we have a stronger condition on $N$ but our condition on $\rho$ is always weaker and our estimate on $\operatorname*{\sigma_{\min}}(\mathbf{\boldsymbol{A}})$ is sharper if $M>2$ . This comparison is also presented in Figure 6.2.

Example 4.8 (All nodes cluster).

Let $d=1$ and $\lambda=M$ . If $N>8M$ , then Corollary 4.2 implies

[TABLE]

This compares to [3], where the restriction of the nodes to an interval of length $1/(2M^{2})$ and $N\geq 4M^{3}$ imply

[TABLE]

but, note that the definition of a clustered node configuration in [3] is in principle more flexible than ours.

5 Upper bounds and beyond distances

In this section, we show that the obtained lower bounds are sharp for $d=1$ and for $\lambda=2$ , respectively. Moreover, we show for $d>1$ and nodes in generic position (e.g. not all nodes on a line for $d=2$ ), that the cluster complexity $C$ is not the optimal quantity to understand the situation here. If we assume a normalized minimal separation $\tau$ between nodes, then the estimate in Theorem 4.1 is sub-optimal with respect to the order in $\tau$ we can derive from the cluster complexity. For this, we give an example with one cluster of three nodes in the bivariate case, $d=2$ .

Example 5.1 (Matching bounds for $d=1$ ).

*In the second version of [15, Prop. 3] an upper bound on $\operatorname*{\sigma_{\min}}(\mathbf{\boldsymbol{A}})$ is given for a clustered node configuration that consists of at least one cluster of $\lambda$ equispaced, $\tau$ separated nodes. After further simplifications, we can derive

[TABLE]

Together with Remark 3.3 and Corollary 4.2 this assures that for sufficiently large $N\in\mathbb{N}$ , small $\tau$ and $\lambda\geq 2$ , there exist constants $c_{1}\leq c_{2}$ such that

[TABLE]

where the minimum is taken over all clustered node configurations $\Omega$ with at least one cluster of $\lambda$ nodes with normalized minimal separation $\tau$ .

This was also expected in [3, Rem. 3.5]. In particular note that the lower bound in Remark 2.4 iii) implies that the term $\lambda^{\lambda}$ in Theorem 4.1 cannot be avoided.

Example 5.2 (Matching bounds for $\lambda=2$ ).

Let $d\in\mathbb{N}$ , $\lambda\geq 2$ , and $\tau\leq 1$ be such that $\left|\mathbf{\boldsymbol{t}}_{1}-\mathbf{\boldsymbol{t}}_{2}\right|_{\mathbb{T}^{d}}=\tau/N$ , then the Cauchy interlacing theorem for eigenvalues ([10, Thm. 4.3.28]) and the binomial formula yield

[TABLE]

Together with Example 4.5, there exists constants $c_{1}(d)\leq c_{2}(d)$ such that

[TABLE]

where the minimum is taken over all clustered node configurations $\Omega$ with at least one cluster of $\lambda=2$ nodes with normalized minimal separation $\tau$ .

Example 5.3 (Triple cluster).

Let $d=2$ , $N\in\mathbb{N}$ , and $\Omega=\left\{\mathbf{\boldsymbol{t}}_{1},\mathbf{\boldsymbol{t}}_{2},\mathbf{\boldsymbol{t}}_{3}\right\}\subset\left[0,1\right)^{2}$ with

[TABLE]

and hence, the normalized minimal separation of $\Omega$ is $\nu/\sqrt{2}\leq\tau\leq\nu$ . Then the smallest singular value of the corresponding Vandermonde matrix $\mathbf{\boldsymbol{A}}$ fulfills

[TABLE]

and this can be seen as follows: Define the real matrix

[TABLE]

note that $\operatorname*{\sigma_{\min}}(\mathbf{\boldsymbol{A}})^{2}=\operatorname*{\sigma_{\min}}(\mathbf{\boldsymbol{A}}\mathbf{\boldsymbol{A}}^{*})=\operatorname*{\sigma_{\min}}(\mathbf{\boldsymbol{M}})=\left\|\mathbf{\boldsymbol{M}}^{-1}\right\|_{2}^{-1}$ , and use the explicit formula

[TABLE]

The univariate Taylor expansion

[TABLE]

and $a_{1}^{2}+a_{2}^{2}=1=b_{1}^{2}+b_{2}^{2}$ yield

[TABLE]

and similar expressions for the other quantities. By direct computation, we see that the entries in the matrix on the right hand side of (5.1) are all $\mathcal{O}\left(\nu^{2}\right)$ and for example the diagonal entry $u^{2}-1$ is $\Theta(\nu^{2})$ independent of $\mathbf{\boldsymbol{a}}$ and $\mathbf{\boldsymbol{b}}$ . Hence, the norm of that matrix is $\Theta(\nu^{2})$ . Similarly, the denominator of (5.1) can be computed to be

[TABLE]

Finally, this yields

[TABLE]

and together with Theorem 4.1 the assertion.

6 Numerics

In this section we do four different experiments. Two of them are to compare our results with recent results from the literature ( $d=1$ ) and two of them underline our results from section 5. All computations were carried out using MATLAB R2017b.

6.1 Pair clusters

In order to compare our results (see Example 4.6) with the ones from the second version of [15, Thm. 2], [8] and [13], we set $d=1$ , $N=2^{15}+1$ ([13] requires odd $N$ without further considerations), and take $M=4$ and $M=20$ nodes, respectively. The node configuration consists of uniformly placed clusters (at $l/N$ , $l=1,\dots,M/2$ ) that include two nodes each. The first cluster realizes the minimal separation $\tau$ , which is picked logarithmically uniformly at random from $[10^{-12},1]$ , i.e. $t_{1}=0$ and $t_{2}=\tau/N$ . The further clusters have nodes $t_{2l}=l/N$ and $t_{2l+1}=(l+\delta)/N$ for $l=1,\dots,(M-1)/2$ , where $\delta\in[\tau,2\tau]$ (parameter $c=2$ in [13, Thm. 4.7]) is picked uniformly randomly. Afterwards, we compute $\operatorname*{\sigma_{\min}}(\mathbf{\boldsymbol{A}})$ , where $\mathbf{\boldsymbol{A}}$ is the Vandermonde matrix defined in (2.1) corresponding to the node configuration. For each $M$ we pick $50$ instances of $\tau$ and the results are presented in Figure 6.1.

This clustered node configuration fulfills $\rho\geq\frac{2N}{M}-1$ independently of $\tau$ . Theorem 4.1 and the second version of [15, Thm. 1] make restrictions to $\tau$ through the condition on $\rho$ . Therefore, choosing $\beta$ logarithmically as in Example 4.3 ii) requires $\tau\geq\textnormal{e}^{-\frac{35.9-2N/M}{6.6}}$ , which is below $10^{-200}$ for both $M=4$ and $M=20$ . The second version of [15] and our result with $\beta=4$ requires respectively

[TABLE]

6.2 Bigger clusters

In this numerical example, we confirm our results in the univariate case, $d=1$ , for bigger clusters of size $\lambda=5$ and compare them with the results from the second version of [15]. The polynomial degree is set to $N=2^{15}$ . We build up clustered node configurations with $L=2$ $(M=10)$ and $L=10$ $(M=50)$ clusters placed equispaced at $\frac{l}{L}$ for $l=0,\dots,L-1$ . At each cluster position the cluster nodes start to lie equispaced with separation $\frac{\tau}{N}$ , where $\tau\in[10^{-4},1/4]$ (the right hand interval bound is due to cluster lying in an interval of length $1/N$ ) is picked logarithmically uniformly at random. Afterwards the smallest singular value $\operatorname*{\sigma_{\min}}(\mathbf{\boldsymbol{A}})$ is computed. This procedure is repeated 100 times for the respective choice of $L$ and the results are presented in Figure 6.2. We use the statements from Example 4.7 with $C_{0}=(1-\frac{\pi^{2}}{3\lambda^{2}})^{-1/2}N/\lambda\left\lfloor n/\lambda\right\rfloor^{-1}$ . Since $d=1$ , the worst case cluster complexity is estimated by (2.4) to $\mathcal{C}\leq\tau^{-4}/4$ .

6.3 Pair clusters, bivariate

We present a numerical experiment in order to confirm our results for the higher dimensional case and set $d=2$ . Randomized clustered node configurations of $L=2$ , $L=20$ and $L=40$ clusters with $2$ nodes each are constructed for $100$ different minimal separations $\tau$ , respectively. Then the smallest singular values of the corresponding Vandermonde matrices $\operatorname*{\sigma_{\min}}(\mathbf{\boldsymbol{A}})$ are computed and the upper bound from Example 5.2 and the lower bound from Example 4.5 are shown. The results are presented in Figure 6.3. The node configurations are built as follows. The minimal separation $\tau$ is picked logarithmically uniformly at random in $[10^{-3},1]$ . We set $N=10^{3}$ so that the condition on $\rho$ in Example 4.5 together with the left interval bound for $\tau$ make $\rho\geq\rho_{min}$ (value shown in the figure) necessary. Two clusters realize the cluster separation $\rho_{min}$ and for the remaining clusters, we pick a position in $[0,1]^{2}$ uniformly at random. The positions are fixed for the respective choice of $L$ and do not change for different $\tau$ . Each cluster is constructed randomly by setting one node to $(0,0)$ and one to either $(a,1)$ or $(1,a)$ for some $a\in\left[0,1\right]$ . Then we scale the clusters by $\tau$ and move them to their respective cluster positions.

6.4 One triple cluster, bivariate

Here we present a numerical experiment for Example 5.3. We set $N=100$ , $d=2$ and build the triple cluster consisting of the nodes $\mathbf{\boldsymbol{t}}_{1}=(0,0)^{T}$ , $\mathbf{\boldsymbol{t}}_{2}=(-\sqrt{1-a^{2}}\nu/N,a\nu/N)^{T}$ and $\mathbf{\boldsymbol{t}}_{3}=(\nu/N,0)^{T}$ (see Figure 6.4, left), where $\tau=\nu\sqrt{1-a^{2}}\in[10^{-6},1/2]$ is picked logarithmically uniformly at random. Then we compute the smallest singular value of the Vandermonde matrix $\operatorname*{\sigma_{\min}}(\mathbf{\boldsymbol{A}})$ . This is repeated $100$ times for $a=0.1$ and $a=0$ each. The results are presented in Figure 6.4 (right). We see the asymptotic behavior with respect to $\tau$ calculated in Example 5.3. Furthermore, for nodes not being antipodal, we observe that the asymptotic starts when $\tau$ becomes smaller than the displacement parameter $a$ .

Acknowledgements. The authors thank Jürgen Prestin for discussions on Lemma 3.1 and gratefully acknowledge support by the projects DFG-GK1916 and DFG-SFB944.

Bibliography23

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] A. Akinshin, G. Goldman, and Y. Yomdin. Geometry of error amplification in solving Prony system with near-colliding nodes. Ar Xiv e-prints , Jan. 2017.
2[2] C. Aubel and H. Bölcskei. Vandermonde matrices with nodes in the unit disk and the large sieve. Appl. Comput. Harmon. Anal. , 47(1):53–86, jul 2019.
3[3] D. Batenkov, L. Demanet, G. Goldman, and Y. Yomdin. Stability of partial Fourier matrices with clustered nodes. Ar Xiv e-prints , Sept. 2018.
4[4] D. Batenkov, G. Goldman, and Y. Yomdin. Super-resolution of near-colliding point sources. ar Xiv e-prints , page ar Xiv:1904.09186, Apr 2019.
5[5] F. S. V. Bazán. Conditioning of rectangular Vandermonde matrices with nodes in the unit disk. SIAM J. Matrix Anal. Appl. , 21(2):679–693, 1999.
6[6] B. G. R. de Prony. Essai éxperimental et analytique: sur les lois de la dilatabilité de fluides élastique et sur celles de la force expansive de la vapeur de l’alkool, a différentes températures. Journal de l’école polytechnique , 1(22):24–76, 1795.
7[7] L. Demanet and N. Nguyen. The recoverability limit for superresolution via sparsity. Ar Xiv e-prints , Feb. 2015.
8[8] B. Diederichs. Well-Posedness of Sparse Frequency Estimation. ar Xiv e-prints , May 2019.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

On the smallest singular value of multivariate Vandermonde matrices with clustered nodes

Abstract

1 Introduction

2 Preliminaries

Definition 2.1** (Setting).**

Lemma 2.2** (Robust duality, cf. [15, v2, Prop. 2]).**

Proof.

Definition 2.3** (Geometry of nodes).**

Remark 2.4**.**

3 Auxiliary functions

Lemma 3.1** (Modified Dirichlet kernel).**

Proof.

Lemma 3.2** (Lagrange-like basis with decay, cf. [15, v2, Lem. 3]).**

Proof.

Remark 3.3**.**

4 A lower bound on the smallest singular value

Theorem 4.1**.**

Proof.

Corollary 4.2**.**

Example 4.3** (Specific choices of β\betaβ).**

Example 4.4** (Well separated nodes).**

Example 4.5** (Pair clusters).**

Example 4.6** (Pair clusters, comparison).**

Example 4.7**.**

Example 4.8** (All nodes cluster).**

5 Upper bounds and beyond distances

Example 5.1** (Matching bounds for d=1d=1d=1).**

Example 5.2** (Matching bounds for λ=2\lambda=2λ=2).**

Example 5.3** (Triple cluster).**

6 Numerics

6.1 Pair clusters

6.2 Bigger clusters

6.3 Pair clusters, bivariate

6.4 One triple cluster, bivariate

Definition 2.1 (Setting).

Lemma 2.2 (Robust duality, cf. [15, v2, Prop. 2]).

Definition 2.3 (Geometry of nodes).

Remark 2.4.

Lemma 3.1 (Modified Dirichlet kernel).

Lemma 3.2 (Lagrange-like basis with decay, cf. [15, v2, Lem. 3]).

Remark 3.3.

Theorem 4.1.

Corollary 4.2.

Example 4.3 (Specific choices of $\beta$ ).

Example 4.4 (Well separated nodes).

Example 4.5 (Pair clusters).

Example 4.6 (Pair clusters, comparison).

Example 4.7.

Example 4.8 (All nodes cluster).

Example 5.1 (Matching bounds for $d=1$ ).

Example 5.2 (Matching bounds for $\lambda=2$ ).

Example 5.3 (Triple cluster).