The multidimensional truncated Moment Problem: Carathéodory Numbers
Philipp J. di Dio
Universität Leipzig, Mathematisches Institut, Augustusplatz 10/11, D-04109 Leipzig, Germany
Max Planck Institute for Mathematics in the Sciences, Inselstraße 22, D-04103 Leipzig, Germany
[email protected]
and
Konrad Schmüdgen
Universität Leipzig, Mathematisches Institut, Augustusplatz 10/11, D-04109 Leipzig, Germany
[email protected]
Abstract.
Let A be a finite-dimensional subspace of C(X;\mathdsR), where X is a locally compact Hausdorff space, and A={f1,…,fm} a basis of A. A sequence s=(sj)j=1m is called a moment sequence if sj=∫fj(x)dμ(x), j=1,…,m, for some positive Radon measure μ on X. Each moment sequence s has a finitely atomic representing measure μ. The smallest possible number of atoms is called the Carathéodory number CA(s). The largest number CA(s) among all moment sequences s is the Carathéodory number CA. In this paper the Carathéodory numbers CA(s) and CA are studied. In the case of differentiable functions methods from differential geometry are used. The main emphasis is on real polynomials. For a large class of spaces of polynomials in one variable the number CA is determined. In the multivariate case we obtain some lower bounds and we use results on zeros of positive polynomials to derive upper bounds for the Carathéodory numbers.
AMS Subject Classification (2000).
44A60, 14P10.
Key words: truncated moment problem, Carathéodory number, convex cone, positive polynomials
1. Introduction
The present paper continues the study of the truncated moment problem began in our previous papers [Sch15] and [dDS]. Here we investigate the Carathéodory number of moment sequences and moment cones.
Throughout this paper, we assume that X is a locally compact topological Hausdorff space, A is a finite-dimensional real linear subspace of C(X;\mathdsR) and A={f1,…,fm} is a fixed basis of the vector space A.
Let s=(sj)j=1m be a real sequence and let Ls be the linear functional on A defined by Ls(fj)=sj,j=1,…,m. We say that s is a moment sequence, equivalently, Ls is a moment functional on A, if there exists a (positive) Radon measure μ on X such that fj is μ-integrable and
[TABLE]
equivalently,
[TABLE]
Such a measure μ is called a representing measure of s resp. Ls. The Richter–Tchakaloff Theorem (see Proposition 1 below) implies that each moment sequence has a k-atomic representing measure, where k≤m=dimA. The smallest number k is called the Carathéodory number CA(s) and the smallest number K such that each moment sequence s has a k-atomic representing measure with k≤K is the Carathéodory number CA.
Let Ls be a moment functional. Determining a k-atomic representing measure ν for Ls is closely related to the problem of finding quadrature or cubature formulas in numerical integration, see for instance [DR84], [SW97]. The Carathéodory number CA(s) corresponds then to the smallest possible number of nodes.
A large part of our considerations is developed in this general setup. Nevertheless we are mainly interested in the case when A consists of real polynomials and X is a closed subset of \mathdsRn or of the projective real space \mathdsP(\mathdsRn). In this case moment sequences are usually called truncated moment sequences in the literature.
This paper is organized as follows.
In Section 2, we define and investigate Carathéodory numbers and the cone SA of moment sequences in the case when A⊆C(X,\mathdsR). In Section 3, we assume that the functions of A are differentiable and apply differential geometric methods to study the moment cone and Carathéodory numbers. Important technical tools are the total derivative DSk,A(C,X) associated with a k-atomic measure μ=∑i=1kciδxi and the smallest number NA of atoms such that DSk,A(C,X) has full rank m=∣A∣. This number NA is a lower bound of the Carathéodory number CA.
The remaining four sections are concerned with polynomials. Section 4 deals with polynomials in one variable. For A={1,x,…,xm} it is a classical fact
that CA=⌈2m⌉. We investigate a set A and its homogenization B with gaps, that is,
[TABLE]
where 0=d1<...<dm=2d. Our main result (Theorem 45)
gives sufficient conditions for the validity of the formula CA=CB=⌈2m⌉.
Sections 5–7 are devoted to the multivariate case. Except from a few simple cases the Carathéodory number CA is unknown for polynomials in several variables. In Section 5 we give a new lower bound of CA and relate the number NA to the Alexander–Hirschowitz Theorem. Another group of main results of this paper is obtained in Section 6. Here we use known results on zeros of non-negative polynomials to derive upper bounds for Carathéodory numbers (Theorems 57, 59, and 62). Section 7 deals with signed Carathéodory numbers and the real Waring rank.
The multidimensional truncated moment problem was first studied in the Thesis of J. Matzke [Mat92] and independently by R. Curto and L. Fialkow [CF96a], [CF96b]. It is an active research topic, see e.g. [Ric57], [Kem68], [Rez92], [Sch15], [Lau09], [FN10], [CF13], [Fiaa], [Fiab], [dDS].
Carathéodory numbers of multivariate polynomials have been investigated by C. Riener and M. Schweighofer [RS]. Carathéodory numbers of general convex cones are studied in [Tun01].
For r∈\mathdsR let ⌈r⌉ denote the smallest integer larger or equal to r.
2. Carathéodory Numbers: Continuous Functions
Let δx be the delta measure at x∈\mathdsRn, that is, δx(M)=1 if x∈M and δx(M)=0 if x∈/M. By a signed k-atomic measure μ we mean a signed measure μ=∑j=1kcjδxj, where x1,…,xk are pairwise different points of \mathdsRn and c1,…,ck are nonzero real numbers. If all numbers c1,…,ck are positive, then μ is a positive measure and is called simply a k-atomic measure. The points xj are called the atoms of μ. The zero measure is considered as [math]-atomic measure.
A crucial result for our considerations is the Richter–Tchakaloff Theorem proved in [Ric57]. In the present context it can be stated as follows.
Proposition 1**.**
Each truncated moment sequence s of A has a k-atomic representing measure with k≤m=∣A∣.
Definition 2**.**
The moment cone SA≡S(A,X) is the set of all truncated X-moment sequences.
Obviously, SA is a convex cone in \mathdsRm. Since the functions f1,…,fm form a vector space basis of A,
it follows easily that \mathdsRm=SA−SA.
Definition 3**.**
The Carathéodory number CA(s)≡CA,X(s) of s∈S(A,X) is the smallest k such that s has a k-atomic representing measure with all atoms in X. The Carathéodory number CA≡CA,X of the moment cone S(A,X) is the smallest number CA such that each moment sequence s∈S(A,X) has a k-atomic representing measure with all atoms in X and k≤CA.
Definition 4**.**
The signed Carathéodory number CA,±(s)≡CA,X,±(s) of s∈\mathdsRm is the smallest number k such that s has a signed k-atomic representing measure with all atoms in X. The signed Carathéodory number CA,±≡CA,X,± is the smallest number CA,± such that every sequence s has a signed k-atomic representing measure with all atoms in X and k≤CA,±.
Since \mathdsRm=SA−SA as noted above, Proposition 1 implies each vector s′∈\mathdsRm has a signed k-atomic representing measure, where k≤2m, and we have
[TABLE]
Remark 5**.**
The above definitions of moment sequences, moment cones and Carathéodory numbers make sense for Borel functions rather than continuous functions. For instance, let x1,…,xm be pairwise different points of \mathdsRn and let A be the set of characteristic functions of the points xj. Then it is easily verified that the Carathéodory number CA is equal to m=∣A∣.
Definition 6**.**
The moment curve of A in \mathdsRm is defined by
[TABLE]
and we set
[TABLE]
where C=(c1,...,ck), X=(x1,...,xk).
Clearly, sA(x) is the moment sequence of the delta measure δx and Sk,A(C,X) is the moment sequence with representing measure μ=∑i=1kciδxi:
[TABLE]
By Proposition 1, each moment sequence s∈SA is of the form Sm,A(C,X) for some (C,X)∈(\mathdsR≥0)m×Xm. Further, let us introduce a convenient notation:
[TABLE]
The following proposition restates a known result (see e.g. Lemma 3 and Proposition 27(i) in [dDS]).
Proposition 7**.**
Suppose that s∈SA is a boundary point of SA. Then there exists p∈Pos(A,K), p=0, such that Ls(p)=0 and each representing measure of s is supported on the set of zeros Z(p) of p.
The next proposition is a crucial technical ingredient of many proofs given below.
The following condition is used at several places of this paper:
[TABLE]
Proposition 8**.**
Let s∈SA and x∈X. Suppose that condition (6) is satisfied. Define
[TABLE]
Then cs(x)≤e(x)−1Ls(e) and (s−cs(x)sA(x))∈∂SA.
If K is compact, then the supremum in (7) is attained, the moment cone SA is closed in \mathdsRm, and we have
[TABLE]
Proof.
Let c∈\mathdsR. If (s−csA(x))∈SA, then Ls−clx is a moment functional on A and therefore (Ls−clx)(e)≥0, so that c≤e(x)−1Ls(e). Hence cs(x)≤e(x)−1Ls(e). The definition of cs(x) implies that s−cs(x)sA(x) belongs to the boundary of SA.
Since X is compact, it was shown in [FN10] that the moment cone SA is closed in \mathdsRm. We choose a sequence (cn)n∈\mathdsN such that s−cnsA(x)∈SA for all n and limncn=cs(x). Then s−cnsA(x)→s−cs(x)sA(x). Since SA is closed, we have (s−cs(x)sA(x))∈SA, that is, the supremum (7) is attained.
Note that (s−cs(x)sA(x))∈∂SA∩SA. Obviously, CA(s)≤CA(s−cs(x)sA(x))+1. This implies the inequality (8).
∎
The following example shows that the number cs(x) is not equal to
[TABLE]
However, if s∈int SA, then cs(x)=cs(x) by
Proposition 10(vi) below.
Example 9**.**
Set X=[−1,π],
[TABLE]
and gi=fi∣[−1,π) for i=1,2,3. Set A={f1,f2,f3} and B={g1,g2,g3}. Then SA is closed, but SB is not closed. In fact, SB=SA. Let s=sA(−1)=(1,0,0)T,
Then s′=s−sA(0)/2=(1/2,0,−1/2)T=sA(π)/2∈∂SA=∂SB, but sA(π)∈SA. Thus cs(0)=0 and cs(0)=1/2.
Recall from [Sch15] the maximal mass function ρL(x) of a moment functional L:
[TABLE]
Proposition 10**.**
Suppose that condition (6) holds and retain the notation from Proposition 8.
s−c⋅sA(x)∈SA* for all c>cs(x).*
If s∈intSA, then s−c⋅sA(x)∈intSA for all c<cs(x).
The map int SA∋s↦cs(x)∈\mathdsR is concave and continuous for all x∈X.
The map X∋x↦cs(x)∈\mathdsR is continuous for all s∈intSA.
cs(x)=ρLs(x).
If s∈int SA, then cs(x)=cs(x).
Proof.
(i) is clear from the definition (7).
(ii): Since s is an inner point, there exists ε>0 such that Bε(s)⊂intSA. From the convexity of SA it follows that
[TABLE]
(iii): Let s,t∈SA and λ∈(0,1). Choose c,c′∈\mathdsR such that c<cs(x) and c′<ct(x). Then s−csA(x) and t−c′sA(x) are in SA. Since SA is convex, we have
[TABLE]
i.e., λc+(1−λ)c′≤cλs+(1−λ)t(x). Taking the suprema over c and c′ it follows that λcs(x)+(1−λ)ct(x)≤cλs+(1−λ)t(x). Hence s↦cs(x) is a concave function and therefore continuous on int SA by [Sch14, Thm. 1.5.3].
(iv): Let x∈X. Let K be a compact neighborhood of x and (xi)i∈I a net in K such that limi∈Ixi=x. Since K is compact, we have e(y)≥δ>0 and ∥sA(y)∥≥δ for y∈K. Hence cs(y) is bounded on K, say by k, by Proposition 8.
Since sA(y) is continuous, there exist M>0
such that ∥cs(y)sA(y)∥≤M on K.
Further, from (i) and (ii) it follows that ∂SA∩(s+\mathdsR⋅sA(y))={s−cs(y)sA(y)} for y∈K.
Define sy′:=s−cs(y)sA(y). Then sy′∈BM(s)∩∂SA for all y∈K. Since ∂SA is closed and BM(s) is compact, BM(s)∩∂SA is also compact. Therefore, (sxi′)i∈I⊆BM(s)∩∂SA has an accumulation point, say a. Since ∂SA is closed, a∈∂SA. Since cs(xi) is bounded by k and sA is continuous,
[TABLE]
for all v⊥sA(x), i.e., a−s∈[−k,k]⋅sA(x), so that a∈s+[−k,k]⋅sA(x). Then
[TABLE]
so (sxi′)i∈I has a unique accumulation point sx′. Thus limi∈Isxi′=sx′. This proves that the map y↦sy′ is continuous at x. Therefore,
[TABLE]
is continuous at x. Since x∈X was arbitrary, x↦cs(x) is continuous on X.
(v): Let c∈\mathdsR be such that s~:=s−c⋅sA(x)∈SA. Then Ls=Ls~+c⋅δx. Hence there is a representing measure μ of s such that c≤μ({x})≤ρLs(x). Taking the supremum over c yields cs(x)≤ρLs(x).
Assume that cs(x)<ρLs(x). By the definition of ρLs(x), there exist a c∈(cs(x),ρLs(x)) and a representing measure μ of s such that μ({x})=c. Then μ~:=μ−c⋅δx is a positive Radon measure representing s~=s−c⋅sA(x). But s~∈SA by (i), a contradiction. This proves that cs(x)<ρLs(x). Thus, cs(x)=ρLs(x).
(vi): Since s∈int SA, it follows from (i) and (ii) that
[TABLE]
Both numbers s−cs(x)sA(x) and s−cs(x)sA(x) belong to the set on left hand side set. Hence they are equal and therefore cs(x)=cs(x).
∎
From Proposition 10(iii) we easily derive that the supremum in (10) is attained if X is compact. This was proved in [Sch15, Prop. 6] by using the weak topology on the set of representing measures and the Portmanteau Theorem.
The following example shows that (iv) is false in general if s∈∂SA.
Example 11**.**
Let {x1,...,x10} be the zero set of the Robinson polynomial, A the homogeneous polynomials of degree 6 on \mathdsP(\mathdsR2), and s:=∑i=110sA(xi). By Theorem 18 and Example 18 in [dDS], s is determinate. Therefore,
[TABLE]
If K is not compact, then the supremum in (7) is not attained in general. This is shown by the following simple example.
Example 12**.**
Let X=\mathdsR, A={1,x,x2}. Set s=(1,0,1)T=21(sA(−1)+sA(1)). Clearly, sA(0)=(1,0,0)T. Then cs(0)=1, but s′=s−cs(0)sA(0)=(0,0,1)T is not in SA.
The following theorem improves the first equality in (1) and Proposition 1.
Theorem 13**.**
Suppose that condition (6) holds. If m≥2 and X has at most m−1 path-connected components, then CA≤m−1.
Proof.
Obviously, the Carathéodory number CA depends only on the linear span A, but not on the particular basis A of A=LinA. Hence we can assume without loss of generality that e=fm. Since e(x)>0 on X by assumption, bj:=fje−1∈C(X) for j=1,…,m. Set B={b1,…,bm}.
Let s be a moment sequence of B. First we prove that s has a finitely atomic representing measure of a most m−1 atoms. Upon normalization we can assume that sm=1. By Proposition 1, s has a k-atomic measure μ=∑j=1kcjδxj, where k≤m and xj∈X and cj>0 for all j. If k<m, we are done, so we can assume that k=m. Since X consists of at most m−1 path-connected components, it follows that at least two points xi, say x1 and x2, are in the same component, say X1, of X. Then there is a connecting path γ:[0,1]→X1 such that γ(0)=x1 and γ(1)=x2. For t∈[0,1] we denote by Δt the simplex in \mathdsRm−1×{1} spanned by the points sB(x1),sB(γ(t)),sB(x3),…,sB(xm). Since sm=1, we have ∑j=1mcj=1. Hence s:=(s1,…,sm) belongs to the convex hull of sB(x1),sB(x2),sB(x3),…,sB(xm), that is, s is in the simplex Δ1. By decreasing t to [math] it follows from the continuity of bi that there exists a t0∈[0,1] such that s belongs to the boundary of the simplex Δt0. Then s is a convex combination of at most m−1 vertices. This yields a k-representing measure μ~ of s with k≤m−1.
Now we show that each moment sequence of A has a k-atomic representing measure with k≤m−1. This in turn implies the assertion CA≤m−1. Let s′ be a moment sequence of A and let μ′ be a finitely atomic representing measure of s′. Let s be the moment sequence of B given by the measure e(x)dμ. As shown in the preceding paragraph, s has a k-atomic representing measure ν, where k≤m−1. Then e(x)−1dν is a k-atomic representing measure of s′.
∎
Corollary 14**.**
Let A={p∈\mathdsR[x1,…,xn]:deg(p)≤d} and X=\mathdsRn. Then
[TABLE]
We give two somewhat pathological examples. Example 15 shows that the assertion of Theorem 13 is not true if the assumption on the function e(x) is omitted.
Example 15**.**
Set
[TABLE]
φ1(x):=φ(x), φ2(x):=φ(x−1), φ3(x):=φ(x−2). Then A:={φ1,φ2,φ3}⊂C(\mathdsR). Using the moment sequence s=(1,1,1) we find that CA=3.
Example 16 gives a three-dimensional moment cone with CA=1. A slight modification of this idea yields for m∈\mathdsN an m-dimensional space A such that CA=1.
Example 16**.**
Let xL and yL be the coordinate functions of a space filling curve [Sag94, Ch. 5], i.e., xL,yL:[0,1]→[0,1] are continuous, nowhere differentiable on the Cantor set C, differentiable on [0,1]∖C, and the curve
[TABLE]
is surjective. Set A:={xL,yL,1} and X=[0,1]. Then
[TABLE]
and the moment cone SA={(x,y,z):z≥0,0≤x≤z,0≤y≤z} is full-dimensional. Clearly, (∗) implies that CA=1.
Remark 17**.**
In this paper the vector space A is finite-dimensional. However the definitions of the moment cone and the Carathéodory number can be extended to infinite-dimensional vector spaces A. The following example shows that even in this case it is possible that CA=1.
Let A={φn}n∈\mathdsN be the coordinate functions of the ℵ0-dimensional Schönberg space filling curve [Sag94, Ch. 7], i.e., φn is continuous and nowhere differentiable on [0,1] for all n, and set φ0=1. Then
[TABLE]
is surjective. The moment cone SA={(xn)n∈\mathdsN0:0≤xn≤x0} is full dimensional, closed, and CA=1 from (∗).
Theorem 18**.**
Let p∈A and x1,…,xk∈X,k∈\mathdsN. Suppose that p(x)≥0 for x∈X, Z(p)={x1,...,xk} and the set {sA(xi):i=1,...,k} is linearly independent. Then CA≥k.
Proof.
Let s=∑i=1ksA(xi). Clearly, Ls(p)=0 and hence suppμ⊆Z(p)={x1,…,xk} for any representing measure μ of s by Proposition 7. Assume there is an at most (k−1)-atomic representing measure μ. Without loss of generality we assume that x1∈/suppμ, so μ is of the form μ=∑i=2kciδxi, ci≥0. Then
[TABLE]
Since the set {sA(xi):i=1,...,k} is linear independent, this is a contradiction. Therefore, k=CA(s)≤CA.
∎
Applications of the previous theorem will be given in Examples 31 and 63. Deeper results on the connections between the Carathéodory number and the zeros of positive polynomials are treated in Section 6.
We derive some useful facts which will be used several times. We investigate some properties of the set
[TABLE]
of moment sequences which are given by measures of at most k atoms.
Lemma 19**.**
For fixed k∈\mathdsN the following are equivalent:
Sk* is convex, or equivalently, Sk+Sk⊆Sk.*
Sk=Sk+1.
k≥CA.
Proof.
(i)⇒(ii): Let s=(1−λ)s1+λsA(x)∈Sk+1 with s1,sA(x)∈Sk. Since Sk is convex , s∈Sk. Hence Sk=Sk+1.
(ii)⇒(iii): Let s=s0+λ1sA(x1)+...+λlsA(xl)∈Sk+l be an arbitrary moment sequence. Set si:=s0+λ1sA(x1)+...+λlsA(xi). Then
[TABLE]
Thus CA≤k.
(iii)⇒(ii): Since CA≤k, we have SCA⊆Sk⊆SCA. Here the last inclusion follows from the mimimality of CA. Hence, Sk=SCA is convex.
∎
An immediate consequence of the preceding lemma are the following inclusions:
[TABLE]
Proposition 20**.**
CA=min{k:Sk is convex}=min{k:Sk=Sk+1}.
For each k=0,1,...,CA there is a moment sequence s such that CA(s)=k.
Proof.
(i) follows at once from the minimality of CA in Lemma 19.
(ii): By (11), we have Sk−1⫋Sk for k=0,...,CA, where we set S−1:=∅. Therefore, Sk∖Sk−1=∅.
∎
Proposition 21**.**
Suppose that condition (6) is satisfied.
The cone S is pointed, that is, S∩(−S)={0}.
If S1 is closed, then Sk is closed for all k.
If the set X is compact, then Sk is closed for all k.
Proof.
(i): Suppose that s,−s∈S. Using that e(x)>0 on X we conclude that Ls(e)≥0 and L−s(e)=−Ls(e)≥0, so Ls(e)=0 and therefore s=0.
(ii): The proof follows by induction. Assume S1 and Sk is closed for some k. We show that Sk+1 is also closed.
Let (sn)n∈\mathdsN be a sequence of Sk+1 such that sn→s∈Sk+1. We can write sn=αnxn+βnyn such that xn∈Sk,yn∈S1, αn,βn∈[0,+∞), and ∥xn∥=∥yn∥=1 for all n. Since Sk and S1 are closed, the sets Sk∩B1(0) and S1∩B1(0) are both compact.
Hence we can find a subsequence (ni) such that
xni→x∈Sk∩B1(0), and yni→y∈S1∩B1(0). Let us assume for a moment that the sequences (αni) and (βni) are bounded. There is a subsequence nij such that αnij→α∈[0,+∞) and βnij→β∈[0,+∞). Then snij→s=αx+βy∈Sk+1. Thus, Sk+1 is closed.
We show that the sequence (βni) is unbounded if (αni) is unbounded. Taking the standard scalar product ⟨⋅,⋅⟩ in \mathdsRm, we can uniquely write yn=yn⊥+yn∥ with xn∥yn∥, xn⊥yn⊥. Then
[TABLE]
Since (sni) converges, the sequence (∥sni∥) is bounded by some k. Thus,
[TABLE]
and if (αni) is unbounded, so (βni) is unbounded. The same reasoning shows that (αni) is unbounded if (βni) is unbounded.
If the sequence (βni) is unbounded, (yni⊥) converges to [math] and hence y=−x. Since S is pointed by (i), this implies x=y=0, a contradiction to ∥x∥=∥y∥=1. This completes the proof.
(iii): By (ii) it suffices to prove that S1 is closed. Clearly, condition (6) implies that sA(x)=0 for all x∈X. Since A⊆C(X,\mathdsR) and X is compact, we have ∥sA∥−1sA∈C(X,Sm−1) (Sm−1 denotes the unit sphere in \mathdsRm) and range∥sA∥−1sA is closed. Hence, S1≡\mathdsR≥0⋅range∥sA∥−1sA is closed.
∎
More on the moment cone can be found in Proposition 30.
3. Caratheodory Numbers: Differentiable Functions
In the rest of this paper we assume that X=\mathdsRn or \mathdsP(\mathdsRn) and A is a finite-dimensional linear subspace of Cr(\mathdsRn;\mathdsR), r∈\mathdsN.
Clearly, Sk,A in Definition 6 is a Cr-map of \mathdsR≥0k×\mathdsRkn into \mathdsRm. Let DSk,A denote its total derivative. We can write
[TABLE]
The following number is crucial in what follows.
Definition 22**.**
[TABLE]
i.e., NA is the smallest number k of atoms such that DSk,A has full rank m=∣A∣.
A lower bound for NA is given by the following proposition.
Proposition 23**.**
We have ⌈n+1∣A∣⌉≤NA. If all functions fi are homogeneous of the same degree, then ⌈n∣A∣⌉≤NA.
Proof.
Since DSk,A has ∣A∣ rows and each atom contributes n+1 columns, we need at least k≥n+1∣A∣ atoms for full rank. Thus, NA≥⌈n+1∣A∣⌉.
If all functions fi are homogeneous of degree r, then fi(λx)=λrfi(x) and so δλx=λrδx. Hence DS1,A has rank at most d and kernel dimension at least 1. Therefore, at least k≥n∣A∣ atoms are needed, so that NA≥⌈n∣A∣⌉.
∎
Example 24**.**
Let φ∈C0∞(\mathdsR),φ=0, and supp(φ)⊆(0,1). Set φi(x):=φ(x−i+1) for i=1,...,m∈\mathdsN and A:={φ1,...,φm}. Then ∂sA(x)=φi′(x)ei for x∈(i−1,i) or [math] otherwise. Then NA=CA=m.
Theorem 25**.**
Suppose that A⊆C1(\mathdsRn,\mathdsR). Then
[TABLE]
Set C=(1,...,1)∈\mathdsRNA.
There exists X∈\mathdsRNA⋅n and an open neighborhood U of (C,X) such that for every ε>0 there are (Cε,Xε)∈U and λε∈\mathdsR such that
[TABLE]
Proof.
It clearly suffices to prove the second part of the theorem. The first assertion follows then from the second.
Since DSNA has full rank, there is a (C,X)∈\mathdsR>0NA×\mathdsRNAn such that DSNA(C,X) has full rank. Since scaling the columns of DSNA(C,X) does not change the rank, we can assume without loss of generality that C=(1,...,1). Since the determinant is continuous there is an open neighborhood U of (C,X) such that
[TABLE]
Let s∈\mathdsRm. By (∗) there is a (Cε,Xε)∈U such that SA,NA(C,X)−SA,NA(Cε,Xε) is a multiple of s, i.e., (15) holds for some λε∈\mathdsR.
∎
Definition 26**.**
Let n,k∈\mathdsN with k≥NA. A k-atomic measure (C,X) on \mathdsRn is called regular (for Sk,A) iff DSk,A(C,X) has full rank. Otherwise the measure (C,X) is called singular (for Sk,A).
A real sequence s=(sα)α∈A is called regular iff Sk,A−1(s) is empty (that is, s is not a moment sequence) or consists solely of regular measures. Otherwise, s is called singular.
Theorem 27**.**
Suppose that A⊂Cr(\mathdsRn;\mathdsR) and r>NA⋅(n+1)−m. Then
[TABLE]
Further, the set of moment sequences s which can be represented by less than NA atoms has ∣A∣-dimensional Lebesgue measure zero in \mathdsRm.
Proof.
By Proposition 23 we have r>NA⋅(n+1)−m≥0, so that r≥1.
The moment sequences which can be represented by less than NA atoms are singular values. Hence the second assertion follows from Sard’s Theorem [Sar42].
To prove (15) assume to the contrary that CA<NA. Then every moment sequence in the moment cone is singular. This is a contradiction to Sard’s Theorem since the moment cone has non-empty interior.
∎
Remark 28**.**
Theorem 27 also holds for the signed Carathéodory number with verbatim the same proof. With Theorem 25 we get
[TABLE]
for A⊂Cr(\mathdsRn;\mathdsR) and r>NA⋅(n+1)−m. Without these conditions the lower bound needs not to hold, neither for CA nor for CA,±, see [Fed69, pp. 317–318].
Proposition 29**.**
Suppose that A⊆C1(\mathdsRn,\mathdsR) and {x∈\mathdsRn:sA(x)=0} is bounded. Let γ∈C1(\mathdsRn,\mathdsR) be such that γ(x)≥1 and lim∣x∣→∞γ(x)fi(x)=0 for i=1,…,m and let s be a moment sequence of A. Set
[TABLE]
with l∈\mathdsN0 and c≥0. Then:
- (i)
Γl,c(s)* is closed for all l∈\mathdsN0 and c≥0.*
2. (ii)
Γ0,c(s)* is compact for all c≥0.*
If, in addition, s is regular, then:
- (iii)
∃c≥0:Γl,c(s)* unbounded ⇔ l≥1.*
2. (iv)
Γl,c(s)* compact ∀c≥0 ⇔ l=0.*
Proof.
(i): If f∈C(\mathdsRn,\mathdsRm) and K⊆\mathdsRm is closed, then f−1(K) is closed by the continuity of f. Since both intersecting sets in (17) are of the form f−1(K), they are closed and so is their intersection.
(ii): Suppose Γ0,c is non-empty. Since Γ0,c is closed by (i), it suffices to show that it is bounded. Assume to the contrary that it is unbounded and let (C(i),X(i)) be a sequence such that limi→∞∥(C(i),X(i))∥=∞. Since
[TABLE]
the sequence (C(i)) is bounded. The sequence (X(i)) is unbounded. After renumbering and passing to subsequences we can assume that cj(i)→cj∗ for all j, xj(i)→xj∗ for j=1,...,k and ∥xj(i)∥→∞ for j=k+1,...,CA(s) as i→∞. Since
[TABLE]
for all i, it follows that
[TABLE]
Therefore, μ∗=((c1∗,...,ck∗),(x1∗,...,xk∗)) is a k-atomic representing measure of s with k≤CA(s). By the minimality of CA(s), k=CA(s). Hence all sequence (xj(i)) are bounded which is a contradiction. Thus Γ0,c(s) is bounded.
It is clear that (iii) and (iv) are equivalent. Thus it suffices to prove (iii).
(iii) “⇒”: By (ii), if Γl,c(s) is unbounded, we find a k-atomic representing measure with k<CA(s)+l, i.e., l≥1.
(iii) “⇐”: We will show that there is a c>0 such that for every x∈\mathdsRn there is a representing measure μ in Γ1,c(s) which has x as an atom. This will prove that Γ1,c, hence also Γl,c, is unbounded for l≥1.
Let μ0=(C0,X0)=((c0,1,...,c0,CA(s)),(x0,1,...,x0,CA(s))) be a representing measure of s. Set
[TABLE]
Since s is regular, all representing measures have full rank. Hence there exist variables y1,...,ym from c1,...,cCA(s),x1,1,...,xCA(s),n such that DyS(μ0) is a square matrix with full rank. Then
[TABLE]
is a C1-function such that F((C0,X0),0)=0 and DyF((C0,X0))=DyS(μ0) is bijective. Thus, F fulfills all assumptions of the implicit function theorem, hence there are an ε>0 and a C1–function (C(t),X(t)) such that F((C(t),X(t)),t)=0 for all t∈(−ε,ε). Since ci,0>0, there is t0∈(0,ε) such that ci(t0)>0 for all i, so
[TABLE]
is a (CA(s)+1)-atomic representing measure of s which has x as an atom.
∎
(iii) and (iv) no longer hold if s is singular. E.g., let s be moment sequence of the measure μ=∑i=110δxi where xi are the ten zeros of the Robinson polynomial, then Sk,A−1(s)⊆[0,10]k×{x1,...,x10}k is compact for all k≥10.
The next proposition summarizes a number of basic properties of the sets Sk and the Carathéodory number. Recall that Bρ(t) is the ball with center t and radius ρ in \mathdsRm.
Proposition 30**.**
Suppose that Sk−1 is closed for some k, NA≤k≤CA. Then there exist a moment sequence s and an ε>0 such that CA(t)=k for all t∈Bε(s).
s∈intSCA* if and only if
there exists (C,X) such that SA(C,X)=s and rankDSA(C,X)=∣A∣.*
s∈∂SCA* if and only if rankDSA(C,X)<∣A∣ for all (C,X) such that SA(C,X)=s.*
Suppose that NA<CA and Sk is closed for all k=NA,...,CA. Then for each such k there exists s∈intSCA such that all k-atomic representing measures of s are singular, but s has a regular representing measure with at least k+1 atoms.
Suppose that NA<CA, Sk is closed for k=CA−1,CA and SCA=\mathdsR∣A∣. If \mathdsR∣A∣∖SCA−1 is path-connected, there exists s∈∂SCA such that CA(s)=CA.
Proof.
(i): Fix such a number k and assume (intSk)∖Sk−1=∅. Then we have Sk−1⊇intSk⊇intSk−1. Taking the closure, Sk−1=Sk−1⊇Sk⊇Sk−1=Sk−1, so that Sk⊆Sk−1 which contradicts (11). Thus, (intSk)∖Sk−1=∅.
(ii): “⇐”: Let (C,X) be a full rank measure of s. Then a neighborhood U of (C,X) is mapped onto a neighborhood of s, that is, s is an inner point.
“⇒”: Let s be an inner point. Choose ν such that SA(ν) has full rank. Since s is an inner point, there exists ε>0 such that s′:=s−ε⋅SA(ν) is also an inner point. In particular, s′ is a moment sequence. Let μ′ be a representing measure of s′. Then μ=μ′+ε⋅ν is a representing measure of s and has full rank, since already DSA(ν) has full rank.
(iii) follows from (ii).
(iv): Let s∈intSk⊆intSCA. By (i), there exists t∈(intSCA)∖Sk for all k=NA,...,CA−1. Then [s,t]:={λs+(1−λ)t∣λ∈[0,1]}⊆intSCA by the convexity of SCA. Therefore, since s∈intSk but s∈intSk, we have
[TABLE]
Hence there exists s∈intSCA∩∂Sk. Then all k-atomic representing measures of s are singular. Otherwise, a full rank k-atomic measures implies that s is an inner point of Sk. But, by (iv), s has a regular l-atomic measure with l>k.
(v): Let s∈intSCA∖SCA−1 by (i) and t∈\mathdsR∣A∣∖SCA. Since s,t∈\mathdsR∣A∣∖SCA−1, they are path-connected, so there exists a continuous path γ:[0,1]→\mathdsR∣A∣∖SCA−1 with γ(0)=s and γ(1)=t. But since s=γ(0)∈intSCA, t=γ(1)∈intSCA, and γ([0,1])⊆\mathdsR∣A∣∖SCA−1, we have γ([0,1])∩(∂SCA∖SCA−1)=∅. Therefore, ∂SCA∖SCA−1=∅.
∎
In Sections 4 and 6 we derive upper bounds of CA by using Proposition 8 and the inequality (8). As (v) implies, this inequality can be strict, since the Carathéodory number CA can be attained at a boundary point, see the following example.
Example 31**.**
The (homogeneous) Motzkin polynomial
[TABLE]
has the 6 projective roots
[TABLE]
We consider the truncated moment problem on the projective space \mathdsP(\mathdsR2) for
[TABLE]
Clearly, M∈linB. Since M is non-negative and has a discrete set of roots, s=∑ξ∈Z(M)sB(ξ) is a boundary point of the closed moment cone. The matrix
[TABLE]
has rank 6, i.e., the set {sB(ξ)}ξ∈Z(M) is linearly independent. Hence CB(s)=6 by Theorem 18 and CB≤6=∣B∣−1 by Theorem 13. Thus, CB=CB(s)=6, that is, the Carathéodory number is attained at the boundary moment sequence s.
Next we derive an upper bound for the Carathéodory number in terms of zeros of positive elements of A. For the rest of this section we assume that X is a closed subset of
\mathdsRn or \mathdsP(\mathdsRn) and A⊆C1(X,\mathdsR). By the latter we mean that there exists an open subset U of \mathdsRn or \mathdsP(\mathdsRn) such that X⊆U and A⊆C1(U,\mathdsR).
Definition 32**.**
*Let MA be the largest number k obeying the following property:
(∗)k: There exist f∈A and x1,…,xk∈Z(f) such that f(x)≥0 on X and {sA(xi)}i=1,...,k is linearly independent (DSk,A((1,...,1),(x1,...,xk)) does not have full rank).*
From the definition it is clear that MA is the largest dimension an exposed face of SA.
Proposition 33**.**
For each s∈∂SA∩SA we have CA(s)≤MA.
Proof.
In this proof we abbreviate N:=CA(s).
Let μ=∑i=1Nciδxi be an N-atomic representing measure of s.
Since s∈∂SA∩SA, there exists f∈A,f=0, such that f(x)≥0 on X and Ls(f)=0. From the latter it follows that suppμ⊆Z(f) and hence x1,…,xN∈Z(f). Further, by Proposition 30(iii), s∈∂SA∩SA implies that DSN,A(C,X) does not have full rank ∣A∣. Since ci>0 for all i, we have rankDSN,A(C,X)=rankDSN,A((1,…,1),X).
Finally, by Theorem 18 the set {sA(xi)}i=1,...,N is linearly independent. Thus, property (∗)N in Definition 32 holds, so that CA(s)=N≤MA.
∎
Theorem 34**.**
Suppose that X is a compact subset of \mathdsRn or \mathdsP(\mathdsRn), condition (6) is satisfied, and A⊆C1(X,\mathdsR). Then
[TABLE]
Proof.
The assumptions of this theorem ensure that Proposition 8 applies. Hence the assertion follows by combining Proposition 33 with the inequality (8).
∎
4. Carathéodory Numbers: One-dimensional Monomial Case
For the one-dimensional truncated moment problem the number NA can be calculated from the formula for the Vandermonde determinant.
Lemma 35**.**
Let A:={1,x,...,xn}, where n∈\mathdsN.
If n=2k−1,k∈\mathdsN, then
[TABLE]
If n=2k,k∈\mathdsN, then
[TABLE]
NA=⌊2n⌋+1=⌈2n+1⌉.
Proof.
We carry out the proofs in the odd case n=2k−1. The even case n=2k is derived in a similar manner.
We have ∂ciSk(C,X)=sA(xi) and ∂xiSk(C,X)=cisA′(xi). Therefore,
[TABLE]
and we compute
[TABLE]
Combined with (20), this yields (18).
We choose the numbers xi pairwise different and all ci positive. Then the determinants in (18) and (19) are non-zero. Hence detDSk,A=0 and therefore NA=k=⌊2n⌋+1.
∎
Example 36**.**
H. Richter [Ric57] has shown that for the one-dimensional truncated moment problem A={1,x,...,xd} the Carathéodory number is CA=⌈2d+1⌉. This result will also follow from Theorem 45 below. If we take this equality for granted and combine it with Lemma 35(iii), then we obtain
[TABLE]
Now we turn to the general case and assume that
[TABLE]
Then we compute
[TABLE]
From the latter equation it follows that each linear polynomial xj−xi, j=i, divides the polynomial fA. Hence there exists a polynomial pA such that
[TABLE]
The polynomial pA is uniquely determined by (4). It is homogeneous with degree
[TABLE]
Such polynomials pA are called Schur polynomials. They are well studied in the literature, see e.g. [Mac95]. For these Schur polynomials it is known that
[TABLE]
where α ranges over some Young tableaux. In particular, (24) implies that all non-zero coefficients of pA are positive.
Example 37**.**
- (1)
A={x,x4,x7}. Then we compute
[TABLE]
where
[TABLE]
2. (2)
A={x,x2,x6}. Then
[TABLE]
where
[TABLE]
3. (3)
A={1,x,x2,x6}. Then
[TABLE]
with
[TABLE]
4. (4)
A={1,x2,x3,x5,x6}. Then
[TABLE]
with
[TABLE]
Ω={all permuations of (2,2,1,1,0)}, Φ={all permuations of (2,1,1,1,1)}.
Definition 38**.**
Assume that A is as in (21) and pA is defined by (4). Set
[TABLE]
if m=2k is even and
[TABLE]
for all i=1,...,k if m=2k−1 is odd.
Lemma 39**.**
- (i)
If m is even then qA is symmetric.
2. (ii)
If m is odd then qA,i(x1,...,xk)=qA,k(x1,...,xi−1,xi+1,...,xk,xi) for all i=1,...,k.
Proof.
(i): Since the Schur polynomial pA is symmetric, so is qA.
(ii): We derive
[TABLE]
In the odd case it suffices to prove formula (26) below. All other determinants are then obtained by interchanging variables and Lemma 39(ii).
Lemma 40**.**
Suppose that A is of the form (21).
If m=2k is even, then
[TABLE]
If m=2k−1 is odd, then
[TABLE]
NA=⌈2m⌉.
Proof.
(i): Clearly, ∂ciSk(C,X)=sA(xi) and ∂xiSk(C,X)=cisA′(xi). By the linearity of the determinant, the factor c1⋯ck can be taken out, so we can assume without loss of generality that c1=...=ck=1.
Let m=2k. We proceed in a similar manner as in the proof of Lemma 35. Using (4) we derive
[TABLE]
(ii): The proof in the odd case n=2k−1 is similar.
(iii): Since qA is not the zero polynomial and all nonzero coefficients are positive, there are x1,...,xk such that det(DSk,A)(x1,...,xk)=0. Then DSk,A has full rank, so that NA=k=⌈m/2⌉.
∎
Now we turn to the homogeneous case and set
[TABLE]
Example 41**.**
- (1)
In the case B={xy7,x4y4,x7y} we have
[TABLE]
with
[TABLE]
2. (2)
B={xy5,x2y4,x6}. Then we have
[TABLE]
with
[TABLE]
Definition 42**.**
For even m=2k we define
[TABLE]
For odd m=2k−1 we set
[TABLE]
Lemma 43**.**
Let B be of the form (27).
If m=2k is even, then
[TABLE]
If m=2k−1 is odd, then
[TABLE]
qB* in (28) and qB,k in (29) are in \mathdsN0[x1,y1,...,xk,yk].*
NB=⌈2m⌉.
Proof.
(i): Again it suffices to prove the formulas in the case c1=...=ck=1. We set u=yx and ui=yixi. Using the relation ∂x=y−1∂u and equations (18) and (28) we compute
[TABLE]
(ii): We proceed in a similar manner and derive
[TABLE]
(iii): First we show that qB and qB,k are polynomials. That they are polynomials in x1,...,xk is clear, since qA and qA,k are polynomials in the coordinates and all xi appear only with non-negative exponent in the definitions. Therefore, it suffices to show that they are also polynomials in all yi. We will only prove the statement for qB, for qB,k the same chain of arguments holds.
Assume the contrary. Then qB contains a term with yk−l for some l>0 with non-zero coefficient. Let l be the largest such l and let f(x,y):=∑iaixαiyβi be the factor of yk−l in qB. Since f is non-zero by assumption, there are Z=(x1,y1,...,xk,yk)∈\mathdsR2k and ε>0 such that f is non-zero on the ball Bε(Z) centered at Z with radius ε.
On the other hand, we expand ∏1≤i<j≤k(xjyi−xiyj)4 and let g(x,y):=∑ibixγiyδi be the sum of all terms therein which contain no yk. Then g is a polynomial in all xi and yi and g is not the zero-polynomial. Hence g is not identically zero on Bε(Z) and so is
[TABLE]
From the Laplace expansion it follows that the determinant
[TABLE]
is a polynomial in xi,yj. Further, fg appears in the expansion of the product
[TABLE]
and by the maximality of l it does not cancel. Hence (∗) does not contain a term with yk−l but (∗∗) does. Since both are equal by (i), we get a contradiction. Thus, qB is a polynomial in all yi.
It remains to show that all coefficients of qB are in \mathdsN0. Since qA comes from the Schur polynomial pA (see (24)), its coefficients are in \mathdsN0. This is not changed by multiplication with (y1⋯yk)2(dm−d1−m)+3, so qB has \mathdsN0-coefficients as well.
(iv): By (iii) all nonzero coefficients of qB are positive integers. Hence, by (i) and (ii), we can find real numbers x1,y1,…,xk,yk such that the corresponding determinants are non-zero. Hence NB=k=⌈2m⌉.
∎
Example 44**.**
- (1)
Let B={xy7,x4y4,x7y}. Then we have
[TABLE]
2. (2)
B={xy5,x2y4,x6}. Then
[TABLE]
The following theorem is the main result of this section. It gives sufficient conditions for the validity of formula (31) concerning the Carathéodory number CB.
Theorem 45**.**
Let m,d1,d2,…,dm,d∈\mathdsN be such that 0=d1<...<dm=2d, put A={1,xd2,...,xdm}, B={y2d,xd2y2d−d2,...,xdm−1y2d−dm−1,x2d}, and Z:=Z(qA) if m is even or Z:=Z(qA,1)∩...∩Z(qA,k) if m=2k−1 is odd, where qA and qA,j are as in Definition 38. Suppose that
[TABLE]
Then
[TABLE]
Proof.
Recall that SA and SB denote the moment cones of A and B, respectively. We set ∂∗SA:=∂SA∩SA.
By Lemmas 40(iii) and 43(iv) we have NA=NB=⌈2m⌉. Further, NA≤CA and NA≤CB by Theorem 27. Therefore, it suffices to show that CA≤NA and
CB≤NA.
First we prove that CB≤NA.
Let s∈SB. Since X=\mathdsP(\mathdsR) is compact and condition (6) is satisfied (with e(x,y):=x2d+y2d∈B), Proposition 8 applies with x=(1,0). Hence the supremum cs(1,0):=sup{c∈\mathdsR:s−csB(1,0)∈SB} is attained and s′:=s−cs(1,0)sB(1,0)∈∂SB. By Proposition 30(iii) all representing measures (C′,X′) of s′ are singular. They do not contain (1,0) as an atom. (Indeed, otherwise cs(1,0) could be increased which contradicts to the maximality of cs(1,0).) Since the polynomials of B are homogeneous, we can assume without loss of generality that Xi′=(xi′,1) with xi′ pairwise different, say x1′<x2′<...<xl′, and s′∈∂∗SA, i.e., s′ is a boundary moment sequence of SA. But from (30) and Lemma 43,(i) and (ii), it follows that l<NA, that is, CB(s)≤l+1≤NB=NA. This completes the proof of the inequality CB≤NA.
Next we show that CA≤NA.
If s∈∂∗SA, then CA(s)<NA by the preceding proof. Now let s∈intSA=intSB. Then CB(s)≤NA by the preceding paragraph and it suffices to show that s has an at most NA-atomic representing measure which does not have an atom at (1,0). We choose ε>0 such that Bε(s)⊆int SA.
Let ct(x) be defined by (7).
Since t↦Lt is a continuous map of \mathdsRm→A∗, t↦Lt(e) is continuous. Hence, ct(x)≤e(x)−1Lt(e) (by Proposition 8) is bounded from above on Bε(s). Then the supremum C of ct(1,0) on Bε(s) is finite. Let
[TABLE]
be the ε-tube around the line γ:=s−[0,C+1]⋅sB(1,0). Write T=T1∪T2∪T3 with T2:=T∩∂SB, T1:=T∩int SB, and T3:=T∖(T1∪T2), i.e., T1 is the part inside SB, T3 is the part outside SB, and T2 is the boundary part of SB in T. Since SB is closed and convex, T2 is closed and every path in T starting in T1 and ending in T3 contains at least one point in T2. By construction, t′:=t−ct(1,0)sB(1,0)∈T2 for all t∈T1 and no representing measure of t′ contains (1,0) as an atom, i.e., T2⊂∂∗SA. Then γ=s−[0,1]⋅(C+1)sB(1,0)⊂T and s∈γ∩T1 and s−(C+1)sB(1,0)∈γ∩T3, so that s′=s−cs(1,0)sB(1,0)∈T2. Since sB is continuous and C<∞ there is a δ>0 such that
[TABLE]
Thus, s−(C+1)sB(1,δ)∈T3. Then γδ:=s−[0,1]⋅(C+1)sB(1,δ)⊂T and s∈γδ∩T1 and s−(C+1)sB(1,δ)∈γδ∩T3, i.e.,
[TABLE]
Summarizing, s=sδ′+cs(1,δ)sB(1,δ) and sδ′ has a k-atomic representing measure (k<NA) which has no atom at (1,0). Therefore, s has an l-atomic presenting measure (l≤NA) which has no atom at (1,0). This proves CA(s)≤NA.
∎
We illustrate the preceding by the following examples.
Example 46**.**
Let A={1,x2,x3,x5,x6} and B={y6,x2y4,x3y3,x5y,x6}, that is, m=5. Then we have
[TABLE]
where
[TABLE]
This implies NB=NA=3 as also proved in Lemma 35 and 43. Hence CA≥3. From the Richter–Tchakaloff Theorem (Proposition 1) we find CA≤m=5, while Theorem 13 gives a better bound CA≤m−1=4.
To apply Theorem 45 we have to check that the assumptions are satisfied. Clearly, d1=0 and dm=6 is even. It remains to show that (30) is true. By symmetry it suffices to verify (30) for
[TABLE]
Set Z:=Z(f1)∩Z(f2)∩Z(f3) and let X=(x,y,z)∈Z. If X=0, then (30) holds. Now let X=0. Since f is homogeneous, we can scale X such that x2+y2+z2=1. Then we derive (for instance, by using spherical coordinates)
[TABLE]
so (30) is fulfilled. Therefore, by Theorem 45 we have CA=3.
A nice application of the preceding example is the following corollary.
Corollary 47**.**
Let p(x)=a+bx2+cx3+dx5+ex6 be a non-negative polynomial which is not the zero polynomial. Then p has at most 2 distinct real zeros.
Proof.
Assume to the contrary that p has three distinct zeros, say x,y,z. Let s be the moment sequence of the measure μ=δx+δy+δz. Then Ls(p)=0, so s is a boundary point of the moment cone. But from (34) it follows that the determinant (32) is non-zero, so s is an inner point, a contradiction.
∎
In the following example the assumption (30) of Theorem 45 is not satisfied and the assertion (31) does not hold.
Example 48**.**
Let A={1,x,x2,x6} and B={y6,xy5,x2y4,x6}. From Theorem 13, CB≤m−1=3, while Theorem 27 and
[TABLE]
yield 2=NB≤CA, so that CB∈{2,3}. We prove that CB=3.
Let ν:=41(δ−2+δ−1+δ1+δ2). Then
[TABLE]
By some straightforward computations it can be shown that s has no k-atomic representing measure with k≤2. Therefore, since CB∈{2,3}, we have CB=3=⌈23⌉.
*Note that NB=2=⌈23⌉. Thus, the equality (31) fails.
*
5. Carathéorody Numbers: Multidimensional Monomial Case
Definition 49**.**
For n,d∈\mathdsN set
[TABLE]
Note that ∣An,d∣=(n+dd) and ∣Bn,d∣=(n+d−1d).
Throughout this section, we assume the following:
For the polynomials An,d:=LinAn,d we consider the truncated moment problem on X=\mathdsRn, while for the homogeneous polynomials Bn,d:=LinBn,d the moment problem is treated on the real projective space X:=\mathdsP(\mathdsRn−1). Let Sn−1 be the unit sphere in \mathdsRn,n≥2, and S+n−1 the set of points x∈Sn−1 for which the first non-vanishing coordinate is positive. We consider S+n−1 as a realization of the projective space \mathdsP(\mathdsRn−1).
The following simple fact is often used without mention: A polynomial of Bn,2d is non-negative on S+n−1, equivalently on \mathdsP(\mathdsRn−1), if and only if it is on \mathdsRn−1.
The following example shows how differential geometric methods can be used for the truncated moment problem.
Example 50**.**
Let n=d=k=2, xα=(x(1))α1(x(2))α2 and
[TABLE]
Then
[TABLE]
where C=(c1,c2) and X=(x1,x2), xi=(xi(1),xi(2)). From this we find that
[TABLE]
Hence rankDS2,A2,2=5 at each point (x1,x2), x1=x2, so the local rank theorem of differential geometry applies. Fix (C,X) as above. The local rank theorem [Hil03, Proposition 1, p. 309] implies that there is a one-parameter family (C(t),X(t)) which has the same moments as (C,X) satisfying the differential equations γ˙(t)=v(C(t),X(t)) with initial condition (C(0),X(0))=(C,X). This system is
[TABLE]
and its solution is given by
[TABLE]
*with t∈(−2c2,0,2c1,0). Here C=(c1,0,c2,0) and X=((γ1,1,γ1,2),(γ2,1,γ2,2)) are the initial values at t=0. It should be noted that the corresponding moment sequence is indeterminate, but it is a boundary point of the moment cone.
*
Recall that NA≤CA by Theorem 27. There are various other lower bounds for Carathéodory numbers in the literature, see e.g. [DR84, p. 366]. In the case A2,2k−1, M. Möller [Möl76] obtained the lower bound
[TABLE]
The following result improves Möller’s lower bound.
Proposition 51**.**
[TABLE]
For k≥4 we have
[TABLE]
Proof.
The second inequality of (37) has been stated in Proposition 23. It reamins to prove the first inequality of (37). In the cases k=1,2,3 it is verified by direct computations; we omit the details. For k≥4 it follows from the following computation:
[TABLE]
Before we turn to our next result we restate the Alexander–Hirschowitz Theorem [AH95]. We denote by Vn,d,r the vector space of polynomials in n variables of degree at most d having singularities at r general points in \mathdsRn.
Proposition 52**.**
The subspace Vn,d,r has the expected codimension
[TABLE]
except for the following cases:
d=2; 2≤r≤n, codimVn,2,r=r(n+1)−r(r−1)/2;
d=3; n=4, r=7, dimV4,3,7=1;
d=4; (n,r)=(2,5),(3,9),(4,14), dimVn,4,r=1.
Theorem 53**.**
We have
[TABLE]
except for the following cases
d=2: NAn,2=n+1.
n=4, d=3: NA4,3=8.
n=2, d=4: NA2,4=6.
n=3, d=4: NA3,4=10.
n=d=4: NA4,4=15.
Proof.
From the corresponding definitions of Vn,d,r and DSk,An,d we obtain
[TABLE]
Therefore, apart from exceptional cases, (39) follows at once from the Alexander–Hirschowitz Theorem. Next we treat the exceptions.
(i): Note that NAn,2≥⌈n/2⌉+1. Since for all k satisfying ⌈n/2⌉+1≤k≤n the matrix DSk,An,2((1,...,1),X) has not the expected full rank for any X, the first k with full rank is k=n+1.
(ii): Since NA4,3≥7 and DS7,A4,3((1,...,1),X) has not the expected full rank for any X, NA4,3=8.
(iii): We have NA2,4≥5 and DS5,A2,4((1,...,1),X) has not the expected full rank for any X. Hence NA2,4=6.
(iv): Then NA3,4≥9 and DS9,A3,4((1,...,1),X) has not the expected full rank for any X. Thus NA3,4=10.
(v): Then NA4,4≥14 and DS14,A4,4((1,...,1),X) has not the expected full rank for any X. Therefore, NA4,4=15.
∎
For the homogeneous case we have
Corollary 54**.**
NBn+1,d=NAn,d.
Proof.
Let X=(X1,...,Xk)∈\mathdsRnk, k=NAn,d, be such that DSk,An,d(1,X) has full rank. Then DSk,Bn+1,d(1,Y) with Y=((X1,1),...,(Xk,1)) has full rank, so that NAn,d=k≥NBn+1,d.
On the other hand, let Y=(Y1,...,Yk)∈\mathdsR(n+1)k, k=NBn+1,d, be such that DSk,Bn+1,d(1,Y) has full rank. We can assume that all (n+1)-th coordinates of Yi are non-zero by the continuity of the determinant and therefore they can be chosen to be 1, since we are in \mathdsP(\mathdsRn).
The column ∂n+1sBn+1,d(Yi) depends linearly on sBn+1,d(Yi) and ∂jsBn+1,d(Yi), j=1,...,n. Therefore, omitting this column does not change the rank. Hence DSk,An,d(1,X) with Yi=(Xi,1) has full rank, that is, NAn,d≤k=NBn+1,d.
∎
6. Carathéodory Numbers and Zeros of positive Polynomials
For f∈B3,2d, Z\mathdsP(f) denotes the projective zero set of f. Set
[TABLE]
In this section we use the following proposition of Choi, Lam, and Reznick [CLR80].
Proposition 55**.**
Let f∈B3,2d. Suppose that f∈Pos(\mathdsR3) and ∣Z\mathdsP(f)∣>α(2d). Then ∣Z\mathdsP(f)∣ is infinite and there are polynomials p∈B3,2d1, q∈B3,d2 such that f=pq2, where d1+d2=d, p∈Pos(\mathdsR3), ∣Z\mathdsP(p)∣<∞, q is indefinite, and ∣Z\mathdsP(q)∣ is infinite. (It is possible that p is a positive real constant; in this case d1=0 and we set B3,0:=\mathdsR.)
The main aim of this section is to derive upper bounds for the Carathéodory number CBn,2d, n=3. The first approach (Theorem 57) applies also to cases with n>3 (see Theorem 59). The second approach (Theorem 62) is based on Bezout’s Theorem and gives better bounds.
For d∈\mathdsN let β(2d) denote the maximum of ∣Z\mathdsP(f)∣, where f∈B3,2d, f∈Pos(\mathdsR2) and Z\mathdsP(f) is finite. By the Choi–Lam–Reznick Theorem (Proposition 55), β(d)≤α(d) for d∈\mathdsN. We abbreviate C2d:=CB3,2d.
Theorem 56**.**
[TABLE]
Proof.
Let s∈S. Since the projective space \mathdsP(\mathdsRn−1) is compact and condition (6) holds with e:=x12d+x22d+x32d, it follows from Proposition 8 that C2d≤maxs∈∂SC2d(s)+1. Therefore, it is sufficient to show
[TABLE]
for all s∈∂S.
Let s=∑i=1lcisB3,2d(xi) be an l-atomic representating measure of s∈∂S. Since s∈∂S, there exists a polynomial p∈B3,2d,p=0, such that p(x)≥0 on \mathdsP(\mathdsR2) and Ls(p)=0. Then suppμ⊆Z(p), that is, x1,…,xl∈Z(p).
We can assume without loss of generality that the set {sB3,2d(xi)}i=1,...,l is linearly independent. Indeed, assume that these vectors are linearly dependent and let ∑i=1ldisB3,2d(xi)=0 be a non-trivial linear combination. Since all ci>0, there exists ε>0 such that ci+εdi≥0 for all i and cj+εdj=0 for one j. Hence μ′=∑i=1l(ci+εdi)⋅sB3,2d(xi) is a (l−1)-atomic representing measure of s.
The polynomial p∈B3,2d is non-negative on \mathdsP(\mathdsR2), hence on \mathdsR3, so the Choi–Lam–Reznick Theorem (Proposition 55) applies. There are two cases:
∣Z(p)∣≤β(2d).
p=h2q, where k:=deg(h)≥1.
In the case a) we have ∣Z(p)∣≤β(2d) by the definition of β(2d) and therefore C2d(s)≤β(2d). This is the case d=k in (∗).
Now we turn to case b). Then k=1,…,d−1.
Let D(k) denote the largest l for which there exist y1,…,yl∈Z(h) such that the vector sB3,2d(y1),....,sB3,2d(yl) are linearly independent. Then, by the paragraph before last, we have
[TABLE]
Let y1,...,yl∈Z(h). We define
[TABLE]
and hα:=xαh for α∈\mathdsN03,∣α∣=2d−k. Let h~α be the coefficient vector of hα, that is, hα(⋅)=⟨h~α,sB3,2d(⋅)⟩. Since sB3,2d(yi)T⋅h~α=⟨h~α,sB3,2d(yi)⟩=yiαh(yi)=0, we have h~α∈kerM(y1,...,yl). Clearly, the vectors h~α are linearly independent.
Therefore, using (42) we derive
[TABLE]
which is the k-th term in (∗).
Summarizing, we have k=d in case a) and k=1,...,d−1 in case b). Thus we have proved (∗) for arbitrary s∈∂S which completes the proof.
∎
As far as the authors know, the numbers β(2d) are not yet known for d≥4, but we have β(2d)≤α(2d) by Proposition 55.
Theorem 57**.**
For d∈\mathdsN we have
[TABLE]
Proof.
Since β(2d)≤α(2d)=23d(d−1)+1 and (d−k)(k+3)−1≥0 for all d∈\mathdsN and k=0,...,d−1, we have for (41)
[TABLE]
Inserting the latter into (41) we obtain the assertion.
∎
In Table 1 we collect some numerical cases of Carathéodory bounds.
The next proposition is also due to Choi–Lam–Reznick [CLR80]. We will use it to derive a bound for the Carathéodory number CB4,4.
Proposition 58**.**
If p∈B4,4 and ∣Z\mathdsP(p)∣>11, then p is a sum of at most six squares of quadratics.
Theorem 59**.**
CB4,4≤26.
Proof.
Let s be a boundary moment sequence. Then there exists p∈B4,4,p=0, such that p∈Pos(\mathdsR3) and Ls(p)=0. By Proposition 58, ∣Z(p)∣≤11 or we have p=f12+...+f62 for some f1,...,f6∈B4,2.
In the following proof we give an upper bound on the maximal number l of linearly independent vectors sB4,4(x1),...,sB4,4(xl) with xi∈Z(p). By Theorem 18, this number l is an upper bound of CB4,4(s). We proceed in a similar manner as in the proof of Theorem 57.
By Proposition 58 we have two cases:
∣Z(p)∣≤11,
p=f12+...+fk2, k≤6.
In the case a) we clearly have l≤∣Z(p)∣≤11.
Now we treat
case b). Clearly, Z(f12+...+fk2)⊆Z(f12)=Z(f1).
Hence it suffices to determine the maximal number l for a single square p=f2, where f∈B4,2,f=0. Let x1,...,xl∈Z(f) be such that the set {sB4,4(xi)}i=1,...,l is linearly independent. Define
[TABLE]
fα:=xαf for α∈\mathdsN04,∣α∣=2, and f~α by fα(⋅)=⟨f~α,sB4,4(⋅)⟩. Then we have f~α∈kerM(x1,...,xl), since sB4,4(xi)T⋅f~α=⟨f~α,sB4,4(xi)⟩=xiαf(xi)=0. The vectors f~α are linearly independent. Therefore, dimkerM(x1,...,xl)≥#fα=∣B4,2∣ and
[TABLE]
This proves that the moment sequence s can be represented by at most 25 atoms.
Summarizing both cases, we have shown that each s∈∂S has a k-atomic representing measure with k≤25. Therefore, by Proposition 8, CB4,4≤25+1=26.
∎
Proposition 1 yields CB4,4≤35, while Theorem 13 gives CB4,4≤34. Combining the upper bound of Theorem 59 with the lower bound from Theorem 27 we get
[TABLE]
Now we give another approach to obtain estimates of the Carathéodory number CB3,2d from above. It is based on Bezout’s Theorem.
Let f1∈B3,d1 and f2∈B3,d2.
For each point t∈Z\mathdsP(f1)∩Z\mathdsP(f2) the intersection multiplicity It(f1,f2)∈\mathdsN of the projective curves f1=0 and f2=0 at t is defined in [Wal78, III, Section 2.2].
We do not restate the precise definition here. In what follows we use only the fact that It(f1,f2)≥2 if t is a singular point of one of the curves f1=0 or f2=0.
We use the following version of Bezout’s Theorem. The symbol ∣Z∣ denotes the number of points of a set Z.
Lemma 60**.**
*If f1∈B3,d1 and f2∈B3,d2 are relatively prime in \mathdsR[x1,x2,x3], then
*
[TABLE]
Proof.
See e.g. [Wal78, p. 59].
∎
Lemma 61**.**
Let s be a moment sequence for B3,2d. Suppose p∈B3,k is irreducible in \mathdsR[x1,x2,x3], k≤d, and Ls(p2(x12+x22+x32)d−k)=0. Then
[TABLE]
Proof.
Consider the moment cone S~:=S(B3,2d,Z(p)). Then S~ is an exposed face of the moment cone S=S(B3,2d,\mathdsP(\mathdsR2)) and s∈S~. By Proposition 21, S is closed and so is S~. Clearly, each point of S~ is the limit of relative inner points of S~. Therefore, since the sets S~k are closed by Proposition 21, it is sufficient to prove the assertion for all relatively inner points of the cone S~.
Let s be a relatively inner point of S~ and x∈Z(p). Setting e:=x12d+x22d+x32d, condition (6) holds. Since Z(p) is compact, Proposition 8 applies, so the supremum cs(x):=sup{c:s−c⋅sB3,2d(x)∈S~} is attained and s′:=s−cs(x)⋅sB3,2d(x)∈∂S~.
Thus there exists a supporting hyperplane of the cone S~ at s′. Hence there exists a polynomial q∈B3,2d such that Ls′(q)=0, Ls(q)>0, and q≥0 on Z(p).
From Ls′(q)=Ls(q)−cs(x)q(x)=0 it follows that q(x)=0. (Indeed, otherwise Ls(q)=0, so s would be a boundary point of S~, a contradiction.) Since p(x)=0 and q(x)=0, the irreducible polynomial p is not a factor of q, so p and q are relatively prime and Bezout’s Theorem applies.
Since q(y)≥0 on Z(p), for each intersection point of q and p has the intersection multiplicity of at least 2. Therefore, by Lemma 60,
[TABLE]
Since each representing measure of s′ is supported on Z(p)∩Z(q), (45) implies that C2d(s′)≤dk. Hence C2d(s)≤C2d(s′)+1≤dk+1.
∎
Our main result in this section is the following theorem.
Theorem 62**.**
C2d≤α(2d)+1=23d(d−1)+2* for d∈\mathdsN, d≥5.*
Proof.
Let us consider the moment cone S:=S(B3,2d,\mathdsP(\mathdsR2)). We proceed in a similar manner as in the proof of
Lemma 61. By Proposition 21, the sets Sk are closed. Hence it suffices to prove the inequality C2d(s)≤α(2d)+1 for all relatively inner points of the cone S.
Let s be an inner point of S and x∈\mathdsP(\mathdsR2). By Proposition 8, the supremum cs(x):=sup{c:s−c⋅sB3,2d(x)∈S} is attained and s′:=s−cs(x)⋅sB3,2d(x)∈∂S. Then there exists a supporting hyperplane of S at s, hence there is a polynomial f∈B3,2d such that Ls′(f)=0 and f≥0 on \mathdsP(\mathdsR2). We apply Proposition 55 to f. Then, we can write f=p⋅q12⋯qr2 (r≤d), where p∈Pos(\mathdsP(\mathdsR2)), all qi are indefinite and irreducible in \mathdsR[x1,x2,x3], Z(p)<∞ and all ∣Z(qi)∣ are infinite.
Since
[TABLE]
we find a disjoint decomposition Z∪Z1∪⋯∪Zr of Z(f) with Z⊆Z(p) and Zi⊆Z(qi). Let μ′=∑j=1mcjδxj be a representing measure of s′ and set
[TABLE]
Clearly, s′=s0+s1+⋯+sr. Setting di=deg(qi) and 2k=deg(p), we have d=k+d1+⋯+dr and r≤d−k. Using Proposition 55 and Lemma 61 we derive
[TABLE]
Therefore, CB3,2d(s)≤CB3,2d(s′)+1≤α(2d)+1=23d(d−1)+2 for all d≥5.
∎
Example 63** (d=5).**
W. R. Harris [Har99] discovered a polynomial h∈B3,10 that is nonnegative on \mathdsP(\mathdsR2) with projective zero set
[TABLE]
where (a,b,c)∗ denotes all permutations of (a,b,c) including sign changes. Hence h has exactly 30 projective zeros zi,i=1,…,30. A computer calculation shows that the matrix (sB3,10(z))z∈Z\mathdsP(h) has rank 30, i.e., the set {sB3,10(zi):i=1,…,30} is linearly independent. Therefore, CB3,10≥30 by Theorem 18. Further, we compute NB3,10=15 and have CB3,10≤α(10)+1=37 by Theorem 62. Summarizing,
[TABLE]
The following corollary reformulates Theorem 18 in the present context.
Corollary 64**.**
Let d∈\mathdsN and p∈B3,2d. Suppose that p∈Pos(\mathdsR3), ∣Z(p)∣=β(2d), and the set {sB3,2d(z):z∈Z(p)} is linearly independent. Then
[TABLE]
It seems natural to ask whether or not the assumption on the linear independence of the set {sB3,2d(z):z∈Z(p)} in Corollary 64 can be omitted. This leads to the
Question:* Suppose p∈B3,2d, p∈Pos(\mathdsR3), and ∣Z(p)∣<∞ (or ∣Z(p)∣=β(2d)).
Is the set {sB3,2d(z):z∈Z(p)} linearly independent?*
Note that for the Robinson polynomial R∈B3,6 the answer is “Yes”.
Recall that β(2d)≤α(2d) by the Choi–Lam–Reznick Theorem (Proposition 55). It seems likely to conjecture that
[TABLE]
The Robinson polynomial has 10 projective zeros, so that α(6)=β(6)=10. Therefore, since CB3,6=11 as shown in [Kun14], this conjecture is true for d=3.
As noted above, the Harris polynomial R∈B3,10 has 30 projective zeros. Hence 30≤β(10)≤α(10)=31.
From the proof of Theorem 62 it follows that (47) holds if
[TABLE]
7. Carathéodory Numbers and Real Waring Rank
In Definition 4 we introduced the signed Carathéodory number CA,±. In this section we connect it to the real Waring rank w(n,2d), that is, to the smallest number w(n,2d) such that each f∈Bn,2d can be written as real linear combination
[TABLE]
of 2d-powers of linear forms x⋅λi=λi,1x1+⋯+λi,nxn, where k≤w(n,2d), ci∈\mathdsR, λi∈\mathdsRn.
Let us recall some basics on the apolar scalar product [⋅,⋅], see e.g. [Rez92]. For α=(α1,...,αn)∈\mathdsN0n with ∣α∣:=α1+⋯+αn=2d we set γα:=α1!⋯αn!(2d)!. Let p,q∈Bn,2d. We write p(x)=∑αγαaαxα and q(x)=∑αγαbαxα and define
[TABLE]
Then (Bn,2d,[⋅,⋅]) becomes a finite-dimensional real Hilbert space. Setting fλ(x):=(λ⋅x)2d, we obtain
[TABLE]
Let f be of the form (48). Then, for p∈Bn,2d it follows from (49) that
[TABLE]
that is, the linear functional Lf on Bn,2d is the integral with respect to the signed measure μ:=∑i=1kciδλi. Conversely, each signed atomic measure yields a function f of the form (48) such that (50) holds. By the Riesz Theorem all linear functionals on Bn,2d are of the form Lf, where f is as in (48).
Theorem 65**.**
w(n,2d)=CBn,2d,±.
NBn,2d≤w(n,2d)≤2NBn,2d.
Set N:=NBn,2d. Then there exists λ=(λ1,...,λN)∈\mathdsRN⋅n such that for all ε>0 and p∈Bn,2d we have
[TABLE]
for some λε=(λ1ε,...,λNε) with ∥λ−λε∥<ε, ∣1−ci∣<ε, c∈\mathdsR.
The set of vectors λ as in (iii) is open and dense in \mathdsRNBn,2d⋅n.
Proof.
(i) is clear from the preceding considerations on the apolar scalar product.
Remark 28 and (i) imply (ii), while (iii) follows from Theorem 25 combined with (i). (iv) is a consequence of Sard’s Theorem as in Theorem 27.
∎
With Theorem 53 the upper bound in (ii) was already obtained in [BT15, Cor. 9].
Acknowledgment
The authors are grateful to G. Blekherman, M. Schweighofer, and C. Riener for valuable discussions on the subject of this paper.
K.S. thanks also J. Stückrad to helpful discussions.
P.dD. was supported by the Deutsche Forschungsgemeinschaft (SCHM1009/6-1).