Finite Blaschke products with prescribed critical points, Stieltjes polynomials, and moment problems
Gunter Semmler, Elias Wegert

TL;DR
This paper explores the equivalence between determining finite Blaschke products from critical points, charge interaction models, and moment problems, revealing new theoretical insights and algorithmic possibilities.
Contribution
It establishes the equivalence of three problems involving Blaschke products, charge configurations, and moment problems, and links these to polynomial solutions and energy minimization.
Findings
Proves the equivalence of three key problems in complex analysis and mathematical physics.
Connects Blaschke products with Stieltjes polynomials and moment problems.
Suggests new algorithmic approaches based on energy minimization.
Abstract
The determination of a finite Blaschke product from its critical points is a well-known problem with interrelations to other topics. Though existence and uniqueness of solutions are established for long, we present several new aspects which have not yet been explored to their full extent. In particular, we show that the following three problems are equivalent: (i) determining a finite Blaschke product from its critical points, (ii) finding the equilibrium position of moveable point charges interacting with a special configuration of fixed charges, (iii) solving a moment problem for the canonical representation of power moments on the real axis. These equivalences are not only of theoretical interest, but also open up new perspectives for the design of algorithms. For instance, the second problem is closely linked to the determination of certain Stieltjes and Van Vleck polynomials for a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMathematical functions and polynomials · Quantum chaos and dynamical systems · Matrix Theory and Algorithms
Finite Blaschke products with prescribed critical points,
Stieltjes polynomials, and moment problems
Gunter Semmler and Elias Wegert
Abstract
The determination of a finite Blaschke product from its critical points is a well-known problem with interrelations to several other topics. Though existence and uniqueness of solutions are established for long, we present new aspects which have not yet been explored to their full extent. In particular, we show that the following three problems are equivalent: (i) determining a finite Blaschke product from its critical points, (ii) finding the equilibrium position of moveable point charges interacting with a special configuration of fixed charges, and (iii) solving a moment problem for the canonical representation of power moments on the real axis. These equivalences are not only of theoretical interest, but also open up new perspectives for the design of algorithms. For instance, the second problem is closely linked to the determination of certain Stieltjes and Van Vleck polynomials for a second order ODE and characterizes solutions as global minimizers of an energy functional.
1 Introduction
A finite Blaschke product of degree is a rational function of the form
[TABLE]
whose (not necessarily distinct) zeros are in the unit disc and . A point where is called critical point of the Blaschke product. Every Blaschke product of degree has exactly critical points in (counted with multiplicities), and another critical points , symmetric to with respect to the unit circle (see e.g. [24]). Note that if is a -fold zero then and are both criticals points of order , and if [math] is a critical point then we also count as a critical point of the same multiplicity.
While it is straightforward to determine the critical points of a Blaschke product from its zeros (by computing the zeros of a polynomial), the reverse problem is much more challenging. The basic existence and uniqueness result is summarized in the following theorem.
Theorem 1**.**
Let be points in . Then there is a Blaschke product of degree with critical points . is unique up to post-composition with a conformal automorphism of .
This theorem has been proved (in chronological order) by Heins [15], Wang and Peng [39], Bousch [3], and Zakeri [40], using topological arguments. Stephenson [36, Theorem 21.1] obtains as the limit of discrete finite Blaschke products, i.e., he considers sequences of circle packings with prescribed branch set. Kraus and Roth [19], [20] describe an approach based on a solution of the Gaussian curvature equation (that works equally well for infinite Blaschke products), and ask for a procedure to actually compute from its critical points.
In this paper we show that the determination of Blaschke products with prescribed critical points is equivalent to two other classical problems of analysis. The first one is obtained after transforming the problem from the unit disk to the upper half plane in form of a second order ODE. We will have to look for its polynomial solutions known as Stieltjes and Van Vleck polynomials. Like in the case originally considered by Stieltjes this allows an electrostatic interpretation and the characterization of solutions as minimum points of an energy functional. This approach yields a new and (as we hope) very transparent proof of Theorem 1. Moreover, the polynomial encoding the given critical points is positive on the real axis and therefore an inner point of the convex cone of positive polynomials on . By mapping it to an inner point of the cone of power moments we demonstrate that the problem is also equivalent to the classical problem of finding canonical representations of given moments.
Algorithmic aspects of the different approaches and the results of numerical experiments will be discussed in a forthcoming paper.
2 Transformations
Let be Blaschke product of degree . As in [34, Lemma 3] it is convenient to transform the problem using the (inverse) Cayley transform
[TABLE]
which maps and onto the upper half-plane and the extended real line , respectively, and satisfies , . Consequently, is a rational function that is real-valued on (except at its poles). The transition between the unit disc and the upper half plane was also a key tool in the work of Gorkin and Rhoades [8] on boundary interpolatioin by finite Blaschke products.
Let us first assume that satisfies the normalization . Then has a zero at infinity and therefore with real polynomials of degree and of degree . Since, for an appropriate branch of the argument function, the mapping , is continuous and strictly monotone from onto some interval , has simple poles at real numbers corresponding to the values of with . In each of the intervals as well as in and , the function is strictly increasing. Therefore, the partial fraction decomposition of has the form
[TABLE]
with positive numbers . Conversely, if is a rational function of the form (1) with ordered poles and , then is a rational function that maps and onto themselves and satisfies . Hence is a Blaschke product. By the argument principle it has exactly zeros in , i.e., has degree . Thus we have shown:
Lemma 1**.**
The mapping is a bijection between all Blaschke products of degree satisfying and all rational functions of the form (1) with ordered poles and numbers for .
It will turn out useful to also consider Blaschke products with the side condition . Then has a pole at infinity and can be written in the form with real polynomials of degree and of degree , respectively. As in the case considered above, one derives the existence of finite poles corresponding to the points , where . In each of the intervals as well as in and the function is strictly increasing, hence the partial fraction decomposition of has the form
[TABLE]
with , and . Proceeding as above, we get the following lemma.
Lemma 2**.**
The mapping is a bijection between all Blaschke products of degree satisfying and all rational functions of the form (2) with , , and for .
We first investigate the problem for functions of the form (1). It follows from the chain rule that is a critical point of if and only if is a critical point of . Since Möbius maps preserve symmetries in circles and lines, also the points are critical points of corresponding to the critical points of . The derivative of has the form
[TABLE]
with a monic real polynomial of degree , and
[TABLE]
We conclude that has the factorization
[TABLE]
where , . The polynomial is entirely determined by the location of the critical points and (3) shows that it satisfies the equation
[TABLE]
Evaluating (6) at we get
[TABLE]
Inserting (7) into (6), and introducing the Lagrange interpolation polynomials
[TABLE]
which satisfy , we can rewrite equation (6) as
[TABLE]
Since has degree , the Lagrange-Hermite interpolation formula (cf. Chapter 14.1 of [38]) implies that for any pairwise distinct points ,
[TABLE]
where for ,
[TABLE]
are the fundamental polynomials of the first and second kind of Hermite interpolation. From formulas (10), (11) we get
[TABLE]
The polynomials are linearly independent, and comparing (9) and (12) we see that the representation (9) holds if and only if
[TABLE]
Since the are exactly the (simple) roots of , this condition means that the polynomial is divisible by , i.e., there is a real polynomial such that
[TABLE]
Because has exact degree we see that has degree . One easily checks that these considerations can be reversed to obtain the following equivalence statement.
Lemma 3**.**
Let , , , , and let , , be defined by (1), (4), (5), respectively. Then the rational function has critical points if and only if there is a real polynomial of (exact) degree such that
[TABLE]
and the are given by (7) for some .
The corresponding constructions for the function in (2) are similar: Its derivative is
[TABLE]
where is defined in (5) as before, and
[TABLE]
Now we obtain the equation
[TABLE]
in particular
[TABLE]
With the help of the Lagrange interpolation polynomials with nodes , given by
[TABLE]
equation (16) can be rewritten as
[TABLE]
Since and are monic, is at most of degree , and hence the Lagrange-Hermite interpolation formula applied to this polynomial tells us that
[TABLE]
where, for ,
[TABLE]
A straightforward computation yields
[TABLE]
and comparing this with (19) we get
[TABLE]
which is equivalent to the divisibility of by (see (15)). The quotient is a polynomial of exact degree . We summarize these results in the next lemma.
Lemma 4**.**
Let , , , , , , and let , and be given by (2), (5), and (15), respectively. Then the rational function has critical points if and only if there is a real polynomial of degree such that
[TABLE]
and the are given by (17).
In this way the determination of a Blaschke product with given critical points is reduced to the question for which real polynomials (resp. ) the second order ODE (13) (resp. (20)) has a polynomial solution of degree (resp. of degree ) with simple real roots. This is a classical problem which will be considered in the next section.
3 Stieltjes and Van Vleck polynomials
Let and be given polynomials of degrees and , respectively. A polynomial of degree is called Van Vleck polynomial, if the generalized Lamé equation
[TABLE]
has a polynomial solution of preassigned degree . The solutions are called Stieltjes polynomials or Heine-Stieltjes polynomials. Under certain condition given below, Stieltjes [37] proved the existence of the polynomials that carry now his name (see also [38, Section 6.8]). Stieltjes assumed that
[TABLE]
has real roots and that the coefficients in the partial fraction decomposition
[TABLE]
are all positive. Now consider partitions of into non-negative integer summands . There are such partitions, and each partition corresponds to exactly one monic polynomial with distinct real roots such that (21) holds for a suitable Van Vleck polynomial . This polynomial has exactly roots in for and can be characterized as follows: Put positive charges at the fixed positions and movable unit charges on the real line such that each interval contains exactly of them. As usual in plane electrostatics, the force (repulsion or attraction) between two charges is assumed to be proportional to their magnitudes and to the inverse of their distance. Stieltjes proved that there is a unique equilibrium position of these positive movable unit charges; it is attained when the charges are located at the zeros of and corresponds to the global minimum of the potential energy of each possible charge configuration.
In our equations (13) and (20) the polynomial does not have real zeros so that Stieltjes’ results are not directly applicable. The necessary adaptations to the situation at hand will be done in this section.
Assume first that is a Van Vleck polynomial for the equation (13) and that has the form (4). The identity
[TABLE]
is well-known (see for instance formula (2.9) in [17]). Recall that the logarithmic derivative of has the representation
[TABLE]
Dividing (13) by and evaluating at using (22) and we get
[TABLE]
Conversely, if the equations (23) are satisfied, the polynomial has the zeros , so that is a polynomial which satisfies (13). Summarizing we get the following result:
Lemma 5**.**
Let be given and define by (5). Then the polynomial from (4) with is a Stieltjes polynomial for the equation (13) if and only if (23) holds. Similarly, the polynomial from (15) with is a Stieltjes polynomial for the equation (20) if and only if
[TABLE]
Equation (23) has the following electrostatic interpretation: Fix negative charges of size at each of the points and . Then (moveable) positive unit charges at the positions on the real axis are in equilibrium in the field generated by all charges. Analogously, (24) describes the equilibrium positions of positive unit charges at on the real axis in the presence of the same negative charges at the points .
The equilibrium of moveable positive unit charges in the presence of fixed negative charges in was studied by Orive and García [29]. Their results are applicable to (24), and we will include a proof only to make our exposition self-contained (see Lemma 6 below). However, the equilibrium problem described by (23) is not covered by their results, since the number of movable unit charges surpasses the total of negative charges by .
Other equilibrium problems involving fixed charges have recently been considered by Grünbaum [10], [11], Dimitrov and Van Assche [5], [6], and Grinshpan [9]; see Marcellán, Martínez-Finkelshtein, and Martínez-González [22] for an overview until 2007. More recent literature includes Martínez-Finkelshtein and Rakhmanov [23], McMillen, Bourget, and Agnew [25], Orive and Sánchez-Lara [30], [31], and Shapiro [35].
Example 1. Let , i.e., there is only one critical point given in . Then (24) with simplifies to
[TABLE]
The only solution of this equation is , the expected equilibrium position of one unit charge. The equations (23) read as follows
[TABLE]
From the first equation in (25) we get
[TABLE]
and after some elementary manipulations we arrive at
[TABLE]
Since this equation is invariant with respect to interchanging and , the second equation of (25) yields the same conditon. Equation (26) has a simple geometric meaning; it says that, by Thales’ theorem, the points and lie on a circle.
\operatorname{Re}$$\operatorname{Im}$$t$$\zeta$$\overline{\zeta}$$x_{1}$$x_{2}$$t
The equilibrium positions are therefore not uniquely determined: for each there is a corresponding , and vice versa. As we will see later (Theorem 2), the situation is similar for .
As pointed out in [19], the uniqueness statement in Theorem 1 follows from Nehari’s generalization of Schwarz’ Lemma (see [26, corollary to Theorem 1]). While all proofs of the existence part of Theorem 1 in the literature are quite hard, the electrostatic interpretation allows us to provide a simple and transparent proof of existence and uniqueness.
The main argument for proving uniqueness is originally due to Sarason and Suarez [33] and has also been used in [9] and [29]. The formulation of [29, Theorem 2] can easily be extended to cover our situation here. In our exposition the proof is naturally based on the fundamental relation (19).
We start with the problem of movable unit charges at positions on the real line and introduce their energy, which is (neglecting some physically motivated factor)
[TABLE]
Since does not change upon permutations of its variables, we can confine our considerations to the open subset of where .
Lemma 6**.**
The function attains a global minimum at a point in . This point is the only critical point of in and corresponds to the unique solution of (24) with .
Proof.
-
A point is a critical point of if and only if for . A straightforward computation shows that this is equivalent to (24).
-
In order to prove that attains a global minimum in , we observe that the first sum in (27) is bounded from below, while the second sum tends to whenever approaches a finite boundary point of (which implies that for some ). So it only remains to study the behavior of when in some norm of . Let be a constant (depending on ) such that
[TABLE]
Using the inequality
[TABLE]
we estimate
[TABLE]
and hence as . We conclude that attains a global minimum at a point which represents a solution of (24).
- In order to show uniqueness (of the solution of (24) and hence the critical point of in ), we assume that and are two solutions of (24). For let
[TABLE]
By Lemma 5, both and are Stieltjes polynomials, i.e. they satify (20) for suitable Van Vleck polynomials. Going back from this equation in the calculations in Section 2 we infer they also satisfy equation (19), and hence we have
[TABLE]
The polynomials are of degree , hence they must be linearly dependent, i.e., there are real numbers , not all vanishing, such that
[TABLE]
Multiplying this by and adding it to (28) we get
[TABLE]
for all . Since we can choose such that (at least) one of the numbers , with vanishes while all the others remain non-negative. Assume e.g. for some . Since S\big{(}t_{k_{0}}\big{)}=0 and S_{k}\big{(}t_{k_{0}}\big{)}=0 for , the left hand side of (29) vanishes at . But the right hand side of this equation cannot have other zeros than , hence for some .
If we are done. Otherwise the polynomials of degree
[TABLE]
are also linearly dependent, so that we have a non-trivial relation of the form
[TABLE]
With (28) we have for
[TABLE]
As above we find now another pair of equal zeros of and . Proceeding inductively we finally get that both solutions of (24) are identical. ∎∎
The solution of (23) can now be reduced to (24) using an appropriate Möbius transformation. We summarize the results for both equations in the following theorem.
Theorem 2**.**
For arbitrary we have:
- (i)
The equation (24) has a unique solution with . It realizes the global minimum of the potential energy defined in (27), and is (up to a rearrangement of the ) the only critical point of . 2. (ii)
Let additionally , and fix . For every there are unique points for such that equation (23) is satisfied. All depend continuously and monotonously on , and we have if and if .
Proof.
Assertion (i) has been proved in Lemma 6. In order to show the existence part of (ii) let , be arbitrary and define by (17). According to Lemma 4 and Lemma 5, the function from (2) has critical points . Since as ( and is strictly increasing in each interval (, any such interval contains a unique solution of . It is clear that these points have the claimed properties concerning continuity, monotonicity and limit behavior. The function defined by
[TABLE]
is rational and can be represented as a quotient , where is a real polynomial of degree , and is a real polynomial of degree , respectively. Since has simple poles in , the partial fraction decomposition of has the form (1). From the monotonicity of we obtain that is increasing between two poles and thus . Because and have the same critical points , we can combine Lemma 3 and Lemma 5 to obtain (23).
In order to show the uniqueness assertion of (ii) we let be any solution of (23) with and define by (7) for some and replaced by . Let the function be of the form (1) with poles at and residues . According to Lemma 3 and Lemma 5, has critical points . Since is real-valued and increasing between any two of its poles, it has zeros for . Hence has the form
[TABLE]
The critical points of are the same as those of , hence we can again invoke Lemma 4 and Lemma 5 to conclude that solve (24). From the uniqueness stated in (i) it follows that for . Consequently, by (14) and (15),
[TABLE]
Hence for some , and therefore with . Using (30) we obtain
[TABLE]
and since and both have poles at it follows that . Thus , in particular for . ∎∎
The proof of Theorem 2 also indicates how to construct rational functions and of the form (1) or (2) with given critical points from the solutions of (23) and (24), respectively. Setting we obtain by Lemma 1 a Blaschke product with critical points , and the same is true for by Lemma 2. This shows again the existence of a Blaschke product of degree with given critical points in . In the same way the uniqueness statement of Theorem 1 can be inferred from the uniqueness statements in Theorem 1 and we have thus provided an independent proof of this result.
Example 2. Consider the case where . Since , we have , i.e., this problem is equivalent to finding a Blaschke product of degree with critical point of order at . An obvious solution with is , and the solutions of are the th roots of unity (). The unique solution of (24) is then given by the images of under the mapping ,
[TABLE]
The case where all coincide at another point of can be reduced to this cases by an appropriate automorphism with .
By now we know that the Stieltjes polynomials (4) (corresponding to the one-parameter family of solutions of (23)) and (15) (corresponding to the solution of (24)) are associated with certain Van Vleck polynomials satisfying (13) and (20), respectively. It turns out that all these polynomials are the same, so that we can speak about the Van Vleck polynomial for the critical points .
Lemma 7**.**
Let and let and be the solutions described in Theorem 2. Then the Van Vleck polynomials and , corresponding to solutions (4) and (15) of (13) and (20), respectively, coincide.
Proof.
Let be defined by (1) and (7) with and let be given by (2) and (17) with and . As we have seen in the proof of Theorem 2, and are connected by the equation
[TABLE]
with constants and (i.e. they differ by an automorphism of that maps [math] to ). Differentiating this equation we obtain
[TABLE]
and plugging in (3) and (14) we arrive at
[TABLE]
so that . Since , , as , we have shown the remarkable identity
[TABLE]
Differentiating (31) we get
[TABLE]
Recall from (3) that , and thus
[TABLE]
Using (in this order) (20), (32), (13), (33), and (31), we get
[TABLE]
so that . ∎∎
As a consequence of this lemma we obtain that for the Van Vleck polynomial the solution space of the Lamé equation
[TABLE]
consists only of polynomials, and that its general solution is given by
[TABLE]
Fixing and letting run through , the zeros of the polynomials (35) run through all equilibrium positions of points. Solutions with correspond to the limit configuration where one charge escaped to infinity and only charges remain on the real line. The preceding result also confirms what was said in [29, Remark 3] about possible polynomial solutions of the Lamé equation.
Once two Stieltjes polynomials and as solutions of (34) are known, the polynomial and the Van Vleck polynomial can be reconstructed as shows the following result.
Lemma 8**.**
Let and be monic polynomial solutions of degree and , respectively, of the Lamé equation (34). Then we have
[TABLE]
and
[TABLE]
Proof.
Since and are linearly independent they form a fundamental system for the differential equation. Their Wronskian
[TABLE]
is therefore non-vanishing and Abel’s formula applied to (13) implies that for
[TABLE]
Since and are easily seen to be monic, the two polynomials coincide and we have proved (36). Differentiating this equation we get
[TABLE]
and using (34), (36), we finally arrive at
[TABLE]
and (37) follows. ∎∎
4 Convex cones and moment problems
In this section we will show that a well-known problem in moment theory is also equivalent to the determination of a Blaschke product with given critical points. We start with some notions and facts from convex analysis, our main sources are [1], [2], [4], [16], [18], [21].
A convex cone is a subset of a real vector space, such that implies for all positive and . One standard example is the convex cone of real non-negative polynomials of degree at most ,
[TABLE]
where we identified a polynomial with the vector of its coefficients. The interior of this convex cone consists of all positive polynomials; hence the polynomial from (5) belongs to the interior of provided that for . Recall that encodes the given data, i.e., the critical points of the Blaschke product.
Another important example is the convex cone of symmetric positive semi-definite matrices,
[TABLE]
The interior of is the convex cone of symmetric positive definite matrices,
[TABLE]
Furthermore, we consider the moment cone
[TABLE]
where is the set of nonnegative measures on such that
[TABLE]
The set is the conic hull of the moment curve
[TABLE]
i.e., is the smallest convex cone containing , cf. Theorem 2.1 in chapter V of [18]. Note that is not a closed subset of since points in the closure of this set can involve representations with “mass at infinity”. More precisely, belongs to the closure of if and only if it has the representation
[TABLE]
with and non-negative (representing a mass at infinity). An alternative characterization of is
[TABLE]
where
[TABLE]
denotes the Hankel matrix associated with . Also, a point is an inner point of if and only if is positive definite. Using the linear mapping
[TABLE]
we can therefore write
[TABLE]
A representation (39) of a point in (the closure of) the moment cone is usually not unique. Therefore one searches for canonical representations of where the measure is concentrated at a finite number of points. For example, if is concentrated at points with masses and mass at infinity we have
[TABLE]
If is concentrated at points with masses and has no mass at infinity then we have
[TABLE]
For given moments , the moment problem consists in finding the roots and the corresponding weights , , featuring in the representations (42), (43), respectively. The following theorem is well known (see e.g. [1], [18]).
Theorem 3**.**
For any the following assertions hold:
- (i)
There is a representation (42) with uniquely determined roots , weights and . 2. (ii)
Let additionally , , and fix . Then for every there are unique roots for and weights such that equation (43) is satisfied. All depend continuously and monotonically on .
Before going on, we observe that the moment problem (43) can be rephrased as a matrix factorization problem. Let
[TABLE]
denote the Vandermonde matrix of the points . With the abbreviations , , and from (40), the equations (43) can be rewritten as
[TABLE]
This representation is known as the Vandermonde factorization of the (positive definite) Hankel matrix (see Heinig and Rost [13], [14]).
There is also a nice way to obtain the roots . The polynomial defined by
[TABLE]
is of (exact) degree and its roots are . To see this we observe that in view of the first columns of the matrix in (45) are linearly independent and in view of (42) each of them is a linear combination of the first columns of . Hence the spaces spanned by the first columns of these two matrices coincide, such that the two determinants and vanish for the same values of .
Similarly, in view of the first columns of the matrix in
[TABLE]
are linearly independent and by (43) we find that the zeros of coincide with the zeros of . Hence the zeros of are the roots described in Theorem 3 (ii).
There is an evident similarity between Theorem 2 and Theorem 3. As we will see in a moment, the solutions described in both theorems coincide if we map the polynomial from (5) to an appropriate point . This mapping was investigated by Nesterov [27] and will be described below. Hachez and Nesterov [12] used it together with the Vandermonde factorization (44) to represent a positive polynomial as a weighted sum of squares of Lagrange interpolation polynomials as in formula (9). This yields another equivalent reformulation of the original Blaschke product problem.
In order to describe a mapping between and we have to recall the notion of duality. If is a convex cone in a real Hilbert space with scalar product , its dual cone is . In the sequel we use scalar products for vectors and matrices defined by
[TABLE]
With respect to these scalar products we have the following duality relations:
[TABLE]
Let
[TABLE]
be the mapping which sends a matrix to the vector of its anti-diagonal sums. The mapping is dual to the mapping defined in (41) in the sense that
[TABLE]
Moreover, the cone of non-negative (positive) polynomials can be obtained as the image of the cone of non-negative (positive) definite matrices,
[TABLE]
The mapping has an interesting interpretation when we identify a vector with the polynomial , and a matrix with the polynomial in two variables
[TABLE]
where is the function defined in (38). Then is the operator of equating variables that maps to the polynomial
[TABLE]
Nesterov [27] introduced the mapping
[TABLE]
which is defined for whenever is nonsingular. If is even positive definite, the inverse matrix is also positive definite and we have for
[TABLE]
i.e., is (the coefficient vector of) a positive polynomial. Thus the Nesterov mapping maps to . Nesterov [27] also remarked that is the (negative) gradient of the function
[TABLE]
which is a (strongly non-degenerate self-concordant) barrier functional for , see [28] or [32] for definitions and basic properties. Hence the gradient of is a bijection of onto the interior of its dual cone . We will give an alternative proof of this fact, demonstrating how the Nesterov mapping connects Theorem 2 and Theorem 3.
Theorem 4**.**
Let be a positive polynomial with zeros in the upper half plane. Then there is a unique such that . If and satisfy (23) and (24), respectively, then the canonical representations (42) and (43) hold with the positive numbers
[TABLE]
where is the leading coefficient of . Conversely, if (42) and (43) are satisfied for , and positive numbers , then also the equilibrium conditions (23) and (24) are true. Moreover, , , and satisfy (47).
The proof of Theorem 4 is split into several lemmas. Without loss of generality we always assume that and .
In the following we denote by the Bezoutian of (interpretable as polynomial of degree ), which is the matrix defined by
[TABLE]
see [7], [13], [14] for more information on this topic. Recall that the inverse of a non-singular Hankel matrix is a non-singular Bezoutian, and vice versa.
Lemma 9**.**
For each there exists a vector with .
Proof.
Let be a positive polynomial with zeros and for . Then with and the monic polynomial from (5). Let and be the polynomials (4) and (15) with zeros at the equilibrium points and according to Theorem 2. Let further be the Bezoutian of and , i.e.,
[TABLE]
The application of yields in view of (46)
[TABLE]
We have shown in equation (36) of Lemma 8 that is the Wronskian of and , from which we conclude that . Since is a non-singular Bezoutian, its inverse is a Hankel matrix, hence there is with . This vector satisfies . Defining we get . ∎∎
We will soon see that even . Before that we show:
Lemma 10**.**
If satisfy (23) and are given by (47), then the canonical representation (43) holds.
Proof.
We show that for
[TABLE]
is equal to . In view of (36) and we have and hence
[TABLE]
Putting in (48) we find
[TABLE]
where are the Lagrange interpolation polynomials defined in (8). Letting we obtain
[TABLE]
i.e., the matrix with entries
[TABLE]
is a right inverse of the transposed Vandermonde matrix . Since is a square matrix, is also left inverse to , so that for
[TABLE]
So the Hankel matrix is inverse to the Bezoutian and thus for . ∎∎
As a consequence of this lemma and (44) we get that is similar to a diagonal matrix with positive diagonal elements and therefore positive definite. We have thus proved that and hence the surjectivity of the Nesterov mapping .
Lemma 11**.**
If satisfy (24) and are given by (47) (where is the leading coefficient of ), then the canonical representation (42) holds.
Proof.
We set
[TABLE]
and show that for all . Note that in view of (36) and
[TABLE]
Setting in (48) we get
[TABLE]
where are the Lagrange interpolation polynomials defined in (18). Putting and using (50) we obtain
[TABLE]
which can be rewritten as
[TABLE]
where
[TABLE]
Since the are polynomials of degree , comparing coefficients of in (51) yields that
[TABLE]
Defining for this can be written as
[TABLE]
We also have (see (53) and (54))
[TABLE]
Since is the leading coefficient of the monic polynomial we get
[TABLE]
This, together with the equations (52), (55), and (56), implies that the matrix with entries (, ) is right inverse to the matrix
[TABLE]
Since both matrices are square, is also a left inverse. Inserting the definitions of , taking into account that , and recalling the definition (49) of , we find for
[TABLE]
Hence the Hankel matrix is the inverse of the Bezoutian and thus for . ∎∎
Lemma 12**.**
Let be any moment vector such that . If (42) and (43) are satisfied for and positive numbers , then also (23), (24), and (47) hold.
Proof.
Since is a Bezoutian, there are monic polynomials and with for some constant (that will turn out to be the leading coefficient of ). Since the Bezoutian does not change if we add a multiple of to , we can assume that . Now we have
[TABLE]
Starting from
[TABLE]
plugging in
[TABLE]
and reversing the above computations we arrive at
[TABLE]
By (59), the polynomial vanishes for and has leading coefficient , hence
[TABLE]
For fixed , the polynomial
[TABLE]
vanishes for and has leading coefficient , hence we find
[TABLE]
By (61), the coefficient of in the polynomial vanishes, hence
[TABLE]
for , so that
[TABLE]
By (63) we therefore have
[TABLE]
and in particular (see (57))
[TABLE]
Hence is the leading coefficient of and we can write . If we put now
[TABLE]
and recall that , we find
[TABLE]
i.e., is a real rational function of the form (2) having the critical points . Also and in (2) since and thus is strictly increasing in each interval without poles. Using Lemma 4 and Lemma 5 we conclude that (24) is satisfied. Consequently, the function is of the form (1) with , and has the same critical points as . By Lemma 3 and Lemma 5, the equilibrium equation (23) is satisfied. From equation (60) with we get
[TABLE]
and from (59) with we obtain
[TABLE]
Finally, by (62) we have , so that (47) has been verified. ∎∎
Lemma 13**.**
The Nesterov mapping is bijective.
Proof.
The surjectivity of has already been shown. To prove its injectivity we assume that and show that is unique. By Theorem 3 there are numbers and such that
[TABLE]
By Lemma 12, is a solution of the equilibrium equation (24). Since this equation has a unique solution by Lemma 6, we conclude that are uniquely determined. Lemma 12 tells us that (47) holds, from which we get unique values of and . Finally, (64) determines uniquely. ∎∎
With the preceding lemma the proof of Theorem 4 has been completed.
5 Concluding remarks
In this paper we have established a one-to-one relation between three problems:
- (i)
determining a finite Blaschke product from its critical points,
- (ii)
finding the equilibrium position of moveable unit charges on the real line in an electric field generated by a special configuration of negative point charges,
- (iii)
solving the moment problems (42), (43).
Algorithmically, the last problem requires the Vandermonde factorization of the associated Hankel matrix, but it is difficult to exploit this for solving problems (i) or (ii). While the transition between (i) and (ii) in both directions is based on simple transformations, the translation of (ii) into (iii) needs the construction of with for a positive polynomial with zeros at the given critical points . Hence we have to invert the Nesterov mapping . This can be interpreted as finding a positive definite Bezoutian with prescribed anti-diagonal sums . Though this problem has a unique solution for every , we are not aware of an efficient procedure to find it. On the other hand, can be computed indirectly, if problem (ii) can be solved: starting with (a coefficient vector of) a positive polynomial , find the corresponding equilibrium positions , and compute and the weights from (47). The solution is then obtained by evaluating (42).
We shall discuss these algorithmic and numerical aspects in more detail in a forthcoming paper.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] N. I. Ahiezer and M. Kreĭn, Some Question in the Theory of Moments , Translations of mathematical monographs, vol. 2, American Mathematical Society, 1962.
- 2[2] A. Barvinok, A Course in Convexity , Graduate Studies in Mathematics, vol. 54, American Mathematical Society, 2002.
- 3[3] T. Bousch, Sur quelques problèmes de dynamique holomorphe , Ph.D. thesis, Université Paris 11, Orsay, 1992.
- 4[4] St. Boyd and L. Vandenberghe, Convex Optimization , Cambridge University Press, 2004.
- 5[5] D. K. Dimitrov and W. Van Assche, Lamé differential equations and electrostatics , Proc. Amer. Math. Soc. 128 (2000), 3621–3628.
- 6[6] , Erratum to ”Lamé differential equations and electrostatics” , Proc. Amer. Math. Soc. 131 (2003), no. 7, 2303.
- 7[7] P. A. Fuhrmann, A Polynomial Approach to Linear Algebra , Springer, 2012.
- 8[8] P. Gorkin and R.C. Rhoades, Boundary interpolation by finite Blaschke products , Constr. Approx. 27 (2008), 75–98.
